Download Document

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Statistics wikipedia , lookup

History of statistics wikipedia , lookup

Probability interpretations wikipedia , lookup

Probability wikipedia , lookup

Transcript
Probability and
Information Theory
Random Variables
• A random variable is a variable that can take on different values
randomly.
•
•
•
•
a description of the states that are possible
Denoted as a lower case letter
discrete or continuous
Ex) P(x=‘yes’)
Probability Distributions
• A probability distribution is a description of how likely a random
variable or set of random variables is to take on each of its possible
states.
Discrete Variables and Probability Mass
Functions
• Probability mass function (PMF)
• A probability distribution over discrete variables may be described using a
probability mass function (PMF)
• maps from a state of a random variable to the probability of that random
variable taking on that state
• P(x=x) : random variable x 가 x 상태(값)을 가질 확률로 매핑
Discrete Variables and Probability Mass
Functions
• Joint probability distribution
• P(x=x, y=y) denotes the probability that x=x and y=y simultaneously.
• P(x, y)
Discrete Variables and Probability Mass
Functions
Continuous Variables and Probability
Density Functions
• Probability Density Function (PDF)
Marginal Probability
• The probability distribution over just a subset of them.
Conditional Probability
• The probability of some event, given that some other event has
happened
The Chain Rule of Conditional Probabilities
• Any joint probability distribution over many random variables may be
decomposed into conditional distributions over only one variable
Chain rule
Independence and Conditional
Independence
• Two random variables x and y are independent
• Conditionally independent
Expectation
• The expectation or expected value of some function f(x) with respect
to a probability distribution P(x)
• the average or mean value that f takes on when x is drawn from P
Expectation
• Expectations are linear
Variance
• a measure of how much the values of a function of a random variable x
vary as we sample different values of x from its probability
distribution
Covariance
• Gives some sense of how much two values are linearly related to each
other
Bernoulli Distribution
• a distribution over a single binary random variable
Multinoulli Distribution
• The multinoulli or categorical distribution is a distribution over a
single discrete variable with k different states, where k is finite.
• parametrized by a vector p ∈[0,1]k−1, where pi gives the probability of
the i-th state. The final k-th state’s probability is given by 1− 1Tp.
Gaussian Distribution
• The most commonly used distribution over real numbers
Gaussian Distribution
Gaussian Distribution
• Multivariate normal distribution
Exponential distribution
• In the context of deep learning, we often want to have a probability
distribution with a sharp point at x= 0
Laplace distribution
Mixtures of Distributions
• A mixture distribution is made up of several component distributions.
Bayes’ Rule