User:Egm6936.f10/Probability concepts
[edit] Probability concepts and notations
djvu notes: Probability, distribution, density
[edit] Events(Samples, Outcomes)
Event (or sample or outcome) is a subset of results of random experiments, which designated by
.
For example, the result of tossing a coin once should be either head or tail, i.e.,
head or
tail.
If we toss the coin twice, then
{head, tail} or
{head head} or
{tail head} or
{tail tail}.
[edit] Algebra of Events
In some ways, the algebra of events share some similarities with the algebra of real numbers, with intersection (
) corresponding to multiplication (
), complement (
) to subtraction (
or
) and union (
) to addition (
).
The union of
events
is the set collecting all points in all those events
.
Notation:
-
, or 
(1)
The intersection of
events
is the set collecting all points belonging to all those events
.
We call they are disjoint if the intersection of sets is empty.
Intersection has the associative property.
Notation:
-
, or 
(2)
-
=
(3)
The complement of a event
in
is the set collecting all points in
but not in the event
. Generally, we can have two kinds of complement: relative complement and absolute complement.
Relative complement of
with respect to
is the set of points in
but not in
. If union of all sets
considered to be
, the absolute complement of
is the set of points in
, but not in
.
Notation:
-

(4)
For schematic representations of union, intersection and complement, we can use Venn Diagram.
[edit] Sample space (Outcome space)
Sample space(Statistical Theory) or outcome space(Probability Theory) is a collection of all possible outcomes (or events or samples) of random experiments, which denoted by
.
For example, in the coin - tossing experiment, a coin is tossed once, the outcome space
={heads, tails}. If we tossing twice, the outcome space
{ {head, tail}, {head head}, {tail head}, {tail tail} }.
( Xiu 2010, p.9[1];, Shao 2007, p.1[2].).
[edit] Sigma-Field
Sigma-Field is a collection of subsets of a sample space
(not necessary all), which denoted by
. For instance,
in the coin - tossing experiment.
Three conditions that the sigma-field must satisfy:
Non-empty:
and
;
Given
, then
;
Given
,
,...
, then
and
.
i.e., sum or union of any subsets of
is a subset of
.
( Xiu 2010, p.10[1];, Shao 2007, p.2[2].)
Note:
is called a " sigma-field " or " sigma-algebra ", written as
-field or
-algebra.
is mnemonic for " S ", and " Sum ", due to property.
[edit] Probability
Probability is used to measure the likelihood of the occurrence of certain event (or outcome). Probability of an event
belonging to an element
is a non-negative number (or measure), which is mathematically denoted by
-

(5)
For example, in the coin - tossing experiment,
,
,
.
[edit] Algebra of Probability
The complement of an event
is the event not
(that is, the event of
not occurring); its probability is given by
. As an example, the chance of not rolling a six on a six-sided die is 1 – (chance of rolling a six)=
.
If both events
and
occur on a single performance of an experiment, this is called the intersection or joint of
and
, denoted as
. If two events,
and
are independent, then the joint probability is
-

(6)
For example, if two coins are tossed, the chance of both being heads is
.
If either event
or event
or both events occur on a single performance of an experiment this is called the union of the events
and
denoted as
.
If two events are mutually exclusive, then the probability of either occurring is
-

(7)
For example, the chance of rolling a 2 or 3 or 5 on a six-sided die is
.
If the events are not mutually exclusive then
-

(8)
[edit] Random variable and vector
Intuitively, Random Variable is used to designate a random outcome (event or sample) in a random experiment, usually denoted in capital letters, e.g.,
is a random variable. It's a numerical description of the outcome of an experiment.
Formally, it is a mapping from a probability space to the real numbers, which is measurable.
-


(9)
Where
is event space
endowed with
- algebra
,
is set of real numbers
endowed with " Borel
- algebra "
(sigma-algebra of finite open subsets of
). (Shao 2007, p.7[2].)
= ( arbitrary ) number selected to represent each event
in
. For Example, typically, in the coin - tossing experiment, we can use number 1 to designate the heads and 0 for the tails, i.e.,
,
. But it is also possible to select, event though not a good choice, since not as mnemonic as
,
,
.
Example:
![\mathcal B = \sigma \Big( \{(a,b]: a,b \in \mathbb R \} \Big)](http://upload.wikimedia.org/wikiversity/en/math/a/4/c/a4c8f53b2939be7bb51141662f91a6f5.png)
Set of finite open intervals in 
This choice of
allows for the probability of
, i.e.,
.
(Xiu 2010, p.11[1])
In the turbulent flows case, the sample space
can be thought of as a set of repeat experiments(samples) to verify, say, a hypothesis or observations on a given flow.
where
is the total number of repeated experiments, e.g., until the standard deviation is small enough compared to the mean.
The
th velocity component(a random variable) at
in experiment
is
.
A random vector
is composed of real-valued random variables
. A typical n-dimensional random vector can be represented as
.
【Theorem 1】:
Let
be a Gaussian random vector with distribution
and let
be an
matrix. Then
has an
distribution.
In case events were already representable by real numbers, then the event space is already
. It's then not necessary to mention
, but directly
. An example of such variable is a velocity component in a turbulent flow.(Pope 2000[3])
[edit]
Probability distribution
Probability Distribution is a function that describes the probability of a random variable taking certain values.
-

(10)
In practice, only to refer to an open interval in 
-
![\displaystyle
{[(a,\, b] \in \mathcal B] \mapsto [P_X((a,\, b]) \in \mathbb R_{0}^{+}]}](//upload.wikimedia.org/wikiversity/en/math/a/3/8/a38cec4ff6cfa13a0aaacd3a95dbdff6.png)
(11)
-
![\displaystyle
P_X((a,b]) \equiv P_X(X( \omega) \in (a,b])](//upload.wikimedia.org/wikiversity/en/math/a/f/e/afefe815cb0707cb9c6f5009728452fc.png)
(12)
i.e., the probability that
.
[edit] 
Cumulative distribution function ( CDF )
Cumulative Distribution Function (CDF), or only Distribution Function, describes the probability a real-valued random variable
with a given probability distribution will be found at a value less than or equal to x. Intuitively, it is the "area so far" function of the probability distribution.
-
![\displaystyle
F_X(x) := P_X((-\infty, x]) = P_X(X \le x)](//upload.wikimedia.org/wikiversity/en/math/6/f/a/6fad4783d589239f2fabcc9fc0226de9.png)
(13)
For random vectors,
-
![\displaystyle
F_X(\mathbf x) := P_X((-\infty, \mathbf x]) = P_X(X_1 \le x_1, X_1 \le x_2,...,X_n \le x_n), \mathbf x = (x_1, x_2,...,x_n) \in \mathbb R^n](//upload.wikimedia.org/wikiversity/en/math/4/2/b/42bca4abf3e36d29c4a666112de2ecb3.png)
(14)
Normal(Gaussian) Distribution
-
![\displaystyle
F_X(x) = \frac12\left[\, 1 + \operatorname{erf} \left(\displaystyle \frac{x}{\sqrt{2}} \right) \right])](//upload.wikimedia.org/wikiversity/en/math/7/a/c/7ac39f26a60bc43a08bae9d890c74921.png)
(15)
[edit]
Probability density function ( PDF )
Probability Density Function (PDF), or density of a continuous random variable is a function that describes the relative likelihood for this random variable to occur at a given point. The probability for the random variable to fall within a particular region is given by the integral of this variable’s density over the region. The probability density function is nonnegative everywhere, and its integral over the entire space is equal to one.
-

(24)
-

(25)
-

(26)
For random vectors,
-

(27)
and
-

(28)
If a vector
has density
, then all its subsets have a density, called marginal densities.
-

(29)
Normal(Gaussian) distribution
-
![\displaystyle
f_X(x) = \frac{1}{\sqrt{2\pi\sigma^2}}{\rm exp}\left[{-\frac{(x-\mu)^2}{2\sigma^2}}\right]](//upload.wikimedia.org/wikiversity/en/math/a/a/0/aa081d3683258cc8ccce825c14d0e212.png)
(30)
Binomial distribution
-

(31)
Poisson distribution
-

(32)
Note:
Notation
and 
In recent literature, an uppercase letter, e.g.,
, is used to designate a random variable , whereas the corresponding lowercase letter, e.g.,
, is used to designate the real variable that is the upper bound of
.
Kolmogorov, 1933[4]; Famous work influencing subsequent mathematical probability and statistics works.
-

(33)
(Kolmogorov, 1933 p.24[4])
[edit] Expectations and Moments
The expectation, also mean value or the first moment, of a random variable
is defined as:
For continuous distribution, with the probability density function
,
-
![\displaystyle
\mu_X = \mathbb E[X] := \int_{-\infty}^{+\infty}xf_X(x)dx](//upload.wikimedia.org/wikiversity/en/math/2/b/4/2b4d5cbb8be29907c7c6bb37ff9ab2a2.png)
(31)
The expectation of
, where
is a real-valued function, is:
-
![\displaystyle
\mathbb E[g(X)] := \int_{-\infty}^{+\infty}g(X)f_X(x)dx](//upload.wikimedia.org/wikiversity/en/math/e/0/2/e020a3c796a7468ca2d0589f80dbab46.png)
(32)
The
th moment of random variable
, where
, is:
-
![\displaystyle
\mathbb E[X^k] := \int_{-\infty}^{+\infty}x^kf_X(x)dx](//upload.wikimedia.org/wikiversity/en/math/8/f/b/8fb859471bfb5f0ddeee3f38ce32b53f.png)
(33)
The variance of random variable
,
, is:
-
![\displaystyle
\sigma_X^2 = var(X) := \int_{-\infty}^{+\infty}(x-\mu_X)^2f_X(x)dx =: \mathbb E[(X-\mu_X)^2]](//upload.wikimedia.org/wikiversity/en/math/a/c/0/ac0f7953a4c7752a8a205d5153e33123.png)
(34)
For discrete distribution, with the probability
, the definition of above terms are:
-

(35)
-
![\displaystyle
\mathbb E[g(X)] := \sum_{n=1}^{+\infty}g(X_{n})p_{n}](//upload.wikimedia.org/wikiversity/en/math/a/0/2/a02759f919dd19511d21342e41882301.png)
(36)
-
![\displaystyle
\mathbb E[X^k] := \sum_{n=1}^{+\infty}x_{n}^{k}p_{n}](//upload.wikimedia.org/wikiversity/en/math/a/7/4/a7444d526ebc8c618f0ef6dc28b51687.png)
(37)
-
![\displaystyle
\sigma_X^2 = var(X) := \sum_{n=1}^{+\infty}(x_{n}-\mu_X)^2p_{n} =: \mathbb E[(X-\mu_X)^2]](//upload.wikimedia.org/wikiversity/en/math/1/4/5/145b2101c4921f8923a9d7e1b2bd33b5.png)
(38)
From equation (34)&(38), we can deduce that
-
![\displaystyle
\sigma_X^2 = \mathbb E[(X-\mu_X)^2] = \mathbb E[X^2 - 2X\mu_X + \mu_X^2] = \mathbb E[X^2] - \mathbb E[2X\mu_X] + \mathbb E[\mu_X^2] = \mathbb E[X^2] - \mu_X^2](//upload.wikimedia.org/wikiversity/en/math/5/e/8/5e87d0fb614512300380115fc937fd08.png)
(39)
Variance equals the mean of the square minus the square of the mean.
The expectation of a random vector
is
-
![\displaystyle
\mu_{\mathbf X}=\mathbb E[\mathbf X] = (\mathbb E[X_1],\mathbb E[X_2],...,\mathbb E[X_n]).](//upload.wikimedia.org/wikiversity/en/math/4/0/7/40776de3b09d8a4645eeada1e6fa49cb.png)
(40)
A very frequently used quantity of random vector is the covariance matrix, which is defined as
-

(41)
where
is the covariance of
and
. We have
.
[edit] Convergence modes
Given a sequence of random variables
, we defined following convergence modes.
Convergence in distribution, 
For all continuous points
of the distribution function
, if we have the relation
-

(42)
is satisfied.
Convergence in distribution is a weak convergence.
Convergence in probability, 
If the probability of the difference between
and
, with
larger than any positive
tends to zero, then we call
converges to
in probability,which can be written as
.
-

(43)
Convergence in probability implies convergence in distribution. The converse is true if and only if
for some constant
.
Almost sure Convergence, 
Convergence, 

[edit] Questions&Answers
[edit] References
- ↑ 1.0 1.1 1.2 Xiu, D., Numerical methods for stochastic computations: A spectral method approach, Princeton University Press, 2010
- ↑ 2.0 2.1 2.2 Shao, J., Mathematical statistics, 2nd edition, Springer, 2007
- ↑ S. B. Pope, Turbulent Flows, 1st edition, Cambridge University Press, 2000
- ↑ 4.0 4.1 A.N. Kolmogorov, Foundations of the theory of Probability, Chelsea Publishing Co., 1933.
, or 
, or 
=








Probability distribution
![\displaystyle
{[(a,\, b] \in \mathcal B] \mapsto [P_X((a,\, b]) \in \mathbb R_{0}^{+}]}](http://upload.wikimedia.org/wikiversity/en/math/a/3/8/a38cec4ff6cfa13a0aaacd3a95dbdff6.png)
![\displaystyle
P_X((a,b]) \equiv P_X(X( \omega) \in (a,b])](http://upload.wikimedia.org/wikiversity/en/math/a/f/e/afefe815cb0707cb9c6f5009728452fc.png)

![\displaystyle
F_X(x) := P_X((-\infty, x]) = P_X(X \le x)](http://upload.wikimedia.org/wikiversity/en/math/6/f/a/6fad4783d589239f2fabcc9fc0226de9.png)
![\displaystyle
F_X(\mathbf x) := P_X((-\infty, \mathbf x]) = P_X(X_1 \le x_1, X_1 \le x_2,...,X_n \le x_n), \mathbf x = (x_1, x_2,...,x_n) \in \mathbb R^n](http://upload.wikimedia.org/wikiversity/en/math/4/2/b/42bca4abf3e36d29c4a666112de2ecb3.png)
![\displaystyle
F_X(x) = \frac12\left[\, 1 + \operatorname{erf} \left(\displaystyle \frac{x}{\sqrt{2}} \right) \right])](http://upload.wikimedia.org/wikiversity/en/math/7/a/c/7ac39f26a60bc43a08bae9d890c74921.png)
Probability density function ( PDF )





![\displaystyle
f_X(x) = \frac{1}{\sqrt{2\pi\sigma^2}}{\rm exp}\left[{-\frac{(x-\mu)^2}{2\sigma^2}}\right]](http://upload.wikimedia.org/wikiversity/en/math/a/a/0/aa081d3683258cc8ccce825c14d0e212.png)



![\displaystyle
\mu_X = \mathbb E[X] := \int_{-\infty}^{+\infty}xf_X(x)dx](http://upload.wikimedia.org/wikiversity/en/math/2/b/4/2b4d5cbb8be29907c7c6bb37ff9ab2a2.png)
![\displaystyle
\mathbb E[g(X)] := \int_{-\infty}^{+\infty}g(X)f_X(x)dx](http://upload.wikimedia.org/wikiversity/en/math/e/0/2/e020a3c796a7468ca2d0589f80dbab46.png)
![\displaystyle
\mathbb E[X^k] := \int_{-\infty}^{+\infty}x^kf_X(x)dx](http://upload.wikimedia.org/wikiversity/en/math/8/f/b/8fb859471bfb5f0ddeee3f38ce32b53f.png)
![\displaystyle
\sigma_X^2 = var(X) := \int_{-\infty}^{+\infty}(x-\mu_X)^2f_X(x)dx =: \mathbb E[(X-\mu_X)^2]](http://upload.wikimedia.org/wikiversity/en/math/a/c/0/ac0f7953a4c7752a8a205d5153e33123.png)

![\displaystyle
\mathbb E[g(X)] := \sum_{n=1}^{+\infty}g(X_{n})p_{n}](http://upload.wikimedia.org/wikiversity/en/math/a/0/2/a02759f919dd19511d21342e41882301.png)
![\displaystyle
\mathbb E[X^k] := \sum_{n=1}^{+\infty}x_{n}^{k}p_{n}](http://upload.wikimedia.org/wikiversity/en/math/a/7/4/a7444d526ebc8c618f0ef6dc28b51687.png)
![\displaystyle
\sigma_X^2 = var(X) := \sum_{n=1}^{+\infty}(x_{n}-\mu_X)^2p_{n} =: \mathbb E[(X-\mu_X)^2]](http://upload.wikimedia.org/wikiversity/en/math/1/4/5/145b2101c4921f8923a9d7e1b2bd33b5.png)
![\displaystyle
\sigma_X^2 = \mathbb E[(X-\mu_X)^2] = \mathbb E[X^2 - 2X\mu_X + \mu_X^2] = \mathbb E[X^2] - \mathbb E[2X\mu_X] + \mathbb E[\mu_X^2] = \mathbb E[X^2] - \mu_X^2](http://upload.wikimedia.org/wikiversity/en/math/5/e/8/5e87d0fb614512300380115fc937fd08.png)
![\displaystyle
\mu_{\mathbf X}=\mathbb E[\mathbf X] = (\mathbb E[X_1],\mathbb E[X_2],...,\mathbb E[X_n]).](http://upload.wikimedia.org/wikiversity/en/math/4/0/7/40776de3b09d8a4645eeada1e6fa49cb.png)


