Probability

Basic Rules

**Union : ** $$p(A \lor B) = p(A)+p(B)-p(A \land B)$$
**Joint : ** $$p(A, B) = p(A \land B) = p(A|B)p(B)$$ $$p(A) = \displaystyle\sum_b p(A, B) = \displaystyle\sum_bp(A|B = b)p(B = b)$$
**Conditional : ** $$p(A|B) = \dfrac{p(A, B)}{p(B)} \space \space if \space p(B) > 0$$
**Bayes-Rule : ** $$p(X =x|Y =y) = \dfrac{p(X =x,Y =y)}{p(Y =y)} = \dfrac{p(X =x)p(Y =y|X =x)}{\sum_{x'}p(X ={x'})p(Y =y|X =x')}$$
**Un-conditional Independent : ** $$p(X,Y)=p(X)p(Y)$$
**Conditional Independent : ** $$p(X, Y |Z) = p(X|Z)p(Y |Z)$$

Density function

For continues data

Examples

Uniform Distribution
Normal/Gaussian Distribution
Exponential Distribution

Mass function

For discrete data

Examples

Binomial probability mass function
Poisson probability mass function

Mean And Variance

Mean ($$\mu$$)

Discrete : $$ E[X] \triangleq \displaystyle\sum_{x \in X} x \space p(x) $$

Continues : $$E[X] \triangleq \displaystyle\int_{X} x \space p(x) \space dx $$

Variance ($$\sigma^2$$)

$$Var[X] \triangleq E[(X - \mu)^2]$$

$$E[X^2] = \mu^2 + \sigma^2$$

$$std[X] \triangleq \sqrt{var[X]}$$

Percentiles and Moments

Percentiles

In a dataset the point at which x% of the data is less than that value

Moments

Moments in mathematical statistics involve a basic calculation. These calculations can be used to find a probability distribution's mean, variance and skewness.

In a discrete random variable, The $$s$$th moment of the data set with values $$x_1, x_2, x_3,...x_n$$ is given by the formula:

$$(x_1^s + x_2^s + x_3^s + ... + x_n^s)/n$$

First Moment is Simple Mean

Moments about the mean

$$((x_1-\mu)^s + (x_2-\mu)^s + (x_3-\mu)^s + ... + (x_n-\mu)^s)/n$$

First moment about mean is zero
Second moment about mean is variance
Third moment about mean is skew
Fourth moment about mean is kurtosis

The above applies same for continues random variable

Covariance and Corelation

Describe the degree to which two random variables or sets of random variables tend to deviate from their expected values in similar ways

Covariance

$$Cov(X, Y) = \sigma_{XY} = E[X - E[X]]E[Y - E[Y]]$$ $$Cov(X, Y) = E[XY] - E[X]E[Y]$$

If x is a d-dimensional random vector, its covariance matrix is defined to be the following symmetric, positive definite matrix:

$$Cov[x] = E[(x-E[x])(x-E[x])^T] = \begin{pmatrix} var[X_1] & cov[X_1,X_2] & \cdot \cdot \cdot & cov[X_1,X_d] \\ cov[X_2, X_1] & var[X_2] & \cdot \cdot \cdot & cov[X_2, X_d] \\ \cdot & \cdot & \cdot \cdot \cdot & \cdot \\ \cdot & \cdot & \cdot \cdot \cdot & \cdot \\ \cdot & \cdot & \cdot \cdot \cdot & \cdot \\ cov[X_d, X_1] & cov[X_d,X_2] & \cdot \cdot \cdot & var[X_d] \end{pmatrix} $$

Covariances can be between 0 and infinity. Sometimes it is more convenient to work with a normalized measure, with a finite upper bound.

Corelation

$$Cor(X, Y) = \dfrac{Cov(X, Y)}{\sigma_X\sigma_Y}$$

$$-1 \le Cor[X, Y] \le 1$$

If there is a linear relationship between $$X$$ and $$Y$$ then $$Corr[X,Y] = 1$$

If X and Y are independent, meaning $$p(X, Y ) = p(X)p(Y )$$, then $$cov [X, Y ] = 0$$, and hence $$corr [X, Y ] = 0$$ so they are uncorrelated, Converse is not true, Uncorrelated does not imply independent

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

01-Probability.md

01-Probability.md

Probability

Basic Rules

Density function

Mass function

Mean And Variance

Mean ($$\mu$$)

Variance ($$\sigma^2$$)

Percentiles and Moments

Percentiles

Moments

Covariance and Corelation

Covariance

Corelation

Files

01-Probability.md

Latest commit

History

01-Probability.md

File metadata and controls

Probability

Basic Rules

Density function

Mass function

Mean And Variance

Mean ($$\mu$$)

Variance ($$\sigma^2$$)

Percentiles and Moments

Percentiles

Moments

Covariance and Corelation

Covariance

Corelation