distribution-is-all-you-need
distribution-is-all-you-need is the basic distribution probability tutorial for most common distribution focused on Deep learning using python library.
Overview of distribution probability
data:image/s3,"s3://crabby-images/b590e/b590e8d4417d26bad8ccace78dfd213a9e742737" alt=""
distribution probabilities and features
- Uniform distribution(continuous), code
- Uniform distribution has same probaility value on [a, b], easy probability.
data:image/s3,"s3://crabby-images/78b5b/78b5b8383d7fe169c3a2b70855dd919f82760bb8" alt=""
- Bernoulli distribution(discrete), code
- Bernoulli distribution is not considered about prior probability P(X). Therefore, if we optimize to the maximum likelihood, we will be vulnerable to overfitting.
- We use binary cross entropy to classify binary classification. It has same form like taking a negative log of the bernoulli distribution.
data:image/s3,"s3://crabby-images/65453/6545354af6b60432bfd97e5f880b484b001478cb" alt=""
- Binomial distribution(discrete), code
- Binomial distribution with parameters n and p is the discrete probability distribution of the number of successes in a sequence of n independent experiments.
- Binomial distribution is distribution considered prior probaility by specifying the number to be picked in advance.
data:image/s3,"s3://crabby-images/14ec3/14ec37fb21febe7aaedbea2bb28a8e9541ceccf2" alt=""
- Multi-Bernoulli distribution, Categorical distribution(discrete), code
- Multi-bernoulli called categorical distribution, is a probability expanded more than 2.
- cross entopy has same form like taking a negative log of the Multi-Bernoulli distribution.
data:image/s3,"s3://crabby-images/92d5d/92d5d6bd387a4d69c80be565b12ea8d8f6807615" alt=""
- Multinomial distribution(discrete), code
- The multinomial distribution has the same relationship with the categorical distribution as the relationship between Bernoull and Binomial.
data:image/s3,"s3://crabby-images/89133/891339228ab56bca9e97a4966877a6ca1ddd2b63" alt=""
- Beta distribution(continuous), code
- Beta distribution is conjugate to the binomial and Bernoulli distributions.
- Using conjucation, we can get the posterior distribution more easily using the prior distribution we know.
- Uniform distiribution is same when beta distribution met special case(alpha=1, beta=1).
data:image/s3,"s3://crabby-images/d7e6a/d7e6a4ff467e5bb0dac1102cc91d663762cf672f" alt=""
- Dirichlet distribution(continuous), code
- Dirichlet distribution is conjugate to the MultiNomial distributions.
- If k=2, it will be Beta distribution.
data:image/s3,"s3://crabby-images/09768/09768fc26370614eef8900b4eb6c3770014bbaa1" alt=""
- Gamma distribution(continuous), code
- Gamma distribution will be beta distribution, if
Gamma(a,1) / Gamma(a,1) + Gamma(b,1)
is same with Beta(a,b)
.
- The exponential distribution and chi-squared distribution are special cases of the gamma distribution.
data:image/s3,"s3://crabby-images/3a87a/3a87ad14bf97606cd1913ffcf8c4243eec38b6ed" alt=""
- Exponential distribution(continuous), code
- Exponential distribution is special cases of the gamma distribution when alpha is 1.
data:image/s3,"s3://crabby-images/b7060/b70605aa200701eebb4cec71fb8dc4a8a17e40ae" alt=""
- Gaussian distribution(continuous), code
- Gaussian distribution is a very common continuous probability distribution
data:image/s3,"s3://crabby-images/96dd4/96dd415975ed57d4567d9d07d45221f0f2c8a4c9" alt=""
- Normal distribution(continuous), code
- Normal distribution is standarzed Gaussian distribution, it has 0 mean and 1 std.
data:image/s3,"s3://crabby-images/a2a9e/a2a9e67714c323aad9367185351525f570a337a2" alt=""
- Chi-squared distribution(continuous), code
- Chi-square distribution with k degrees of freedom is the distribution of a sum of the squares of k independent standard normal random variables.
- Chi-square distribution is special case of Beta distribution
data:image/s3,"s3://crabby-images/b582c/b582cd64f4d857a41b3bac258ac308bcaeab48fd" alt=""
- Student-t distribution(continuous), code
- The t-distribution is symmetric and bell-shaped, like the normal distribution, but has heavier tails, meaning that it is more prone to producing values that fall far from its mean.
data:image/s3,"s3://crabby-images/88fe7/88fe7179b682f19cc688cf84db9f7d2470dc25e7" alt=""
Author
If you would like to see the details about relationship of distribution probability, please refer to this.
data:image/s3,"s3://crabby-images/984b3/984b320336a25c11f148f3415aa01a4940cbc8fc" alt=""
- Tae Hwan Jung @graykode, Kyung Hee Univ CE(Undergraduate).
- Author Email : [email protected]
- If you leave the source, you can use it freely.