About the chart:
The list on the left-hand side displays the names of the 76 probability distributions (19 discrete distributions given by the rectangular boxes and 57 continuous distributions given by the rectangular boxes with the rounded corners) present in the chart. Hovering your mouse over the name of a distribution highlights the distribution on the chart, along with its related distributions. Depending on the size of your browser window, you might have to adjust the display to find the distribution you are looking for. You may scroll the chart window or zoom in and out with the + and - buttons as needed.
Each distribution on the chart, when clicked, links to a document showing detailed information about the distribution, including alternate functional forms of the distribution and the distribution's mean, variance, skewness, and kurtosis.
What is a univariate distribution?
A univariate probability distribution is used to assign a probability to various outcomes of a random experiment. A random experiment is one whose outcome can not be predicted with certainty prior to conducting the experiment. When the set of all possible outcomes to a random experiment is countable or a countable infinity, the probability distribution can be described by a probability mass function and the associated random variable is discrete. Otherwise, the probability distribution can be described by a probability density function and the associated random variable is continuous. A mix of these two cases is known as a mixed discrete-continuous distribution. Illustrations of a probability mass function in the case of rolling a pair of fair dice and summing the outcomes on the up faces and a probability density function in the case of the well-known normal distribution can be seen by clicking here.
A univariate probability distribution is the probability distribution of a single random variable. This is in contrast to a bivariate or multivariate probability distribution, which defines the probability distribution of two or more random variables.
What do the arrows mean?
Solid lines represent special cases and transformations from one distribution to another.
Dashed arrows are used for asymptotic relationships, typically as the limit as one or more parameters approach the boundary of the parameter space.
Dotted arrows represent Bayesian relationships.
Upon interacting with the chart, outbound arrows are highlighted in yellow pointing away from the selected distribution. Incoming arrows and related distributions are highlighted in white. Placing the cursor over an arrow turns the arrow blue. Clicking the arrow reveals a .pdf file that contains a proof when one exists. The accompanying transformation or parameterization will be highlighted next to the arrow.
What do the letters just below the distribution names indicate?
- C: The convolution property (C) indicates that sums of independent random variables having this particular distribution come from the same distribution family.
- F: The forgetfulness property (F), more commonly known as the memoryless property, indicates that the conditional distribution of a random variable is identical to the unconditional distribution.
- I: The inverse property (I) indicates that the reciprocal of a random variable of this type comes from the same distribution family.
- L: The linear combination property (L) indicates that the linear combinations of independent random variables having this particular distribution come from the same distribution family.
- M: The minimum property (M) indicates that the smallest of independent and identically distributed random variables from a distribution comes from the same distribution family.
- P: The product property (P) indicates that the product of independent random variables having this particular distribution comes from the same distribution family.
- R: The residual property (R) indicates that the conditional distribution of a random variable left-truncated at a value in its support belongs to the same distribution family as the unconditional distribution.
- S: The scaling property (S) implies that any positive real constant times a random variable having this distribution comes from the same distribution family.
- V: The variate generation property (V) indicates that the inverse cumulative distribution function of a continuous random variable can be expressed in closed form. For a discrete random variable, this property indicates that a variate can be generated in an O(1) algorithm that does not cycle through the support values or rely on a special property.
- X: The maximum property (X) indicates that the largest of independent and identically distributed random variables from a distribution comes from the same distribution family.
Placing the cursor over a letter for a property turns the letter blue. Clicking the property reveals a .pdf file that contains a proof when one exists.
What is the meaning of the parameters associated with the univariate probability distributions?
Parameters are used to enhance the flexibility of a univariate probability distribution. The normal distribution with its bell-shaped probability
density function, for example, might be an appropriate probability model the annual return for a stock index or the diameter of a ball bearing by adjusting
the values of its parameters.
Generally speaking, there are three types of parameters associated with a continuous distribution. A location parmeter shifts the probability density function to the left or to the right along the horizontal axis. A scale parameter contracts or expands the scale associated with the horizontal axis of the probability density function. A shape parameter changes the shape of the probability density function. An example of a location parameter is the mean of a normal random variable; an example of a scale parameter is the standard deviation of a normal random variable; an example of a shape parameter is the degrees of freedom of a t random variable.
Are there errors on the chart?
Yes. The chart is basically identical to that which was published in The American Statistician. In writing the proofs for some of the properties and
relationships, we have uncovered errors. In addition, we were unable to complete some of the proofs. They are listed by categories below.
- Distributions that don't belong on the chart
- The Gamma-normal distribution is a bivariate distribution
- Properties
- Incorrect properties:
Standard Cauchy (S)
Standard Wald (S)
von Mises (S)
- Unproven properties:
Cauchy (C)
Cauchy (I)
InverseGaussian (L)
Lognormal (P)
- Potential missing properties:
Inverse Gaussian (S)
Uniform (S)
- Relationships
- Incorrect relationships:
Beta-binomial ---> Negative hypergeometric [should be a = n1, b = n3 - n1, n = n2 via Jean Peyhardi]
- Unproven relationships:
Doubly noncentral F ---> Noncentral F
Generalized gamma ---> Lognormal
Hypoexponential ---> Erlang
Inverse Gaussian ---> Chi-square
Inverse Gaussian ---> Standard normal
Normal ---> Noncentral chi-square
Pascal ---> Normal [should be mu = n (1 - p) / p on the chart]
Pascal ---> Poisson
- Potential missing relationships:
- Wrong parameter values:
Standard Uniform ---> Logistic-Exponential
- Plots on the distribution page would be helpful: Polya, Power series
Are there other univariate distributions not on the chart?
Yes. We were not able to squeeze all of them onto the chart. Here is a partial list of distributions that are not among the 76 given on the chart:
- Amoroso distribution
- ARGUS distribution
- Asymmetric Laplace distribution
- Balding-Nichols distribution
- Bates distribution
- Benini distribution
- Benktander Type I distribution
- Benktander Type II distribution
- Beta-exponential distribution
- Beta prime distribution
(proof by Hans Brunner and Stefano Bettelli of a limiting result)
- Beta-rectangular distribution
- Beta prime distribution
- Birthday distribution
- Borel distribution
- Burr distribution
- Cantor distribution
- Categorical distribution
- Circular uniform distribution
- Compound Poisson distribution
- Conway-Maxwell-Poisson distribution
- Coupon collector distribution
- Dagum distribution
- Davis distribution
- de la Vallee-Poussin distribution
- Delaporte distribution
- Dickman distribution
- Dirac delta distribution
- Discrete phase-type distribution
- Elliptical distribution
- Exponential family
- Exponentiated Weibull distribution
Mudholkar, G.S., Srivastava, D.K., and Freimer, M. (1995), The Exponentiated Weibull Family: A Reanalysis of the Bus-Motor-Failure Data, Technometrics, Vol. 37, No. 4, 436-445.
- Exponential-logarithmic distribution
- Extended negative binomial distribution
- Fiducial distribution
- Fisher's Z distribution
- Flory-Schulz distribution
- Folded normal distribution
- Frechet distribution
- Gamma-exponential distribution
- Gamma/Gompertz distribution
- Gauss-Kuzmin distribution
- Gaussian q-distribution
- Generalized beta distribution
- Generalized extreme value (Fisher-Tippett) distribution
- Generalized hyperbolic distribution
- Generalized inverted exponential distribution
- Generalized inverse Gaussian distribution
- Generalized normal distribution
- Generalized Pareto distribution
- Generalized Rayleigh distribution
- Geometric stable (Linnik) distribution
- Gilbert sine distribution
- Grand unified distribution
- Half logistic distribution
- Half normal distribution
- Holtsmark distribution
- Hotelling's T-squared distribution
- Hyper-Erlang distribution
- Inverse chi-squared distribution
- Inverse Weibull distribution
- Irwin-Hall (uniform sum) distribution
- Iwueze's distribution
- Johnson SU distribution
- Kent distribution
- Kolmogorov-Smirnov distribution
- Kumaraswamy distribution
- Kummer beta distribution
- Laha distribution
- Landau distribution
- Levy distribution
- Location scale family
- Log Cauchy distribution
- Log gamma distribution (alternative parameterization)
- Log Laplace distribution
- Logarithmic series distribution
- Logit-normal distribution
- Marchenko-Pastur distribution
- Matrix-exponential distribution
- Maximum entropy distribution
- Maxwell-Boltzmann distribution
- Maxwell-Juttner distribution
- Metalog distribution
- Mittag-Leffler distribution
- Mixture distribution
- Nakagami distribution
- Natural exponential family
- Noncentral beta distribution
- Noncentral hypergeometric distribution
Liao, J.G., Rosen, O. (2001), Fast and Stable Algorithms for Computing and Sampling from the Noncentral Hypergeometric Distribution, The American Statistician, Vol. 55, No. 4, 366-369.
- Normal inverse Gaussian distribution
- Panjer recursion
- Parabolic fractal distributions
- Pearson system of distributions
- PERT distribution
- Phase-type distribution
- Poisson-binomial distribution
- Poly-Weibull distribution
- Polya-Eggenberger distribution
- Prentice distribution
- Q-exponential distribution
- Q-Gaussian distribution
- Q-Weibull distribution
- Quantile functions
- Quantile-parametrized distributions
- Rademacher distribution
- Raised cosine distribution
- Reciprocal distribution
- Rectified Gaussian distribution (mixed discrete-continuous)
- Relativistic Breit-Wigner distribution
- Rice distribution
- Scaled inverse chi-squared distribution
- Seba curves
- Shifted Gompertz distribution
- Shifted log logistic distribution
- Sinh-Arcsinh distribution
- Skellam distribution
- Skewed generalized t distribution
- Skew normal distribution
- Slash distribution
- Soliton distribution
- Stable distribution
- Theta distribution
- Tracy-Widom distribution
- Truncated normal distribution
- Tukey g-and-h distribution 1;
Tukey g-and-h distribution 2
- Tukey lambda distribution
- Tukey studentized range distribution
- Tweedie distribution
- Type I Gumbel distribution
- Type II Gumbel distribution
- U-quadratic distribution
- Variance gamma distribution
- Voigt distribution
- Wigner semicircle distribution
- Wilk's lambda distribution
- Wrapped distribution
- Wrapped asymmetirc Laplace distribution
- Wrapped Cauchy distribution
- Wrapped exponential distribution
- Wrapped Levy distribution
- Wrapped normal distribution
- Yule-Simon distribution
- Zero inflated Poisson (ZIP) distribution
- Zipf-Mandelbrot distribution
Are there other relationships between univariate distributions?
Some relationships did not fit on the chart because the chart needed to be a planar graph. Other relationships involve the combination of two random variables to create a third. Examples include
- A scaled F random variable converges to a normal random variable as the degrees of freedom go to infinity.
(See proof by Hans Brunner and Stefano Bettelli.)
- The ratio of a standard normal random variable Z to the square root of an independent chi-square(n) random
variable divided by its degrees of freedom has the t distribution with n degrees of freedom.
- Let Z be standard normal. Let W be unit exponential. Let Z and W be independent. Then mu + sigma * sqrt(W) * Z has a symmetric Laplace distribution with mean mu and standard deviation sigma. The parameterization differs from that given in the chart. See Geraci and Borja, "The Laplace Distribution", Significance, Volume 15, Issue 5, October 2018, 10-11.
- Let Y be Gompertz. Then exp(-Y) is Weibull and exp(Y) is inverse Weibull. See page 179 of Kleiber, Christian; Kotz, Samuel (2003). Statistical Size Distributions in Economics and Actuarial Sciences, Wiley.
- Let X be lognormal. Let Y be lognormal. Then X/Y also has the lognormal distribution. However, there is no quotient property, so this relationship does not appear on the chart.
- Let X1, X2, ... , Xn be iid U(0, 1). Let Y be n * min{X1, X2, ... , Xn}. The limiting distribution of Y as n -> infinity is exponential(1).
- (Multiplicative CLT) Let X1, X2, ... , Xn be mutually independent random variables with finite mean and finite standard deviation. The limiting distribution of the product of these random variables is lognormal as n -> infinity.
Is there more information available?
Clicking on a distributon's name will download a .pdf file that includes the cumulative distribution function, survivor function, hazard function, cumulative
hazard function, inverse distribution function, and (where applicable) the moments and moment generating function. For even more information, see the "Links" tab. A nice on-line compendium is given by Gavin Crooks
Who developed the interactive graphic website?
The main developers were Lawrence Leemis, Daniel Luckett, Austin Powell, Jackie Taber,
and Peter Vermeer (all from
the
COR (Computational Operations Research) program
at The College of William & Mary). Other contributors are:
Tim Adams,
Stefano Bettelli,
Ruben Becerril Borja,
Hans Brunner,
Ryan Carpenter,
Erin Catlett,
Danny Cogut,
Chad Conrad,
Laura Decena,
Evan Elsaesser,
Luke Godwin--Jones,
Matt Goldman,
Matt Hanson,
Yan Hao,
Billy Kaczynski,
Andrew Mashchak,
Maximilian Matthe,
James McCormack,
Lee McDaniel,
Lauren Merrill,
Erik Olsen,
Scott Percic,
Jean Peyhardi,
Eduardo García-Portugúes,
Robin Ryder,
Evan Saltzman,
Bruce Schmeiser,
Linus Schrage,
Christian Schröder,
Erich Schubert,
Witold Siedlaczek,
Maciej Swat,
Chris Thoma,
Melissa Tilashalski,
Erik Vargo,
Chris Weld,
Jeff Yang,
Vincent Yannello,
Hang Yu, and
Andrei Zorine.
Support also came from the National Science Foundation.
What if I found something wrong, something missing, or want to contribute a proof?
Questions and concerns may be directed to leemis (AT) math (doTT) wm (d0T) edu