About the chart:
The list on the left-hand side displays the names of the 76 probability distributions (19 discrete distributions given by the rectangular boxes and 57 continuous distributions given by the rectangular boxes with the rounded corners) present in the chart. Hovering your mouse over the name of a distribution highlights the distribution on the chart, along with its related distributions. Depending on the size of your browser window, you might have to adjust the display to find the distribution you are looking for. You may scroll the chart window or zoom in and out with the + and - buttons as needed.
Each distribution on the chart, when clicked, links to a document showing detailed information about the distribution, including alternate functional forms of the distribution and the distribution's mean, variance, skewness, and kurtosis.
What is a univariate distribution?
A univariate probability distribution is used to assign a probability to various outcomes of a random experiment. A random experiment is one whose outcome can not be predicted with certainty prior to conducting the experiment. When the set of all possible outcomes to a random experiment is countable or a countable infinity, the probability distribution can be described by a probability mass function and the associated random variable is discrete. Otherwise, the probability distribution can be described by a probability density function and the associated random variable is continuous. A mix of these two cases is known as a mixed discrete-continuous distribution. Illustrations of a probability mass function in the case of rolling a pair of fair dice and summing the outcomes on the up faces and a probability density function in the case of the well-known normal distribution can be seen by clicking here.
A univariate probability distribution is the probability distribution of a single random variable. This is in contrast to a bivariate or multivariate probability distribution, which defines the probability distribution of two or more random variables.
What do the arrows mean?
Solid lines represent special cases and transformations from one distribution to another.
Dashed arrows are used for asymptotic relationships, typically as the limit as one or more parameters approach the boundary of the parameter space.
Dotted arrows represent Bayesian relationships.
Upon interacting with the chart, outbound arrows are highlighted in yellow pointing away from the selected distribution. Incoming arrows and related distributions are highlighted in white. Placing the cursor over an arrow turns the arrow blue. Clicking the arrow reveals a .pdf file that contains a proof when one exists. The accompanying transformation or parameterization will be highlighted next to the arrow.
What do the letters just below the distribution names indicate?
- C: The convolution property (C) indicates that sums of independent random variables having this particular distribution come from the same distribution family.
- F: The forgetfulness property (F), more commonly known as the memoryless property, indicates that the conditional distribution of a random variable is identical to the unconditional distribution.
- I: The inverse property (I) indicates that the reciprocal of a random variable of this type comes from the same distribution family.
- L: The linear combination property (L) indicates that the linear combinations of independent random variables having this particular distribution come from the same distribution family.
- M: The minimum property (M) indicates that the smallest of independent and identically distributed random variables from a distribution comes from the same distribution family.
- P: The product property (P) indicates that the product of independent random variables having this particular distribution comes from the same distribution family.
- R: The residual property (R) indicates that the conditional distribution of a random variable left-truncated at a value in its support belongs to the same distribution family as the unconditional distribution.
- S: The scaling property (S) implies that any positive real constant times a random variable having this distribution comes from the same distribution family.
- V: The variate generation property (V) indicates that the inverse cumulative distribution function of a continuous random variable can be expressed in closed form. For a discrete random variable, this property indicates that a variate can be generated in an O(1) algorithm that does not cycle through the support values or rely on a special property.
- X: The maximum property (X) indicates that the largest of independent and identically distributed random variables from a distribution comes from the same distribution family.
Placing the cursor over a letter for a property turns the letter blue. Clicking the property reveals a .pdf file that contains a proof when one exists.
What is the meaning of the parameters associated with the univariate probability distributions?
Parameters are used to enhance the flexibility of a univariate probability distribution. The normal distribution with its bell-shaped probability
density function, for example, might be an appropriate probability model the annual return for a stock index or the diameter of a ball bearing by adjusting
the values of its parameters.
Generally speaking, there are three types of parameters associated with a continuous distribution. A location parmeter shifts the probability density function to the left or to the right along the horizontal axis. A scale parameter contracts or expands the scale associated with the horizontal axis of the probability density function. A shape parameter changes the shape of the probability density function. An example of a location parameter is the mean of a normal random variable; an example of a scale parameter is the standard deviation of a normal random variable; an example of a shape parameter is the degrees of freedom of a t random variable.
Are there errors on the chart?
Yes. The chart is basically identical to that which was published in The American Statistician. In writing the proofs for some of the properties and
relationships, we have uncovered errors. In addition, we were unable to complete some of the proofs. They are listed by categories below.
- Distributions that don't belong on the chart
- The Gamma-normal distribution is a bivariate distribution
- Incorrect properties:
Standard Cauchy (S)
Standard Wald (S)
von Mises (S)
- Unproven properties:
- Potential missing properties:
Inverse Gaussian (S)
- Incorrect relationships:
Beta-binomial ---> Negative hypergeometric [should be a = n1, b = n3 - n1, n = n2 via Jean Peyhardi]
- Unproven relationships:
Doubly noncentral F ---> Noncentral F
Generalized gamma ---> Lognormal
Hypoexponential ---> Erlang
Inverse Gaussian ---> Chi-square
Inverse Gaussian ---> Standard normal
Normal ---> Noncentral chi-square
Pascal ---> Normal [should be mu = n (1 - p) / p on the chart]
Pascal ---> Poisson
- Potential missing relationships:
- Wrong parameter values:
Standard Uniform ---> Logistic-Exponential
- Plots on the distribution page would be helpful: Polya, Power series
Are there other univariate distributions not on the chart?
Yes. Here is a partial list:
- Amoroso distribution
- Beta-exponential distribution
- Beta prime distribution
- Birthday distribution
- Burr distribution
- Categorical distribution
- Coupon collector distribution
- Dagum distribution
- de la Vallee-Poussin distribution
- Dickman distribution
- Exponentiated Weibull distribution
Mudholkar, G.S., Srivastava, D.K., and Freimer, M. (1995), The Exponentiated Weibull Family: A Reanalysis of the Bus-Motor-Failure Data, Technometrics, Vol. 37, No. 4, 436-445.
- Exponential logarithmic distribution
- Folded normal distribution
- Frechet distribution
- Gamma-exponential distribution
- Generalized beta distribution
- Generalized extreme value (Fisher-Tippett) distribution
- Generalized inverted exponential distribution
- Generalized inverse Gaussian distribution
- Generalized Rayleigh distribution
- Geometric stable (Linnik) distribution
- Gilbert sine distribution
- Grand unified distribution
- Half normal distribution
- Inverse Weibull distribution
- Irwin-Hall (uniform sum) distribution
- Johnson distribution
- Kolmogorov-Smirnov distribution
- Kummer beta distribution
- Laha distribution
- Landau distribution
- Location scale family
- Logarithmic series distribution
- Maxwell-Boltzmann distribution
- Noncentral hypergeometric distribution
Liao, J.G., Rosen, O. (2001), Fast and Stable Algorithms for Computing and Sampling from the Noncentral Hypergeometric Distribution, The American Statistician, Vol. 55, No. 4, 366-369.
- Pearson system of distributions
- Prentice distribution
- Quantile functions
- Rademacher distribution
- Rice distribution
- Seba curves
- Skewed generalized t distribution
- Stable distribution
- Theta distribution
- Tracy-Widom distribution
- Truncated normal distribution
- Tukey lambda distribution
- Tukey studentized range distribution
- U-quadratic distribution
- Wigner semicircle distribution
Is there more information available?
Clicking on a distributon's name will download a .pdf file that includes the cumulative distribution function, survivor function, hazard function, cumulative
hazard function, inverse distribution function, and (where applicable) the moments and moment generating function. For even more information, see the "Links" tab. A nice on-line compendium is given by Gavin Crooks
Who developed the interactive graphic website?
The main developers were Lawrence Leemis, Daniel Luckett, Austin Powell, Jackie Taber,
and Peter Vermeer (all from
COR (Computational Operations Research) program
at The College of William & Mary). Other contributors are:
Ruben Becerril Borja,
Vincent Yannello, and
Support also came from the National Science Foundation.
What if I found something wrong, something missing, or want to contribute a proof?
Questions and concerns may be directed to leemis (AT) math (doTT) wm (d0T) edu