| Tukey lambda distribution | |||
|---|---|---|---|
|
Probability density function | |||
| Notation | Tukey(λ) | ||
| Parameters | λ ∈ ℝ — shape parameter | ||
| Support |
x ∈ [ − 1 /λ, 1 /λ ] if λ > 0 , x ∈ ℝ if λ ≤ 0 . | ||
| CDF |
(general case) (special case exact solution) | ||
| Mean | |||
| Median | 0 | ||
| Mode | 0 | ||
| Variance |
| ||
| Skewness | |||
| Excess kurtosis |
| ||
| Entropy | [1] | ||
| CF | [2] | ||
Formalized by John Tukey, the Tukey lambda distribution is a continuous, symmetric probability distribution defined in terms of its quantile function. It is typically used to identify an appropriate distribution (see the comments below) and not used in statistical models directly.
The Tukey lambda distribution has a single shape parameter, λ, and as with other probability distributions, it can be transformed with a location parameter, μ, and a scale parameter, σ. Since the general form of probability distribution can be expressed in terms of the standard distribution, the subsequent formulas are given for the standard form of the function.
Quantile function
editFor the standard form of the Tukey lambda distribution, the quantile function, (i.e. the inverse function to the cumulative distribution function) and the quantile density function, are
For most values of the shape parameter, λ, the probability density function (PDF) and cumulative distribution function (CDF) must be computed numerically. The Tukey lambda distribution has a simple, closed form for the CDF and / or PDF only for a few exceptional values of the shape parameter, for example: λ ∈ { 2, 1, 1 /2, 0 } (see uniform distribution [ cases λ = 1 and λ = 2 ] and the logistic distribution [ case λ = 0 ].
However, for any value of λ both the CDF and PDF can be tabulated for any number of cumulative probabilities, p, using the quantile function Q to calculate the value x, for each cumulative probability p, with the probability density given by 1/q, the reciprocal of the quantile density function. As is the usual case with statistical distributions, the Tukey lambda distribution can readily be used by looking up values in a prepared table.
Moments
editThe Tukey lambda distribution is symmetric around zero, therefore the expected value of this distribution, if it exists, is equal to zero. The variance exists for λ > − 1 /2 , and except when λ = 0 , is given by the formula
More generally, the n-th order moment is finite when λ > −1 /n and is expressed (except when λ = 0 ) in terms of the beta function Β(x,y) :
Due to symmetry of the density function, all moments of odd orders, if they exist, are equal to zero.
L-moments
editDifferently from the central moments, L-moments can be expressed in a closed form. For the th L-moment, is given by[3]
The first six L-moments can be presented as follows:[3]
Comments
edit
The Tukey lambda distribution is actually a family of distributions that can approximate a number of common distributions. For example,
λ ≈ −1 approx. Cauchy C( 0, π ) λ = 0 exactly logistic λ ≈ 0.14 approx. normal N( 0, 2.142± ) λ = 1 /2 strictly concave (-shaped) λ = 1 exactly uniform U( −1, +1 ) λ = 2 exactly uniform U( − 1 /2 , + 1 /2 )
The most common use of this distribution is to generate a Tukey lambda PPCC plot of a data set. Based on the value for λ with best correlation, as shown on the PPCC plot, an appropriate model for the data is suggested. For example, if the best-fit of the curve to the data occurs for a value of λ at or near 0.14, then empirically the data could be well-modeled with a normal distribution. Values of λ less than 0.14 suggests a heavier-tailed distribution.
A milepost at λ = 0 (logistic) would indicate quite fat tails, with the extreme limit at λ = −1 , approximating Cauchy and small sample versions of the Student's t. That is, as the best-fit value of λ varies from thin tails at 0.14 towards fat tails −1, a bell-shaped PDF with increasingly heavy tails is suggested. Similarly, an optimal curve-fit value of λ greater than 0.14 suggests a distribution with exceptionally thin tails (based on the point of view that the normal distribution itself is thin-tailed to begin with; the exponential distribution is often chosen as the exemplar of tails intermediate between fat and thin).
Except for values of λ approaching 0 and those below, all the PDF functions discussed have finite support, between −1 /|λ| and +1 / |λ| .
Since the Tukey lambda distribution is a symmetric distribution, the use of the Tukey lambda PPCC plot to determine a reasonable distribution to model the data only applies to symmetric distributions. A histogram of the data should provide evidence as to whether the data can be reasonably modeled with a symmetric distribution.[4]
Generalization
editThe generalized lambda distribution (GLD) generalizes the lambda distribution by splitting the occurrences of λ in the quantile into two shape parameters (accounts for asymmetry), a scale parameter, and adds a location parameter. It is defined by the quantile function[5]
The inverse function is the cumulative distribution function. This distribution also has (rather complex) expressions for μ, σ, and the moments.[5]
The above parameterization is the original form proposed by Ramberg and Schmeiser; some choices of λ3,4 yield invalid quantile functions (a quantile function should not be negative).[6]: (Theorem 1.3.33) A more commonly used form is FMKL (1988) which is valid as long as λ2 > 0;. There is also a variant of FMKL that adds skewness. The other alternative parameterization is GPD of van Staden and Loots (2009), which has simpler expressions for L-moments and a simple rule for parameter validity.[7]
The GLD was developed by Ramberg and Schmeiser in 1974 for generating random numbers for Monte Carlo simulation. Other researchers found it useful for fitting various measurements in the natural sciences, finance, and economy.[8] Its use spread to the field of statistical quality control and procedures are available for using it in the calculation of process capability indices.[5]
References
edit- ^ Vasicek, Oldrich (1976). "A test for normality based on sample entropy". Journal of the Royal Statistical Society. Series B. 38 (1): 54–59. doi:10.1111/j.2517-6161.1976.tb01566.x.
- ^ Shaw, W.T.; McCabe, J. (2009), "Monte Carlo sampling given a characteristic function: Quantile mechanics in momentum space", arXiv:0903.1592 [q-fin.CP]
- ^ a b Karvanen, Juha; Nuutinen, Arto (2008). "Characterizing the generalized lambda distribution by L-moments". Computational Statistics & Data Analysis. 52 (4): 1971–1983. arXiv:math/0701405. doi:10.1016/j.csda.2007.06.021. S2CID 939977.
- ^ Joiner, Brian L.; Rosenblatt, Joan R. (1971). "Some properties of the range in samples from Tukey's symmetric lambda distributions". Journal of the American Statistical Association. 66 (334): 394–399. doi:10.2307/2283943. JSTOR 2283943.
- ^ a b c Pal, Surajit (31 December 2004). "Evaluation of Nonnormal Process Capability Indices using Generalized Lambda Distribution". Quality Engineering. 17 (1): 77–85. doi:10.1081/QEN-200028709.
- ^ . ISBN 978-1-040-19883-4.
{{cite book}}: Missing or empty|title=(help) - ^ King, Robert (2025-10-14). "gld: Estimation and Use of the Generalised (Tukey) Lambda Distribution, version 2.6.8" (PDF). – Chapters GeneralisedLambdaDistribution and gl.check.lambda.
- ^ Dedduwakumara, Dilanka S.; Prendergast, Luke A.; Staudte, Robert G. (1 January 2019). "A simple and efficient method for finding the closest generalized lambda distribution to a specific model". Cogent Mathematics & Statistics. 6 (1) 1602929. doi:10.1080/25742558.2019.1602929.
External links
edit- "Tukey-Lambda distribution". Gallery of Distributions. Engineering Statistics Handbook. US NIST Information Technology Laboratory. 1.3.6.6.15. EDA 366F.
This article incorporates public domain material from the National Institute of Standards and Technology










