Cumulant
Encyclopedia : C : CU : CUM : Cumulant
- 1 Cumulants of a random variable
- 2 Some properties of cumulants
- 2.1 Invariance and equivariance
- 2.2 Homogeneity
- 2.3 Additivity
- 2.4 Cumulants and moments
- 2.5 Cumulants and set-partitions
- 3 Cumulants of particular probability distributions
- 4 Joint cumulants
- 5 History
- 6 Formal cumulants
- 7 One well-known example
- 8 Cumulants of a polynomial sequence of binomial type
- 9 Free cumulants
- 10 See also
- 11 External references
Cumulants of a random variable
In probability theory and statistics, a random variable X has an expected value μ = E(X) and a variance σ2 = E((X-μ)2). These are the first two cumulants: μ = κ1 and σ2 = κ2.The cumulants κn are defined by the cumulant-generating function:
- [g(t)=\log(E\left(\exp(tX)\right))=\sum_^\infty\frac=\mu t + \frac + ...\,]
- [g'(t)=\sum_^\infty\frac t^n}=\mu + \sigma^2 t + ...\,]
- κ1 = μ = g ' (0),
- κ2 = σ2 = g ' ' (0),
- κn = g(n) (0) .
Some properties of cumulants
Invariance and equivariance
The first cumulant is shift-equivariant; all of the others are shift-invariant. To state this less tersely, denote by κn(X) the nth cumulant of the probability distribution of the random variable X. The statement is that if c is constant then κ1(X + c) = κ1(X) + c and κn(X + c) = κn(X) for n ≥ 2, i.e., c is added to the first cumulant, but all higher cumulants are unchanged.Homogeneity
The nth cumulant is homogeneous of degree n, i.e. if c is any constant, then
- [\kappa_n(cX)=c^n\kappa_n(X).]
Additivity
If X and Y are independent random variables then κn(X + Y) = κn(X) + κn(Y).Cumulants and moments
The moment-generating function is:
- [1+\sum_^\infty \frac=\exp\left(\sum_^\infty \frac\right)].
The cumulants are related to the moments by the following recursion formula:
- [\kappa_n=\mu'_n-\sum_^\kappa_k \mu_'.]
- [\mu'_1=\kappa_1\,]
- [\mu'_2=\kappa_2+\kappa_1^2\,]
- [\mu'_3=\kappa_3+3\kappa_2\kappa_1+\kappa_1^3\,]
- [\mu'_4=\kappa_4+4\kappa_3\kappa_1+3\kappa_2^2+6\kappa_2\kappa_1^2+\kappa_1^4\,]
- [\mu'_5=\kappa_5+5\kappa_4\kappa_1+10\kappa_3\kappa_2+10\kappa_3\kappa_1^2+15\kappa_2^2\kappa_1+10\kappa_2\kappa_1^3+\kappa_1^5\,]
- [\mu'_6=\kappa_6+6\kappa_5\kappa_1+15\kappa_4\kappa_2+15\kappa_4\kappa_1^2+10\kappa_3^2+60\kappa_3\kappa_2\kappa_1+20\kappa_3\kappa_1^3+15\kappa_2^3+45\kappa_2^2\kappa_1^2+15\kappa_2\kappa_1^4+\kappa_1^6\,]
The coefficients are precisely those that occur in Faà di Bruno's formula.
Cumulants and set-partitions
These polynomials have a remarkable combinatorial interpretation: the coefficients count certain partitions of sets. A general form of these polynomials is
- [\mu'_n=\sum_\prod_\kappa_]
- π runs through the list of all partitions of a set of size n;
- "B ∈ π" means B is one of the "blocks" into which the set is partitioned; and
- |B| is the size of the set B.
Cumulants of particular probability distributions
- For the constant random variable, X = μ, the derivative of the cumulant generating function is g '(t) = μ. The cumulants are κ1 = μ, and κn = 0 for n = 2, 3, 4, ...
- For the Bernoulli distribution, (number of successes in one trial), with expectation p, the derivative of the cumulant generating function is g '(t) = ((p−1−1) e−t + 1)−1.
- The cumulants satisfy a recursion formula: [\kappa_1=p,\,\kappa_=p(1-p).\,]
- For the binomial distribution, (number of successes in n independent trials with probability p of success on each trial), every cumulant is just n times the cumulant of the same order, of the Bernoulli distribution. The derivative of the cumulant generating function is g '(t) = n((p−1−1) e−t + 1)−1. The first cumulants are μ = g '(0) = np and σ2 = g ' '(0) = μ(1−p). The special case n = 1 is the Bernoulli distribution. Substituting p=μn−1 gives g '(t) = ((μ−1 − n−1) e−t + n−1)−1. The limiting case n−1 = 0 is the Poisson distribution g '(t) = μet.
- For the geometric distribution, (number of failures before one success), the derivative of the cumulant generating function is g '(t) = ((1−p)−1e−t−1)−1.
- For the negative binomial distribution, (number of failures before n successes), the derivative of the cumulant generating function is g '(t) = n((1−p)−1e−t−1)−1. The special case n = 1 is the Geometric distribution. The first cumulants are μ = g '(0) = n((1−p)−1−1)−1 = n(p−1−1), and σ2 = g ' '(0) = μp−1. The limiting case n−1 ´= 0 is the Poisson distribution.
- For the Poisson distribution, the derivative of the cumulant generating function is g '(t) = λet. All cumulants are equal to the parameter: κn = λ for n=1,2,3,...
- :g '(t) = μ (ε e−t − ε + 1)−1.
- :g ' '(t) = (g '(t)) (1 + (ε−1 − 1) et)−1
- For the normal distribution with expected value μ and variance σ2, the derivative of the cumulant generating function is g '(t) = μ+σ2t. The cumulants are κ1=μ, κ2=σ2, and κn=0 for n>2. The special case σ2=0 is the constant random variable X=μ.
- The cumulants of the uniform distribution on the interval [−1, 0] are κn = Bn/n, where Bn is the nth Bernoulli number.
Joint cumulants
The joint cumulant of several random variables X1, ..., Xn is
- [\kappa(X_1,\dots,X_n)=\sum_\pi\prod_(|B|-1)!(-1)^E\left(\prod_X_i\right)]
- [\kappa(X,Y,Z)=E(XYZ)-E(XY)E(Z)-E(XZ)E(Y)-E(YZ)E(X)+2E(X)E(Y)E(Z).\,]
The combinatorial meaning of the expression of moments in terms of cumulants is easier to understand than that of cumulants in terms of moments:
- [E(X_1\cdots X_n)=\sum_\pi\prod_\kappa(X_i : i \in B).]
- [E(XYZ)=\kappa(X,Y,Z)+\kappa(X,Y)\kappa(Z)+\kappa(X,Z)\kappa(Y)+\kappa(Y,Z)\kappa(X)+\kappa(X)\kappa(Y)\kappa(Z).\,]
- [\kappa(X+Y,Z_1,Z_2,\dots)=\kappa(X,Z_1,Z_2,\dots)+\kappa(Y,Z_1,Z_2,\dots).\,]
- [\operatorname(X+Y)=\operatorname(X)+2\operatorname(X,Y)+\operatorname(Y)\,]
- [\kappa_n(X+Y)=\sum_^n \kappa(\,\underbrace_,\underbrace_).\,]
Conditional cumulants and the law of total cumulance
The law of total expectation and the law of total variance generalize naturally to conditional cumulants. The case n = 3, expressed in the language of (central) moments rather than that of cumulants, says
- [\mu_3(X)=E(\mu_3(X\mid Y))+\mu_3(E(X\mid Y))+3\,\operatorname(E(X\mid Y),\operatorname(X\mid Y)).]
In general, we have
- [\kappa(X_1,\dots,X_n)=\sum_\pi \kappa(\kappa(X_\mid Y),\dots,\kappa(X_\mid Y))]
- the sum is over all partitions π of the set of indices, and
- π1, ..., πb are all of the "blocks" of the partition π; the expression κ(Xπk) indicates that the joint cumulant of the random variables whose indices are in that block of the partition.
History
Cumulants were first introduced by the Danish astronomer, actuary, mathematician, and statistician Thorvald N. Thiele (1838 - 1910) in 1889. Thiele called them half-invariants. They were first called cumulants in a 1931 paper, The derivation of the pattern formulae of two-way partitions from those of simpler patterns, Proceedings of the London Mathematical Society, Series 2, v. 33, pp. 195-208, by the great statistical geneticist Sir Ronald Fisher and the statistician John Wishart, eponym of the Wishart distribution. The historian Stephen Stigler has said that the name cumulant was suggested to Fisher in a letter from Harold Hotelling. In another paper published in 1929, Fisher had called them cumulative moment functions.Formal cumulants
More generally, the cumulants of a sequence , not necessarily the moments of any probability distribution, are given by
- [1+\sum_^\infty m_n t^n/n!=\exp\left(\sum_^\infty\kappa_n t^n/n!\right)]
One well-known example
In combinatorics, the nth Bell number is the number of partitions of a set of size n. All of the cumulants of the sequence of Bell numbers are equal to 1. The Bell numbers are the moments of the Poisson distribution with expected value 1.Cumulants of a polynomial sequence of binomial type
For any sequence of scalars in a field of characteristic zero, being considered formal cumulants, there is a corresponding sequence of formal moments, given by the polynomials above. For those polynomials, construct a polynomial sequence in the following way. Out the polynomial
- [\begin\mu'_6= &\kappa_6+6\kappa_5\kappa_1+15\kappa_4\kappa_2+15\kappa_4\kappa_1^2+10\kappa_3^2+60\kappa_3\kappa_2\kappa_1 \\ \\& +20\kappa_3\kappa_1^3+15\kappa_2^3+45\kappa_2^2\kappa_1^2+15\kappa_2\kappa_1^4+\kappa_1^6\end]
- [\beginp_6(x)= &(\kappa_6)\,x+(6\kappa_5\kappa_1+15\kappa_4\kappa_2+10\kappa_3^2)\,x^2+(15\kappa_4\kappa_1^2+60\kappa_3\kappa_2\kappa_1+15\kappa_2^3)\,x^3 \\ \\& +(45\kappa_2^2\kappa_1^2)\,x^4+(15\kappa_2\kappa_1^4)\,x^5 +(\kappa_1^6)\,x^6\end]
This sequence of polynomials is of binomial type. In fact, no other sequences of binomial type exist; every polynomial sequence of binomial type is completely determined by its sequence of formal cumulants.
Free cumulants
In the identity
- [E(X_1\cdots X_n)=\sum_\pi\prod_\kappa(X_i : i\in B)]
The ordinary cumulants of degree higher than 2 of the normal distribution are zero. The free cumulants of degree higher than 2 of the Wigner semicircle distribution are zero. This is one respect in which the role of the Wigner distribution in free probability theory is analogous to that of the normal distribution in conventional probability theory.
See also
External references
- , [Cumulant] at MathWorld.
From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.
