Opentopia Directory Encyclopedia Tools

Yule-Simon distribution

Encyclopedia : Y : YU : YUL : Yule-Simon distribution


\,] for [\rho>3\,]| kurtosis =[\rho+3+\frac \,] for [\rho>4\,]| entropy =| mgf =[\frac\;_2F_1(1,1; \rho+2; e^t)\,e^t \,]| char =[\frac\;_2F_1(1,1; \rho+2; e^)\,e^ \,]| }} In probability and statistics, the Yule-Simon distribution is a discrete probability distribution named after Udny Yule and Herbert Simon. Simon originally called it the Yule distribution.

The probability mass function of the Yule-Simon(ρ) distribution is

[f(k;\rho) = \rho\,\mathrm(k, \rho+1), \,]
for integer [k \geq 1] and real [\rho > 0], where [\mathrm] is the beta function. Equivalently the pmf can be written in terms of the falling factorial as

[ f(k;\rho) = \frac}} ,\,]
where [\Gamma] is the gamma function. Thus, if [\rho] is an integer,

[ f(k;\rho) = \frac .\,]
The probability mass function f has the property that for sufficiently large k we have

[ f(k;\rho) \approx \frac} \propto \frac} .\,]
This means that the tail of the Yule-Simon distribution is a realization of Zipf's law: [f(k;\rho)] can be used to model, for example, the relative frequency of the [k]th most frequent word in a large collection of text, which according to Zipf's law is inversely proportional to a (typically small) power of [k].

Occurrence

The Yule-Simon distribution arises as a continuous mixture of geometric distributions. Specifically, assume that [W] follows an exponential distribution with scale [1/\rho] or rate [\rho]:

[W \sim \mathrm(\rho)\,]
[h(w;\rho) = \rho \, \exp(-\rho\,w)\,]
Then a Yule-Simon distributed variable [K] has the following geometric distribution:

[K \sim \mathrm(\exp(-W))\,]
The pmf of a geometric distribution is

[g(k; p) = p \, (1-p)^\,]
for [k\in\]. The Yule-Simon pmf is then the following exponential-geometric mixture distribution:

[f(k;\rho) = \int_0^ \,\,\, g(k;\exp(-w))\,h(w;\rho)\,dw\,]

Generalizations

Simon also hinted at a two-parameter generalization of the Yule-Simon distribution, in which the beta function is replaced by an incomplete beta function. The probability mass function of the generalized Yule-Simon(ρ, α) distribution is defined as

[ f(k;\rho,\alpha) = \frac} \; \mathrm_(k, \rho+1) , \,]
with [0 \leq \alpha < 1]. For [\alpha = 0] the ordinary Yule-Simon(ρ) distribution is obtained as a special case.

Plot of the Yule-Simon(1) distribution (red) and its asymptotic Zipf law (blue)
Enlarge
Plot of the Yule-Simon(1) distribution (red) and its asymptotic Zipf law (blue)

References

Probability distributions  [ view][ talk][ edit] 
Univariate Multivariate
Discrete: BernoullibinomialBoltzmanncompound PoissondegeneratedegreeGauss-Kuzmingeometrichypergeometriclogarithmicnegative binomialparabolic fractalPoissonRademacherSkellamuniformYule-SimonzetaZipfZipf-Mandelbrot Ewensmultinomial
Continuous: BetaBeta primeCauchychi-squareexponentialexponential powerFfadingFisher's zFisher-TippettGammageneralized extreme valuegeneralized hyperbolicgeneralized inverse GaussianHotelling's T-squarehyperbolic secanthyper-exponentialhypoexponentialinverse chi-squareinverse gaussianinverse gammaKumaraswamyLandauLaplaceLévyLévy skew alpha-stablelogisticlog-normalMaxwell-BoltzmannMaxwell speednormal (Gaussian)ParetoPearsonpolarraised cosineRayleighrelativistic Breit-WignerRiceStudent's ttriangulartype-1 Gumbeltype-2 GumbeluniformVoigtvon MisesWeibullWigner semicircle DirichletKentmatrix normalmultivariate normalvon Mises-FisherWigner quasiWishart
Miscellaneous: Cantorconditionalexponential family • infinitely divisible • location-scale familymarginalmaximum entropyphase-typeposteriorpriorquasisampling

 


From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.

Search Titles
0123456789
ABCDEFGHIJ
KLMNOPQRST
UVWXYZ?

E-mail this article to:

Personal Message: