Opentopia Directory Encyclopedia Tools

Gini coefficient

Encyclopedia : G : GI : GIN : Gini coefficient


Graphical representation of the Gini coefficient
Graphical representation of the Gini coefficient

The Gini coefficient is a measure of inequality of a distribution, defined as the ratio of area between the Lorenz curve of the distribution and the curve of the uniform distribution, to the area under the uniform distribution. It is often used to measure income inequality. It is a number between 0 and 1, where 0 corresponds to perfect equality (i.e. everyone has the same income) and 1 corresponds to perfect inequality (i.e. one person has all the income, while everyone else has zero income). It was developed by the Italian statistician Corrado Gini and published in his 1912 paper "Variabilità e mutabilità" ("Variability and Mutability"). The Gini coefficient is equal to half of the relative mean difference. The Gini index is the Gini coefficient expressed as a percentage, and is equal to the Gini coefficient multiplied by 100.

While the Gini coefficient is mostly used to measure income inequality, it can also be used to measure wealth inequality. This use requires that no one has a negative net wealth.

Calculation

The Gini coefficient is defined as a ratio of the areas on the Lorenz curve diagram. If the area between the line of perfect equality and Lorenz curve is A, and the area under the Lorenz curve is B, then the Gini coefficient is A/(A+B). Since A+B = 0.5, the Gini coefficient, G = 2A = 1-2B. If the Lorenz curve is represented by the function Y = L(X), the value of B can be found with integration and:

[G = 1 - 2\,\int_0^1 L(X) dX ]
In some cases, this equation can be applied to calculate the Gini coefficient without direct reference to the Lorenz curve. For example:
[G = \frac(n+1 - 2\,\frac^n \; (n+1-i)y_i}^n y_i})\,]
[G = 1 - \frac^n \; f(y_i)(S_+S_i)}]
where:
[S_i = \Sigma_^i \; f(y_j)\,y_j\,] and [S_0 = 0\,]
[G = 1 - \frac\int_0^\infty (1-F(y))^2dy]
Since the Gini coefficient is half the relative mean difference, it can also be calculated using formulas for the relative mean difference.

For a random sample S consisting of values yi, i = 1 to n, that are indexed in non-decreasing order ( yiyi+1), the statistic:

[G(S) = \frac(n+1 - 2\,\frac^n \; (n+1-i)y_i}^n y_i})\,]
is a consistent estimator of the population Gini coefficient, but is not, in general, unbiased. Like the relative mean difference, there does not exist a sample statistic that is in general an unbiased estimator of the population Gini coefficient. Confidence intervals for the population Gini coefficient can be calculated using bootstrap techniques.

Sometimes the entire Lorenz curve is not known, and only values at certain intervals are given. In that case, the Gini coefficient can be approximated by using various technigues for interpolating the missing values of the Lorenz curve. If ( X k , Yk ) are the known points on the Lorenz curve, with the X k indexed in increasing order ( X k - 1 < X k ), so that:

If the Lorenz curve is approximated on each interval as a line between consecutive points, then the area B can be approximated with trapezoids and:
[G_1 = 1 - \sum_^ (X_ - X_) (Y_ + Y_)]
is the resulting approximation for G. More accurate results can be obtained using other methods to approximate the area B, such as approximating the Lorenz curve with a quadratic function across pairs of intervals, or building an appropriately smooth approximation to the underlying distribution function that matches the known data. If the population mean and boundary values for each interval are also known, these can also often be used to improve the accuracy of the approximation.

Income Gini coefficients in the world

See complete listing in list of countries by income equality.

Gini coefficient, income distribution by country
Enlarge
Gini coefficient, income distribution by country

180px
Enlarge
180px


While most developed European nations tend to have Gini coefficients between 0.24 and 0.36, the United States Gini coefficient is above 0.4, indicating that the United States has greater inequality. Using the Gini can help quantify differences in welfare and compensation policies and philosophies. However it should be borne in mind that the Gini coefficient can be misleading when used to make political comparisons between large and small countries (see criticisms section).

Gini coefficients, income distribution over time for selected countries
Enlarge
Gini coefficients, income distribution over time for selected countries

Correlation with per-capita GDP

Poor countries (those with low per-capita GDP) have Gini coefficients that fall over the whole range from low (0.25) to high (0.71), while rich countries have generally low Gini coefficient (under 0.40).

US income gini coefficients over time

Gini coefficients for the United States at various times, according to the US Census Bureau:

Advantages as a measure of inequality

Disadvantages as a measure of inequality

As one result of this criticism, additionally to or in competition with the Gini coefficient entropy measures are frequently used (e.g. the Atkinson and Theil indices). These measures attempt to compare the distribution of resources by intelligent players in the market with a maximum entropy random distribution, which would occur if these players acted like non-intelligent particles in a closed system following the laws of statistical physics.

Notes

References

  • Dixon, PM, Weiner J., Mitchell-Olds T, Woodley R. Bootstrapping the Gini coefficient of inequality. Ecology 1987;68:1548-1551.
  • Gini C. "Variabilità e mutabilità" (1912) Reprinted in Memorie di metodologica statistica (Ed. Pizetti E, Salvemini, T). Rome: Libreria Eredi Virgilio Veschi (1955).

See also

External links

  • Software:
  • * [Free Online Calculator] computes the Gini Coefficient, plots the Lorenz curve, and computes many other measures of concentration for any dataset
  • * Free Calculator: [Online] and [downloadable scripts] (Python and Lua) for Atkinson, Gini, Hoover and Kullback-Leibler inequalities
  • * Users of the [R] data analysis software can install the "ineq" package which allows for computation of a variety of inequality indices including Gini, Atkinson, Theil.

 


From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.

Search Titles
0123456789
ABCDEFGHIJ
KLMNOPQRST
UVWXYZ?

E-mail this article to:

Personal Message: