# Difference between revisions of "Gini"

Line 12: | Line 12: | ||

# Saudi Arabia - Odd no data given wealth | # Saudi Arabia - Odd no data given wealth | ||

− | [[ | + | The '''Gini coefficient''' is a [[Statistical_dispersion#Measures_of_statistical_dispersion|measure of statistical dispersion]] developed by the [[Italian people|Italian]] [[statistics|statistician]] [[Corrado Gini]] and published in his 1912 paper "Variability and Mutability" ({{lang-it|Variabilità e mutabilità}}). It is commonly used as a measure of inequality of [[income inequality metrics|income]] or [[Wealth condensation|wealth]]. It has, however, also found application in the study of inequalities in disciplines as diverse as [[health science]], [[ecology]], and [[chemistry]]. |

− | == | + | ==Definition== |

+ | [[Image:Economics Gini coefficient2.svg|thumb|right|280px|Graphical representation of the Gini coefficient.<br /><br />The graph shows that while the Gini is technically equal to the area marked 'A' divided by the sum of the areas marked 'A' and 'B' (that is, Gini = A/(A+B)), it is also equal to 2*A, since A+B = 0.5 since the axes scale from 0 to 1, and the total surface of the graph therefore equals 1.]] | ||

+ | The Gini coefficient is usually defined [[mathematics|mathematically]] based on the [[Lorenz curve]] (below). It can be thought of as the [[ratio]] of the [[area]] that lies between the line of equality and the Lorenz curve (marked 'A' in the diagram) over the total area under the line of equality (marked 'A' and 'B' in the diagram); i.e., G=A/(A+B). | ||

− | + | The Gini coefficient can range from 0 to 1; it is sometimes multiplied by 100 to range between 0 and 100. A low Gini coefficient indicates a more equal distribution, with 0 corresponding to perfect equality, while higher Gini coefficients indicate more unequal distribution, with 1 corresponding to perfect inequality. To be validly computed, no negative goods can be distributed. Thus, if the Gini coefficient is being used to describe [[household income]] inequality, then no [[household]] can have a negative income. When used as a measure of income inequality, the most unequal society will be one in which a single person receives 100% of the total income and the remaining people receive none (G=1); and the most equal society will be one in which every person receives the same percentage of the total income (G=0). | |

− | : | + | Some find it more intuitive (and it is mathematically equivalent) to think of the Gini coefficient as half of the [[Mean difference|Relative mean difference]]. The mean difference is the average absolute difference between two items selected randomly from a population, and the relative mean difference is the mean difference divided by the average, to normalize for scale. Worldwide, Gini coefficients for income range from approximately 0.230 in [[Sweden]] to 0.707 in [[Namibia]] although not every country has been assessed. |

+ | |||

+ | As a mathematical measure of inequality, the Gini coefficient carries no [[moral judgement]] about whether a particular level of (in)equality is good or bad. | ||

+ | |||

+ | ==Different uses== | ||

+ | Although the Gini coefficient is most popular in economics, it can in theory be applied in any field of science that studies a distribution. For example, in ecology the Gini coefficient has been used as a measure of [[biodiversity]], where the cumulative proportion of species is plotted against cumulative proportion of individuals<ref name=natureArticle>{{cite article | ||

+ | | last = Wittebolle | ||

+ | | first = Lieven | ||

+ | | coauthors = et al | ||

+ | | title = Initial community evenness favours functionality under selective stress | ||

+ | | journal = [[Nature (journal)|Nature]] | ||

+ | | year = 2009 | ||

+ | | volume = 458 | ||

+ | | pages = 623-626 | ||

+ | }}</ref>. In health, it has been used as a measure of the inequality of health related [[quality of life]] in a population<ref name=popHealthArticle>{{cite article | ||

+ | | last=Asada | ||

+ | | first=Yukiko | ||

+ | | title = Assessment of the health of Americans: the average health-related quality of life and its inequality across individuals and groups | ||

+ | | journal = Population Health Metrics | ||

+ | | year = 2005 | ||

+ | | volume = 3 | ||

+ | | pages = 7 | ||

+ | | doi = 10.1186/1478-7954-3-7 | ||

+ | }}</ref>. In chemistry it has been used to to express the selectivity of [[protein kinase inhibitors]] against a panel of kinases<ref name=JMedChemArticle>{{cite article | ||

+ | | last = Graczyk | ||

+ | | first = Piotr | ||

+ | | title = Gini Coefficient: A New Way To Express Selectivity of Kinase Inhibitors against a Family of Kinases | ||

+ | | journal = Journal of Medicinal Chemistry | ||

+ | | year = 2007 | ||

+ | | volume = 50 | ||

+ | | pages = 5773–5779 | ||

+ | | doi = 10.1021/jm070562u | ||

+ | }}</ref>. | ||

+ | |||

+ | == Calculation == | ||

+ | The Gini index is defined as a ratio of the areas on the [[Lorenz curve]] diagram. If the area between the line of perfect equality and the Lorenz curve is A, and the area under the Lorenz curve is B, then the Gini index is A/(A+B). Since A+B = 0.5, the Gini index, G = A/(0.5) = 2A = 1-2B. If the Lorenz curve is represented by the function Y = L(X), the value of B can be found with [[integral|integration]] and: | ||

+ | :<math>G = 1 - 2\,\int_0^1 L(X) dX. </math> | ||

+ | In some cases, this equation can be applied to calculate the Gini coefficient without direct reference to the Lorenz curve. For example: | ||

+ | *For a population uniform on the values ''y''<sub>''i''</sub>, ''i'' = 1 to ''n'', indexed in non-decreasing order ( ''y''<sub>''i''</sub> ≤ ''y''<sub>''i''+1</sub>): | ||

+ | :<math>G = \frac{1}{n}\left ( n+1 - 2 \left ( \frac{\Sigma_{i=1}^n \; (n+1-i)y_i}{\Sigma_{i=1}^n y_i} \right ) \right ) </math> | ||

+ | :This may be simplified to: | ||

+ | :<math>G = \frac{2 \Sigma_{i=1}^n \; i y_i}{n \Sigma_{i=1}^n y_i} -\frac{n+1}{n}</math> | ||

+ | |||

+ | *For a [[Discrete probability distribution|discrete probability function]] ''f''(''y''), where ''y''<sub>''i''</sub>, ''i'' = 1 to ''n'', are the points with nonzero probabilities and which are indexed in increasing order ( ''y''<sub>''i''</sub> < ''y''<sub>''i''+1</sub>): | ||

+ | :<math>G = 1 - \frac{\Sigma_{i=1}^n \; f(y_i)(S_{i-1}+S_i)}{S_n}</math> | ||

+ | :where | ||

+ | :<math>S_i = \Sigma_{j=1}^i \; f(y_j)\,y_j\,</math> and <math>S_0 = 0\,</math> | ||

+ | |||

+ | *For a [[cumulative distribution function]] ''F''(''y'') that is piecewise [[differentiable]], has a [[mean]] μ, and is zero for all negative values of ''y'': | ||

+ | :<math>G = 1 - \frac{1}{\mu}\int_0^\infty (1-F(y))^2dy = \frac{1}{\mu}\int_0^\infty F(y)(1-F(y))dy</math> | ||

+ | |||

+ | *Since the Gini coefficient is half the relative mean difference, it can also be calculated using formulas for the relative mean difference. For a random sample ''S'' consisting of values ''y''<sub>''i''</sub>, ''i'' = 1 to ''n'', that are indexed in non-decreasing order ( ''y''<sub>''i''</sub> ≤ ''y''<sub>''i''+1</sub>), the statistic: | ||

+ | :<math>G(S) = \frac{1}{n-1}\left (n+1 - 2 \left ( \frac{\Sigma_{i=1}^n \; (n+1-i)y_i}{\Sigma_{i=1}^n y_i}\right ) \right )</math> | ||

+ | |||

+ | :is a [[estimator#consistency|consistent]] [[estimator]] of the population Gini coefficient, but is not, in general, [[estimator#Point Estimators|unbiased]]. Like, G, G(S) has a simpler form: | ||

+ | |||

+ | :<math>G(S) = 1 - \frac{2}{n-1}\left ( n - \frac{\Sigma_{i=1}^n \; iy_i}{\Sigma_{i=1}^n y_i}\right ) </math>. | ||

+ | |||

+ | There does not exist a sample statistic that is in general an unbiased estimator of the population Gini coefficient, like the [[relative mean difference]]. | ||

+ | |||

+ | Sometimes the entire Lorenz curve is not known, and only values at certain intervals are given. In that case, the Gini coefficient can be approximated by using various techniques for [[interpolation|interpolating]] the missing values of the Lorenz curve. If ( X<sub> k</sub> , Y<sub>k</sub> ) are the known points on the Lorenz curve, with the X<sub> k</sub> indexed in increasing order ( X<sub> k - 1</sub> < X<sub> k</sub> ), so that: | ||

+ | * X<sub>k</sub> is the cumulated proportion of the population variable, for k = 0,...,n, with X<sub>0</sub> = 0, X<sub>n</sub> = 1. | ||

+ | * Y<sub>k</sub> is the cumulated proportion of the income variable, for k = 0,...,n, with Y<sub>0</sub> = 0, Y<sub>n</sub> = 1. | ||

+ | |||

+ | If the Lorenz curve is approximated on each interval as a line between consecutive points, then the area B can be approximated with [[Trapezoidal rule|trapezoids]] and: | ||

+ | :<math>G_1 = 1 - \sum_{k=1}^{n} (X_{k} - X_{k-1}) (Y_{k} + Y_{k-1})</math> | ||

+ | |||

+ | is the resulting approximation for G. More accurate results can be obtained using other methods to [[Numerical integration|approximate the area]] B, such as approximating the Lorenz curve with a [[Simpson's rule|quadratic function]] across pairs of intervals, or building an appropriately smooth approximation to the underlying distribution function that matches the known data. If the population mean and boundary values for each interval are also known, these can also often be used to improve the accuracy of the approximation. | ||

+ | |||

+ | The Gini coefficient calculated from a sample is a statistic and its standard error, or confidence intervals for the population Gini coefficient, should be reported. These can be calculated using bootstrap techniques but those proposed have been mathematically complicated and computationally onerous even in an era of fast computers. Ogwang (2000) made the process more efficient by setting up a “trick regression model” in which the incomes in the sample are ranked with the lowest income being allocated rank 1. The model then expresses the rank (dependent variable) as the sum of a constant ''A'' and a [[normal distribution|normal]] error term whose variance is inversely proportional to ''y''<sub>''k''</sub>; | ||

+ | |||

+ | :<math>k = A + \ N(0, s^{2}/y_k) </math> | ||

+ | |||

+ | Ogwang showed that ''G'' can be expressed as a function of the weighted least squares estimate of the constant ''A'' and that this can be used to speed up the calculation of the jackknife estimate for the standard error. Giles (2004) argued that the standard error of the estimate of ''A'' can be used to derive that of the estimate of ''G'' directly without using a jackknife at all. This method only requires the use of ordinary least squares regression after ordering the sample data. The results compare favorably with the estimates from the jackknife with agreement improving with increasing sample size. The paper describing this method can be found here: http://web.uvic.ca/econ/ewp0202.pdf | ||

+ | |||

+ | However it has since been argued that this is dependent on the model’s assumptions about the error distributions (Ogwang 2004) and the independence of error terms (Reza & Gastwirth 2006) and that these assumptions are often not valid for real data sets. It may therefore be better to stick with jackknife methods such as those proposed by Yitzhaki (1991) and Karagiannis and Kovacevic (2000). The debate continues. | ||

+ | |||

+ | The Gini coefficient can be calculated if you know the mean of a distribution, the number of people (or percentiles), and the income of each person (or percentile). [[Princeton University|Princeton]] [[development economics|development economist]] [[Angus Deaton]] (1997, 139) simplified the Gini calculation to one easy formula: | ||

+ | |||

+ | :<math>G = \frac{N+1}{N-1}-\frac{2}{N(N-1)u}(\Sigma_{i=1}^n \; P_iX_i)</math> | ||

+ | |||

+ | where u is mean income of the population, P<sub>i</sub> is the income rank P of person i, with income X, such that the richest person receives a rank of 1 and the poorest a rank of N. This effectively gives higher weight to poorer people in the income distribution, which allows the Gini to meet the Transfer Principle. | ||

+ | |||

+ | == Income Gini indices in the world == | ||

+ | A complete listing is in [[list of countries by income equality]]; the article [[economic inequality]] discusses the social and policy aspects of income and asset inequality. | ||

+ | |||

+ | [[Image:Gini Coefficient World Human Development Report 2007-2008.png|thumb|right|500px|Gini coefficient, income distribution by country. {| width=100% |Gini coefficient, income distribution by country. | ||

+ | ]] | ||

+ | |||

+ | While most developed European nations and [[Canada]] tend to have Gini indices between 24 and 36, the United States' and Mexico's Gini indices are both above 40, indicating that the [[United States]] and [[Economy of Mexico|Mexico]] have greater inequality. Using the Gini can help quantify differences in [[Social welfare|welfare]] and [[living wage|compensation]] policies and philosophies. However it should be borne in mind that the Gini coefficient can be misleading when used to make political comparisons between large and small countries (see [[Gini coefficient#Disadvantages of Gini coefficient as a measure of inequality|criticisms]] section). | ||

+ | |||

+ | The Gini index for the entire world has been estimated by various parties to be between 56 and 66.<ref>{{Citation | ||

+ | |url=http://siteresources.worldbank.org/INTDECINEQ/Resources/PSBSutcliffe.pdf | ||

+ | |title=Postscript to the article ‘World inequality and globalization’ (Oxford Review of Economic Policy, Spring 2004) | ||

+ | |author=Bob Sutcliffe | ||

+ | |month=April | ||

+ | |year=2007 | ||

+ | |accessdate=2007-12-13}}</ref><ref>[http://hdr.undp.org/docs/statistics/understanding/resources/HDR_2003_2_2_global_income_inequality.pdf United Nations Development Programme]</ref> | ||

+ | |||

+ | {{-}} | ||

+ | [[Image:Gini since WWII.gif|thumb|center|700px|Gini indices, income distribution over time for selected countries]] | ||

+ | |||

+ | === US income Gini indices over time === | ||

+ | Gini indices for the [[United States of America|United States]] at various times, according to the [[United States Census Bureau|US Census Bureau]]:<ref> | ||

+ | {{cite web | ||

+ | |url= http://www.census.gov/hhes/www/income/histinc/h04.html | ||

+ | |title= Gini Ratios for Households, by Race and Hispanic Origin of Householder: 1967 to 2007 | ||

+ | |work= Historical Income Tables - Households | ||

+ | |publisher= [[United States Census Bureau]] | ||

+ | }}</ref> | ||

+ | |||

+ | * 1929: 45.0 (estimated) | ||

+ | * 1947: 37.6 (estimated) | ||

+ | * 1967: 39.7 (first year reported) | ||

+ | * 1968: 38.6 (lowest index reported) | ||

+ | * 1970: 39.4 | ||

+ | * 1980: 40.3 | ||

+ | * 1990: 42.8 | ||

+ | * 2000: 46.2 | ||

+ | * 2005: 46.9 | ||

+ | * 2006: 47.0 (highest index reported) | ||

+ | * 2007: 46.3 <ref>Note that the calculation of the index for the United States was changed in 1992, resulting in an upwards shift of about 2.</ref> | ||

+ | |||

+ | === EU gini index === | ||

+ | <!-- THIS IS NOT A WEIGHTED AVERAGE, ITS FROM AN EU AGENCY --> | ||

+ | In 2005 the Gini index for the EU was estimated at 31.<ref>http://www.eurofound.europa.eu/areas/qualityoflife/eurlife/index.php?template=3&radioindic=158&idDomain=3</ref>. This is surprisingly low, since the EU has virtually no interstate income redistribution power (the EU budget is only ~1% of the total GDP, there are no EU taxes, there is no EU social policy, and no EU treasury); moreover, a number of poorer new member states joined in 2004.{{Citation needed|date=July 2009}} | ||

+ | |||

+ | ==Advantages of Gini coefficient as a measure of inequality== | ||

+ | * The Gini coefficient's main advantage is that it is a measure of inequality by means of a [[ratio analysis]], rather than a variable unrepresentative of most of the population, such as [[per capita income]] or [[gross domestic product]]. | ||

+ | |||

+ | * It can be used to compare income distributions across different population sectors as well as countries, for example the Gini coefficient for urban areas differs from that of rural areas in many countries (though the United States' urban and rural Gini coefficients are nearly identical). | ||

+ | |||

+ | * It is sufficiently simple that it can be compared across countries and be easily interpreted. GDP statistics are often criticized as they do not represent changes for the whole population; the Gini coefficient demonstrates how income has changed for poor and rich. If the Gini coefficient is rising as well as GDP, poverty may not be improving for the majority of the population. | ||

+ | |||

+ | * The Gini coefficient can be used to indicate how the distribution of income has changed within a country over a period of time, thus it is possible to see if inequality is increasing or decreasing. | ||

+ | |||

+ | * The Gini coefficient satisfies four important{{Who|date=July 2009}} principles: | ||

+ | ** ''Anonymity'': it does not matter who the high and low earners are. | ||

+ | ** ''Scale independence'': the Gini coefficient does not consider the size of the economy, the way it is measured, or whether it is a rich or poor country on average. | ||

+ | ** ''Population independence'': it does not matter how large the population of the country is. | ||

+ | ** ''Transfer principle'': if income (less than the difference), is transferred from a rich person to a poor person the resulting distribution is more equal. | ||

+ | |||

+ | ==Disadvantages of Gini coefficient as a measure of inequality== | ||

+ | * The Gini coefficient of different sets of people cannot be averaged to obtain the Gini coefficient of all the people in the sets: if a Gini coefficient were to be calculated for each person it would always be zero. For a large, economically diverse country, a much higher coefficient will be calculated for the country as a whole than will be calculated for each of its regions. (The coefficient is usually applied to measurable [[Real versus nominal value (economics)|nominal]] income rather than local [[purchasing power]], tending to increase the calculated coefficient across larger areas.) | ||

+ | |||

+ | : For this reason, the scores calculated for individual countries within the [[European Union|EU]] are difficult to compare with the score of the entire US: the overall value for the EU should be used in that case, 31.3<ref>{{Citation | ||

+ | |url=https://www.cia.gov/library/publications/the-world-factbook/geos/EE.html | ||

+ | |title=European Union | ||

+ | |publisher=CIA World Factbook | ||

+ | |accessdate=2007-12-13}}</ref>, which is still much lower than the United States', 45.<ref>{{Citation | ||

+ | |url=https://www.cia.gov/library/publications/the-world-factbook/geos/US.html | ||

+ | |title=United States | ||

+ | |publisher=CIA World Factbook | ||

+ | |accessdate=2007-12-13}}</ref> Using decomposable inequality measures (e.g. the [[Theil index]] <math>T</math> converted by <math>1-{e^{-T}}</math> into a inequality coefficient) averts such problems. | ||

+ | |||

+ | * The Lorenz curve may understate the actual amount of inequality if richer households are able to use income more efficiently than lower income households or vice versa. From another point of view, measured inequality may be the result of more or less efficient use of household incomes. | ||

+ | |||

+ | * Economies with similar incomes and Gini coefficients can still have very different income distributions. This is because the Lorenz curves can have different shapes and yet still yield the same Gini coefficient. | ||

+ | |||

+ | * It measures current income rather than lifetime income. A society in which everyone earned the same over a lifetime would appear unequal because of people at different stages in their life; a society in which students study rather than save can never have a coefficient of 0. However, Gini coefficient can also be calculated for any kind of distribution, e.g. for wealth. <ref>[http://daviddfriedman.com/Academic/Price_Theory/PThy_Chapter_14/PThy_Chapter_14.html Friedman, David D.]</ref> | ||

+ | |||

+ | ==Problems in using the Gini coefficient== | ||

+ | * Gini coefficients do include investment income; however, the Gini coefficient based on net income does not accurately reflect differences wealth - a possible source of misinterpretation. For example, [[Sweden]] has a low Gini coefficient for income distribution but a significantly higher Gini coefficient for wealth (still low by international standards, but significantly higher than for income: for instance 77% of the share value owned by households is held by just 5% of Swedish shareholding households )<ref>(Data from the [http://www.scb.se/templates/Publikation____193443.asp Statistics Sweden].)</ref>. In other words, the Gini income coefficient should not be interpreted as measuring effective [[egalitarianism]]. | ||

+ | |||

+ | * Too often only the Gini coefficient is quoted without describing the proportions of the quantiles used for measurement. As with other inequality coefficients, the Gini coefficient is influenced by the granularity of the measurements. For example, five 20% quantiles (low granularity) will usually yield a lower Gini coefficient than twenty 5% quantiles (high granularity) taken from the same distribution. This is an often encountered problem with measurements. | ||

+ | |||

+ | * Care should be taken in using the Gini coefficient as a measure of [[egalitarianism]], as it is properly a measure of income dispersion. For example, if two equally egalitarian countries pursue different immigration policies, the country accepting higher proportion of low-income or impoverished migrants will paradoxically be assessed as less equal (gain a higher Gini coefficient). | ||

+ | |||

+ | * The Gini coefficient is point-estimate of equality at a certain time, hence it ignores life-span changes in income. Typically, increases in the proportion of young or old members of a society will drive apparent changes in equality. Because of this, factors such as age distribution within a population and mobility within income classes can create the appearance of differential equality when none exist taking into account epidemiological effects. Thus a given economy may have a higher Gini coefficient at any one point in time compared to another, while the Gini coefficient calculated over individuals' lifetime income is actually lower than the apparently more equal (at a given point in time) economy's.<ref>N. Blomquist, "A comparison of distributions of annual and lifetime income: Sweden around 1970", Review of Income and Wealth, Volume 27 Issue 3, Pages 243 - 264, [http://www3.interscience.wiley.com/journal/119573773/abstract?CRETRY=1&SRETRY=0]</ref> Essentially, what matters is not just inequality in any particular year, but the composition of the distribution over time. | ||

+ | |||

+ | * Countries can have the same Gini coefficient but have completely different levels of wealth. Similarly, the Gini coefficient as measured over time does not measure growth in incomes. | ||

+ | |||

+ | ==General problems of measurement== | ||

+ | * Comparing income distributions among countries may be difficult because benefits systems may differ. For example, some countries give benefits in the form of money while others give [[food stamps]], which might not be counted by some economists and researchers as income in the Lorenz curve and therefore not taken into account in the Gini coefficient. The USA counts income before benefits, while France counts it after benefits, making the USA appear slightly more unequal vis-a-vis France than it admittedly is. In another example, USSR appeared to have relatively high income inequality: by some estimates, in the late 70's, Gini coefficient of its urban population was as high as 0.38<ref>"Politics, work, and daily life in the USSR", James R. Millar, 1987, p.193</ref>, which is higher than many Western countries today. This apparent inequality ignored the fact that many benefits received by Soviet citizens were nonmonetary and were afforded regardless of income: these benefits included, among others, free child care for children as young as 2 months, free elementary, secondary and higher education, free cradle-to-grave medical care, free or heavily subsidized housing. In this example, an accurate comparison between the 1970s USSR and Western countries would require one to assign monetary values to such benefits (a difficult task in the absence of free markets). Similar problems arise whenever a comparison between pure free-market economies and partially socialist economies is attempted. Benefits may take various and unexpected forms: for example, major oil producers such as Venezuela and Iran provide indirect benefits to its citizens by subsidizing the retail price of gasoline. | ||

+ | |||

+ | * The measure will give different results when applied to individuals instead of households. When different populations are not measured with consistent definitions, comparison is not meaningful. | ||

+ | |||

+ | * As for all statistics, there may be systematic and random errors in the data. The meaning of the Gini coefficient decreases as the data become less accurate. Also, countries may collect data differently, making it difficult to compare statistics between countries. | ||

+ | |||

+ | As one result of this criticism, in addition to or in competition with the Gini coefficient ''entropy measures'' are frequently used (e.g. the [[Theil Index]] and the [[Atkinson index]]). These measures attempt to compare the distribution of resources by intelligent agents in the market with a maximum [[information entropy|entropy]] [[random distribution]], which would occur if these agents acted like non-intelligent particles in a closed system following the laws of statistical physics. | ||

+ | |||

+ | ==Credit risk== | ||

+ | |||

+ | The Gini coefficient is also commonly used for the measurement of the discriminatory power of [[credit rating|rating]] systems in [[credit risk]] management. Since Gini coefficient addresses wealth inequality it may be important to understand what a [[transformative asset]] is. Transformative assets increase the Gini coefficient as they provide a family or individual with a wealth advantage over most persons. | ||

+ | |||

+ | The discriminatory power refers to a credit risk model's ability to differentiate between defaulting and non-defaulting clients. The above formula <math>G_1</math> may be used for the final model and also at individual model factor level, to quantify the discriminatory power of individual factors. This is as a result of too many non defaulting clients falling into the lower points scale e.g. factor has a 10 point scale and 30% of non defaulting clients are being assigned the lowest points available e.g. 0 or negative points. This indicates that the factor is behaving in a counter-intuitive manner and would require further investigation at the model development stage. | ||

+ | |||

+ | References: The Analytics of risk model validation | ||

+ | |||

+ | ==See also== | ||

+ | {{Col-begin}} | ||

+ | {{Col-break}} | ||

+ | * [[Managing for development results]] | ||

+ | * [[Globalization and Health]] | ||

+ | * [[Atkinson index]] | ||

+ | * [[Human Poverty Index]] | ||

+ | * [[Income inequality metrics]] | ||

+ | {{Col-break}} | ||

+ | * [[Pareto distribution]] | ||

+ | * [[Robin Hood index]] | ||

+ | * [[ROC analysis]] | ||

+ | * [[Social welfare provision]] | ||

+ | * [[Income_inequality_metrics#Spreadsheet computations|Spreadsheet computations]] | ||

+ | {{Col-break}} | ||

+ | * [[Suits index]] | ||

+ | * [[Theil index]] | ||

+ | * [[Wealth condensation]] | ||

+ | * [[Welfare economics]] | ||

+ | * [[List of countries by income equality]] | ||

+ | * [[List of countries by Human Development Index]] | ||

+ | |||

+ | {{Col-end}} | ||

+ | |||

+ | == References== | ||

+ | {{reflist}} | ||

+ | |||

+ | == Further reading== | ||

+ | <div class="references-small" style="-moz-column-count:2; column-count:2;"> | ||

+ | *{{cite book | author=Amiel, Y.; Cowell, F.A. | year=1999 | title=Thinking about Inequality | publisher=Cambridge}} | ||

+ | *{{cite book | first=Sudhir | last=Anand | year=1983 | title=Inequality and Poverty in Malaysia | publisher=Oxford University Press | location=New York}} | ||

+ | *{{cite journal | author=Brown, Malcolm | title=Using Gini-Style Indices to Evaluate the Spatial Patterns of Health Practitioners: Theoretical Considerations and an Application Based on Alberta Data | journal=Social Science Medicine | year=1994 | volume=38 | pages=1243–1256 | doi=10.1016/0277-9536(94)90189-9}} | ||

+ | *{{cite book | first=S. R. | last=Chakravarty | year=1990 | title=Ethical Social Index Numbers | publisher=Springer-Verlag | location=New York }} | ||

+ | *{{cite book | first=Angus | last=Deaton | year=1997 | title=Analysis of Household Surveys | publisher=Johns Hopkins University Press | location=Baltimore MD }} | ||

+ | *{{cite journal | author=Dixon, PM, Weiner J., Mitchell-Olds T, Woodley R. | title=Bootstrapping the Gini coefficient of inequality | journal=Ecology | year=1987 | volume=68 | pages=1548–1551 | doi=10.2307/1939238}} | ||

+ | *{{cite journal | author=Dorfman, Robert | title=A Formula for the Gini Coefficient | journal=The Review of Economics and Statistics | year=1979 | volume=61 | pages=146–149 | doi=10.2307/1924845}} | ||

+ | *{{cite journal | author=Gastwirth, Joseph L. | title=The Estimation of the Lorenz Curve and Gini Index | journal=The Review of Economics and Statistics | year=1972 | volume=54 | pages=306–316 | doi=10.2307/1937992}} | ||

+ | *{{cite journal | author=Giles, David | title=Calculating a Standard Error for the Gini Coefficient: Some Further Results | journal=Oxford Bulletin of Economics and Statistics | year=2004 | volume=66 | pages=425–433 | doi=10.1111/j.1468-0084.2004.00086.x }} | ||

+ | *Gini, Corrado (1912). "Variabilità e mutabilità" Reprinted in Memorie di metodologica statistica (Ed. Pizetti E, Salvemini, T). Rome: Libreria Eredi Virgilio Veschi (1955). | ||

+ | *{{cite journal | author=Gini, Corrado | title=Measurement of Inequality of Incomes | journal=The Economic Journal | year=1921 | volume=31 | pages=124–126 | doi=10.2307/2223319}} | ||

+ | *{{cite journal | author=Karagiannis, E. and Kovacevic, M. | title=A Method to Calculate the Jackknife Variance Estimator for the Gini Coefficient | journal=Oxford Bulletin of Economics and Statistics | year=2000 | volume=62 | pages=119–122 | doi=10.1111/1468-0084.00163}} | ||

+ | *{{cite journal | author=Mills, Jeffrey A.; Zandvakili, Sourushe | title=Statistical Inference via Bootstrapping for Measures of Inequality | journal=Journal of Applied Econometrics | year=1997 | volume=12 | pages=133–150 | doi=10.1002/(SICI)1099-1255(199703)12:2<133::AID-JAE433>3.0.CO;2-H}} | ||

+ | *{{cite journal | author=Modarres, Reza and Gastwirth, Joseph L. | title=A Cautionary Note on Estimating the Standard Error of the Gini Index of Inequality | journal=Oxford Bulletin of Economics and Statistics | year=2006 | volume=68 | pages=385–390 | doi=10.1111/j.1468-0084.2006.00167.x}} | ||

+ | *{{cite journal | author=Morgan, James | title=The Anatomy of Income Distribution | journal=The Review of Economics and Statistics | year=1962 | volume=44 | pages=270–283 | doi=10.2307/1926398 }} | ||

+ | *{{cite journal | author=Ogwang, Tomson | title=A Convenient Method of Computing the Gini Index and its Standard Error | journal=Oxford Bulletin of Economics and Statistics | year=2000 | volume=62 | pages=123–129 | doi=10.1111/1468-0084.00164 }} | ||

+ | *{{cite journal | author=Ogwang, Tomson | title=Calculating a Standard Error for the Gini Coefficient: Some Further Results: Reply | journal=Oxford Bulletin of Economics and Statistics | year=2004 | volume=66 | pages=435–437 | doi=10.1111/j.1468-0084.2004.00087.x}} | ||

+ | *{{Cite paper | author=Xu, Kuan | title=How Has the Literature on Gini's Index Evolved in the Past 80 Years? | publisher=Department of Economics, Dalhousie University | date=January 2004 | version= | url=http://economics.dal.ca/RePEc/dal/wparch/howgini.pdf | accessdate=2006-06-01 }} The Chinese version of this paper appears in {{cite journal | author=Xu, Kuan | title=How Has the Literature on Gini's Index Evolved in the Past 80 Years? | journal = China Economic Quarterly | year=2003 | volume=2 | pages=757–778 }} | ||

+ | *{{cite journal | author=Yitzhaki, S. | title=Calculating Jackknife Variance Estimators for Parameters of the Gini Method| journal=Journal of Business and Economic Statistics | year=1991 | volume=9 | pages=235–239 | doi=10.2307/1391792}} | ||

+ | </div> | ||

+ | |||

+ | == External links == | ||

+ | * Deutsche Bundesbank: [http://www.bundesbank.de/download/bankenaufsicht/dkp/200503dkp_b.pdf Do banks diversify loan portfolios?], 2005 (on using e.g. the Gini coefficient for risc evaluation of loan portefolios) | ||

+ | * [http://www.forbes.com/billionaires/free_forbes/2003/0317/098.html Forbes Article, In praise of inequality] | ||

+ | * [http://web.archive.org/web/20070614000931/http://hdr.undp.org/reports/global/2004/pdf/hdr04_HDI.pdf Gini index calculated for all countries] (from internet archive) | ||

+ | * [http://www.theresearchkitchen.com/blog/archives/219 Measuring Software Project Risk With The Gini Coefficient], an application of the Gini coefficient to software | ||

+ | * [http://web.worldbank.org/WBSITE/EXTERNAL/TOPICS/EXTPOVERTY/EXTPA/0,,contentMDK:20238991~menuPK:492138~pagePK:148956~piPK:216618~theSitePK:430367,00.html The World Bank: Measuring Inequality] | ||

+ | * [http://utip.gov.utexas.edu/tutorials/theo_basic_ineq_measures.doc Travis Hale, University of Texas Inequality Project:The Theoretical Basics of Popular Inequality Measures], online computation of examples: [http://www.poorcity.richcity.org/calculator/?quantiles=7,*18000|10,*22000|280,*25000|15,*35000|15,*40000|50,*60000|10,*75000|6,*80000|4,*120000|2,*200000|1,1000000 1A], [http://www.poorcity.richcity.org/calculator/?quantiles=12,*15000|25,*20000|1000,*30000|35,*35000|100,*45000|80,*50000|10,*60000|25,*80000|8,*175000|4,*250000|1,5000000 1B] | ||

+ | * [http://www.census.gov/hhes/www/income/histinc/state/state4.html United States Census Bureau List of Gini Coefficients by State for Families and Households] | ||

+ | * [http://image.guardian.co.uk/sys-files/Guardian/documents/2009/03/13/inequality.pdf Article from The Guardian analysing inequality in the UK 1974 - 2006] | ||

+ | * [http://www.wider.unu.edu/research/Database/en_GB/database/ World Income Inequality Database] | ||

+ | * [http://www.oecd.org/document/53/0,3343,en_2649_33933_41460917_1_1_1_1,00.html Income Distribution and Poverty in OECD Countries] | ||

+ | * Software: | ||

+ | ** [http://www.mathworks.com/matlabcentral/fileexchange/loadFile.do?objectId=19968 A Matlab Inequality Package], including code for computing Gini, Atkinson, Theil indexes and for plotting the Lorenz Curve. Many examples are available. | ||

+ | ** [http://www.wessa.net/co.wasp Free Online Calculator] computes the Gini Coefficient, plots the Lorenz curve, and computes many other measures of concentration for any dataset | ||

+ | ** Free Calculator: [http://www.poorcity.richcity.org/calculator.htm Online] and [http://luaforge.net/project/showfiles.php?group_id=49 downloadable scripts] ([[Python (programming language)|Python]] and [[Lua programming language|Lua]]) for Atkinson, Gini, and Hoover inequalities | ||

+ | ** Users of the [http://www.r-project.org/ R] data analysis software can install the "[http://cran.r-project.org/web/packages/ineq/index.html ineq]" package which allows for computation of a variety of inequality indices including Gini, Atkinson, Theil. | ||

+ | |||

+ | {{DEFAULTSORT:Gini Coefficient}} | ||

+ | [[Category:Welfare economics]] | ||

+ | [[Category:Index numbers]] | ||

+ | [[Category:Socioeconomics]] | ||

+ | [[Category:Economic indicators]] | ||

+ | [[Category:Economic inequality]] | ||

+ | [[Category:Summary statistics]] | ||

+ | [[Category:Income distribution]] | ||

+ | [[Category:Demographic economics]] | ||

+ | |||

+ | [[ar:معامل جيني]] | ||

+ | [[bn:জিনি সহগ]] | ||

+ | [[be-x-old:Каэфіцыент Джыні]] | ||

+ | [[ca:Coeficient de Gini]] | ||

+ | [[cs:Giniho koeficient]] | ||

+ | [[da:Gini-koefficient]] | ||

+ | [[de:Gini-Koeffizient]] | ||

+ | [[es:Coeficiente de Gini]] | ||

+ | [[eo:Koeficiento de Gini]] | ||

+ | [[fa:شاخص جینی]] | ||

+ | [[fr:Coefficient de Gini]] | ||

+ | [[ko:지니 계수]] | ||

+ | [[ig:Gini coefficient]] | ||

+ | [[it:Coefficiente di Gini]] | ||

+ | [[he:מדד ג'יני]] | ||

+ | [[lo:ສຳປະສິດຈິນີ]] | ||

+ | [[lv:Džini koeficients]] | ||

+ | [[mk:Џини коефициент]] | ||

+ | [[ms:Pekali Gini]] | ||

+ | [[nl:Gini-coëfficiënt]] | ||

+ | [[ja:ジニ係数]] | ||

+ | [[no:Gini-koeffisient]] | ||

+ | [[pl:Współczynnik Giniego]] | ||

+ | [[pt:Coeficiente de Gini]] | ||

+ | [[ro:Coeficientul lui Gini]] | ||

+ | [[ru:Коэффициент Джини]] | ||

+ | [[sah:Дьини коэффициена]] | ||

+ | [[sl:Ginijev koeficient]] | ||

+ | [[su:Koefisien Gini]] | ||

+ | [[fi:Gini-kerroin]] | ||

+ | [[sv:Ginikoefficient]] | ||

+ | [[th:ค่าสัมประสิทธิ์จีนี]] | ||

+ | [[tr:Gini katsayısı]] | ||

+ | [[vi:Hệ số Gini]] | ||

+ | [[zh:基尼系数]] |

## Revision as of 20:41, 4 August 2009

## Stub Draft

The public wiki is not an appropriate place for unofficial estimation of Gini for those countries for which it isn't currrently available but the following are of interest:

- Cuba - Should be very low
- DPRK - Low or High? This and prior one should be informed by the existing stats for China.
- Iraq - Obvious political interest
- Saudi Arabia - Odd no data given wealth

The **Gini coefficient** is a measure of statistical dispersion developed by the Italian statistician Corrado Gini and published in his 1912 paper "Variability and Mutability" (Template:Lang-it). It is commonly used as a measure of inequality of income or wealth. It has, however, also found application in the study of inequalities in disciplines as diverse as health science, ecology, and chemistry.

## Definition

The Gini coefficient is usually defined mathematically based on the Lorenz curve (below). It can be thought of as the ratio of the area that lies between the line of equality and the Lorenz curve (marked 'A' in the diagram) over the total area under the line of equality (marked 'A' and 'B' in the diagram); i.e., G=A/(A+B).

The Gini coefficient can range from 0 to 1; it is sometimes multiplied by 100 to range between 0 and 100. A low Gini coefficient indicates a more equal distribution, with 0 corresponding to perfect equality, while higher Gini coefficients indicate more unequal distribution, with 1 corresponding to perfect inequality. To be validly computed, no negative goods can be distributed. Thus, if the Gini coefficient is being used to describe household income inequality, then no household can have a negative income. When used as a measure of income inequality, the most unequal society will be one in which a single person receives 100% of the total income and the remaining people receive none (G=1); and the most equal society will be one in which every person receives the same percentage of the total income (G=0).

Some find it more intuitive (and it is mathematically equivalent) to think of the Gini coefficient as half of the Relative mean difference. The mean difference is the average absolute difference between two items selected randomly from a population, and the relative mean difference is the mean difference divided by the average, to normalize for scale. Worldwide, Gini coefficients for income range from approximately 0.230 in Sweden to 0.707 in Namibia although not every country has been assessed.

As a mathematical measure of inequality, the Gini coefficient carries no moral judgement about whether a particular level of (in)equality is good or bad.

## Different uses

Although the Gini coefficient is most popular in economics, it can in theory be applied in any field of science that studies a distribution. For example, in ecology the Gini coefficient has been used as a measure of biodiversity, where the cumulative proportion of species is plotted against cumulative proportion of individuals^{[1]}. In health, it has been used as a measure of the inequality of health related quality of life in a population^{[2]}. In chemistry it has been used to to express the selectivity of protein kinase inhibitors against a panel of kinases^{[3]}.

## Calculation

The Gini index is defined as a ratio of the areas on the Lorenz curve diagram. If the area between the line of perfect equality and the Lorenz curve is A, and the area under the Lorenz curve is B, then the Gini index is A/(A+B). Since A+B = 0.5, the Gini index, G = A/(0.5) = 2A = 1-2B. If the Lorenz curve is represented by the function Y = L(X), the value of B can be found with integration and:

- <math>G = 1 - 2\,\int_0^1 L(X) dX. </math>

In some cases, this equation can be applied to calculate the Gini coefficient without direct reference to the Lorenz curve. For example:

- For a population uniform on the values
*y*_{i},*i*= 1 to*n*, indexed in non-decreasing order (*y*_{i}≤*y*_{i+1}):

- <math>G = \frac{1}{n}\left ( n+1 - 2 \left ( \frac{\Sigma_{i=1}^n \; (n+1-i)y_i}{\Sigma_{i=1}^n y_i} \right ) \right ) </math>
- This may be simplified to:
- <math>G = \frac{2 \Sigma_{i=1}^n \; i y_i}{n \Sigma_{i=1}^n y_i} -\frac{n+1}{n}</math>

- For a discrete probability function
*f*(*y*), where*y*_{i},*i*= 1 to*n*, are the points with nonzero probabilities and which are indexed in increasing order (*y*_{i}<*y*_{i+1}):

- <math>G = 1 - \frac{\Sigma_{i=1}^n \; f(y_i)(S_{i-1}+S_i)}{S_n}</math>
- where
- <math>S_i = \Sigma_{j=1}^i \; f(y_j)\,y_j\,</math> and <math>S_0 = 0\,</math>

- For a cumulative distribution function
*F*(*y*) that is piecewise differentiable, has a mean μ, and is zero for all negative values of*y*:

- <math>G = 1 - \frac{1}{\mu}\int_0^\infty (1-F(y))^2dy = \frac{1}{\mu}\int_0^\infty F(y)(1-F(y))dy</math>

- Since the Gini coefficient is half the relative mean difference, it can also be calculated using formulas for the relative mean difference. For a random sample
*S*consisting of values*y*_{i},*i*= 1 to*n*, that are indexed in non-decreasing order (*y*_{i}≤*y*_{i+1}), the statistic:

- <math>G(S) = \frac{1}{n-1}\left (n+1 - 2 \left ( \frac{\Sigma_{i=1}^n \; (n+1-i)y_i}{\Sigma_{i=1}^n y_i}\right ) \right )</math>

- is a consistent estimator of the population Gini coefficient, but is not, in general, unbiased. Like, G, G(S) has a simpler form:

- <math>G(S) = 1 - \frac{2}{n-1}\left ( n - \frac{\Sigma_{i=1}^n \; iy_i}{\Sigma_{i=1}^n y_i}\right ) </math>.

There does not exist a sample statistic that is in general an unbiased estimator of the population Gini coefficient, like the relative mean difference.

Sometimes the entire Lorenz curve is not known, and only values at certain intervals are given. In that case, the Gini coefficient can be approximated by using various techniques for interpolating the missing values of the Lorenz curve. If ( X_{ k} , Y_{k} ) are the known points on the Lorenz curve, with the X_{ k} indexed in increasing order ( X_{ k - 1} < X_{ k} ), so that:

- X
_{k}is the cumulated proportion of the population variable, for k = 0,...,n, with X_{0}= 0, X_{n}= 1. - Y
_{k}is the cumulated proportion of the income variable, for k = 0,...,n, with Y_{0}= 0, Y_{n}= 1.

If the Lorenz curve is approximated on each interval as a line between consecutive points, then the area B can be approximated with trapezoids and:

- <math>G_1 = 1 - \sum_{k=1}^{n} (X_{k} - X_{k-1}) (Y_{k} + Y_{k-1})</math>

is the resulting approximation for G. More accurate results can be obtained using other methods to approximate the area B, such as approximating the Lorenz curve with a quadratic function across pairs of intervals, or building an appropriately smooth approximation to the underlying distribution function that matches the known data. If the population mean and boundary values for each interval are also known, these can also often be used to improve the accuracy of the approximation.

The Gini coefficient calculated from a sample is a statistic and its standard error, or confidence intervals for the population Gini coefficient, should be reported. These can be calculated using bootstrap techniques but those proposed have been mathematically complicated and computationally onerous even in an era of fast computers. Ogwang (2000) made the process more efficient by setting up a “trick regression model” in which the incomes in the sample are ranked with the lowest income being allocated rank 1. The model then expresses the rank (dependent variable) as the sum of a constant *A* and a normal error term whose variance is inversely proportional to *y*_{k};

- <math>k = A + \ N(0, s^{2}/y_k) </math>

Ogwang showed that *G* can be expressed as a function of the weighted least squares estimate of the constant *A* and that this can be used to speed up the calculation of the jackknife estimate for the standard error. Giles (2004) argued that the standard error of the estimate of *A* can be used to derive that of the estimate of *G* directly without using a jackknife at all. This method only requires the use of ordinary least squares regression after ordering the sample data. The results compare favorably with the estimates from the jackknife with agreement improving with increasing sample size. The paper describing this method can be found here: http://web.uvic.ca/econ/ewp0202.pdf

However it has since been argued that this is dependent on the model’s assumptions about the error distributions (Ogwang 2004) and the independence of error terms (Reza & Gastwirth 2006) and that these assumptions are often not valid for real data sets. It may therefore be better to stick with jackknife methods such as those proposed by Yitzhaki (1991) and Karagiannis and Kovacevic (2000). The debate continues.

The Gini coefficient can be calculated if you know the mean of a distribution, the number of people (or percentiles), and the income of each person (or percentile). Princeton development economist Angus Deaton (1997, 139) simplified the Gini calculation to one easy formula:

- <math>G = \frac{N+1}{N-1}-\frac{2}{N(N-1)u}(\Sigma_{i=1}^n \; P_iX_i)</math>

where u is mean income of the population, P_{i} is the income rank P of person i, with income X, such that the richest person receives a rank of 1 and the poorest a rank of N. This effectively gives higher weight to poorer people in the income distribution, which allows the Gini to meet the Transfer Principle.

## Income Gini indices in the world

A complete listing is in list of countries by income equality; the article economic inequality discusses the social and policy aspects of income and asset inequality.

While most developed European nations and Canada tend to have Gini indices between 24 and 36, the United States' and Mexico's Gini indices are both above 40, indicating that the United States and Mexico have greater inequality. Using the Gini can help quantify differences in welfare and compensation policies and philosophies. However it should be borne in mind that the Gini coefficient can be misleading when used to make political comparisons between large and small countries (see criticisms section).

The Gini index for the entire world has been estimated by various parties to be between 56 and 66.^{[4]}^{[5]}

### US income Gini indices over time

Gini indices for the United States at various times, according to the US Census Bureau:^{[6]}

- 1929: 45.0 (estimated)
- 1947: 37.6 (estimated)
- 1967: 39.7 (first year reported)
- 1968: 38.6 (lowest index reported)
- 1970: 39.4
- 1980: 40.3
- 1990: 42.8
- 2000: 46.2
- 2005: 46.9
- 2006: 47.0 (highest index reported)
- 2007: 46.3
^{[7]}

### EU gini index

In 2005 the Gini index for the EU was estimated at 31.^{[8]}. This is surprisingly low, since the EU has virtually no interstate income redistribution power (the EU budget is only ~1% of the total GDP, there are no EU taxes, there is no EU social policy, and no EU treasury); moreover, a number of poorer new member states joined in 2004.

## Advantages of Gini coefficient as a measure of inequality

- The Gini coefficient's main advantage is that it is a measure of inequality by means of a ratio analysis, rather than a variable unrepresentative of most of the population, such as per capita income or gross domestic product.

- It can be used to compare income distributions across different population sectors as well as countries, for example the Gini coefficient for urban areas differs from that of rural areas in many countries (though the United States' urban and rural Gini coefficients are nearly identical).

- It is sufficiently simple that it can be compared across countries and be easily interpreted. GDP statistics are often criticized as they do not represent changes for the whole population; the Gini coefficient demonstrates how income has changed for poor and rich. If the Gini coefficient is rising as well as GDP, poverty may not be improving for the majority of the population.

- The Gini coefficient can be used to indicate how the distribution of income has changed within a country over a period of time, thus it is possible to see if inequality is increasing or decreasing.

- The Gini coefficient satisfies four importantTemplate:Who principles:
*Anonymity*: it does not matter who the high and low earners are.*Scale independence*: the Gini coefficient does not consider the size of the economy, the way it is measured, or whether it is a rich or poor country on average.*Population independence*: it does not matter how large the population of the country is.*Transfer principle*: if income (less than the difference), is transferred from a rich person to a poor person the resulting distribution is more equal.

## Disadvantages of Gini coefficient as a measure of inequality

- The Gini coefficient of different sets of people cannot be averaged to obtain the Gini coefficient of all the people in the sets: if a Gini coefficient were to be calculated for each person it would always be zero. For a large, economically diverse country, a much higher coefficient will be calculated for the country as a whole than will be calculated for each of its regions. (The coefficient is usually applied to measurable nominal income rather than local purchasing power, tending to increase the calculated coefficient across larger areas.)

- For this reason, the scores calculated for individual countries within the EU are difficult to compare with the score of the entire US: the overall value for the EU should be used in that case, 31.3
^{[9]}, which is still much lower than the United States', 45.^{[10]}Using decomposable inequality measures (e.g. the Theil index <math>T</math> converted by <math>1-{e^{-T}}</math> into a inequality coefficient) averts such problems.

- The Lorenz curve may understate the actual amount of inequality if richer households are able to use income more efficiently than lower income households or vice versa. From another point of view, measured inequality may be the result of more or less efficient use of household incomes.

- Economies with similar incomes and Gini coefficients can still have very different income distributions. This is because the Lorenz curves can have different shapes and yet still yield the same Gini coefficient.

- It measures current income rather than lifetime income. A society in which everyone earned the same over a lifetime would appear unequal because of people at different stages in their life; a society in which students study rather than save can never have a coefficient of 0. However, Gini coefficient can also be calculated for any kind of distribution, e.g. for wealth.
^{[11]}

## Problems in using the Gini coefficient

- Gini coefficients do include investment income; however, the Gini coefficient based on net income does not accurately reflect differences wealth - a possible source of misinterpretation. For example, Sweden has a low Gini coefficient for income distribution but a significantly higher Gini coefficient for wealth (still low by international standards, but significantly higher than for income: for instance 77% of the share value owned by households is held by just 5% of Swedish shareholding households )
^{[12]}. In other words, the Gini income coefficient should not be interpreted as measuring effective egalitarianism.

- Too often only the Gini coefficient is quoted without describing the proportions of the quantiles used for measurement. As with other inequality coefficients, the Gini coefficient is influenced by the granularity of the measurements. For example, five 20% quantiles (low granularity) will usually yield a lower Gini coefficient than twenty 5% quantiles (high granularity) taken from the same distribution. This is an often encountered problem with measurements.

- Care should be taken in using the Gini coefficient as a measure of egalitarianism, as it is properly a measure of income dispersion. For example, if two equally egalitarian countries pursue different immigration policies, the country accepting higher proportion of low-income or impoverished migrants will paradoxically be assessed as less equal (gain a higher Gini coefficient).

- The Gini coefficient is point-estimate of equality at a certain time, hence it ignores life-span changes in income. Typically, increases in the proportion of young or old members of a society will drive apparent changes in equality. Because of this, factors such as age distribution within a population and mobility within income classes can create the appearance of differential equality when none exist taking into account epidemiological effects. Thus a given economy may have a higher Gini coefficient at any one point in time compared to another, while the Gini coefficient calculated over individuals' lifetime income is actually lower than the apparently more equal (at a given point in time) economy's.
^{[13]}Essentially, what matters is not just inequality in any particular year, but the composition of the distribution over time.

- Countries can have the same Gini coefficient but have completely different levels of wealth. Similarly, the Gini coefficient as measured over time does not measure growth in incomes.

## General problems of measurement

- Comparing income distributions among countries may be difficult because benefits systems may differ. For example, some countries give benefits in the form of money while others give food stamps, which might not be counted by some economists and researchers as income in the Lorenz curve and therefore not taken into account in the Gini coefficient. The USA counts income before benefits, while France counts it after benefits, making the USA appear slightly more unequal vis-a-vis France than it admittedly is. In another example, USSR appeared to have relatively high income inequality: by some estimates, in the late 70's, Gini coefficient of its urban population was as high as 0.38
^{[14]}, which is higher than many Western countries today. This apparent inequality ignored the fact that many benefits received by Soviet citizens were nonmonetary and were afforded regardless of income: these benefits included, among others, free child care for children as young as 2 months, free elementary, secondary and higher education, free cradle-to-grave medical care, free or heavily subsidized housing. In this example, an accurate comparison between the 1970s USSR and Western countries would require one to assign monetary values to such benefits (a difficult task in the absence of free markets). Similar problems arise whenever a comparison between pure free-market economies and partially socialist economies is attempted. Benefits may take various and unexpected forms: for example, major oil producers such as Venezuela and Iran provide indirect benefits to its citizens by subsidizing the retail price of gasoline.

- The measure will give different results when applied to individuals instead of households. When different populations are not measured with consistent definitions, comparison is not meaningful.

- As for all statistics, there may be systematic and random errors in the data. The meaning of the Gini coefficient decreases as the data become less accurate. Also, countries may collect data differently, making it difficult to compare statistics between countries.

As one result of this criticism, in addition to or in competition with the Gini coefficient *entropy measures* are frequently used (e.g. the Theil Index and the Atkinson index). These measures attempt to compare the distribution of resources by intelligent agents in the market with a maximum entropy random distribution, which would occur if these agents acted like non-intelligent particles in a closed system following the laws of statistical physics.

## Credit risk

The Gini coefficient is also commonly used for the measurement of the discriminatory power of rating systems in credit risk management. Since Gini coefficient addresses wealth inequality it may be important to understand what a transformative asset is. Transformative assets increase the Gini coefficient as they provide a family or individual with a wealth advantage over most persons.

The discriminatory power refers to a credit risk model's ability to differentiate between defaulting and non-defaulting clients. The above formula <math>G_1</math> may be used for the final model and also at individual model factor level, to quantify the discriminatory power of individual factors. This is as a result of too many non defaulting clients falling into the lower points scale e.g. factor has a 10 point scale and 30% of non defaulting clients are being assigned the lowest points available e.g. 0 or negative points. This indicates that the factor is behaving in a counter-intuitive manner and would require further investigation at the model development stage.

References: The Analytics of risk model validation

## See also

Template:Col-begin Template:Col-break

- Managing for development results
- Globalization and Health
- Atkinson index
- Human Poverty Index
- Income inequality metrics

- Suits index
- Theil index
- Wealth condensation
- Welfare economics
- List of countries by income equality
- List of countries by Human Development Index

## References

- ↑ Template:Cite article
- ↑ Template:Cite article
- ↑ Template:Cite article
- ↑ Template:Citation
- ↑ United Nations Development Programme
- ↑
"Gini Ratios for Households, by Race and Hispanic Origin of Householder: 1967 to 2007".
*Historical Income Tables - Households*. United States Census Bureau. Archived from the original. Error: You must specify the date the archive was made using the`|archivedate=`

parameter. http://www.census.gov/hhes/www/income/histinc/h04.html. - ↑ Note that the calculation of the index for the United States was changed in 1992, resulting in an upwards shift of about 2.
- ↑ http://www.eurofound.europa.eu/areas/qualityoflife/eurlife/index.php?template=3&radioindic=158&idDomain=3
- ↑ Template:Citation
- ↑ Template:Citation
- ↑ Friedman, David D.
- ↑ (Data from the Statistics Sweden.)
- ↑ N. Blomquist, "A comparison of distributions of annual and lifetime income: Sweden around 1970", Review of Income and Wealth, Volume 27 Issue 3, Pages 243 - 264, [1]
- ↑ "Politics, work, and daily life in the USSR", James R. Millar, 1987, p.193

## Further reading

- Amiel, Y.; Cowell, F.A. (1999).
*Thinking about Inequality*. Cambridge. - Anand, Sudhir (1983).
*Inequality and Poverty in Malaysia*. New York: Oxford University Press. - Template:Cite journal
- Chakravarty, S. R. (1990).
*Ethical Social Index Numbers*. New York: Springer-Verlag. - Deaton, Angus (1997).
*Analysis of Household Surveys*. Baltimore MD: Johns Hopkins University Press. - Template:Cite journal
- Template:Cite journal
- Template:Cite journal
- Template:Cite journal
- Gini, Corrado (1912). "Variabilità e mutabilità" Reprinted in Memorie di metodologica statistica (Ed. Pizetti E, Salvemini, T). Rome: Libreria Eredi Virgilio Veschi (1955).
- Template:Cite journal
- Template:Cite journal
- Template:Cite journal
- Template:Cite journal
- Template:Cite journal
- Template:Cite journal
- Template:Cite journal
- Template:Cite paper The Chinese version of this paper appears in Template:Cite journal
- Template:Cite journal

## External links

- Deutsche Bundesbank: Do banks diversify loan portfolios?, 2005 (on using e.g. the Gini coefficient for risc evaluation of loan portefolios)
- Forbes Article, In praise of inequality
- Gini index calculated for all countries (from internet archive)
- Measuring Software Project Risk With The Gini Coefficient, an application of the Gini coefficient to software
- The World Bank: Measuring Inequality
- Travis Hale, University of Texas Inequality Project:The Theoretical Basics of Popular Inequality Measures, online computation of examples: 1A, 1B
- United States Census Bureau List of Gini Coefficients by State for Families and Households
- Article from The Guardian analysing inequality in the UK 1974 - 2006
- World Income Inequality Database
- Income Distribution and Poverty in OECD Countries
- Software:
- A Matlab Inequality Package, including code for computing Gini, Atkinson, Theil indexes and for plotting the Lorenz Curve. Many examples are available.
- Free Online Calculator computes the Gini Coefficient, plots the Lorenz curve, and computes many other measures of concentration for any dataset
- Free Calculator: Online and downloadable scripts (Python and Lua) for Atkinson, Gini, and Hoover inequalities
- Users of the R data analysis software can install the "ineq" package which allows for computation of a variety of inequality indices including Gini, Atkinson, Theil.

ar:معامل جيني bn:জিনি সহগ be-x-old:Каэфіцыент Джыні ca:Coeficient de Gini cs:Giniho koeficient da:Gini-koefficient es:Coeficiente de Gini eo:Koeficiento de Gini fa:شاخص جینی fr:Coefficient de Gini ko:지니 계수 ig:Gini coefficient it:Coefficiente di Gini he:מדד ג'יני lo:ສຳປະສິດຈິນີ lv:Džini koeficients mk:Џини коефициент ms:Pekali Gini nl:Gini-coëfficiënt ja:ジニ係数 no:Gini-koeffisient pl:Współczynnik Giniego ro:Coeficientul lui Gini ru:Коэффициент Джини sah:Дьини коэффициена sl:Ginijev koeficient su:Koefisien Gini fi:Gini-kerroin th:ค่าสัมประสิทธิ์จีนี tr:Gini katsayısı vi:Hệ số Gini