ON CALCULATION OF THE EXTENDED GINI COEFFICIENT. Duangkamon Chotikapanich and William Griffiths. No May 2000

Similar documents
How To Calculate The Accountng Perod Of Nequalty

Linear Circuits Analysis. Superposition, Thevenin /Norton Equivalent circuits

The OC Curve of Attribute Acceptance Plans

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

8.5 UNITARY AND HERMITIAN MATRICES. The conjugate transpose of a complex matrix A, denoted by A*, is given by

This circuit than can be reduced to a planar circuit

An Empirical Investigation of Collateral and Sorting in the HELOC Market

Optical Signal-to-Noise Ratio and the Q-Factor in Fiber-Optic Communication Systems

v a 1 b 1 i, a 2 b 2 i,..., a n b n i.

THE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES

IT09 - Identity Management Policy

Can Auto Liability Insurance Purchases Signal Risk Attitude?

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

Recurrence. 1 Definitions and main statements

CHAPTER 14 MORE ABOUT REGRESSION

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

Section 5.4 Annuities, Present Value, and Amortization

Calculation of Sampling Weights

SIMPLE LINEAR CORRELATION

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6

Characterization of Assembly. Variation Analysis Methods. A Thesis. Presented to the. Department of Mechanical Engineering. Brigham Young University

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

Answer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy

What is Candidate Sampling

Quantization Effects in Digital Filters

DEFINING %COMPLETE IN MICROSOFT PROJECT

Simple Interest Loans (Section 5.1) :

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Realistic Image Synthesis

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

Forecasting the Direction and Strength of Stock Market Movement

The Games of Cournot Sports

Rotation Kinematics, Moment of Inertia, and Torque

An Alternative Way to Measure Private Equity Performance

An Interest-Oriented Network Evolution Mechanism for Online Communities

Lecture 3: Annuity. Study annuities whose payments form a geometric progression or a arithmetic progression.

8 Algorithm for Binary Searching in Trees

where the coordinates are related to those in the old frame as follows.

Analysis of Premium Liabilities for Australian Lines of Business

BERNSTEIN POLYNOMIALS

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Evaluating the Effects of FUNDEF on Wages and Test Scores in Brazil *

WORKING PAPERS. The Impact of Technological Change and Lifestyles on the Energy Demand of Households

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation

Economic Interpretation of Regression. Theory and Applications

Traffic-light a stress test for life insurance provisions

Using Series to Analyze Financial Situations: Present Value

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Accident analysis: The role of liability rules pecuniary losses.

In our example i = r/12 =.0825/12 At the end of the first month after your payment is received your amount in the account, the balance, is

Lecture 3: Force of Interest, Real Interest Rate, Annuity

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

reduce competition increase market power cost savings economies of scale and scope cost savings Oliver Williamson: the efficiency defense

1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)

High Correlation between Net Promoter Score and the Development of Consumers' Willingness to Pay (Empirical Evidence from European Mobile Markets)

Application of Quasi Monte Carlo methods and Global Sensitivity Analysis in finance

Marginal Benefit Incidence Analysis Using a Single Cross-section of Data. Mohamed Ihsan Ajwad and Quentin Wodon 1. World Bank.

Stress test for measuring insurance risks in non-life insurance

PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIGIOUS AFFILIATION AND PARTICIPATION

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006

Understanding the Impact of Marketing Actions in Traditional Channels on the Internet: Evidence from a Large Scale Field Experiment

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES

Conversion between the vector and raster data structures using Fuzzy Geographical Entities

Effect of a spectrum of relaxation times on the capillary thinning of a filament of elastic liquid

Binomial Link Functions. Lori Murray, Phil Munz

The Application of Fractional Brownian Motion in Option Pricing

Support Vector Machines

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services

Gender differences in revealed risk taking: evidence from mutual fund investors

HÜCKEL MOLECULAR ORBITAL THEORY

On some special nonlevel annuities and yield rates for annuities

SPEE Recommended Evaluation Practice #6 Definition of Decline Curve Parameters Background:

Ring structure of splines on triangulations

Underwriting Risk. Glenn Meyers. Insurance Services Office, Inc.

Sketching Sampled Data Streams

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Peak Inverse Voltage

Luby s Alg. for Maximal Independent Sets using Pairwise Independence

Joe Pimbley, unpublished, Yield Curve Calculations

Marginal Returns to Education For Teachers

Finite Math Chapter 10: Study Guide and Solution to Problems

Returns to Experience in Mozambique: A Nonparametric Regression Approach

14.74 Lecture 5: Health (2)

! # %& ( ) +,../ # 5##&.6 7% 8 # #...

21 Vectors: The Cross Product & Torque

IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS

Management Quality, Financial and Investment Policies, and. Asymmetric Information

FINANCIAL MATHEMATICS. A Practical Guide for Actuaries. and other Business Professionals

Calculating the high frequency transmission line parameters of power cables

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall

Chapter XX More advanced approaches to the analysis of survey data. Gad Nathan Hebrew University Jerusalem, Israel. Abstract

The Effects of Tax Rate Changes on Tax Bases and the Marginal Cost of Public Funds for Canadian Provincial Governments

Covariate-based pricing of automobile insurance

Heterogeneous Paths Through College: Detailed Patterns and Relationships with Graduation and Earnings

Evaluation Methods for Non- Experimental Data

(6)(2) (-6)(-4) (-4)(6) + (-2)(-3) + (4)(3) + (2)(-3) = = 0

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES

Laddered Multilevel DC/AC Inverters used in Solar Panel Energy Systems

Fixed income risk attribution

Transcription:

Unersty of New England School of Economc Studes ON CALCULATION OF THE EXTENDED GINI COEFFICIENT by Duangkamon Chotkapanch and Wllam Grffths No. 000- ay 000 Workng Paper Seres n Econometrcs and Appled Statstcs ISSN 057-088 http://www.une.edu.au/febl/econstud/wps.htm Copyrght 000 by Duangkamon Chotkapanch and Wllam Grffths. All rghts resered. Readers may make erbatm copes of ths document for non-commercal purposes by any means, proded ths copyrght notce appears on all such copes. ISBN 86389 68 3

ON CALCULATION OF THE EXTENDED GINI COEFFICIENT Duangkamon Chotkapanch and Wllam Grffths Abstract The conentonal formula for estmatng the extended Gn coeffcent s a coarance formula proded by Lerman and Ytzhak (989). We suggest an alternate estmator obtaned by approxmatng the Lorenz cure by a seres of lnear segments. The two estmators are dentcal for the orgnal Gn coeffcent, where the nequalty aerson parameter s =. Howeer, for, they are dfferent. In a onte Carlo experment desgned to assess the relate bas and effcency of the two estmators, we fnd that, when usng grouped data, our new estmator has less bas and lower mean squared error than the coarance estmator. When sngle obseratons are used, there s lttle or no dfference n the performance of the two estmators. Key Words: Lorenz cure, ncome nequalty Duangkamon Chotkapanch s a Senor Lecturer n the School of Economcs and Fnance at Curtn Unersty of Technology. Wllam Grffths s a Professor n the School of Economc Studes at the Unersty of New England. Contact nformaton: School of Economc Studes, Unersty of New England, Armdale, NSW 35, Australa. Emal: wgrfft@metz.une.edu.au.

ON CALCULATION OF THE EXTENDED GINI COEFFICIENT. otaton Let π = F (x) represent the dstrbuton functon for ncome x and let η = F ( x ) be the correspondng frst moment dstrbuton functon. The relatonshp between η and π, defned for 0 x < s the Lorenz cure. We denote t by η = L (π). A muchused measure of ncome nequalty s the Gn coeffcent whch s equal to twce the area between a 45-degree lne and the Lorenz cure. That s, 0 G = L( π) dπ () It can also be wrtten as (see, for example, Lambert 993, p. 43) G = + xf( x) f( x) dx µ 0 x = co, ( ) () µ x { xfx} where µ E(x) s mean ncome and f ( x) = df( x) dx s the densty functon for x = ncome. Gen a sample of obseratons on ncome, whch may be aalable as sngle obseratons, or grouped nto ncome classes, seeral algebracally equalent formulas for estmatng G hae been descrbed n the lterature. See, for example, Table 8. n Nygård and Sandström (98) or Creedy (996, p.0, 0). The focus n ths paper s on dscrete ersons of the expressons n equatons () and (), and on generalsatons of them that can be used to estmate the extended Gn coeffcent ntroduced by Ytzhak (983) to accommodate dfferng aersons to nequalty. Because the dscrete ersons of () and () are algebracally equalent, any choce between them s made smply on the bass of computatonal conenence. Howeer, as we wll see, for estmatng the extended Gn coeffcent, generalsatons of the dscrete formulas lead to two dfferent estmators whch are not algebracally equalent. In these crcumstances, when choosng a formula for estmaton, computatonal conenence and estmator samplng propertes are mportant consderatons. A generalsaton of the dscrete erson of the coarance formula n 3

() was suggested by Lerman and Ytzhak (989). In ths paper we dere an alternate estmator whch s based on a generalsaton of a dscrete erson of equaton (). Ths alternate estmator s smple to calculate and has good samplng propertes. In a onte Carlo experment that we conduct, the two estmators hae smlar propertes when calculated from sngle obseratons; when calculated from grouped data, our new estmator outperforms the coarance estmator n terms of both bas and mean-squared error. Our results hae releance not just for estmaton of the extended Gn coeffcent, but also for estmaton of socal welfare measures that are dependent on the extended Gn coeffcent. See, for example, Lambert (993, p.3-30). In the remander of ths secton we ntroduce requred notaton and ge the dscrete ersons of () and () that hae been used for estmatng G. In Secton we present the extended Gn coeffcent and ts correspondng coarance estmator, and go on to dere our alternate estmator. The setups and results of the onte Carlo experment are descrbed n Secton 3 and some summary remarks are made n Secton 4. To ntroduce the notaton necessary to descrbe the arous estmators, suppose that ncome data hae been sampled and classfed nto ncome groups. The estmators that we descrbe can be used wth grouped data or wth sngle obseratons. In the case of sngle ungrouped obseratons, s the number of obseratons, and, n what follows, there s one obseraton n each group, wth the proporton of obseratons n each group beng p = /. Gen ths leel of generalty, we assume the followng nformaton s aalable for the -th group:. Aerage ncome x.. The proporton of obseratons p. 3. The cumulate proporton of obseratons π = p + p + K + p. 4. The proporton of ncome φ= px px. j j j= 5. The cumulate proporton of ncome η = φ + φ + K + φ. Also, let x = px denote the sample mean ncome. = 4

As noted by Lerman and Ytzhak (989), the dscrete erson of (), that prodes an estmator for G, s where G = p( x x)( π π ) (3) x = π = ( π + π ) and π= p π = Another way to approach the estmaton problem s a equaton (). If the Lorenz cure L (π) s approxmated by a number of lnear segments, wth the -th lnear segment beng a straght lne jonng ( π, η ) to ( π, η ), then the area defned by equaton () can be estmated by aggregatng the areas between the lnear segments and the 45-degree lne. Ths process leads to another famlar expresson for the Gn coeffcent G (4) = η π ηπ + + = = It can be shown that G = G. Howeer, when the estmaton prncples used to obtan Ĝ and Ĝ are appled to the extended Gn coeffcent ntroduced by Ytzhak (983), they yeld estmators that are, n general, not dentcal. Preous lterature has focused on a coarance formula smlar to Ĝ (Lerman and Ytzhak 989). The purpose of our note s to dere an expresson for the extended-gn counterpart of Ĝ and to compare the bas and effcency of the two alternate estmators a a onte Carlo experment.. A New Estmator for the Extended Gn Coeffcent The extended Gn coeffcent, proposed by Ytzhak (983) to accommodate the fact that dfferent ndduals can hae dfferent aersons to nequalty, can be wrtten as 5

0 G( ) = ( ) ( π) L( π) dπ (5) = x F x f xdx µ [ ( )] ( ) 0 x = co,[ ( )] µ x { x F x } (6) where s an nequalty aerson parameter. The coeffcent G ( ) s defned for > and s equal to the orgnal Gn coeffcent when =. The coarance-formula estmator, gen by the emprcal dscrete erson of equaton (6) s (Lerman and Ytzhak 989) where G p x x m (7) ( ) = ( )[( π) ] x = m = p ( π ) = In the remander of ths secton, we dere an alternate estmator obtaned by approxmatng the Lorenz cure n equaton (5) wth a seres of lnear segments. To begn, note that the equaton of a straght lne jonng ( π, η ) to ( π, η ) can be wrtten as where η = c π + (8) d c = η π η π = (9) φ p π η π η d = (0) π π Now, denote the ntegral n equaton (5) by H (). That s, 0 H ( ) = ( π) L( π) dπ () The lnear-segment approxmaton to H () s 6

( ) π ( ) ( ) π ( ) = H = π c π+ d dπ [ I() I()] () = = + where I π ( ) = c π( π) π dπ c = [ π( π) π ( π ) ] c [( π) ( π )] ( ) (3) I π ( ) = d ( π) π dπ d = [( π ) + ( π ) ] (4) Substtutng for c and d n equatons (3) and (4), and addng these two equatons, yelds, after some algebra, I() + I() = [ η( π) η ( π ) ] φ π π ( ) p [( ) ( )] (5) Thus, [ ] H () = I () + I () = φ = π π ( ) [( ) ( ) ] (6) = p and the alternate estmator for the extended Gn coeffcent s G ( ) = ( ) H ( ) 7

= + φ [( π ) ( π )] (7) = p Ths expresson s a relately smple one whch s easy to calculate, despte the tedous algebra necessary to dere t. Its samplng propertes are assessed n Secton 4. It can be shown that G ( ) = G ( ) f =. Howeer, n general, the two estmators are not dentcal. 3. The Relate Performance of the Two Estmators Gen the exstence of two ery reasonable alternate estmators for the extended Gn coeffcent, ther relate samplng performance s of nterest. To ealuate ths performance, we set up a onte Carlo experment wth 4 hypothetcal ncome dstrbutons. The detals of those dstrbutons appear n Table. Two are lognormal dstrbutons where log(x) s normally dstrbuted wth mean µ and standard deaton σ. The other two are parametersatons of the dstrbuton suggested by Sngh and addala (976); ts dstrbuton functon s π = F ( x) = x + b The parameter alues were chosen to ge one lognormal and one Sngh-addala dstrbuton wth relately hgh nequalty, and another par of dstrbutons wth relately low nequalty. Setups and hae relately hgh nequalty wth approxmate Gn coeffcents of G ( ) = 0.7. Setups 3 and 4 hae relately low nequalty wth approxmate Gn coeffcents of G ( ) = 0.38. Relate to the lognormal dstrbuton wth a smlar alue of the Gn coeffcent, the Sngh-addala dstrbuton has a thcker tal, wth extreme alues of ncome more lkely. Desgnng the experment n ths way ges nformaton on the senstty of performance to the type of ncome dstrbuton and the leel of nequalty. a The other dmensons oer whch senstty was assessed were the alue of and the number of ncome groups. The chosen alues of are = (.33,.67,, 3, 5, 0 ). Values of the extended Gn coeffcent correspondng to these alues were q 8

computed a numercal ntegraton of equaton (6), and are reported n Table. Samplng performance was ealuated by drawng 5000 samples, each of sze 000, from each of the four dstrbutons. In addton to usng the sngle obseratons ( = 000), results were obtaned for two ncome groupngs = (0, 0). The results from the onte Carlo experment appear n Tables, 3 and 4 whch contan, respectely, the bas of the two estmators, ther relate arance, and ther relate mean-squared error. Values of relate arance and mean-squared error greater than one mply the coarance estmator G ( ) s outperformng our lnearsegment estmator G ( ). From Table we can make the followng obseratons about bas:. The bas of both estmators s always negate, reflectng the fact they mplctly assume no nequalty wthn each group.. When = 000, both estmators hae neglgble and almost dentcal bas. 3. The absolute bas of the coarance estmator s neer less, and often substantally more, than the absolute bas of the lnear-segment estmator. 4. The relate performance of the lnear-segment estmator mproes the further s the departure of from, and the smaller the number of groups. From the results n Table 3, we see that the lower bas for the lnear-segment estmator usually comes at a cost of hgher arance. Exceptons occur wth the frst three setups when = 0 and = 0. Snce these exceptons look atypcal relate to the remander of the table, we carred out further experments for alues of between 5 and 0 and greater than 0. These experments reealed that, as ncreases, there s a alue for beyond whch the arance of the lnear-segment estmator s less than the arance of the coarance estmator. The alue depends on the setup; n setup t s lower than for setups, 3 and 4. Also, t s larger for = 0 than for = 0. Howeer, for = 0 or 0, and for large enough, there s always a reersal n the relate magntudes of the arances. When = 000, there s no notceable dfference. 9

Snce a comparson of bases faors the lnear-segment estmator, and a comparson of arances faors the coarance estmator (except for large ), a meansquared error comparson s useful. Usng ths crteron, the results n Table 4 show the performance of the lnear-segment estmator s seldom worse, and sometmes ery much better, than the coarance estmator. In the nstances where the lnear-segment estmator s worse (for example, =.33, = 000, setup ), t s only margnally worse, wth the least faorable comparson beng a mean-squared error whch s only 0.8% larger. 4. Summary An estmator for the extended Gn coeffcent has been dered by approxmatng the Lorenz cure by a seres of lnear segments. Ths estmator s smple to compute and has less bas and lower mean-squared error than a coarancebased estmator that has been used n the lterature. The expermental edence s suffcently strong to recommend that practtoners use our new estmator n preference to the coarance estmator n future emprcal studes. 0

References Creedy, J. (996), Fscal Polcy and Socal Welfare, Cheltenham, Edward Elgar. Lambert, P.J. (993), The Dstrbuton and Redstrbuton of Income: A athematcal Analyss, nd edton, anchester: anchester Unersty Press. Lerman, R.I. and S. Ytzhak (989), Improng the Accuracy of Estmates of Gn Coeffcents, Journal of Econometrcs 4, 43-47. Nygård, F. and A. Sandström (98), easurng Income Inequalty, Stockholm, Almqst & Wksell. Sngh, S.K. and G.S. addala (976), A Functon for Sze Dstrbuton of Incomes, Econometrca 44, 963-970. Ytzhak, S. (983), On an Extenson of the Gn Inequalty Index, Internatonal Economc Reew 4, 67-68.

Table. Setups for onte Carlo Experment Setup 3 4 Dstrbuton Lognormal Sngh addala Lognormal Sngh addala Parameters µ = 5 σ =.5 b = 400 a = 0.84 q =.4 µ = 6.4 σ = 0.7 b = 550 a =.9 q = 0.85 µ 457. 33. 768.9 773.5 x G(.33) 0.498 0.4360 0.9 0.034 G(.67) 0.660 0.69 0.3057 0.38 G() 0.7 0.743 0.3794 0.3785 G(3) 0.8360 0.845 0.5059 0.495 G(5) 0.90 0.9 0.649 0.599 G(0) 0.9543 0.9674 0.7 0.688

Table. Bas of the Estmators Groups Estmator.33.67 3 5 0 Setup = 0 G ( ) G ( ) -0.043-0.035-0.05-0.03-0.04-0.04-0.0-0.04-0.004 = 0 G ( ) G ( ) -0.04-0.033-0.03-0.0-0.0-0.0 = 000 G ( ) G ( ) -0.00-0.00 Setup = 0 G ( ) G ( ) -0.035-0.09-0.00-0.08-0.0-0.0-0.008-0.00-0.005-0.098-0.009 = 0 G ( ) G ( ) -0.033-0.07-0.07-0.05-0.008-0.008-0.004-0.00-0.00-0.00 = 000 G ( ) G ( ) -0.005-0.005-0.00-0.00 Setup 3 = 0 G ( ) G ( ) -0.00-0.0-0.0-0.086-0.030 = 0 G ( ) G ( ) -0.008-0.008-0.005-0.004-0.00-0.00-0.00-0.03-0.005 = 000 G ( ) G ( ) Setup 4 = 0 G ( ) G ( ) -0.05-0.0-0.009-0.008-0.07-0.00-0.06-0.04 = 0 G ( ) G ( ) -0.03-0.00-0.004-0.004-0.00-0.00-0.00-0.00-0.004 = 000 G ( ) G ( ) 3

Table 3. Relate Varance: ar[ G ( )] / ar[ G( )] Groups.33.67 3 5 0 Setup = 0.09.00.000.05.053 0.883 = 0.096.03.000.004.00.03 = 000.008.000.000.000.000.000 Setup = 0.066.009.000.0.3 0.9 = 0.07.03.000.007.033.33 = 000.0.00.000.000.000.000 Setup 3 = 0.03.0.000.04.095 0.75 = 0.9.07.000.000.05.53 = 000.00.000.000.000.000.000 Setup 4 = 0.086.08.000.007.04.60 = 0.09.0.000.000.006.049 = 000.006.000.000.000.000.000 4

Table 4. Relate SE: SE[ G ()] SE[ G( )] Groups.33.67 3 5 0 Setup = 0 0.685 0.863.000 0.97 0.88 0.07 = 0 0.674 0.85.000.008 0.849 0.83 = 000.006.000.000.000.000.000 Setup = 0 0.773 0.933.000 0.883 0.67 0.009 = 0 0.770 0.935.000 0.998 0.745 0.03 = 000.008.000.000.000.000.000 Setup 3 = 0 0.58 0.855.000 0.889 0.398 0. = 0 0.568 0.874.000.00 0.884 0.39 = 000.000.000.000.000.000.000 Setup 4 = 0 0.73 0.93.000 0.950 0.55 0.7 = 0 0.735 0.946.000.000 0.959 0.589 = 000.005.000.000.000.000.000 5