Concentration Curves. The concentration curve defined
|
|
|
- Gervais Pierce
- 9 years ago
- Views:
Transcription
1 7 Concentration Curves In previous chapters, we assessed health inequality through variation in mean health across quintiles of some measure of living standards. Although convenient, such a grouped analysis provides only a partial picture of how health varies across the full distribution of living standards. A complete picture can be provided using a concentration curve, which displays the share of health accounted for by cumulative proportions of individuals in the population ranked from poorest to richest (Kakwani 1977; Kakwani et al. 1997; Wagstaff et al. 1991). The concentration curve can be used to examine inequality not just in health outcomes but in any health sector variable of interest. It can also be used to assess differences in health inequality across time and countries. For example, it has been used to assess whether subsidies to the health sector are targeted toward the poor and whether the targeting is better in some countries than in others (O Donnell et al. 2007; Sahn and Younger 2000). It has also been used to assess whether child mortality is more unequally distributed to the disadvantage of poor children in one country than in another (Wagstaff 2000) and whether inequalities in adult health are more pronounced in some countries than in others (van Doorslaer et al. 1997). Many other applications are possible. In this chapter we explain how to compute a concentration curve. We also explain how to test whether a concentration curve departs significantly from an equal distribution and whether there is a statistically significant difference between two concentration curves that may represent different health services, time periods, or countries. This requires computation of standard errors of the concentration curve ordinates. The concentration curve defined The two key variables underlying the concentration curve are the health variable, the distribution of which is the subject of interest, and a variable capturing living standards against which the distribution is to be assessed. Measurement of key health sector variables and of household living standards has been considered in earlier chapters. The health variable must be measured in units that can be aggregated across individuals. This is not necessary for the living standards measure, which is used only to rank individuals from richest to poorest. The data could be at the individual level (e.g., raw household survey data), in which case values of both the health variable and the living standards variable are available for each observation. Alternatively, the data could be grouped, in which case, 83
2 84 Chapter 7 for each living-standard group (e.g., income quintile), the mean value of the health variable is observed. The ranking of the groups (which group is poorest, which group is second poorest, and so on) and the percentage of the sample falling into each group are known. In the case of grouped data, the only advantage of the concentration curve over a table of group means is that it gives a graphical representation of the data. The concentration curve plots the cumulative percentage of the health variable (yaxis) against the cumulative percentage of the population, ranked by living standards, beginning with the poorest, and ending with the richest (x-axis). In other words, it plots shares of the health variable against quantiles of the living standards variable. Examples are given in the figure in box 7.1 and in figures 7.1 and 7.2. So, for example, the concentration curve might show the cumulative percentage of health subsidies accruing to the poorest p percent of the population. If everyone, irrespective of his or her living standards, has exactly the same value of the health variable, the concentration curve will be a 45-degree line, running from the bottom left-hand corner to the top right-hand corner. This is known as the line of equality. If, by contrast, the health sector variable takes higher (lower) values among poorer people, the concentration curve will lie above (below) the line of equality. The farther the curve is above the line of equality, the more concentrated the health variable is among the poor. Concentration curves for the same variable in different countries or time periods can be plotted on the same graph. Similarly, curves for different health sector variables in the same country and time period can be plotted against each other. For example, the analyst may wish to assess whether inpatient care is more unequally distributed than primary care. If the concentration curve for one country (or time period or health service) lies everywhere above that for the other, the first curve is said to dominate the second, and the ranking by degree of inequality is unambiguous. 1 Alternatively, curves may cross, in which case neither distribution dominates the other. It is then still possible to make comparisons of degrees of inequality but only by resorting to a summary index of inequality, which inevitably involves the imposition of value judgments concerning the relative weight given to inequality arising at different points in the distribution (see chapter 8). Rankings by degree of inequality can then differ depending on the inequality index chosen. Graphing concentration curves the grouped-data case In the grouped-data case, the required data and the corresponding charts are easily produced in a spreadsheet program such as Microsoft Excel. The table in box 7.1 is pasted directly from Excel and contains all the data required to plot the concentration curve shown in the box. The curve is constructed in Excel using the XY (scatter) chart-type with the scatter with data points connected by smoothed lines option. The first series graphs the line of equality, the x-values and the y-values both being the cumulative percentage of the sample. The no-marker option is selected for the line of equality. The second series graphs the concentration curve, the x-values being the cumulative percentage of the sample, the y-values being the cumulative percentage of the health variable. It is important to include a 0 percent in both series. Both the x-axis and the y-axis need to be restricted to the range 0 to 100 percent. 1 For an introduction to the concept of dominance, its relation to inequality measurement, and the related concept of stochastic dominance, see Deaton (1997).
3 Box 7.1 Example of a Concentration Curve Derived from Grouped Data In this example, the sample comprises births, the living standards measure is the assets (wealth) index, and the health variable is deaths of children under five years of age. The data are from the demographic and health surveys of India and Mali. The table shows the number of births in each wealth index quintile during the period in India. Expressing these as percentages of the total number of births and cumulating them gives the cumulative percentage of births, ordered by wealth. That is what is plotted on the x-axis in the figure. Also shown are the under-five mortality rates (U5MR) for each of five wealth groups. Multiplying the U5MR by the number of births gives the number of deaths in each wealth group. Expressing these as a percentage of the total number of deaths and cumulating them gives the cumulative percentage of deaths. That is what is plotted on the y-axis in the figure. The concentration curve for India lies above (dominates) the line of equality, indicating that child deaths are concentrated among the poor. Also shown in the figure is the concentration curve for under-five deaths for Mali for the period The Mali curve lies everywhere below that of India (i.e., the India curve dominates the Mali curve), indicating there is less inequality in underfive mortality in Mali than in India. Under-Five Deaths in India, Wealth No. of Rel % Cumul % U5MR No. of Rel % Cumul % group births births births per 1,000 deaths deaths deaths 0 0 Poorest 29, , nd 28, , Middle 26, , th 24, , Richest 19, , Total/average 129, ,419 Concentration Curves for Under-Five Deaths in India and Mali cumulative % under-five deaths India Mali line of equality Source: Authors cumulative % births, ranked by wealth
4 86 Chapter 7 Graphing concentration curves the microdata case Graphing concentration curves in Stata can be done using the command glcurve (an ado file downloadable from the Stata Web site). However, better-looking charts can be obtained using the twoway command. In the example that follows, we plot concentration curves for child malnutrition in Vietnam in 1992/93 and 1997/98, with malnutrition measured by the negative of height-for-age, as in Wagstaff et al. (2003). The dataset contains stacked data for the two years (year being 0 for 1992/93 and 1 for 1997/98), neghaz and lnpcexp being, respectively, the negative of height-for-age and the log of per capita expenditure for the year in question. We ignore below-sample weights, but they can be incorporated in both of the approaches below, as will be clear from Stata code elsewhere in the book. The concentration curve can be produced directly using glcurve as follows: glcurve neghaz, glvar(yord) pvar(rank) sortvar(lnpcexp) replace by(year) split lorenz The rank variable here is the lnpcexp. glcurve generates three new variables: rank, which is the child s rank in the expenditure distribution in each year, and yord_0 and yord_1, which are respectively the y-ordinates for 1992/93 and 1997/98. Adding the lorenz option requests that y-ordinates be cumulative proportions of the health variable and not the cumulative means, which is the default. Alternatively, these three variables could have been obtained through the following commands: sort year lnpcexp forval i = 0/1 { sum neghaz if year==`i scalar nobs`i = r(n) } ge rank=. egen tmp = rank(lnpcexp) if year==0 replace rank=tmp/nobs0 if year==0 drop tmp egen tmp = rank(lnpcexp) if year==1 replace rank=tmp/nobs1 if year==1 forval i = 0/1 { sum neghaz if year==`i scalar s_malnut`i = r(sum) } gen yord_0 = sum(neghaz)/s_malnut0 if year==0 gen yord_1 = sum(neghaz)/s_malnut1 if year==1 Whichever way the x-ordinates and y-ordinates are obtained, the concentration curve can be graphed using the full range of the options provided by twoway as follows: ge rank2=rank lab var yord_0 1992/93 lab var yord_1 1997/98
5 Concentration Curves 87 lab var rank cumul share of children (poorest first) lab var rank2 line of equality twoway (line yord_0 rank, sort clwidth(medthin) /// clpat(solid))(line yord_1 rank, sort clwidth(medthin) /// clpat(longdash) clcolor( ))(line rank2 rank, /// sort clwidth(medthin) clcolor(gray)), /// ytitle(cumulative share of malnutrition, size(medsmall)) /// yscale(titlegap(5)) xtitle(, size(medsmall)) /// legend(rows(5)) xscale(titlegap(5)) /// legend(region(lwidth(none))) plotregion(margin(zero)) /// ysize(5.75) xsize(5) plotregion(lcolor(none)) graph export $path0\cc curves 1992 and 1997.emf, replace The first line generates a duplicate rank variable that allows the line of equality to be plotted and labeled. The colors, pattern, and thickness of the concentration curves are controlled in the twoway command using the clcolor, clwidth, and clpat options. (The /// in the code simply allows code to be continued over several lines.) The last line of code exports the graph in Windows Enhanced Meta Format (emf), which allows easy viewing from within Windows Explorer and easy insertion into a Word document or a PowerPoint presentation (using Insert, Picture, From File). Figure 7.1 shows the resultant concentration curve chart, which reveals, Figure 7.1 Concentration Curve for Child Malnutrition in Vietnam, 1992/93 and 1997/ cumulative share of malnutrition cumulative share of children (poorest first) 1.0 Source: Authors. 1992/ /98 line of equality
6 88 Chapter 7 as indicated in Wagstaff et al. (2003), that inequality in child malnutrition increased somewhat in Vietnam between 1992/93 and 1997/98. Testing concentration curve dominance Concentration curves are estimated from survey data and so are subject to sampling variability. Although visual inspection of a concentration curve in comparison with the 45-degree line or another concentration curve may give an impression of whether there is dominance, obviously this inspection is not sufficient to conclude whether or not dominance is statistically significant. To make inferences about dominance, the standard errors of the concentration curve ordinates must be computed in addition to their point estimates. If the analyst wishes to test dominance of a concentration curve against the Lorenz curve of income/consumption or against another concentration curve estimated from the same sample, then the standard errors for the differences between curve ordinates must be computed. This is complicated by the fact that, in such cases, the curves are dependent. The appropriate variance-covariance matrix, allowing for dependence between curves, has been derived by Bishop et al. (1994) and Davidson and Duclos (1997). One decision rule that has been used in Lorenz (concentration) dominance tests has been to reject the null of nondominance in favor of dominance if there is at least one significant difference between curves (or a curve and the 45-degree line) in one direction and no significant difference in the other. For example, if there is at least one quantile point at which curve A lies significantly above curve B and there is no quantile point at which curve B lies above curve A, then it is concluded that A dominates B. If conventional critical values are used with such a decision rule, then there will be overrejection of the null because there is no correction for the fact that multiple comparisons are being made (Howes 1996). One solution is to use the same decision rule but to take multiple testing into account by using critical values from the studentized maximum modulus (SMM) distribution (Beach and Richmond 1985; Bishop et al. 1992; Stoline and Ury 1979). This is referred to as the multiple comparison approach (Dardanoni and Forcina 1999). An alternative criterion requires significant difference between ordinates at all quantile points to accept dominance (Howes 1996; Sahn and Younger 2000; Sahn et al. 2000). This is consistent with the intersection union principle (Dardanoni and Forcina 1999). Dardanoni and Forcina (1999) present Monte Carlo evidence showing that although this stricter rule reduces the probability of erroneously rejecting nondominance, it has greatly reduced the power of detecting dominance when true. If there is at least one significant difference between ordinates in each direction, then it is concluded that curves cross. If there are no significant differences in either direction, then, with the multiple comparison approach, null of nondominance is not rejected. Besides the decision rule, the analyst must choose the number of quantile points at which ordinates are to be compared. If the number of comparison points is too restricted, then dominance across the full range of the distribution is not being tested. It is difficult, however, to find dominance at the extremes of distributions (Howes 1996). With reasonably large samples, a popular choice has been to test for differences at 19 evenly spaced quantiles from 0.05 to 0.95 (O Donnell et al. forthcoming; Sahn and Younger 2000; Sahn et al. 2000).
7 Concentration Curves 89 The best statistical package for dominance testing is DAD, a specialist package for poverty and inequality analysis. 2 We have written our own Stata ado file for dominance testing. 3 The command follows the conventional Stata syntax, dominance varlist [if] [in] [weight] [using filename], sortvar() [options] If one variable is included in varlist, dominance of the concentration curve for this variable is tested against both the 45-degree line and the Lorenz curve of the living standards variable specified in sortvar(), which must be included. The default uses the multiple comparison approach decision rule, with comparisons at 19 equally spaced quantile points and a 5 percent significance level. The decision criterion can be changed to that of the intersection union principle with the option rule(iup), or results using both decision rules can be requested with rule(both). The number of comparison points can be changed with the option level(#), with 20 being the maximum value. The significance level can be changed from 5 percent to 1 percent with level(1). Quintile (or decile) cumulative shares of the health variable and the living standards variable can be requested by the option shares(quintiles). This will also report the p-value for a test of significant differences between the cumulative shares of the health and living standards variables at each quintile (decile). Differences between the shares of each variable and the population shares are also tested. To illustrate, we use data from the National Sample Survey to test dominance of the concentration curve for the public health subsidy (totsub) in India against both the 45-degree and the Lorenz curve for equivalent household consumption (hhexp_eq) as follows: 4 dominance totsub [aw=weight], sortvar(hhexp_eq) shares(quintiles) Results confirm that the concentration curve is dominated by (lies below) the 45- degree line, but there is no dominance between the concentration curve of the subsidy and the Lorenz curve of household consumption. The cumulative quintile shares for the subsidy reported with the output are as follows: Cumulative shares of totsub Quantile Cum. share std. error Diff. from Diff. from pop. share income share p-value p-value q % q % q % q % The program can be downloaded free from 3 The file can be downloaded from the Web site for this book. The program calls two other ados that need to be downloaded from the Stata Web site, glcurve and locpoly. 4 For details of the application see O Donnell et al (forthcoming).
8 90 Chapter 7 The poorest 20 percent of individuals receive only 12.5 percent of the subsidy. The p-value in the fourth column confirms that this subsidy share is significantly less than the respective population share, and that is true at all quintiles. The p-values in the final column indicate that the subsidy shares are not significantly different from the consumption shares for all quintiles. Although there is a significant difference (at 5 percent) for the first quintile group, that is not inconsistent with the earlier finding of nondominance against the Lorenz curve, because in the dominance test critical values are adjusted for multiple comparisons. If two variables are included in varlist, the program tests for dominance between the concentration curves of the two variables, allowing for dependence between the curves. In this case, the shares() option is not available. To illustrate, we examine dominance between the concentration curves for the subsidy to nonhospital care (nonhsub) and the subsidy to hospital inpatient care (ipsub) in India (see O Donnell et al. [forthcoming]). From figure 7.2, it appears that the concentration curve for nonhospital care lies above that for inpatient care. Figure 7.2 was produced with the following commands: #delimit ; glcurve nonhsub [aw=weight], sortvar(hhexp_eq) lorenz pvar(rank) glvar(nonh) nograph; glcurve ipsub [aw=weight], sortvar(hhexp_eq) lorenz plot(line nonh rank, legend(label(2 non-hospital )) line rank rank, legend(label(3 45 degree ))) legend(label(1 inpatient )) l1(cumulative subsidy proportion); Figure 7.2 Concentration Curves of Public Subsidy to Inpatient Care and Subsidy to Nonhospital Care, India, cumulative subsidy proportion nonhospital 45-degree line inpatient 0 0 Source: Authors cumulative population proportion
9 Concentration Curves 91 where the first glcurve command generates the x and y coordinates of the concentration curve for the nonhospital subsidy, and the second command plots this curve along with that for inpatient care and the 45-degree line. We test for statistically significant dominance between the curves with dominance nonhsub ipsub [aw=weight], sortvar(hhexp_eq) The result confirms that the subsidy to nonhospital care dominates (is more propoor than) the inpatient subsidy. The analyst may wish to test dominance between two concentration curves of the same variable estimated from independent samples. In this case, dominance testing is more straightforward because the ordinates of the respective concentration curves are independent. An important application is for cross-country comparisons of the distribution of a particular health or health care variable. For example, is the distribution of the public subsidy to health care less pro-poor in India than it is in Vietnam? To answer this question, we use data from the Indian 1995/96 NSS and the 1998 Vietnam Living Standards Survey. 5 The appropriate test can be carried out using the dominance command. In this case, the using option must be specified, and varlist must contain only one variable. Assume that we have the Indian data file loaded in Stata, that the Vietnam data file is named Vietnam, and that both files contain the variables totsub (total public health subsidy), hhexp_ eq (equivalent household consumption), and weight. The syntax would then be as follows: #delimit ; dominance totsub [aw=weight] using Vietnam, sortvar(hhexp_eq) labels(india Vietnam); where the labels() option gives labels to the two concentration curves that are used in the output. The result is reported as follows: Test of dominance between concentration curves of totsub for India and Vietnam Data 1 Data 2 Sign. level # points Rule India Vietnam 5% 19 mca Vietnam dominates India This confirms that the distribution of the public health subsidy is more pro-poor in Vietnam than it is in India. One other thing the analyst might want to do is to check dominance across years. For example, is it really the case that the 1997/98 concentration curve in Figure 7.2 lies everywhere above that of 1992/93? dominance requires the 1992/93 and 1997/98 data to be in two separate data files. The code below first saves the data set with the newly constructed rank variable, then saves the 1997/98 cases separately in the file VN97.dta. The code then loads the original stacked data set 5 See O Donnell et al. (2007) for details and for more cross-country comparisons of the distribution of public health subsidies in Asia.
10 92 Chapter 7 containing the data for 1992/93 and 1997/98, drops the observations for 1997/98, and runs the dominance routine, requesting that results using both decision rules be reported. save $path0\ch 7.dta, replace use $path0\ch 7.dta, clear keep if year==1 save $path0\vn97.dta, replace use $path0\ch 7.dta, clear keep if year==0 dominance neghaz using VN97, sortvar(rank) labels( ) rule(both) The output from Stata in this case is Test of dominance between concentration curves of neghaz for 1993 and 1997 Data 1 Data 2 Sign. level # points Rule % 19 mca 1997 dominates % 19 iup nondominance In this case, the 1997/98 concentration curve dominates (lies above) that of 1992/93 according to the less strict mca option, but not according to the stricter iup option. This reflects the fact that the two curves overlap toward the bottom of the income distribution. References Beach, C., M., and J. Richmond Joint Confidence Intervals for Income Shares and Lorenz Curves. International Economic Review 26: Bishop, J. A., J. P. Formby, and P. D. Thistle Convergence of the South and Non-South Income Distributions, American Economic Review 82(1): Bishop, J. A., K. V. Chow, and J. P. Formby Testing for Marginal Changes in Income Distributions with Lorenz and Concentration Curves. International Economic Review 35(2): Dardanoni, V., and A. Forcina Inference for Lorenz Curve Orderings. Econometrics Journal 2: Davidson, R., and J.-Y. Duclos Statistical Inference for the Measurement of the Incidence of Taxes and Transfers. Econometrica 65(6): Deaton, A The Analysis of Household Surveys: A Microeconometric Approach to Development Policy. Baltimore, MD: Johns Hopkins University Press. Howes, S A New Test for Inferring Dominance from Sample Data. Mimeo. World Bank, Washington, DC. Kakwani, N. C Measurement of Tax Progressivity: An International Comparison. Economic Journal 87(345):
11 Concentration Curves 93 Kakwani, N. C., A. Wagstaff, and E. van Doorslaer Socioeconomic Inequalities in Health: Measurement, Computation and Statistical Inference. Journal of Econometrics 77(1): O Donnell, O., E. van Doorslaer, R. P. Rannan-Eliya, A. Somanathan, S. R. Adhikari, D. Harbianto, C. G. Garg, P. Hanvoravongchai, M. N. Huq, A. Karan, G. M. Leung, C.-W. Ng, B. R. Pande, K. Tin, L. Trisnantoro, C. Vasavid, Y. Zhang, and Y. Zhao The Incidence of Public Spending on Health Care: Comparative Evidence from Asia. World Bank Economic Review 21(1): Sahn, D., and D. Younger Expenditure Incidence in Africa: Microeconomic Evidence. Fiscal Studies 21(3): Sahn, D. E., S. D. Younger, and K. R. Simler Dominance Testing of Transfers in Romania. Review of Income and Wealth 46(3): Stoline, M. R., and H. K. Ury Tables of the Studentized Maximum Modulus Distribution and an Application to Multiple Comparisons among Means. Technometrics 21(1): van Doorslaer, E., A. Wagstaff, H. Bleichrodt, S. Calonge, U. G. Gerdtham, M. Gerfin, J. Geurts, L. Gross, U. Hakkinen, R. E. Leu, O. O Donnell, C. Propper, F. Puffer, M. Rodriguez, G. Sundberg, and O. Winkelhake Income-Related Inequalities In Health: Some International Comparisons. J Health Econ 16(1): Wagstaff, A Socioeconomic Inequalities in Child Mortality: Comparisons across Nine Developing Countries. Bulletin of the World Health Organization 78(1): Wagstaff, A., P. Paci, and E. van Doorslaer On the Measurement of Inequalities in Health. Soc Sci Med 33(5): Wagstaff, A., E. van Doorslaer, and N. Watanabe On Decomposing the Causes of Health Sector Inequalities, with an Application to Malnutrition Inequalities in Vietnam. Journal of Econometrics 112(1):
12
Health Care Payments and Poverty
19 Health Care Payments and Poverty In the previous chapter we examined the issue of catastrophic payments for health care the disruption to material living standards due to large out-of-pocket (OOP) payments
Catastrophic Payments for Health Care
18 Catastrophic Payments for Health Care Health care finance in low-income countries is still characterized by the dominance of out-of-pocket payments and the relative lack of prepayment mechanisms, such
The Concentration Index
8 The Concentration Index Concentration curves can be used to identify whether socioeconomic inequality in some health sector variable exists and whether it is more pronounced at one point in time than
Measuring and Explaining Inequity in Health Service Delivery
15 Measuring and Explaining Inequity in Health Service Delivery Equitable distribution of health care is a principle subscribed to in many countries, often explicitly in legislation or official policy
Analyzing Health Equity Using Household Survey Data
Andrew M. Jones Professor of Economics and Director of the Graduate Program in Health Economics University of York, UK This is an excellent and exciting collection of knowledge of analytical techniques
Module 5: Measuring (step 3) Inequality Measures
Module 5: Measuring (step 3) Inequality Measures Topics 1. Why measure inequality? 2. Basic dispersion measures 1. Charting inequality for basic dispersion measures 2. Basic dispersion measures (dispersion
Charting Income Inequality
Charting Inequality Resources for policy making Module 000 Charting Inequality Resources for policy making Charting Inequality by Lorenzo Giovanni Bellù, Agricultural Policy Support Service, Policy Assistance
Inequalities in Self-Reported Health: A Meta-Regression Analysis
WP 13/05 Inequalities in Self-Reported Health: A Meta-Regression Analysis Joan Costa-Font and Cristina Hernandez-Quevedo February 2013 york.ac.uk/res/herc/hedgwp Inequalities in Self-Reported Health: A
Chapter 6. Inequality Measures
Chapter 6. Inequality Measures Summary Inequality is a broader concept than poverty in that it is defined over the entire population, and does not only focus on the poor. The simplest measurement of inequality
Measuring pro-poor growth
Economics Letters 78 (2003) 93 99 www.elsevier.com/ locate/ econbase q Measuring pro-poor growth * Martin Ravallion, Shaohua Chen World Bank, MSN MC 3-306 Development Research Group, 1818 H Street NW,
Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data
Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable
Usage and importance of DASP in Stata
Usage and importance of DASP in Stata Abdelkrim Araar, Jean-Yves Duclos and Luis Huesca Comparisons of Stata to other software or use of Stata together with other software. Mexico, May 12, 2011 Usage and
Plots, Curve-Fitting, and Data Modeling in Microsoft Excel
Plots, Curve-Fitting, and Data Modeling in Microsoft Excel This handout offers some tips on making nice plots of data collected in your lab experiments, as well as instruction on how to use the built-in
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups
Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)
The Distribution of Social Services in Madagascar, 1993-99. Peter Glick, Cornell University Mamisoa Razakamanantsoa, INSTAT
The Distribution of Social Services in Madagascar, 1993-99 Peter Glick, Cornell University Mamisoa Razakamanantsoa, INSTAT December, 2002 Acknowledgments We thank Jesko Hentschel, Bart Minten, Mead Over,
Inequality Measures. Chapter. Summary
Chapter 6 Inequality Measures Summary Inequality is a broader concept than poverty in that it is defined over the entire population, and does not only focus on the poor. The simplest measurement of inequality
Directions for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.
EXCEL Tutorial: How to use EXCEL for Graphs and Calculations. Excel is powerful tool and can make your life easier if you are proficient in using it. You will need to use Excel to complete most of your
Microsoft Excel. Qi Wei
Microsoft Excel Qi Wei Excel (Microsoft Office Excel) is a spreadsheet application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot
Introduction Course in SPSS - Evening 1
ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/
Figure 1. An embedded chart on a worksheet.
8. Excel Charts and Analysis ToolPak Charts, also known as graphs, have been an integral part of spreadsheets since the early days of Lotus 1-2-3. Charting features have improved significantly over the
Dealing with Data in Excel 2010
Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing
Poverty Indices: Checking for Robustness
Chapter 5. Poverty Indices: Checking for Robustness Summary There are four main reasons why measures of poverty may not be robust. Sampling error occurs because measures of poverty are based on sample
Assignment 2: Exploratory Data Analysis: Applying Visualization Tools
: Exploratory Data Analysis: Applying Visualization Tools Introduction Economic boom, though inspiring, is always connected with unsustainable development. Because of this, people tend to view economic
http://school-maths.com Gerrit Stols
For more info and downloads go to: http://school-maths.com Gerrit Stols Acknowledgements GeoGebra is dynamic mathematics open source (free) software for learning and teaching mathematics in schools. It
An introduction to using Microsoft Excel for quantitative data analysis
Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
A Guide to Using Excel in Physics Lab
A Guide to Using Excel in Physics Lab Excel has the potential to be a very useful program that will save you lots of time. Excel is especially useful for making repetitious calculations on large data sets.
Article: Main results from the Wealth and Assets Survey: July 2012 to June 2014
Article: Main results from the Wealth and Assets Survey: July 2012 to June 2014 Coverage: GB Date: 18 December 2015 Geographical Area: Region Theme: Economy Main points In July 2012 to June 2014: aggregate
Poverty and income growth: Measuring pro-poor growth in the case of Romania
Poverty and income growth: Measuring pro-poor growth in the case of EVA MILITARU, CRISTINA STROE Social Indicators and Standard of Living Department National Scientific Research Institute for Labour and
Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.
1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis
Excel Tutorial. Bio 150B Excel Tutorial 1
Bio 15B Excel Tutorial 1 Excel Tutorial As part of your laboratory write-ups and reports during this semester you will be required to collect and present data in an appropriate format. To organize and
Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:
Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:
Forecasting in STATA: Tools and Tricks
Forecasting in STATA: Tools and Tricks Introduction This manual is intended to be a reference guide for time series forecasting in STATA. It will be updated periodically during the semester, and will be
Analyzing the Elements of Real GDP in FRED Using Stacking
Tools for Teaching with Analyzing the Elements of Real GDP in FRED Using Stacking Author Mark Bayles, Senior Economic Education Specialist Introduction This online activity shows how to use FRED, the Federal
Scatter Plots with Error Bars
Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each
Excel Guide for Finite Mathematics and Applied Calculus
Excel Guide for Finite Mathematics and Applied Calculus Revathi Narasimhan Kean University A technology guide to accompany Mathematical Applications, 6 th Edition Applied Calculus, 2 nd Edition Calculus:
GINI-Coefficient and GOZINTO-Graph (Workshop) (Two economic applications of secondary school mathematics)
GINI-Coefficient and GOZINTO-Graph (Workshop) (Two economic applications of secondary school mathematics) Josef Böhm, ACDCA & DERIVE User Group, [email protected] Abstract: GINI-Coefficient together with
How Does the Introduction of Health Insurance Change the Equity of Health Care Provision in Bulgaria?
WP/06/285 How Does the Introduction of Health Insurance Change the Equity of Health Care Provision in Bulgaria? Nora Markova 2006 International Monetary Fund WP/06/285 IMF Working Paper IMF Institute
7 Time series analysis
7 Time series analysis In Chapters 16, 17, 33 36 in Zuur, Ieno and Smith (2007), various time series techniques are discussed. Applying these methods in Brodgar is straightforward, and most choices are
THE CONSTRUCT OF LORENZ CURVES AND OF THE GINI-COEFICIENT TO DEPICT DEGREES IN INEQUALITY IN HEALTH CARE
WWS 597 REINHARDT THE CONSTRUCT OF LORENZ CURVES AND OF THE GINI-COEFICIENT TO DEPICT DEGREES IN INEQUALITY IN HEALTH CARE There are a number of ways in which one can depict the distribution of some magnitude
Chapter 4 Creating Charts and Graphs
Calc Guide Chapter 4 OpenOffice.org Copyright This document is Copyright 2006 by its contributors as listed in the section titled Authors. You can distribute it and/or modify it under the terms of either
Statistics Revision Sheet Question 6 of Paper 2
Statistics Revision Sheet Question 6 of Paper The Statistics question is concerned mainly with the following terms. The Mean and the Median and are two ways of measuring the average. sumof values no. of
Summary of important mathematical operations and formulas (from first tutorial):
EXCEL Intermediate Tutorial Summary of important mathematical operations and formulas (from first tutorial): Operation Key Addition + Subtraction - Multiplication * Division / Exponential ^ To enter a
Advanced Microsoft Excel 2010
Advanced Microsoft Excel 2010 Table of Contents THE PASTE SPECIAL FUNCTION... 2 Paste Special Options... 2 Using the Paste Special Function... 3 ORGANIZING DATA... 4 Multiple-Level Sorting... 4 Subtotaling
Calibration and Linear Regression Analysis: A Self-Guided Tutorial
Calibration and Linear Regression Analysis: A Self-Guided Tutorial Part 1 Instrumental Analysis with Excel: The Basics CHM314 Instrumental Analysis Department of Chemistry, University of Toronto Dr. D.
4 Other useful features on the course web page. 5 Accessing SAS
1 Using SAS outside of ITCs Statistical Methods and Computing, 22S:30/105 Instructor: Cowles Lab 1 Jan 31, 2014 You can access SAS from off campus by using the ITC Virtual Desktop Go to https://virtualdesktopuiowaedu
IBM SPSS Statistics 20 Part 1: Descriptive Statistics
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 1: Descriptive Statistics Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
Curve Fitting in Microsoft Excel By William Lee
Curve Fitting in Microsoft Excel By William Lee This document is here to guide you through the steps needed to do curve fitting in Microsoft Excel using the least-squares method. In mathematical equations
Spreadsheet software for linear regression analysis
Spreadsheet software for linear regression analysis Robert Nau Fuqua School of Business, Duke University Copies of these slides together with individual Excel files that demonstrate each program are available
Module 3: Measuring (step 2) Poverty Lines
Module 3: Measuring (step 2) Poverty Lines Topics 1. Alternative poverty lines 2. Setting an absolute poverty line 2.1. Cost of basic needs method 2.2. Food energy method 2.3. Subjective method 3. Issues
Using the American Community Survey Data
Using the American Community Survey Data The ACS website is accessible from www.census.gov/acs. In the middle of the screen choose In-depth Data Go directly to the ACS Datasets Tab The ACS Datasets Tab
Using Excel for Handling, Graphing, and Analyzing Scientific Data:
Using Excel for Handling, Graphing, and Analyzing Scientific Data: A Resource for Science and Mathematics Students Scott A. Sinex Barbara A. Gage Department of Physical Sciences and Engineering Prince
Data exploration with Microsoft Excel: univariate analysis
Data exploration with Microsoft Excel: univariate analysis Contents 1 Introduction... 1 2 Exploring a variable s frequency distribution... 2 3 Calculating measures of central tendency... 16 4 Calculating
Introduction to SAS Business Intelligence/Enterprise Guide Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN
Paper TS600 Introduction to SAS Business Intelligence/Enterprise Guide Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN ABSTRACT This paper provides an overview of new SAS Business Intelligence
Using Excel for descriptive statistics
FACT SHEET Using Excel for descriptive statistics Introduction Biologists no longer routinely plot graphs by hand or rely on calculators to carry out difficult and tedious statistical calculations. These
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing C. Olivia Rud, VP, Fleet Bank ABSTRACT Data Mining is a new term for the common practice of searching through
2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
Excel Basics By Tom Peters & Laura Spielman
Excel Basics By Tom Peters & Laura Spielman What is Excel? Microsoft Excel is a software program with spreadsheet format enabling the user to organize raw data, make tables and charts, graph and model
Introduction to SPSS 16.0
Introduction to SPSS 16.0 Edited by Emily Blumenthal Center for Social Science Computation and Research 110 Savery Hall University of Washington Seattle, WA 98195 USA (206) 543-8110 November 2010 http://julius.csscr.washington.edu/pdf/spss.pdf
2 Generalized Gini and Concentration coefficients. Keywords sgini ; Stata ; generalized Gini ; Concentration
sgini Generalized Gini and Concentration coefficients (with factor decomposition) in Stata Philippe Van Kerm CEPS/INSTEAD, Luxembourg September 2009 (revised February 2010) Abstract sgini is a user-written
OECD.Stat Web Browser User Guide
OECD.Stat Web Browser User Guide May 2013 May 2013 1 p.10 Search by keyword across themes and datasets p.31 View and save combined queries p.11 Customise dimensions: select variables, change table layout;
Using SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
Evaluating Trading Systems By John Ehlers and Ric Way
Evaluating Trading Systems By John Ehlers and Ric Way INTRODUCTION What is the best way to evaluate the performance of a trading system? Conventional wisdom holds that the best way is to examine the system
Updates to Graphing with Excel
Updates to Graphing with Excel NCC has recently upgraded to a new version of the Microsoft Office suite of programs. As such, many of the directions in the Biology Student Handbook for how to graph with
Child Survival and Equity: A Global Overview
Child Survival and Equity: A Global Overview Abdelmajid Tibouti, Ph.D. Senior Adviser UNICEF New York Consultation on Equity in Access to Quality Health Care For Women and Children 7 11 April 2008 Halong
Information Literacy Program
Information Literacy Program Excel (2013) Advanced Charts 2015 ANU Library anulib.anu.edu.au/training [email protected] Table of Contents Excel (2013) Advanced Charts Overview of charts... 1 Create a chart...
Universal Coverage Scheme in Thailand: Equity Outcomes and Future Agendas to Meet Challenges
Universal Coverage Scheme in Thailand: Equity Outcomes and Future Agendas to Meet Challenges Viroj Tangcharoensathien, Winai Swasdiworn, Pongpisut Jongudomsuk, Samrit Srithamrongswat, Walaiporn Patcharanarumol,
Multivariate Analysis of Health Survey Data
10 Multivariate Analysis of Health Survey Data The most basic description of health sector inequality is given by the bivariate relationship between a health variable and some indicator of socioeconomic
Tools for Excel Modeling. Introduction to Excel2007 Data Tables and Data Table Exercises
Tools for Excel Modeling Introduction to Excel2007 Data Tables and Data Table Exercises EXCEL REVIEW 2009-2010 Preface Data Tables are among the most useful of Excel s tools for analyzing data in spreadsheet
Modeling Lifetime Value in the Insurance Industry
Modeling Lifetime Value in the Insurance Industry C. Olivia Parr Rud, Executive Vice President, Data Square, LLC ABSTRACT Acquisition modeling for direct mail insurance has the unique challenge of targeting
F nest. Financial Intermediation Network of European Studies. Data and Sample Management with Stata
F nest Financial Intermediation Network of European Studies S U M M E R S C H O O L Data and Sample Management with Stata Dr. Giovanni Cerulli 6 October 2015 (half day) University of Rome III, Italy Lecturer
TIPS FOR DOING STATISTICS IN EXCEL
TIPS FOR DOING STATISTICS IN EXCEL Before you begin, make sure that you have the DATA ANALYSIS pack running on your machine. It comes with Excel. Here s how to check if you have it, and what to do if you
Generating Heat. Part 1: Breathing Earth. Part 2: The Growth of Carbon Emitters. Introduction: Materials:
Generating Heat Introduction: Carbon dioxide (CO 2 ) is the primary greenhouse gas contributing to global climate change. A greenhouse gas is a gas that absorbs the sunlight being reflected back towards
Lecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach
Paid and Unpaid Work inequalities 1 Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach
CHARTS AND GRAPHS INTRODUCTION USING SPSS TO DRAW GRAPHS SPSS GRAPH OPTIONS CAG08
CHARTS AND GRAPHS INTRODUCTION SPSS and Excel each contain a number of options for producing what are sometimes known as business graphics - i.e. statistical charts and diagrams. This handout explores
sample median Sample quartiles sample deciles sample quantiles sample percentiles Exercise 1 five number summary # Create and view a sorted
Sample uartiles We have seen that the sample median of a data set {x 1, x, x,, x n }, sorted in increasing order, is a value that divides it in such a way, that exactly half (i.e., 50%) of the sample observations
TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:
Table of contents: Access Data for Analysis Data file types Format assumptions Data from Excel Information links Add multiple data tables Create & Interpret Visualizations Table Pie Chart Cross Table Treemap
From the help desk: Demand system estimation
The Stata Journal (2002) 2, Number 4, pp. 403 410 From the help desk: Demand system estimation Brian P. Poi Stata Corporation Abstract. This article provides an example illustrating how to use Stata to
0 Introduction to Data Analysis Using an Excel Spreadsheet
Experiment 0 Introduction to Data Analysis Using an Excel Spreadsheet I. Purpose The purpose of this introductory lab is to teach you a few basic things about how to use an EXCEL 2010 spreadsheet to do
Tutorial 5: Summarizing Tabular Data Florida Case Study
Tutorial 5: Summarizing Tabular Data Florida Case Study This tutorial will introduce you to the following: Identifying Attribute Data Sources for Health Data Converting Tabular Data into Usable Databases
Working with Excel in Origin
Working with Excel in Origin Limitations When Working with Excel in Origin To plot your workbook data in Origin, you must have Excel version 7 (Microsoft Office 95) or later installed on your computer
Scientific Graphing in Excel 2010
Scientific Graphing in Excel 2010 When you start Excel, you will see the screen below. Various parts of the display are labelled in red, with arrows, to define the terms used in the remainder of this overview.
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application
Module 2 Basic Data Management, Graphs, and Log-Files
AGRODEP Stata Training April 2013 Module 2 Basic Data Management, Graphs, and Log-Files Manuel Barron 1 and Pia Basurto 2 1 University of California, Berkeley, Department of Agricultural and Resource Economics
Basic Statistical and Modeling Procedures Using SAS
Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom
Inquisite Reporting Plug-In for Microsoft Office. Version 7.5. Getting Started
Inquisite Reporting Plug-In for Microsoft Office Version 7.5 Getting Started 2006 Inquisite, Inc. All rights reserved. All Inquisite brand names and product names are trademarks of Inquisite, Inc. All
MARS STUDENT IMAGING PROJECT
MARS STUDENT IMAGING PROJECT Data Analysis Practice Guide Mars Education Program Arizona State University Data Analysis Practice Guide This set of activities is designed to help you organize data you collect
A Picture Really Is Worth a Thousand Words
4 A Picture Really Is Worth a Thousand Words Difficulty Scale (pretty easy, but not a cinch) What you ll learn about in this chapter Why a picture is really worth a thousand words How to create a histogram
Using Microsoft Excel to Plot and Analyze Kinetic Data
Entering and Formatting Data Using Microsoft Excel to Plot and Analyze Kinetic Data Open Excel. Set up the spreadsheet page (Sheet 1) so that anyone who reads it will understand the page (Figure 1). Type
