Ratio Estimators in Stratified Random Sampling
|
|
- Meagan Dixon
- 7 years ago
- Views:
Transcription
1 Biometrical Journal 45 (2003) 2, Ratio Estimators in Stratified Random Sampling C. Kadilar* and H. Cingi Department of Statistics, Hacettepe University, Beytepe, Ankara, Turkey Abstract This paper considers some ratio-type estimators and their properties are studied in stratified random sampling. The results are supported by an application with original data. Key Words: Ratio-type estimators; Stratified random sampling; Mean square errors. 1. Introduction A ratio estimate of the population mean Y can be made in two ways. One is to make a separate ratio estimate of the total of each stratum and add these totals. An alternative estimate is derived from a single combined ratio. From the sample data, we compute sample mean of the variates in stratified random sampling method as y st ¼ Pk w h y h ; x st ¼ Pk w h x h where k is the number of stratum, w h ¼ N h is stratum weight, N is the number of N units in population, N h is the number of units in stratum h, y h is the sample mean of variate of interest in stratum h and x h is the sample mean of auxiliary variate in stratum h. The combined ratio estimate is y RC ¼ y st X ¼ ^R c X ð1:1þ x st where X is the population mean of auxiliary variate (Cochran, 1977). The variance of combined ratio estimate is Vðy RC Þ¼ Pk w 2 h g h ðs2 yh 2RS yxh þ R 2 S 2 xh Þ ð1:2þ where g h ¼ 1 ðn h=n h Þ Y, R ¼ n h X is the population ratio, n h is the number of units * Corresponding author: kadilar@hacettepe.edu.tr # 2003 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim /03/ $ 17.50þ.50/0
2 Biometrical Journal 45 (2003) in sample stratum h, S 2 yh is the population variance of variate of interest in stratum h, S 2 xh is the population variance of auxiliary variate in stratum h and S yxh is the population covariance between auxiliary variate and variate of interest in stratum h. 2. Ratio Estimators and their Mean Square Errors When first degree approximation is used in obtaining the mean square error (MSE) of a ratio estimate, it is known that MSE is equal to the variance, so MSE of combined ratio estimate can be written as follows: MSE ðy RC Þ¼Vðy RC Þ¼ Pk w 2 h g h ðs2 yh 2RS yxh þ R 2 S 2 xh Þ : We can derive the bias of combined ratio estimator as Bðy RC Þ¼ 1 X w 2 h g h ðrs2 xh S yxhþ : ð2:1þ ð2:2þ 2.1 Sisodia-Dwivedi estimator When the population coefficient of variation C x is known, Sisodia and Dwivedi (1981) suggested a modified ratio estimator for Y as X y SD ¼ y þ C x : x þ C x In stratified random sampling, we propose this estimator as y stsd ¼ y st w h ð X h þ C xh Þ. Then equa- We can define x SD ¼ Pk tion (2.4) will be ð2:3þ w h ð X h þ C xh Þ : ð2:4þ w h ðx h þ C xh Þ y stsd ¼ y st x SD X SD ¼ ^R SD X SD w h ðx h þ C xh Þ and X SD ¼ Pk where ^R SD ¼ y st x SD. It should be noted that the difference between combined ratio estimator and Sisodia-Dwivedi estimator is only ^R SD. Thus, MSE and bias of this estimator can be given in the same way like equation (2.1) and equation (2.2),
3 220 C. Kadilar and H. Cingi: Ratio Estimators in Stratified Random Sampling respectively as MSE ðy stsd Þ¼ Pk Bðy SD Þ¼ 1 X SD where R SD ¼ Y st X SD ¼ w 2 h g h ðs2 yh 2R SDS yxh þ R 2 SD S2 xh Þ w 2 h g h ðr SDS 2 xh S yxhþ w h Y h : w h ð X h þ C xh Þ ð2:5þ ð2:6þ 2.2 Singh-Kakran estimator Motivated by Sisodia and Dwivedi (1981), Singh-Kakran ratio-type estimator for Y is developed as X y SK ¼ y þ b 2 ðxþ ð2:7þ x þ b 2 ðxþ where b 2 ðxþ is the population coefficient of kurtosis of auxiliary variate x (Upadhyaya and Singh, 1999). In stratified random sampling, we suggest this estimator as y stsk ¼ y st Again, we can define x SK ¼ Pk Then equation (2.8) will be w h ð X h þ b 2h ðxþþ : ð2:8þ w h ðx h þ b 2h ðxþþ w h ðx h þ b 2h ðxþþ and X SK ¼ Pk w h ð X h þ b 2h ðxþþ. y stsk ¼ y st X SK ¼ ^R SK X SK : x SK The only difference of MSE and bias of this estimator from MSE and bias of Sisodia-Dwivedi estimator is to replace R SD by R SK ¼ equation (2.5) and X SD by X SK in equation (2.6) as MSE ðy stsk Þ¼ Pk Bðy SK Þ¼ 1 X SK w 2 h g h ðs2 yh 2R SKS yxh þ R 2 SK S2 xh Þ w 2 h g h ðr SKS 2 xh S yxhþ Y st w h ð X h þ b 2h ðxþþ in ð2:9þ
4 Biometrical Journal 45 (2003) Upadhyaya-Singh estimator Upadhyaya and Singh (1999) considered both coefficients of variation and kurtosis in their ratio-type estimator as Xb y US1 ¼ y 2 ðxþþc x : xb 2 ðxþþc x We modify this estimator for stratified random sampling as y stus1 ¼ y st where we can define w h ð X h b 2h ðxþþc xh Þ w h ðx h b 2h ðxþþc xh Þ ð2:10þ ð2:11þ x US1 ¼ Pk w h ðx h b 2h ðxþþc xh Þ and X US1 ¼ Pk By this way, we can rewrite equation (2.11) as y stus1 ¼ y st x US1 X US1 ¼ ^R US1 X US1 MSE and bias of this estimator, will be w h ð X h b 2h ðxþþc xh Þ : MSE ðy stus1 Þ¼ Pk Bðy stus1 Þ¼ 1 X US1 w 2 h g h ðs2 yh 2R US1S yxh þ R 2 US1 S2 xh Þ w 2 h g h ðr US1S 2 xh S yxhþ ð2:12þ ð2:13þ respectively, where R US1 ¼ Y st : w h ð X h b 2h ðxþþc xh Þ Upadhyaya and Singh (1999) proposed another estimate by changing the place of coefficient of kurtosis and coefficient of variation as XC y US2 ¼ y x þ b 2 ðxþ ð2:14þ xc x þ b 2 ðxþ In stratified random sampling this estimator will definitely be y stus2 ¼ y st w h ð X h C xh þ b 2h ðxþþ : ð2:15þ w h ðx h C xh þ b 2h ðxþþ
5 222 C. Kadilar and H. Cingi: Ratio Estimators in Stratified Random Sampling Let x US2 ¼ Pk w h ðx h C xh þ b 2h ðxþþ and X US2 ¼ Pk case, w h ð X h C xh þ b 2h ðxþþ. In this y stus2 ¼ y st X x US2 ¼ ^R US2 X US2 : US2 The only difference between this estimator and Upadhyaya-Singh first estimator Y st for MSE and bias terms is to replace R US1 by R US2 ¼ in w h ð X h C xh þ b 2h ðxþþ equation (2.12) and X US1 by X US2 in equation (2.13), respectively. 3. Efficiency Comparisons We compare the combined ratio estimator with Singh-Kakran estimator. We will have the conditions as follows: Let MSE ðy stsk Þ < MSE ðy RC Þ A ¼ Pk w 2 h g h ðs2 yh 2R SKS yxh þ R 2 SK S2 xh Þ < Pk w 2 h g h ð 2R SKS yxh þ R 2 SK S2 xh Þ < Pk w 2 h g h S yxh and B ¼ Pk w 2 h g h ðs2 yh 2RS yxh þ R 2 S 2 xh Þ w 2 h g h ð 2RS yxh þ R 2 S 2 xh Þ w 2 h g h S2 xh Thus, (3.1) becomes 2R SK A þ R 2 SK B < 2RA þ R2 B 2R SK A þ 2RA þ R 2 SK B R2 B < 0 2AðR SK RÞþBðR 2 SK R2 Þ < 0 2AðR SK RÞþBðR SK RÞðR SK þ RÞ < 0 where there are two conditions as follows: (i) when ðr SK RÞðR SK þ RÞ > 0, 2A R SK þ R þ B < 0 B < 2A R SK þ R ð3:1þ
6 Biometrical Journal 45 (2003) (ii) when ðr SK RÞðR SK þ RÞ < 0, 2A R SK þ R þ B > 0 B > 2A R SK þ R : When either of these conditions is satisfied, Singh-Kakran estimator will be more efficient than combined ratio estimator. These conditions are also valid while comparing MSE of combined ratio estimator with MSE of Sisodia-Dwivedi estimator and MSE of combined ratio estimator with MSE of Upadhyaya-Singh estimators. However, it is noted that R SK in the condition (i) or (ii) should be replaced by R SD,R US1 or R US2 according to the estimator, which is compared. 4. Application We have applied the ratio estimators on the data of apple production amount (as interest of variate) and number of apple trees (as auxiliary variate) in 854 villages of Turkey in 1999 (Source: Institute of Statistics, Republic of Turkey). First, we have stratified the data by regions of Turkey and from each stratum (region); we have randomly selected the samples (villages). By using Neyman allocation (Cochran, 1977), n h ¼ n N hs h N h S h ð4:1þ we have computed sample size in stratum h. Here we take sample size as n ¼ 140 (Cingi, 1994). From the results of n h, we decide to join two regions so we take six strata (as 1: Marmara, 2: Agean, 3: Mediterranean, 4: Central Anatolia, 5: Black Sea, 6: East and Southeast Anatolia) for this data. Then by using this stratified random sampling, MSE of ratio estimators are computed as described in section 2 and these estimators are compared between each other with respect to their MSE values. In Table 1, we observe the statistics about the population, strata and sample size. Note that the correlation between the variates is 92%. In Table 2, the values of MSE are given. From Table 2, it can be concluded that the combined ratio estimator has the minimum MSE and therefore it is the best estimator for the data. In the same way, when we analyze MSE of Singh-Kakran estimator and combined ratio estimator, we see ðr SK RÞðR SK þ RÞ < 0
7 224 C. Kadilar and H. Cingi: Ratio Estimators in Stratified Random Sampling Table 1 Data Statistics N ¼ 854 N 1 ¼ 106 N 2 ¼ 106 N 3 ¼ 94 N 4 ¼ 171 N 5 ¼ 204 N 6 ¼ 173 A ¼ n ¼ 140 n 1 ¼ 9 n 2 ¼ 17 n 3 ¼ 38 n 4 ¼ 67 n 5 ¼ 7 n 6 ¼ 2 B ¼ X ¼ X 1 ¼ X 2 ¼ X 3 ¼ X 4 ¼ X 5 ¼ X 6 ¼ 9844 R ¼ 0:07793 Y ¼ 2930 Y 1 ¼ 1536 Y 2 ¼ 2212 Y 3 ¼ 9384 Y 4 ¼ 5588 Y 5 ¼ 967 Y 6 ¼ 404 R SK ¼ 0:07784 b x ¼ 312:07 b x1 ¼ 25:71 b x2 ¼ 34:57 b x3 ¼ 26:14 b x4 ¼ 97:60 b x5 ¼ 27:47 b x6 ¼ 28:10 R SD ¼ 0:07792 b y ¼ 195:84 C x1 ¼ 2:02 C x2 ¼ 2:10 C x3 ¼ 2:22 C x4 ¼ 3:84 C x5 ¼ 1:72 C x6 ¼ 1:91 R US1 ¼ 0:0014 C x ¼ 3:85 C y1 ¼ 4:18 C y2 ¼ 5:22 C y3 ¼ 3:19 C y4 ¼ 5:13 C y5 ¼ 2:47 C y6 ¼ 2:34 R US2 ¼ 0:0285 C y ¼ 5:84 S x1 ¼ S x2 ¼ S x3 ¼ S x4 ¼ S x5 ¼ S x6 ¼ X SK ¼ S x ¼ S y1 ¼ 6425 S y2 ¼ S y3 ¼ S y4 ¼ S y5 ¼ 2390 S y6 ¼ 946 X SD ¼ S y ¼ r 1 ¼ 0.82 r 2 ¼ 0.86 r 3 ¼ 0.90 r 4 ¼ 0.99 r 5 ¼ 0.71 r 6 ¼ 0.89 X US1 ¼ r ¼ 0.92 g 1 ¼ g 2 ¼ g 3 ¼ g 4 ¼ g 5 ¼ g 6 ¼ X US2 ¼ k ¼ w 2 1 ¼ w2 2 ¼ w2 3 ¼ w2 4 ¼ 0.04 w2 5 ¼ w2 6 ¼ since R ¼ 0:07793 and R SK ¼ 0:07784: Then the condition B ¼ > 2A R SK þ R ¼ is not satisfied. Thus, combined ratio estimator is more efficient than Singh-Kakran estimator. Because of the same reason, combined ratio estimator is also more efficient than Sisodia-Dwivedi estimator and Upadhyaya-Singh estimators. For example, when we compare first estimator of Upadhyaya-Singh with combined ratio estimator, again we find ðr US1 RÞðR US1 þ RÞ < 0 so we investigate the condition 2A B ¼ > R US1 þ R ¼ and we see that this condition is not satisfied. Therefore, combined ratio estimator is more efficient than ratio-type estimator proposed by Upadhyaya and Singh (1999). Table 2 MSE Values of Ratio Estimators Estimators MSE Values Combined Ratio Sisodia-Dwivedi Singh-Kakran Upadhyaya-Singh Upadhyaya-Singh
8 Biometrical Journal 45 (2003) Conclusion We have examined ratio-type estimators in stratified random sampling and obtained their MSE equations. By these equations, MSE of estimators have been compared in theory and by this comparison, the conditions which the estimators have smaller MSE with respect to each other, have been found. These theoretical conditions are also satisfied by the results of an application with original data. In this application, it is concluded that the traditional combined ratio estimator is more efficient than the ratio estimators developed in recent years. This conclusion shows that, in the forthcoming studies, new ratio-type estimators should be proposed not only in simple random sampling but also in stratified random sampling. References Cochran, W. G., 1977: Sampling Techniques, John Wiley and Sons, New-York. Cingi, H., 1994: Sampling Theory, Hacettepe University Press. Singh, H. P. and Kakran, M. S., 1993: A Modified Ratio Estimator Using Known Coefficient of Kurtosis of an Auxiliary Character. (unpublished) Sisodia, B. V. S. and Dwivedi, V. K., 1981: A Modified Ratio Estimator Using Coefficient of Variation of Auxiliary Variable. Journal of Indian Society Agricultural Statistics 33, Upadhyaya, L. N. and Singh, H. P., 1999: Use of Transformed Auxiliary Variable in Estimating the Finite Population Mean. Biometrical Journal 41, 5, Received June 2001 Resubmission February 2002 Revised September 2002 Accepted November 2002
RATIO ESTIMATORS IN POST-STRATIFICATION
Nüfusbilim Dergisi\Turkish Journal of Population Studies, 2012, 34, 3-15 3 RATIO ESTIMATORS IN POST-STRATIFICATION In this study, some ratio-type estimators are taken into consideration in literature and
More informationAn Improved Estimator of Population Mean Using Power Transformation
J. Ind. Soc. gril. Statist. 58(2). 2004 : 223-230 n Improved Estimator of Population Mean Using Power Transformation Housila. P. Singh, Rajesh Tailor, Ritesh Tailor and M.S. Kakran 1 Vikram University.
More informationGeneralized modified linear systematic sampling scheme for finite populations
Hacettepe Journal of Mathematics and Statistics Volume 43 (3) (204), 529 542 Generalized modified linear systematic sampling scheme for finite populations J Subramani and Sat N Gupta Abstract The present
More informationInt. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session STS058) p.3319 Geometric Stratification Revisited
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session STS058) p.3319 Geometric Stratification Revisited Horgan, Jane M. Dublin City University, School of Computing Glasnevin
More informationESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA
m ESTIMATION OF THE EFFECTIVE DEGREES OF FREEDOM IN T-TYPE TESTS FOR COMPLEX DATA Jiahe Qian, Educational Testing Service Rosedale Road, MS 02-T, Princeton, NJ 08541 Key Words" Complex sampling, NAEP data,
More informationSSDA.Analysis - A Class Library for Analysis of Sample Survey Data
Vol.2, Issue.1, Jan-Feb 2012 pp-242-246 ISSN: 2249-6645 SSDA.Analysis - A Class Library for Analysis of Sample Survey Data Anu Sharma 1 and S. B. Lal 1 1 (Indian Agricultural Statistics Research Institute)
More informationWhy Sample? Why not study everyone? Debate about Census vs. sampling
Sampling Why Sample? Why not study everyone? Debate about Census vs. sampling Problems in Sampling? What problems do you know about? What issues are you aware of? What questions do you have? Key Sampling
More informationThe 3-Level HLM Model
James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 1 2 Basic Characteristics of the 3-level Model Level-1 Model Level-2 Model Level-3 Model
More informationChapter 11 Introduction to Survey Sampling and Analysis Procedures
Chapter 11 Introduction to Survey Sampling and Analysis Procedures Chapter Table of Contents OVERVIEW...149 SurveySampling...150 SurveyDataAnalysis...151 DESIGN INFORMATION FOR SURVEY PROCEDURES...152
More informationECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2
University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages
More informationSupporting Information
Copyright WILEY VCH Verlag GmbH & Co. KGaA, 69469 Weinheim, Germany, 2014. Supporting Information for Adv. Mater., DOI: 10.1002/adma.201305303 Freestanding Triboelectric-Layer Based Nanogenerators for
More informationMissing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Overview Missingness and impact on statistical analysis Missing data assumptions/mechanisms Conventional
More informationSampling. COUN 695 Experimental Design
Sampling COUN 695 Experimental Design Principles of Sampling Procedures are different for quantitative and qualitative research Sampling in quantitative research focuses on representativeness Sampling
More informationPapers presented at the ICES-III, June 18-21, 2007, Montreal, Quebec, Canada
A Comparison of the Results from the Old and New Private Sector Sample Designs for the Medical Expenditure Panel Survey-Insurance Component John P. Sommers 1 Anne T. Kearney 2 1 Agency for Healthcare Research
More informationWeb-based Supplementary Materials for Bayesian Effect Estimation. Accounting for Adjustment Uncertainty by Chi Wang, Giovanni
1 Web-based Supplementary Materials for Bayesian Effect Estimation Accounting for Adjustment Uncertainty by Chi Wang, Giovanni Parmigiani, and Francesca Dominici In Web Appendix A, we provide detailed
More informationGeometric Stratification of Accounting Data
Stratification of Accounting Data Patricia Gunning * Jane Mary Horgan ** William Yancey *** Abstract: We suggest a new procedure for defining te boundaries of te strata in igly skewed populations, usual
More informationMathematical test criteria for filtering complex systems: Plentiful observations
Available online at www.sciencedirect.com Journal of Computational Physics 7 (8) 678 7 www.elsevier.com/locate/jcp Mathematical test criteria for filtering complex systems: Plentiful ervations E. Castronovo,
More informationNew SAS Procedures for Analysis of Sample Survey Data
New SAS Procedures for Analysis of Sample Survey Data Anthony An and Donna Watts, SAS Institute Inc, Cary, NC Abstract Researchers use sample surveys to obtain information on a wide variety of issues Many
More informationAnnex 6 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION. (Version 01.
Page 1 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION (Version 01.1) I. Introduction 1. The clean development mechanism (CDM) Executive
More informationSurvey Data Analysis in Stata
Survey Data Analysis in Stata Jeff Pitblado Associate Director, Statistical Software StataCorp LP Stata Conference DC 2009 J. Pitblado (StataCorp) Survey Data Analysis DC 2009 1 / 44 Outline 1 Types of
More informationMultiple Imputation for Missing Data: A Cautionary Tale
Multiple Imputation for Missing Data: A Cautionary Tale Paul D. Allison University of Pennsylvania Address correspondence to Paul D. Allison, Sociology Department, University of Pennsylvania, 3718 Locust
More informationDescriptive Methods Ch. 6 and 7
Descriptive Methods Ch. 6 and 7 Purpose of Descriptive Research Purely descriptive research describes the characteristics or behaviors of a given population in a systematic and accurate fashion. Correlational
More informationCORRELATION ANALYSIS
CORRELATION ANALYSIS Learning Objectives Understand how correlation can be used to demonstrate a relationship between two factors. Know how to perform a correlation analysis and calculate the coefficient
More informationPart 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
More informationMore Outlier Weight Issues in REACH 2010
More Outlier Weight Issues in REACH 2010 Steven Pedlow, Yongyi Wang, Ellen Scheib, and Hee-Choon Shin National Opinion Research Center, University of Chicago Abstract Weights are often used in sample surveys
More informationConfidence Intervals for One Standard Deviation Using Standard Deviation
Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from
More informationAP STATISTICS 2010 SCORING GUIDELINES
2010 SCORING GUIDELINES Question 4 Intent of Question The primary goals of this question were to (1) assess students ability to calculate an expected value and a standard deviation; (2) recognize the applicability
More informationProbability Calculator
Chapter 95 Introduction Most statisticians have a set of probability tables that they refer to in doing their statistical wor. This procedure provides you with a set of electronic statistical tables that
More informationSAMPLING & INFERENTIAL STATISTICS. Sampling is necessary to make inferences about a population.
SAMPLING & INFERENTIAL STATISTICS Sampling is necessary to make inferences about a population. SAMPLING The group that you observe or collect data from is the sample. The group that you make generalizations
More informationAn Error-in-Variables Method for Determining Reactivity Ratios by On-Line Monitoring of Copolymerization Reactions
162 Full Paper Summary: A new error-in-variables method was developed to estimate the reactivity ratios in copolymerization systems. It brings the power of automatic, continuous, on-line monitoring of
More informationThe point-estimate method with large numbers of variables
INTERNATIONAL JOURNAL FOR NUMERICAL AND ANALYTICAL METHODS IN GEOMECHANICS Int. J. Numer. Anal. Meth. Geomech., 00; 6:1515 159 (DOI: 10.100/nag.56) The point-estimate method with large numbers of variables
More informationFinancial predictors of real activity and the financial accelerator B
Economics Letters 82 (2004) 167 172 www.elsevier.com/locate/econbase Financial predictors of real activity and the financial accelerator B Ashoka Mody a,1, Mark P. Taylor b,c, * a Research Department,
More informationINSTITUTIONAL SUPPORT FOR AGENCY PARTNERSHIPS: EXPLORING PERSONNEL PERCEPTION AND WEBSITE CONTENT
1 INSTITUTIONAL SUPPORT FOR AGENCY PARTNERSHIPS: EXPLORING PERSONNEL PERCEPTION AND WEBSITE CONTENT Lori Barrow Department of Forestry Southern Illinois University Carbondale, IL 62901 lb463a@siu.edu Erin
More informationA Regional Demand Forecasting Study for Transportation Fuels in Turkey
A al Demand Forecasting Study for Transportation Fuels in Turkey by Özlem Atalay a, Gürkan Kumbaroğlu Bogazici University, Department of Industrial Engineering, 34342, Bebek, Istanbul, Turkey, Phone :
More informationSection 13, Part 1 ANOVA. Analysis Of Variance
Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability
More informationStratified Sampling for Sales and Use Tax Highly Skewed Data Determination of the Certainty Stratum Cut-off Amount
Stratified Sampling for Sales and Use Tax Highly Skewed Data Determination of the Certainty Stratum Cut-off Amount By Eric Falk and Wendy Rotz, Ernst and Young LLP Introduction With the increasing number
More informationCOMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared. jn2@ecs.soton.ac.uk
COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared jn2@ecs.soton.ac.uk Relationships between variables So far we have looked at ways of characterizing the distribution
More informationModule 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
More informationFinancial Risk Management Exam Sample Questions/Answers
Financial Risk Management Exam Sample Questions/Answers Prepared by Daniel HERLEMONT 1 2 3 4 5 6 Chapter 3 Fundamentals of Statistics FRM-99, Question 4 Random walk assumes that returns from one time period
More informationPhoto Size Stratification in the US Forest Survey
FOREST SERVICE U. S. DEPARTMENT OF AGRICULTURE P.O. BOX 45, BERKELEY, CALIFORNIA 94701 PACIFIC SOUTHWEST Forest and Range Experiment Station U.S.D.A. FOREST SERVICE RESEARCH PAPER PSW- 51 /1969 CONTENTS
More informationLeast Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
More informationCS 147: Computer Systems Performance Analysis
CS 147: Computer Systems Performance Analysis One-Factor Experiments CS 147: Computer Systems Performance Analysis One-Factor Experiments 1 / 42 Overview Introduction Overview Overview Introduction Finding
More informationWhat Is an Intracluster Correlation Coefficient? Crucial Concepts for Primary Care Researchers
What Is an Intracluster Correlation Coefficient? Crucial Concepts for Primary Care Researchers Shersten Killip, MD, MPH 1 Ziyad Mahfoud, PhD 2 Kevin Pearce, MD, MPH 1 1 Department of Family Practice and
More informationST 371 (IV): Discrete Random Variables
ST 371 (IV): Discrete Random Variables 1 Random Variables A random variable (rv) is a function that is defined on the sample space of the experiment and that assigns a numerical variable to each possible
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More informationPaper PO06. Randomization in Clinical Trial Studies
Paper PO06 Randomization in Clinical Trial Studies David Shen, WCI, Inc. Zaizai Lu, AstraZeneca Pharmaceuticals ABSTRACT Randomization is of central importance in clinical trials. It prevents selection
More informationQuantitative Methods for Finance
Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain
More informationWHERE DOES THE 10% CONDITION COME FROM?
1 WHERE DOES THE 10% CONDITION COME FROM? The text has mentioned The 10% Condition (at least) twice so far: p. 407 Bernoulli trials must be independent. If that assumption is violated, it is still okay
More informationMeasuring Line Edge Roughness: Fluctuations in Uncertainty
Tutor6.doc: Version 5/6/08 T h e L i t h o g r a p h y E x p e r t (August 008) Measuring Line Edge Roughness: Fluctuations in Uncertainty Line edge roughness () is the deviation of a feature edge (as
More informationIntroduction to Sampling. Dr. Safaa R. Amer. Overview. for Non-Statisticians. Part II. Part I. Sample Size. Introduction.
Introduction to Sampling for Non-Statisticians Dr. Safaa R. Amer Overview Part I Part II Introduction Census or Sample Sampling Frame Probability or non-probability sample Sampling with or without replacement
More informationStatistical Methods 13 Sampling Techniques
community project encouraging academics to share statistics support resources All stcp resources are released under a Creative Commons licence Statistical Methods 13 Sampling Techniques Based on materials
More informationChapter 9. Two-Sample Tests. Effect Sizes and Power Paired t Test Calculation
Chapter 9 Two-Sample Tests Paired t Test (Correlated Groups t Test) Effect Sizes and Power Paired t Test Calculation Summary Independent t Test Chapter 9 Homework Power and Two-Sample Tests: Paired Versus
More informationThe Sample Overlap Problem for Systematic Sampling
The Sample Overlap Problem for Systematic Sampling Robert E. Fay 1 1 Westat, Inc., 1600 Research Blvd., Rockville, MD 20850 Abstract Within the context of probability-based sampling from a finite population,
More informationChapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
More informationSTA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance
Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis
More informationHere alx is an estimate of the incompleteness of
SURVEYS WITH OVERLAPPING FRAMES - PROBLEMS IN APPLICATION Introduction Frederic A. Vogel, Statistical Reporting Service, U. S. Dept. of Agriculture A multiple frame survey may be defined as one relying
More informationDATA warehouses have gained in importance in recent
86 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 15, NO. 1, JANUARY/FEBRUARY 2003 Using Fuzzy Linguistic Representations to Provide Explanatory Semantics for Data Warehouses Ling Feng, Member,
More informationNew Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Introduction
Introduction New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Predictive analytics encompasses the body of statistical knowledge supporting the analysis of massive data sets.
More informationImputing Missing Data using SAS
ABSTRACT Paper 3295-2015 Imputing Missing Data using SAS Christopher Yim, California Polytechnic State University, San Luis Obispo Missing data is an unfortunate reality of statistics. However, there are
More informationThe Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.
The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables Kathleen M. Lang* Boston College and Peter Gottschalk Boston College Abstract We derive the efficiency loss
More informationInfluence of Aggressivenessand Conservativenessin Investing and Financing Policies on Performance of Industrial Firms in Kenya
IOSR Journal of Economics and Finance (IOSR-JEF) e-issn: 2321-5933, p-issn: 2321-5925.Volume 2, Issue 5 (Jan. 2014), PP 27-32 Influence of Aggressivenessand Conservativenessin Investing and Financing Policies
More informationA SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS
A SURVEY ON CONTINUOUS ELLIPTICAL VECTOR DISTRIBUTIONS Eusebio GÓMEZ, Miguel A. GÓMEZ-VILLEGAS and J. Miguel MARÍN Abstract In this paper it is taken up a revision and characterization of the class of
More informationChapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
More informationOverview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
More informationSampling and Sampling Distributions
Sampling and Sampling Distributions Random Sampling A sample is a group of objects or readings taken from a population for counting or measurement. We shall distinguish between two kinds of populations
More informationNorthumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
More informationUNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
More informationIMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD
REPUBLIC OF SOUTH AFRICA GOVERNMENT-WIDE MONITORING & IMPACT EVALUATION SEMINAR IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD SHAHID KHANDKER World Bank June 2006 ORGANIZED BY THE WORLD BANK AFRICA IMPACT
More informationChapter 19 Statistical analysis of survey data. Abstract
Chapter 9 Statistical analysis of survey data James R. Chromy Research Triangle Institute Research Triangle Park, North Carolina, USA Savitri Abeyasekera The University of Reading Reading, UK Abstract
More informationPremaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
More informationChapter 8: Quantitative Sampling
Chapter 8: Quantitative Sampling I. Introduction to Sampling a. The primary goal of sampling is to get a representative sample, or a small collection of units or cases from a much larger collection or
More informationMATHEMATICAL METHODS OF STATISTICS
MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS
More informationPartial Fractions: Undetermined Coefficients
1. Introduction Partial Fractions: Undetermined Coefficients Not every F(s) we encounter is in the Laplace table. Partial fractions is a method for re-writing F(s) in a form suitable for the use of the
More informationIAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results
IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the
More information1. Data and information
1. Data and information -Random sampling application -Information base and data collection 1 References Data sampling: Cochran, W.G. (1977) Sampling Techniques. John Wiley&Sons, Inc. New York and others
More informationA new cut detection algorithm with constant false-alarm ratio for video segmentation
J. Vis. Commun. Image R. 5 (004) 44 www.elsevier.com/locate/jvci A new cut detection algorithm with constant false-alarm ratio for video segmentation Tie-Yan Liu, a Kwok-Tung Lo, b, * Xu-Dong Zhang, a
More informationThe Variance of Sample Variance for a Finite Population
The Variance of Sample Variance for a Finite Population Eungchun Cho November 11, 004 Key Words: variance of variance, variance estimator, sampling variance, randomization variance, moments Abstract The
More informationIdentifying and Reducing Nonresponse Bias throughout the Survey Process
Identifying and Reducing Nonresponse Bias throughout the Survey Process Thomas Krenzke, Wendy Van de Kerckhove, and Leyla Mohadjer Westat Keywords: Weighting, Data Collection, Assessments. Introduction
More informationOne-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups
One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
More informationFEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL STATIsTICs 4 IV. RANDOm VECTORs 1. JOINTLY DIsTRIBUTED RANDOm VARIABLEs If are two rom variables defined on the same sample space we define the joint
More informationSampling Procedures Y520. Strategies for Educational Inquiry. Robert S Michael
Sampling Procedures Y520 Strategies for Educational Inquiry Robert S Michael RSMichael 2-1 Terms Population (or universe) The group to which inferences are made based on a sample drawn from the population.
More informationMultiple Regression: What Is It?
Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in
More informationSupporting Online Material for Achard (RE 1070656) scheduled for 8/9/02 issue of Science
Supporting Online Material for Achard (RE 1070656) scheduled for 8/9/02 issue of Science Materials and Methods Overview Forest cover change is calculated using a sample of 102 observations distributed
More informationTIME SERIES ANALYSIS OF MEAN TEMPERATURE DATA IN TURKEY
TIME SERIES ANALYSIS OF MEAN TEMPERATURE DATA IN TURKEY Ali CAN*, Aysel T. ATIMTAY** *State Institute of Statistics, Necatibey Cad. No:114, Bakanlıklar 06100 Ankara, Turkey (ali.can@die.gov.tr) **Middle
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationTesting Research and Statistical Hypotheses
Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you
More informationAn introduction to Value-at-Risk Learning Curve September 2003
An introduction to Value-at-Risk Learning Curve September 2003 Value-at-Risk The introduction of Value-at-Risk (VaR) as an accepted methodology for quantifying market risk is part of the evolution of risk
More informationSegmentation of stock trading customers according to potential value
Expert Systems with Applications 27 (2004) 27 33 www.elsevier.com/locate/eswa Segmentation of stock trading customers according to potential value H.W. Shin a, *, S.Y. Sohn b a Samsung Economy Research
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More informationAudit Sampling 101. BY: Christopher L. Mitchell, MBA, CIA, CISA, CCSA Cmitchell@KBAGroupLLP.com
Audit Sampling 101 BY: Christopher L. Mitchell, MBA, CIA, CISA, CCSA Cmitchell@KBAGroupLLP.com BIO Principal KBA s Risk Advisory Services Team 15 years of internal controls experience within the following
More informationHomework 11. Part 1. Name: Score: / null
Name: Score: / Homework 11 Part 1 null 1 For which of the following correlations would the data points be clustered most closely around a straight line? A. r = 0.50 B. r = -0.80 C. r = 0.10 D. There is
More informationExpert Systems with Applications
Expert Systems with Applications 39 (2012) 3659 3667 Contents lists available at SciVerse ScienceDirect Expert Systems with Applications journal homepage: www.elsevier.com/locate/eswa Gradient boosting
More informationINTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES
INTRODUCTION TO SURVEY DATA ANALYSIS THROUGH STATISTICAL PACKAGES Hukum Chandra Indian Agricultural Statistics Research Institute, New Delhi-110012 1. INTRODUCTION A sample survey is a process for collecting
More informationData Cleaning and Missing Data Analysis
Data Cleaning and Missing Data Analysis Dan Merson vagabond@psu.edu India McHale imm120@psu.edu April 13, 2010 Overview Introduction to SACS What do we mean by Data Cleaning and why do we do it? The SACS
More informationSection 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
More informationMEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS. Biostatistics
MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS title- course code: Program name: Contingency Tables and Log Linear Models Level Biostatistics Hours/week Ther. Recite. Lab. Others Total Master of Sci.
More informationAP STATISTICS REVIEW (YMS Chapters 1-8)
AP STATISTICS REVIEW (YMS Chapters 1-8) Exploring Data (Chapter 1) Categorical Data nominal scale, names e.g. male/female or eye color or breeds of dogs Quantitative Data rational scale (can +,,, with
More information