An Empirical Investigation on the Prevalence of. Manipulation in Nursing Homes Rating System

Size: px
Start display at page:

Download "An Empirical Investigation on the Prevalence of. Manipulation in Nursing Homes Rating System"

Transcription

1 An Empirical Investigation on the Prevalence of Manipulation in Nursing Homes Rating System Xu Han, Niam Yaraghi & Ram Gopal Abstract: The Nursing Home Compare System supported by the Centers for Medicare & Medicaid Services (CMS) has been widely used by patients, doctors and insurance companies. We argue that potential fraud may exist in the rating procedure, which can lead to biased and misleading ratings. We use the CMS rating data over and the corresponding complaint data reported by California Department of Public Health (CDPH) and financial data reported by Office of Statewide Health Planning and Development (OSHPD) for over 1200 nursing homes in California to empirically examine the key factors affecting the star rating of a nursing home. We find a significant association between a nursing home s star rating and its profits, which points to a financial incentive for nursing homes to cheat. We then demonstrate that this association does not always lead to legitimate efforts to improve their service quality, and can induce cheating in the self-reporting step of the rating procedure. A prediction model is then developed to detect cheating in the suspect population based on which % of the nursing homes are identified as likely cheaters. Key Words: CMS, Fraud Detection, Nursing Home, Prediction, Rating System 1

2 I. Introduction Over nursing home facilities in the United States provide care to 1.5 million residents of which 72% are over 65 years old. Each resident spends an average of 835 days at a nursing home. They account for 6 percent of the Medicare population but 17 percent of total Medicare spending. The Department of Health and Human Services estimate that 4.1% of the 65+ population in 2009 lived in nursing homes. This percentage increases with age, ranging from 1.1% for persons years to 3.5% for persons years and 13.2% for persons 85+. Given the importance of nursing homes in the quality of life of patients and the billions of dollars spent on these facilities, the Centers for Medicare & Medicaid Services (CMS) has designed and implemented a system for evaluating the nursing homes. The system provides a star rating in a 1-5 scale for each nursing home based on 3 domains of on-site inspection, staffing and quality of care. Given the lack of alternative sources of information on nursing homes, the publically available CMS rating has become the gold standard in the industry since its inception, and has been widely popular among patients, physicians and insurance companies (Thomas, 2014). The CMS rating, however, may not always reflect the true service quality of the nursing homes. Cases have been reported in which patients personal experiences differ significantly from the star rating. Some high-star-rating nursing homes are sued for substandard care, even causing death of patient due to improper use of medical treatment (Thomas, 2014). As we discuss below, the current rating system may be prone to manipulation by nursing homes. CMS first assigns an initial star rating to each nursing home based on the annual on-site inspection results that are performed by licensed inspectors. Self-reported measures from 2

3 quality and staffing domains are then used to adjust the initial star rating by adding 1 star if any self-reported domain is excellent (4 or 5 stars for staffing domain and 5 stars for quality domain) and subtracting 1 star if any self-reported domain is 1 star. Figure 1 shows the trend of the ratings over the past five years. The ratings of the self-reported domains consistently shift to the higher end. In other words, the number of nursing homes that claim high performance in staffing and quality domains has continuously increased over the past five years, leading to seemingly improved overall ratings. This trend can be interpreted in two ways: On the one hand, supporters can argue that increased levels of self-reported measures are genuine and represent an honest effort by nursing homes to constantly improving their services. On the other hand, skeptics may argue that the improved ratings are not legitimate but are a result of nursing homes success in developing strategies of manipulating the system to inflate their ratings. The phenomenon leads to the following questions: Is the observed star increasing legitimate or does fraud exist in the system? If there is fraud, then how severe it is? In this paper, we seek answers to the above questions. We develop empirical models to investigate the existence and prevalence of fraud and to identify the incentives behind it. The paper proceeds as follows. In Section II, the data collection is described and the empirical model is developed. In Section III, the regression results are analyzed and a correlation analysis is conducted to show preliminary evidence of fraud in the system. A complaint-based analysis is then conducted in Section IV, which justifies the existence of fraud in the system. In Section V, we develop a fraud estimation model to evaluate the severeness of cheating in the system and conduct a variable importance analysis to show key characteristics of the cheaters. The paper is concluded in Section VI. 3

4 II. Data Collection and Empirical Model A. Data Collection Our primary panel data is derived from the star rating records for 1219 California nursing homes during the period The data consist of CMS s ratings and nursing homes basic information, such as location, size, certificate and ownership. The pooled data consist of records for all the nursing homes over the 60-month (5-year) period. The second major dataset is provided by the California Office of Statewide Health Planning and Development (OSHPD). The OSHPD data includes details on nursing homes financial situation for the years Nursing homes income is categorized into healthcare section and non-healthcare section. The corresponding revenue and expense details for each section are provided, and the profits can be calculated. The healthcare section is further categorized by the payment sources, e.g., Medicare, Medicaid or self-paying. We also collected complaint and deficiency data from California Department of Public Health (CDPH) for year The CDPH data contains basic information of the all California nursing homes as well as the detailed complaints, facility self-reported incidents and inspection results. The CDPH complaint data not only covers complaints that CMS have already considered in its rating procedure, but also includes complaints and deficiency reports that are only state-wide and are not reported to CMS. B. Nursing Homes Financial Incentive CMS star rating has significant financial implications for nursing homes and thus they have an incentive to obtain the highest possible rating. To demonstrate the financial 4

5 implications of star ratings for nursing homes, we calculate the average profit per day per patient for nursing homes in each of the five star rating groups, as shown in Table 1. These averages serve as an estimate of the profit that a nursing home can expect given its rating. The difference is significant. For example, a nursing home receiving 3 stars in inspection may only expect a profit of for treating one patient for one day. However, if it gains two additional stars after self-reporting and achieves an overall rating of 5 stars, its expected profit can be Figure 2 shows the profit trend for each of the star rating group over the 5 years. The results explain nursing homes incentives to achieve the highest possible ratings from the financial perspective, and provide a quantifiable way to measure. In our model, we define the incentive of a nursing home to be the profit difference between its inspection rating and the highest overall rating it can obtain, as shown in Table 1. C. Empirical Model We focus on the change of the star rating that happen as a result of the self-reported measures. Variable StarChange is equal to the difference between the overall rating and the inspection rating. For example, if the nursing home receives 3 stars in the Medicare on-site inspection but receives a 5-star overall rating after including its self-reported measures on staffing and quality domains, then the StarChange equals +2. By definition, StarChange can only take discrete values of +2, +1, 0, -1 and -2, thus we employ an ordinal logistic regression model. The model investigates how StarChange taking values in five levels {-2,, +2} depends on a vector of independent variables X. StarChange is 5

6 determined by a set of parameters -2, -1, 0, 1, which define the cut points of the five levels. The model can be written as exp(α j +X β) P(StarChange j) =, (1) 1+exp(α j +X β) where j = -2, -1, 0, 1, and vector X includes the following independent variables: Incentive, Competition, ResTotal, ForProfit, Medicare, Medicaid, ResCouncil, FamCouncil. Among the independent variables, the main effect we consider in our model is the nursing homes financial incentive, denoted by Incentive. The variable Competition describes the number of competing nursing homes in the 10 miles radius. The number of residential patients in each nursing home represents its size, and is denoted by ResTotal. Variable ForProfit defines a nursing home s Ownership type. ForProfit equals to one if the nursing home is for-profit type and zero otherwise. Variable Medicare and Medicaid define a nursing home s certification. Medicare is equal to one if the nursing home is Medicare certified, likewise, Medicaid is equal to one if the nursing home has Medicaid certification. By law, nursing homes are required to allow councils set up by their residents or the family members of the residents. These residential and family councils can facilitate the communication with staff, and get problem resolved more efficiently. Since nursing home residents may be more vulnerable than normal people due to their health conditions, the residential council and family council can function very differently in resolving issues and handling complaints. In our model, binary variables ResCouncil and FamCouncil are included to denote the council type. A nursing home can have both types of councils. Note that in our dataset, only StarChange and Incentive change over time and the other variables remain constant over all of the 60 months. 6

7 III. Result Analysis A. Regression Results We estimate equation (1) using both the fixed effects and the Hausman-Taylor methods, as shown in Table 2. The pooled linear regression result is also listed. In view that many variables are time-invariant, their coefficients cannot be directly estimated by the fixed effect model. In all the methods, the main effect Incentive is significant, with positive association, i.e., the bigger incentives nursing homes have, the more likely their star ratings will increase after self-reporting. The control variable Medicare is also significant in both the pooled linear regression and Hausman-Taylor method, but Medicaid is not. In view that almost all the nursing homes are Medicaid certified, the results indicate that if a nursing home is only Medicaid certified, it is more likely to obtain additional stars through self-reporting. B. Correlation Analysis The results show a positive association between the financial incentive and the rating change. However, it does not necessarily mean that nursing homes are gaming the system. It is possible that the nursing homes gain the star improvement legitimately through their true efforts. To better understand the changes in star ratings, we investigate the correlation between the on-site inspections and self-reported measures. Two correlations are expected. First, within the same year, if nursing homes are reporting the truth, a positive correlation is expected between the CMS on-site inspection and nursing homes self-reported measures. Second, if a nursing home really puts an effort in improving its care quality, these efforts should 7

8 have a lasting effect and lead to better results in the next year s on-site inspections. The results are shown in Figure 3. It can be seen that the correlation between on-site inspection and selfreported measures within the same year is around 0.2, indicating inconsistency between the inspection and self-reporting domains within the same year. The correlation between selfreported measures and the next year s CMS inspection results is only , which indicates that the self-reported improvements in quality and staffing have no lasting effect on the next year s on-site inspection. IV. Complaint-based Analysis As discussed in the previous section, inconsistency exists between the inspection and self-reported results, which shows preliminary evidence of potential fraud. In this section, we conduct an analysis based on the CDPH complaint data to justify the existence of fraud. Comparing with CMS s complaint data, the CDPH data consists of additional complaint records that are only state-wide and not reported to CMS, thus it can reflect the service quality more accurately. In the following, we first show that the number of complaints is a valid indicator for the service quality by conducting a contingency analysis, and then develop a complaint-based method to justify the existence of fraud. A. Contingency Analysis The CDPH complaint data consists of complaints that can be filed either by the patients or by the family members. Different from the CMS conducted inspection and nursing homes self-reported staffing and quality domain data, the complaints directly reflect the service quality of a nursing home from the customers point of view. We combine the CDPH complaint 8

9 data with CMS star rating data. The combined data has 4378 records in total, i.e., around 900 records for each year. We made the assumption that the complaints reflect the true performance of a nursing home. To test it, we divide the complaint status into 5 levels in which level 1 has the least complaints and level 5 has the most. A 5 by 5 contingency table can be created for each of the three domains. In each contingency table, the columns and rows are the ratings and complaint status, respectively. The chi-square test is then performed to test the independency between the column and row variables. For inspection, quality and staffing domains, the Pearson chi-square statistics are (p-value = 0.000), (p-value = 0.000) and (p-value = 0.000), respectively, indicating strong dependencies between complaints and each of the three domains. In other words, the number of complaints does reflect the performance in all three domains, which supports our assumption. As a result, we can use the complaints as a valid proxy variable when analyzing potential fraud behaviors. B. Evidence on Rating Manipulation The contingency analysis shows that the number of complaints is directly associated with the service quality. If two nursing homes provide similar quality of service, we assume their complaint distributions are similar. Thus if overall rating reflects the true service quality of nursing homes, i.e., there is no fraud, then nursing homes with the same overall rating should have similar distribution in terms of number of complaints. Table 3 shows the average number of complaints for nursing homes with different inspection and overall ratings. For each overall rating level, the nursing homes are divided into 2 categories: Nursing homes whose star ratings increase after self-reporting and nursing homes whose star ratings do not increase after self- 9

10 reporting. We denote the upper triangular to be area I and lower triangular to be area II. In the following, we test the following claims: Claim 1: The nursing homes with the same overall rating but different inspection ratings have different complaint distribution. To test this claim, we run ANOVA for each overall rating level with different inspection rating. For each overall rating level, two ANOVA results are presented in Table 4. In the first one, nursing homes are grouped by whether star rating increases after self-reporting, while in the second one, nursing homes are grouped by on-site inspection rating. It can be seen that all the comparisons are significant. Our claim that nursing homes with the same overall rating but different inspection ratings have different complaint distribution is thus supported. Claim 2: The nursing homes with the same inspection rating but different overall ratings have the same complaint distribution. To test this claim, we run ANOVA for each inspection rating level with different overall rating. For each inspection rating level, two ANOVA results are presented. In the first one, nursing homes are grouped by whether star rating increases after self-reporting, while in the second one, nursing homes are grouped by overall rating. As shown in Table 5, we do not observe significant difference in the number of complaints, although the overall rating can be quite different. The results provide strong evidence for potential cheating behaviors in self-reporting. The self-reported high ratings in the staffing and quality domains, though can gain higher overall ratings, do not lead to any observable service quality improvement in terms of the number of complaints. 10

11 V. Fraud Estimation and Variable Importance As discussed above, the final ratings of a cheating nursing home is driven by two components. The first component is the observable characteristics which are common between both cheating and common while the second is the unobservable cheating component which pertains only to the cheating nursing home. Since the cheating component is unobserved and omitted from our regression model, the estimates of the remaining observed variables will be biased due to the omitted variable of cheating. However, the final rating of an honest nursing home is only driven by one component of observed characteristics. That is, since the cheating component does not exist among the honest nursing homes, our regression estimates for the honest group will not suffer from the omitted variable bias. To develop our fraud detection model, we first divide the nursing homes into two groups: the honest nursing homes and the potential cheaters. A regression is then run for the honest nursing homes. The obtained regression coefficients for the honest group reflect the true association when there is no fraud, and are unbiased. These coefficients are then used to predict the highest possible overall rating for each nursing home in the suspected cheating group. A nursing home is identified as a likely cheater in our estimation if its actual rating is higher than the highest level of estimated rating. A. Prediction Model Since the overall star rating is the variable we want to predict, it is used as the dependent variable, and is denoted by OverallRating. Similar to the variable StarChange in the regression model in Section II, OverallRating is ordinal and takes values in five levels {1, 2,, 5}, so we employ an ordinal logistic regression model. OverallRating is determined by a set of 11

12 parameters 1, 2, 3, 4, which define the cut points of the five star levels. The model can be written as P(OverallRating k) = exp(γ k +X β P ) 1+exp(γ k +X β P ), (2) where k = 1, 2, 3, 4. The independent variables are not changed and are denoted as a vector X. The coefficients of the prediction model are denoted by β P. B. Grouping Since we use the coefficients of the honest group as the unbiased baseline, we define the members in this group strictly, i.e., we want to make sure there is no evidence that these nursing homes are cheating. An honest nursing home is then selected based on the following strict criteria: 1. Its rating does not increase after self-reporting 2. Its number of complaints is strictly lower than the median of its corresponding selfreporting level. To define a nursing home s self-reporting level, we combine the two self-reported domains. The median and average of the number of complaints are reported in Table 6. Based on the criteria stated above, the honest (H) group consists of 1345 nursing homes in the 5 consecutive years. The remaining 3033 nursing home are categorized in the potential cheating (PC) group. Note that the PC group consists of both the actual cheaters and the nursing homes who improve their service qualities through legitimate efforts. In the following, we estimate the proportion of the actual cheaters in the PC population. 12

13 C. Identification of Cheaters For the H group, we run the ordinal logistic regression in equation (2) to obtain the estimates of each coefficient, as shown in Table 7. Note that the H group is strictly defined and there is no evidence for cheating. Both the 95% and 90% confidence interval are reported. The coefficients are assumed to be unbiased. By using the unbiased coefficients and the upper bounds of the confidence intervals, we can predict the highest possible rating for the PC group. The cheaters are identified if its real overall rating is higher than the highest possible rating predicted. By using the 95% confidence interval, we can identify 184 out of the 3033 nursing home (6.07%) in the PC group which are most likely to be cheaters. Based on the 90% confidence interval, we can identify 379 out of 3033 nursing home (12.5%) in the PC group which are most likely to be cheaters. C. Variable Importance To emphasize the key difference between honest nursing homes and the likely cheaters, a subset of the data is constructed by eliminating nursing homes whose status are difficult to be identified. This includes the nursing homes that are not identified as cheaters and are also not a member of the honest group. The remaining dataset consists of 1724 nursing homes, in which 1345 are identified as honest nursing homes and 379 are identified as cheaters. The status of a nursing home is identified as 0 if it is an honest nursing home and 1 if it is a cheater. In view that status takes binary values, we develop a logit model to perform the variable importance analysis, as shown in equation 3. The probability of being an identified cheater is denoted as. 13

14 logit(λ) = β 0 + β 1 Incentive + β 2 Competition + β 3 ResTotal + β 4 ForProfit + β 5 Medicare + β 6 Medicaid + β 7 ResCouncil + β 8 FamCouncil + β 9 InspectionRating + β 10 QualityRating + β 11 StaffingRating +, (3) where InspectionRating, QualityRating and StaffingRating denote the 5-star ratings for the three domains. Other variables follow the same meanings as explained in Section II. The coefficient estimates and their importance are shown in Table 8. The two selfreported domains QualityRating and SatffingRating have the highest variable importance, i.e., they contribute the most to the probability of being a cheater if these values change from their lower bounds to upper bounds. The result is consistent with our analysis since untruthful high self-reporting is the only way for a cheater to achieve high overall rating. Besides the selfreported domain variables, we find the variable ResTotal to be the third in terms of variable importance, achieving a score of out of 100. The result indicates that when a nursing home s size grows, its probability to game the rating system increases significantly. The Incentive and ForProfit, being the 4 th and 5 th in the variable importance analysis, indicate that the financial incentive of a nursing home plays an important role in being a cheater. The forprofits nursing homes are more likely to be cheaters than the non-profits ones, and the higher their financial incentives are, the more likely they will be cheating. On the other hand, the probability of being a cheater does not change a lot with the change of competition and certification status (Medicare or Medicaid Certified). VI. Conclusion 14

15 This paper systematically analyzes CMS s nursing home rating system, and documents the existence of fraud. We develop a fraud detection model to estimate the proportion of cheating nursing homes. A variable importance analysis is then performed to identify the factors that contribute the most to being a cheater. Our research provides several contributions. First, to the best of our knowledge, it is the first study that systematically investigates the fraud in the CMS nursing home rating system, and explains incentives behind fraud. This study pinpoints the major flaws in the current rating system and contributes to the improvement of the rating mechanism. Second, the study estimates the proportion of likely cheaters, which serves as an important performance evaluation on the current rating system. These estimates can be used to strategically focus the future audits of the CMS on the nursing homes which are most likely to be cheaters. 15

16 Bibliography Berg, K. e. (2002). Identification and evaluation of existing nursing homes quality indicators. Health Care Financing Review, Cabin, W. (2014). For-profit Medicare Home Health Agencies' Costs Appear Higher and Quality Appears Lower Compared to Nonprofit Agencies. Health Affair, Della Vingna, S. (2010). Detecting illegal arms trade. American Economic Journal: Economic Policy, Duggan, M. (2000). Winning isin't everything: Corruption in sumo wrestling. National Bureau of economic research. Engelberg. (2014). Financial Conflicts of interest in Medicine. SSRN. Fennell, M. L. (2010). Elderly Hispanics more likely to reside in poor-quality nursing homes. Health Affairs, Grabowski, D. C. (2004). Medicaid payment and risk-adjusted nursing home quality measures. Health Affairs, Grabowski, D. C. (2004). Recent trends in state nursing home payment policies. Health Affairs, Harrington, C. (2001). Does investor ownership of nursin ghomes compromise the quality of care? American Journal of Public Health, Jacob, B. A. (2003). Rotten apples: An investigation of the prevalence and predictors of teacher cheating. National Bureau of Economic Research. 16

17 Kane, R. A. (2003). Definition, measurement and correlates of quality of life in nursing homes: Toward a reasonable practice, research, and policy agenda. The Gerontologist, Mayzlin, D. (2012). Promotional reviews: An empirical investigation of online review manipulation. National Bureau of Economic Research. Mor, V. e. (2003). The quality of quality measurement in U.S. nursing homes. The Gerontologist, Smith, D. B. (2007). Separate and unequal: racial segregation and disparities in quality across U.S. nursing homes. Health Affairs, Stevenson, D. G. (2003). The rise of nursing home litigation: Findings from a national survey of attorneys. Health Affairs, Stevenson, D. G. (2008). Private equity investment and nursing home care: Is it a big deal? Health Affairs, Studdert, D. M. (2011). Relationship between quality of care and negligence litigation in nursing homes. New England Journal of Medicine, Thomas, K. (2014). Medicare Star Ratings Allow Nursing Homes to Game the System. The New York Times. werner, R. E. (2010). Public reporting drove quality gains at nursing homes. HealthAffairs, Werner, R. M. (2010). Advancing nursing home quality through quality improvement itself. HealthAffairs,

18 Figure 1. The Trend of Five-year Ratings : (a) Inspection Rating (b) Quality Measurement Rating (c) Staffing Rating (d) Overall rating (a) (b) (c) (d) 18

19 Figure 2. Profit Trend over the 5 Years : (a) Plotted by Overall Ratings (b) Plotted by Years (a) (b) 19

20 Figure 3. The Correlation Analysis: a) A Positive Correlation Is Expected if Star Increase Is Resulted from Self-improvement Efforts (Red Arrow) b) A Positive Correlation Is Expected if a Nursing Home s Self-reporting is In Line with Medicare s Inspection 20

21 Table 1. Financial Incentive Definition Inspection rating Expected profit* if inspection rating unchanged Maximal possible overall rating Expected profit* if Maximal possible overall rating realized Incentive (profit difference) (Level 5 Level 5) (Level 5 Level 4) (Level 5 Level 3) (Level 4 Level 2) (Level 2 Level 1) *The expected profit is the average per patient per day profit for the corresponding star rating group 21

22 Table 2. Regression Results Variables Pooled data Fixed effect Hausman-Taylor Incentive *** *** *** Competition ResTotal *** ForProfit *** * Medicare *** *** Medicaid *** * ResCouncil FamCouncil **

23 Table 3. Average Complaints Overall 1 Overall 2 Overall 3 Overall 4 Overall 5 Inspection Inspection Inspection Inspection Inspection Note: The blank cells represent the impossible rating transaction according to CMS s rating system design. 23

24 Table 4. F test: Grouped by Inspection Rating F statistics: Area I vs Area II F statistics: Grouped by Inspection Overall 1 star ** Overall 2 star 7.43*** 6.16*** Overall 3 star 13.05*** 5.06*** Overall 4 star 14.22*** 8.35*** Overall 5 star 5.27** 5.70*** 24

25 Table 5. F test: Grouped by Overall Rating F statistics: Area I vs Area II F statistics: Grouped by overall Inspection 1 star Inspection 2 star Inspection 3 star Inspection 4 star 5.37** 2.00 Inspection 5 star

26 Table 6. Combined Self-reporting Levels Average Rating of Staffing and QM Complaint Average Complaint Median Level 1: Level 2: Level 3: Level 4: Level 5:

27 Table 7. The Unbiased Results for the H Group Overll Rating Coefficients 95% Confidence Interval 90% Confidence Interval Incentive *** Competition ResTotal *** ForProfit *** Medicare *** Medicaid * ResCouncil FamCouncil 0.483***

28 Table 8. Variable Importance Analysis Overall Rating Coefficients Variable Importance Incentive *** Competition ResTotal *** ForProfit 0.223*** Medicare Medicaid ResCouncil ** FamCouncil *** InspectionRating ** QualityRating *** StaffingRating ***

Nursing Home Compare Five-Star Quality Rating System: Year Five Report [Public Version]

Nursing Home Compare Five-Star Quality Rating System: Year Five Report [Public Version] Nursing Home Compare Five-Star Quality Rating System: Year Five Report [Public Version] Final Report June 16, 2014 Prepared for Centers for Medicare & Medicaid Services (CMS) AGG/Research Contracts & Grants

More information

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS DATABASE MARKETING Fall 2015, max 24 credits Dead line 15.10. ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS PART A Gains chart with excel Prepare a gains chart from the data in \\work\courses\e\27\e20100\ass4b.xls.

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Data Analysis, Research Study Design and the IRB

Data Analysis, Research Study Design and the IRB Minding the p-values p and Quartiles: Data Analysis, Research Study Design and the IRB Don Allensworth-Davies, MSc Research Manager, Data Coordinating Center Boston University School of Public Health IRB

More information

Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS)

Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS) Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS) April 30, 2008 Abstract A randomized Mode Experiment of 27,229 discharges from 45 hospitals was used to develop adjustments for the

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

Five Star Rating System Tip Sheet

Five Star Rating System Tip Sheet Five Star Rating System Tip Sheet In December 2008, The Centers for Medicare & Medicaid Services (CMS) enhanced its Nursing Home Compare public reporting site to include a set of quality ratings for each

More information

SAMPLE DESIGN RESEARCH FOR THE NATIONAL NURSING HOME SURVEY

SAMPLE DESIGN RESEARCH FOR THE NATIONAL NURSING HOME SURVEY SAMPLE DESIGN RESEARCH FOR THE NATIONAL NURSING HOME SURVEY Karen E. Davis National Center for Health Statistics, 6525 Belcrest Road, Room 915, Hyattsville, MD 20782 KEY WORDS: Sample survey, cost model

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Premaster Statistics Tutorial 4 Full solutions

Premaster Statistics Tutorial 4 Full solutions Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for

More information

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Simple Linear Regression, Scatterplots, and Bivariate Correlation 1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.

More information

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:

More information

Risk adjustment and shared savings agreements

Risk adjustment and shared savings agreements Risk adjustment and shared savings agreements Hans K. Leida, PhD, FSA, MAAA Leigh M. Wachenheim, FSA, MAAA In a typical shared savings arrangement, claim costs during the measurement or experience period

More information

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives

More information

CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V

CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V CHAPTER 15 NOMINAL MEASURES OF CORRELATION: PHI, THE CONTINGENCY COEFFICIENT, AND CRAMER'S V Chapters 13 and 14 introduced and explained the use of a set of statistical tools that researchers use to measure

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Public Health Insurance Expansions for Parents and Enhancement Effects for Child Coverage

Public Health Insurance Expansions for Parents and Enhancement Effects for Child Coverage Public Health Insurance Expansions for Parents and Enhancement Effects for Child Coverage Jason R. Davis, University of Wisconsin Stevens Point ABSTRACT In 1997, the federal government provided states

More information

Conducting an empirical analysis of economic data can be rewarding and

Conducting an empirical analysis of economic data can be rewarding and CHAPTER 10 Conducting a Regression Study Using Economic Data Conducting an empirical analysis of economic data can be rewarding and informative. If you follow some basic guidelines, it is possible to use

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and

More information

Simple Predictive Analytics Curtis Seare

Simple Predictive Analytics Curtis Seare Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Staffing and Quality in California s Nursing Homes

Staffing and Quality in California s Nursing Homes s n a p s h o t in California s s 2006 Introduction Staffing levels in California s nursing homes are a key indicator of the quality of care. A study from the U.S. Centers for Medicare and Medicaid Services

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

Nonparametric statistics and model selection

Nonparametric statistics and model selection Chapter 5 Nonparametric statistics and model selection In Chapter, we learned about the t-test and its variations. These were designed to compare sample means, and relied heavily on assumptions of normality.

More information

INTERNATIONAL STANDARD ON AUDITING 540 AUDITING ACCOUNTING ESTIMATES, INCLUDING FAIR VALUE ACCOUNTING ESTIMATES, AND RELATED DISCLOSURES CONTENTS

INTERNATIONAL STANDARD ON AUDITING 540 AUDITING ACCOUNTING ESTIMATES, INCLUDING FAIR VALUE ACCOUNTING ESTIMATES, AND RELATED DISCLOSURES CONTENTS INTERNATIONAL STANDARD ON AUDITING 540 AUDITING ACCOUNTING ESTIMATES, INCLUDING FAIR VALUE ACCOUNTING ESTIMATES, AND RELATED DISCLOSURES (Effective for audits of financial statements for periods beginning

More information

The Importance of Community College Honors Programs

The Importance of Community College Honors Programs 6 This chapter examines relationships between the presence of honors programs at community colleges and institutional, curricular, and student body characteristics. Furthermore, the author relates his

More information

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written

More information

Two Correlated Proportions (McNemar Test)

Two Correlated Proportions (McNemar Test) Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with

More information

Handling missing data in Stata a whirlwind tour

Handling missing data in Stata a whirlwind tour Handling missing data in Stata a whirlwind tour 2012 Italian Stata Users Group Meeting Jonathan Bartlett www.missingdata.org.uk 20th September 2012 1/55 Outline The problem of missing data and a principled

More information

Statistical modelling with missing data using multiple imputation. Session 4: Sensitivity Analysis after Multiple Imputation

Statistical modelling with missing data using multiple imputation. Session 4: Sensitivity Analysis after Multiple Imputation Statistical modelling with missing data using multiple imputation Session 4: Sensitivity Analysis after Multiple Imputation James Carpenter London School of Hygiene & Tropical Medicine Email: james.carpenter@lshtm.ac.uk

More information

LOGISTIC REGRESSION ANALYSIS

LOGISTIC REGRESSION ANALYSIS LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic

More information

Study Design and Statistical Analysis

Study Design and Statistical Analysis Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study.

Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study. Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study Prepared by: Centers for Disease Control and Prevention National

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Logit Models for Binary Data

Logit Models for Binary Data Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis. These models are appropriate when the response

More information

The Wage Return to Education: What Hides Behind the Least Squares Bias?

The Wage Return to Education: What Hides Behind the Least Squares Bias? DISCUSSION PAPER SERIES IZA DP No. 8855 The Wage Return to Education: What Hides Behind the Least Squares Bias? Corrado Andini February 2015 Forschungsinstitut zur Zukunft der Arbeit Institute for the

More information

Multinomial and Ordinal Logistic Regression

Multinomial and Ordinal Logistic Regression Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,

More information

Introduction to Statistics Used in Nursing Research

Introduction to Statistics Used in Nursing Research Introduction to Statistics Used in Nursing Research Laura P. Kimble, PhD, RN, FNP-C, FAAN Professor and Piedmont Healthcare Endowed Chair in Nursing Georgia Baptist College of Nursing Of Mercer University

More information

Composite performance measures in the public sector Rowena Jacobs, Maria Goddard and Peter C. Smith

Composite performance measures in the public sector Rowena Jacobs, Maria Goddard and Peter C. Smith Policy Discussion Briefing January 27 Composite performance measures in the public sector Rowena Jacobs, Maria Goddard and Peter C. Smith Introduction It is rare to open a newspaper or read a government

More information

An Introduction to. Metrics. used during. Software Development

An Introduction to. Metrics. used during. Software Development An Introduction to Metrics used during Software Development Life Cycle www.softwaretestinggenius.com Page 1 of 10 Define the Metric Objectives You can t control what you can t measure. This is a quote

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material

More information

Multiple logistic regression analysis of cigarette use among high school students

Multiple logistic regression analysis of cigarette use among high school students Multiple logistic regression analysis of cigarette use among high school students ABSTRACT Joseph Adwere-Boamah Alliant International University A binary logistic regression analysis was performed to predict

More information

2015 Michigan Department of Health and Human Services Adult Medicaid Health Plan CAHPS Report

2015 Michigan Department of Health and Human Services Adult Medicaid Health Plan CAHPS Report 2015 State of Michigan Department of Health and Human Services 2015 Michigan Department of Health and Human Services Adult Medicaid Health Plan CAHPS Report September 2015 Draft Draft 3133 East Camelback

More information

OREGON PROPERLY VERIFIED CORRECTION OF DEFICIENCIES IDENTIFIED DURING SURVEYS OF NURSING HOMES PARTICIPATING IN MEDICARE AND MEDICAID

OREGON PROPERLY VERIFIED CORRECTION OF DEFICIENCIES IDENTIFIED DURING SURVEYS OF NURSING HOMES PARTICIPATING IN MEDICARE AND MEDICAID Department of Health and Human Services OFFICE OF INSPECTOR GENERAL OREGON PROPERLY VERIFIED CORRECTION OF DEFICIENCIES IDENTIFIED DURING SURVEYS OF NURSING HOMES PARTICIPATING IN MEDICARE AND MEDICAID

More information

Integrating Quality Assurance into the GIS Project Life Cycle

Integrating Quality Assurance into the GIS Project Life Cycle Integrating Quality Assurance into the GIS Project Life Cycle Abstract GIS databases are an ever-evolving entity. From their humble beginnings as paper maps, through the digital conversion process, to

More information

A Basic Introduction to Missing Data

A Basic Introduction to Missing Data John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item

More information

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship

More information

GAO NURSING HOMES. CMS Needs Milestones and Timelines to Ensure Goals for the Five-Star Quality Rating System Are Met

GAO NURSING HOMES. CMS Needs Milestones and Timelines to Ensure Goals for the Five-Star Quality Rating System Are Met GAO United States Government Accountability Office Report to Congressional Committees March 2012 NURSING HOMES CMS Needs Milestones and Timelines to Ensure Goals for the Five-Star Quality Rating System

More information

Factors affecting online sales

Factors affecting online sales Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4

More information

Analyst HEALTH AND HEALTH CARE IN SAN JOAQUIN COUNTY REGIONAL

Analyst HEALTH AND HEALTH CARE IN SAN JOAQUIN COUNTY REGIONAL SPRING 2016 HEALTH AND HEALTH CARE IN SAN JOAQUIN COUNTY San Joaquin County Health Care s Rapid Growth Creates Critical Shortages in Key Occupations. Health care has been changing rapidly in the United

More information

Are Lottery Players Affected by Winning History? Evidence from China s Individual Lottery Betting Panel Data

Are Lottery Players Affected by Winning History? Evidence from China s Individual Lottery Betting Panel Data Are Lottery Players Affected by Winning History? Evidence from China s Individual Lottery Betting Panel Data Jia Yuan University of Macau September 2011 Abstract I explore a unique individual level lottery

More information

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish

Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Statistics Statistics are quantitative methods of describing, analysing, and drawing inferences (conclusions)

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Technical Supplement. The cost of health insurance administration in California: Insurer, physician, and hospital estimates

Technical Supplement. The cost of health insurance administration in California: Insurer, physician, and hospital estimates Editor s Note: This online data supplement contains supplemental material that was not included with the published article by James G. Kahn and colleagues, The Cost of Health Insurance Administration in

More information

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic

A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic Report prepared for Brandon Slama Department of Health Management and Informatics University of Missouri, Columbia

More information

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could

More information

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance 2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

More information

Crosstabulation & Chi Square

Crosstabulation & Chi Square Crosstabulation & Chi Square Robert S Michael Chi-square as an Index of Association After examining the distribution of each of the variables, the researcher s next task is to look for relationships among

More information

http://www.cms.hhs.gov/therapyservices/

http://www.cms.hhs.gov/therapyservices/ The Centers for Medicare & Medicaid Services (CMS) Web site address referenced on page 1 and page 4 of the following report has changed. The new address is: http://www.cms.hhs.gov/therapyservices/ ~~~~

More information

Multiple Regression: What Is It?

Multiple Regression: What Is It? Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in

More information

Dutch Accreditation Council (RvA) Policy rule Nonconformities. Corrective action

Dutch Accreditation Council (RvA) Policy rule Nonconformities. Corrective action Dutch Accreditation Council (RvA) Policy rule Nonconformities and Corrective action Document code: RvA-BR004-UK Version 2, 23-1-2015 RvA policy guidelines describe the RvA rules and the policy on specific

More information

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS About Omega Statistics Private practice consultancy based in Southern California, Medical and Clinical

More information

The Effect of Housing on Portfolio Choice. July 2009

The Effect of Housing on Portfolio Choice. July 2009 The Effect of Housing on Portfolio Choice Raj Chetty Harvard Univ. Adam Szeidl UC-Berkeley July 2009 Introduction How does homeownership affect financial portfolios? Linkages between housing and financial

More information

PROPOSED CUTS TO MEDICARE HOME HEALTH PAYMENTS ARE UNSUPPORTED AND DISPROPORTIONATE AS COMPARED TO OTHER MEDICARE PROVIDERS

PROPOSED CUTS TO MEDICARE HOME HEALTH PAYMENTS ARE UNSUPPORTED AND DISPROPORTIONATE AS COMPARED TO OTHER MEDICARE PROVIDERS PROPOSED CUTS TO MEDICARE HOME HEALTH PAYMENTS ARE UNSUPPORTED AND DISPROPORTIONATE AS COMPARED TO OTHER MEDICARE PROVIDERS MEDPAC s HOME HEALTH MARGIN ESTIMATES ARE UNRELIABLE HOME HEALTH MAKES UP ONLY

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Rates for Vehicle Loans: Race and Loan Source

Rates for Vehicle Loans: Race and Loan Source Rates for Vehicle Loans: Race and Loan Source Kerwin Kofi Charles Harris School University of Chicago 1155 East 60th Street Chicago, IL 60637 Voice: (773) 834-8922 Fax: (773) 702-0926 e-mail: kcharles@gmail.com

More information

Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University

Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University Practical I conometrics data collection, analysis, and application Christiana E. Hilmer Michael J. Hilmer San Diego State University Mi Table of Contents PART ONE THE BASICS 1 Chapter 1 An Introduction

More information

How Sensitive are Low Income Families to Health Plan Prices?

How Sensitive are Low Income Families to Health Plan Prices? American Economic Review: Papers & Proceedings 100 (May 2010): 292 296 http://www.aeaweb.org/articles.php?doi=10.1257/aer.100.2.292 The Massachusetts Health Insurance Experiment: Early Experiences How

More information

Why are White Nursing Home Residents Twice as Likely as African-Americans to Have an

Why are White Nursing Home Residents Twice as Likely as African-Americans to Have an Why are White Nursing Home Residents Twice as Likely as African-Americans to Have an Advance Directive? Understanding Ethnic Differences in Advance Care Planning Jennifer L. Troyer, Ph.D. Department of

More information

Economic Impact of the Queen of Peace Hospital and Related Health Sectors of Scott County

Economic Impact of the Queen of Peace Hospital and Related Health Sectors of Scott County Economic Impact of the Queen of Peace Hospital and Related Health Sectors of Scott County March 17, 2011 Minnesota Department of Health- Office of Rural Health and Primary Care The health care sector is

More information

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions

Chi Squared and Fisher's Exact Tests. Observed vs Expected Distributions BMS 617 Statistical Techniques for the Biomedical Sciences Lecture 11: Chi-Squared and Fisher's Exact Tests Chi Squared and Fisher's Exact Tests This lecture presents two similarly structured tests, Chi-squared

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

A Probabilistic Model for Measuring Stock Returns and the Returns from Playing Lottery Tickets: The Basis of an Instructional Activity

A Probabilistic Model for Measuring Stock Returns and the Returns from Playing Lottery Tickets: The Basis of an Instructional Activity JOURNAL OF ECONOMICS AND FINANCE EDUCATION Volume 7 Number 2 Winter 28 4 A Probabilistic Model for Measuring Stock Returns and the Returns from Playing Lottery Tickets: The Basis of an Instructional Activity

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the

More information

Gender Effects in the Alaska Juvenile Justice System

Gender Effects in the Alaska Juvenile Justice System Gender Effects in the Alaska Juvenile Justice System Report to the Justice and Statistics Research Association by André Rosay Justice Center University of Alaska Anchorage JC 0306.05 October 2003 Gender

More information

The size and structure

The size and structure The size and structure of the adult social care sector and workforce in England, 2015 September 2015 Acknowledgements We are grateful to many people who have contributed to this report. Particular thanks

More information

Gerry Hobbs, Department of Statistics, West Virginia University

Gerry Hobbs, Department of Statistics, West Virginia University Decision Trees as a Predictive Modeling Method Gerry Hobbs, Department of Statistics, West Virginia University Abstract Predictive modeling has become an important area of interest in tasks such as credit

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Teaching Business Statistics through Problem Solving

Teaching Business Statistics through Problem Solving Teaching Business Statistics through Problem Solving David M. Levine, Baruch College, CUNY with David F. Stephan, Two Bridges Instructional Technology CONTACT: davidlevine@davidlevinestatistics.com Typical

More information

Econometrics Simple Linear Regression

Econometrics Simple Linear Regression Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight

More information

Chapter 15. Mixed Models. 15.1 Overview. A flexible approach to correlated data.

Chapter 15. Mixed Models. 15.1 Overview. A flexible approach to correlated data. Chapter 15 Mixed Models A flexible approach to correlated data. 15.1 Overview Correlated data arise frequently in statistical analyses. This may be due to grouping of subjects, e.g., students within classrooms,

More information

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

The correlation coefficient

The correlation coefficient The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

Malpractice paid losses and financial performance of nursing homes

Malpractice paid losses and financial performance of nursing homes Copyeditor: Mary Grace Trillana October December 2010 1 Malpractice paid losses and financial performance of nursing homes Mei Zhao D. Rob Haley Reid M. Oetjen Henry J. Carretta Background: Florida s nursing

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/319/5862/414/dc1 Supporting Online Material for Application of Bloom s Taxonomy Debunks the MCAT Myth Alex Y. Zheng, Janessa K. Lawhorn, Thomas Lumley, Scott Freeman*

More information

Validity, Fairness, and Testing

Validity, Fairness, and Testing Validity, Fairness, and Testing Michael Kane Educational Testing Service Conference on Conversations on Validity Around the World Teachers College, New York March 2012 Unpublished Work Copyright 2010 by

More information