ABSTRACT INTRODUCTION STUDY DESCRIPTION
|
|
|
- Lambert Barker
- 10 years ago
- Views:
Transcription
1 ABSTRACT Paper Validating Self-Reported Survey Measures Using SAS Sarah A. Lyons MS, Kimberly A. Kaphingst ScD, Melody S. Goodman PhD Washington University School of Medicine Researchers often rely on self-reported responses for survey research studies. The accuracy of this self-reported data is often unknown, particularly in a safety-net medical setting that serves a patient population with varying levels of health literacy. We recruited participants from the waiting room of a St. Louis primary care safety net clinic to participate in a survey investigating the relationship between health environments, health behaviors and health outcomes. The survey included questions regarding personal and family history of chronic disease (diabetes, heart disease, and cancer) and self-reported height and weight (used to calculate Body Mass Index). We subsequently accessed the participant s electronic medical record (EMR) and collected physician-reported data on the same variables. We calculated concordance rates between participant answers and information gathered from EMRs using McNemar s test. Logistic regression was then performed to determine the demographic predictors of concordance. Six hundred sixty two patients completed surveys as part of the study; 69% female, 55% African American, 17% with less than high school level education, 74% annual household income less than $20,000, and 27% uninsured. Preliminary findings suggest a 75-95% concordance rate between self-reported and medical record data across outcomes, with the exception of family history of cancer (70%) and heart disease (61%). Determining the validity of self-reported data influences whether self-reported personal and family history of disease and BMI are appropriate proxies for medical record extraction in this patient population. INTRODUCTION Researchers often rely on self-reported participant responses for personal and family history of disease when conducting survey research. While a full medical examination is seen as the gold standard, this is rarely feasible. Pulling the desired information from the participant s electronic medical record (EMR) is viewed as the best approximation but it is often too impractical to conduct a medical record review for every study participant in large survey studies. These reviews can be a drain to resources, both with regards to time and manpower. Given the newness of EMRs in primary care settings, they are limited by the amount of information available; with varying dates of implementation, varying levels of physician and medical staff use, and varying amounts of historical archives. The degree to which self-reported participant responses are concordant with the information in their EMR is an important area of investigation. Concordance varies between populations and between health conditions 1. Health literacy can influence reporting knowledge of medical diagnoses and family disease history, as there might be lower recognition of medical terms and outcomes that are related to specific conditions 2. SAS is an excellent tool to analyze the concordance between the two methods of reporting: self-report determined through survey questions and physician-report determined through the EMR. The display of the 2x2 table allows for visualization of the degree (magnitude) and direction of the agreement. The SAS PROC FREQ and PROC LOGISTIC procedures are built in with various statistical tests for determining the significance of the agreement as well. In order for self-reported responses to be valid in this patient population, we want high agreement between selfand physician-reported data and non-significant p-values when testing for the differences between reporting groups. STUDY DESCRIPTION Participants were recruited from the waiting room of a safety net primary care clinic that serves a low-income population living in the St. Louis metro area. Data from 662 patients are used in this analysis. The majority of this population was female (69%), African American (55%) and had an annual household income of less than $20,000 (74%). The written survey contained 120 questions designed to capture each participant s health environments (using both written and visual measures) and health information-seeking behaviors as well as collecting data on health outcomes. Health literacy was assessed using the Rapid Assessment of Adult Literacy in Medicine-Revised 3,4. After survey completion, each participant s EMR was accessed by trained research assistants; data on personal and family history of disease was extracted. 1
2 METHODS We examine the degree of agreement between participants self-reports of disease diagnosis and family disease history with physician/medical staff-reported information in their EMR. In order to protect each participant s privacy, survey answers and medical records were linked using a unique numeric identifier and were de-identified of any personal identifiers. It is imperative that the survey variables and the medical record variables are coded consistently through a standardized process. In this dataset, Never received a diagnosis was coded as 0 and Have received diagnosis was coded as 1. BMI was calculated from height and weight recorded in the EMR. Self-perception of weight was determined from the survey; participants were asked if they considered themselves to be average weight, overweight or obese. Responses from the survey and the EMR were dichotomized into an outcome variable with 0 being average or underweight and 1 categorizing those who were overweight or obese. To calculate agreement between the two methods of reporting, the PROC FREQ procedure was utilized. This procedure allows for various methods of determining concordance and its significance. The PROC LOGISTIC procedure was then used to determine if there were any demographic factors that predict concordance rates. Demographic data collected from the survey included gender, age, education, marital status, race/ethnicity, urban status, health literacy, income, insurance status and familiarity with the concept of a family health history. Age was divided into four categories; <25, 26-35, 36-49, and over 50. Education was categorized into Below High School Education, High School Degree or Equivalent and Above High School. Marital status was dichotomized into Married and Not Married. Race was classified as Non-Hispanic Black, Non-Hispanic White and Other. Urban status was categorized as Urban for those living in zip codes within St. Louis City, Suburban for those living in zip codes in St. Louis County and Rural for those living in surrounding parts of Missouri. Health literacy was approximated using the participant s REALM-R score; a score of 6 or less indicates limited health literacy. Income was dichotomized as an annual household income of less than $20,000 and a household income of greater than or equal to $20,000. Insurance status was classified as being uninsured, having public insurance (Medicaid or Medicare) or having private insurance. On the survey, participants were asked if they had ever heard of a family health history before the survey and the response options were Yes or No. PROC FREQ CONCORDANCE RATES Survey Report of Heart Disease Electronic Medical Record Report of Heart Disease No Yes Total No Yes Total Agreement was calculated by adding the concordant percentages; total percent of those who had self- and physician-reported diagnosis of the disease and those who had self- and physician-reported diagnosis of not having the disease (No-No s and Yes-Yes s) as highlighted in Figure 1. For heart disease, agreement is those who have both self-reported history and EMRreported history of having heart disease (12.78%) and then add those who have no reported history of heart disease, both in the survey and on the EMR (70.79%) to get a total agreement of 83.57% Figure 1. 2x2 Frequency Table PROC FREQ Output To determine the agreement between the survey and EMR, we use McNemar s test instead of a more general Chi-square test to account for the paired responses. McNemar s test takes into account that the two measures (selfreport and medical record) are correlated; they are measuring the same participant s history of disease. The null hypothesis for McNemar s test is that there is no difference in the two correlated proportions. We want to fail to reject the null hypothesis (obtain a p-value > than 0.05) as this will give evidence that self-report of personal and family history of disease is an adequate proxy for medical record extraction in this patient population. This test can be called from both the tables statement or the exact statement with the mcnem option. In this case, either method will produce the same results. These statements will also produce the Kappa coefficient which checks for inter-rater reliability. The automatically generated plots were suppressed using the plots=none option in the tables statement. These plots would be useful to visualize the direction of agreement. 2
3 The PROC FREQ procedure was called using the following code to determine agreement between self- and physician-reported diagnoses of cancer: proc freq data = total; tables quest_34*pers_heartdisease / agree plots=none; run; The output displays the two-by-two table shown in Figure 1. From there, agreement can be calculated. The output also provides a table detailing the results of McNemar s Test (Table 1). The p-value is significant, indicating that there Statistic (S) DF 1 Pr > S Kappa Table 1. McNemar s Test and Simple Kappa Coefficient is discordance between what the participant reports on their survey and what is recorded in the EMR for personal history of heart disease. McNemar s test p-value doesn t indicate the direction of the discord, just the degree of significance. To further investigate, we must look at the two-by-two table in Figure 1. We can see that, of those that disagree, more reported heart disease on the survey than were reported in the participant s EMR (n=57; 12%). The Kappa coefficient is , indicating Moderate agreement, as defined by Landis and Koch 5. More encouraging results were found when investigating the agreement in personal history of diabetes (Figure 2). Substantial agreement (Kappa=0.8840) was found as well as an insignificant p-value (p=0.2568) from McNemar s test (Table 2). The percent concordant was 94.44% (57.54%+36.90%). In this population, the self-reported diabetes survey question is an adequate proxy for disease diagnosis as stated in the EMR. Survey Report of Diabetes Electronic Medical Record Report of Diabetes No Yes Total No Yes Total Figure 2. 2x2 Frequency Table PROC FREQ Output Statistic (S) DF 1 Pr > S Kappa Table 2. McNemar s Test and Simple Kappa Coefficient PROC LOGISTIC PREDICTION To understand what drives concordance rates, logistic regression was performed using an individual s concordance as a dichotomous outcome variable and their demographic factors as predictors. An if/else statement was used to create the dichotomous outcome variable (fdm_indic). If the survey and the EMR agreed for the outcome of interest (both 0, or both 1 ), then the indicator was coded as 1 for agreement. Otherwise, the indicator was coded as 0 for disagreement. if f_diabetes =. then fdm_indic=.; else if quest_13a =. then fdm_indic=.; else if f_diabetes = 1 and quest_13b = 1 then fdm_indic = 1; else if f_diabetes = 0 and quest_13b = 0 then fdm_indic = 1; else fdm_indic = 0; Using the PROC FREQ procedure with the chisq option in the tables statement we examine if there were differences in concordance rates by demographic factors. Results from bivariate analyses are used to determine the demographic predictors that will be tested in the development of a prediction model for concordance between EMR and self-reported responses. Predictors with a p-value <0.1 in bivariate analyses were included in logistic regression analysis. 3
4 Prediction models were created for concordance between self-report and EMR for all outcomes that had significant bivariate associations (BMI, personal history of cancer, and family history of diabetes, heart disease and cancer). The following code creates a model for concordance of family history of diabetes. proc logistic data=total; class fdm_indic (ref= 0 ) gender insured income urban / param=ref; model fdm_indic = gender insured income urban / selection = stepwise; run; The selection = stepwise option allows SAS to build the most significant model using an automated process. This model will include only significant predictors above a certain threshold; the default value is This can be changed by using the slentry or slstay options in the model statement which specify the significance for a variable to be entered into the model and the significance for it to stay in the model, respectively. Odds ratios are also produced that will allow for determining demographic subgroups that are most likely to report personal disease diagnosis and family disease history in agreement with their medical record. It will also determine groups for which there are high levels of discordance and for whom targeted interventions should be developed. RESULTS N % Self- Report % Physician- Reported % Agreement McNemar p-value Kappa Personal History Diabetes Heart Disease < Cancer < Family History Diabetes < Heart Disease < Cancer < BMI Obese < Table 3. Reported Prevalence and Agreement between Self-Report and EMR In every case (personal and familial), the prevalence of survey-reported disease is higher than that found in the EMR. These differences are significant in all cases except for personal history of diabetes (Table 3). We can conclude that participant s self-report of diabetes is a reliable way to ascertain a true personal history of the disease (when using the EMR as a gold standard) in this population. Despite the statistically significant rates of discordance there is a fair amount of agreement in all categories, with all personal diagnoses above 80% and all family history above 60%; family history of heart disease has the lowest percent agreement (61.23%). Predictor X 2 P-Value Personal History Diabetes None Heart Disease None Cancer Education BMI Obese Gender <0.01 Education Health Literacy Insurance Status Table 4a. Bivariate Associations Between Concordance Rates and Demographic Predictors 4
5 Family History Predictor X 2 P-Value Diabetes Gender Urban Status Income Status Insurance Status Heart Disease Health Literacy Cancer Urban Status Insurance Status Health Literacy Table 4b. Bivariate Associations Between Concordance Rates and Demographic Predictors No bivariate associations (p<0.1) were found for personal history of diabetes and heart disease (results not shown). Significant associations (p<0.1) were found with education, gender, urban status, income, insurance status and health literacy (Tables 4a and 4b). Three models were built, that predicted concordance of BMI and self-perception of weight, family history of cancer, and family history of diabetes. Models were not built for personal history of diabetes and cancer because there were no significant predictors. Results are not shown for personal and family history of heart disease models because no predictors met the entry threshold (slentry = 0.05) to be entered in the model. Models Predictors Odds Ratio (95% Wald Wald X 2 P-Value Confidence Limits) Model 1. Family History of Gender Diabetes Concordance (Female vs. Male) 0.43 (0.20, 0.94) Urban (Rural vs. Urban) 0.24 (0.09, 0.67) (Suburban vs. Urban) 0.61 (0.31, 1.18) Income (< $20,000 vs. $20,000) 0.27 (0.12, 0.80) Insurance (None vs. Public) 2.68 (1.21,5.95) (Private vs. Public) 2.09 (0.78, 5.56) Model 2. Family History of Cancer Insurance Concordance (None vs. Public) 2.04 (1.10, 3.74) (Private vs. Public) 1.75 (0.80, 3.8) Model 3. BMI and Self-Perception Gender of Weight Concordance (Female vs. Male) 2.09 (1.28, 3.40) 8.69 <0.001 Insurance (None vs. Public) 0.87 (0.49, 1.54) (Private vs. Public) 0.36 (0.20, 0.66) Table 5. Odds Ratios from Logistic Regression Models As shown in Table 5, Model 1 has the most predictors, with gender, urban status, income and insurance being the significant predictors (p<0.05). Being female, living in rural or suburban Missouri and having low income were associated with higher odds of discordance in family history of diabetes. Having no insurance was associated with higher odds of concordance in family history of diabetes, when compared against having some form of public insurance. Model 2 only had one significant predictor. In Model 2, having no insurance was associated with higher odds of concordance in family history of cancer, when compared to only public insurance. In Model 3, there were two predictors. Being female was associated with higher odds of concordance between BMI and self-perception of 5
6 weight. Having private insurance was associated with lower odds of concordance, when compared to public insurance. CONCLUSIONS The methods described here can be used to validate any self-reported survey response against a gold standard. This analysis should be performed on any pilot data to aide in the decision to continue with time-consuming medical record or chart reviews for clinical outcomes. It is important to note a major limitation of this study is the use of EMR as the gold standard for reporting. The results are subject to the validity of the EMR for reporting personal and family disease history. There is a high likelihood that both personal and family history is under-reported on this population s EMRs. Personal history of diabetes does not seem to be under-reported and perhaps is a focus of collection and reporting for this clinic. Further research with a larger sample size should be done to determine what is driving the discordance of the other diseases. This will allow for the identification of certain demographic groups that are more vulnerable to discordance in reporting. Identifying groups that are at high-risk for discordance will allow for targeted interventions that will aim to increase a patient s understanding of personal and family history as well as their importance in prevention and disease management. Providers will also be able to identify which groups should be focused on in terms of collecting and recording the information for their EMR. Despite the statistically significant differences in self- and physician-reported data, we see high (>80%) rates of agreement among personal disease diagnosis and fair (>60%) amount of agreement in the reporting of family disease history. REFERENCES 1. Bush TL, Miller SR, Golden AL, Hale WE. November Self-report and medical record report agreement of selected medical conditions in the elderly. Am J Public Health. 79: Smith B, Chu LK, Smith TC, Amoroso PJ, Boyko EJ, Hooper TI, Gackstetter GD, Ryan MA. June Challenges of self-reported medical conditions and electronic medical records among members of a large military cohort. BMC Med Res Methodol. 8: Davis WL, Demaio TJ Comparing The Think Aloud Interviewing Technique With Standard Interviewing In The Redesign Of A Dietary Recall Questionnaire. Proceedings of the Section on Survey Research Methods, American Statistical Association Davis TC, Crouch MA, Long SW, et al Rapid Assessment of Literacy Level of Adult Primary Care Patients. Fam Med. 23(6): Landis JR, Koch GG The measurement of observer agreement for categorical data. Biometrics. 33: ACKNOWLEDGEMENTS The project and work of Sarah Lyons is supported by funding from the Barnes Jewish Hospital Foundation, Siteman Cancer Center and Washington University School of Medicine. The work of Dr. Goodman is supported by the Barnes- Jewish Hospital Foundation, Siteman Cancer Center, National Institutes of Health, National Cancer Institute grant U54CA153460, Washington University School of Medicine (WUSM) and WUSM Faculty Diversity Scholars Program. Dr. Kaphingst is supported by funding from the Barnes Jewish Hospital Foundation, Siteman Cancer Center and Washington University School of Medicine. CONTACT INFORMATION Name: Sarah Lyons Organization: Division of Public Health Sciences, Washington University School of Medicine in St. Louis 660 S Euclid Ave, Campus Box 8100 St. Louis, MO, Work Phone: [email protected] SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies. 6
Abbas S. Tavakoli, DrPH, MPH, ME 1 ; Nikki R. Wooten, PhD, LISW-CP 2,3, Jordan Brittingham, MSPH 4
1 Paper 1680-2016 Using GENMOD to Analyze Correlated Data on Military System Beneficiaries Receiving Inpatient Behavioral Care in South Carolina Care Systems Abbas S. Tavakoli, DrPH, MPH, ME 1 ; Nikki
PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY
PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY ABSTRACT Keywords: Logistic. INTRODUCTION This paper covers some gotchas in SAS R PROC LOGISTIC. A gotcha
ln(p/(1-p)) = α +β*age35plus, where p is the probability or odds of drinking
Dummy Coding for Dummies Kathryn Martin, Maternal, Child and Adolescent Health Program, California Department of Public Health ABSTRACT There are a number of ways to incorporate categorical variables into
Access Provided by your local institution at 02/06/13 5:22PM GMT
Access Provided by your local institution at 02/06/13 5:22PM GMT brief communication Reducing Disparities in Access to Primary Care and Patient Satisfaction with Care: The Role of Health Centers Leiyu
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@
Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@ Yanchun Xu, Andrius Kubilius Joint Commission on Accreditation of Healthcare Organizations,
An Example of SAS Application in Public Health Research --- Predicting Smoking Behavior in Changqiao District, Shanghai, China
An Example of SAS Application in Public Health Research --- Predicting Smoking Behavior in Changqiao District, Shanghai, China Ding Ding, San Diego State University, San Diego, CA ABSTRACT Finding predictors
Disparities Between Asthma Management and Insurance Type Among Children
o r i g i n a l c o m m u n i c a t i o n Disparities Between Asthma Management and Insurance Type Among Children Crystal N. Piper, MPH, MHA, PhD; Keith Elder, PhD; Saundra Glover, PhD; Jong-Deuk Baek,
NHS Diabetes Prevention Programme (NHS DPP) Non-diabetic hyperglycaemia. Produced by: National Cardiovascular Intelligence Network (NCVIN)
NHS Diabetes Prevention Programme (NHS DPP) Non-diabetic hyperglycaemia Produced by: National Cardiovascular Intelligence Network (NCVIN) Date: August 2015 About Public Health England Public Health England
SUGI 29 Statistics and Data Analysis
Paper 194-29 Head of the CLASS: Impress your colleagues with a superior understanding of the CLASS statement in PROC LOGISTIC Michelle L. Pritchard and David J. Pasta Ovation Research Group, San Francisco,
Using An Ordered Logistic Regression Model with SAS Vartanian: SW 541
Using An Ordered Logistic Regression Model with SAS Vartanian: SW 541 libname in1 >c:\=; Data first; Set in1.extract; A=1; PROC LOGIST OUTEST=DD MAXITER=100 ORDER=DATA; OUTPUT OUT=CC XBETA=XB P=PROB; MODEL
Diabetes Prevention in Latinos
Diabetes Prevention in Latinos Matthew O Brien, MD, MSc Assistant Professor of Medicine and Public Health Northwestern Feinberg School of Medicine Institute for Public Health and Medicine October 17, 2013
An Analysis of the Health Insurance Coverage of Young Adults
Gius, International Journal of Applied Economics, 7(1), March 2010, 1-17 1 An Analysis of the Health Insurance Coverage of Young Adults Mark P. Gius Quinnipiac University Abstract The purpose of the present
EXPANDING THE EVIDENCE BASE IN OUTCOMES RESEARCH: USING LINKED ELECTRONIC MEDICAL RECORDS (EMR) AND CLAIMS DATA
EXPANDING THE EVIDENCE BASE IN OUTCOMES RESEARCH: USING LINKED ELECTRONIC MEDICAL RECORDS (EMR) AND CLAIMS DATA A CASE STUDY EXAMINING RISK FACTORS AND COSTS OF UNCONTROLLED HYPERTENSION ISPOR 2013 WORKSHOP
Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL
Paper SA01-2012 Methods for Interaction Detection in Predictive Modeling Using SAS Doug Thompson, PhD, Blue Cross Blue Shield of IL, NM, OK & TX, Chicago, IL ABSTRACT Analysts typically consider combinations
Beginning Tutorials. PROC FREQ: It s More Than Counts Richard Severino, The Queen s Medical Center, Honolulu, HI OVERVIEW.
Paper 69-25 PROC FREQ: It s More Than Counts Richard Severino, The Queen s Medical Center, Honolulu, HI ABSTRACT The FREQ procedure can be used for more than just obtaining a simple frequency distribution
How to set the main menu of STATA to default factory settings standards
University of Pretoria Data analysis for evaluation studies Examples in STATA version 11 List of data sets b1.dta (To be created by students in class) fp1.xls (To be provided to students) fp1.txt (To be
Predicting Customer Churn in the Telecommunications Industry An Application of Survival Analysis Modeling Using SAS
Paper 114-27 Predicting Customer in the Telecommunications Industry An Application of Survival Analysis Modeling Using SAS Junxiang Lu, Ph.D. Sprint Communications Company Overland Park, Kansas ABSTRACT
A Population Based Risk Algorithm for the Development of Type 2 Diabetes: in the United States
A Population Based Risk Algorithm for the Development of Type 2 Diabetes: Validation of the Diabetes Population Risk Tool (DPoRT) in the United States Christopher Tait PhD Student Canadian Society for
Binary Logistic Regression
Binary Logistic Regression Main Effects Model Logistic regression will accept quantitative, binary or categorical predictors and will code the latter two in various ways. Here s a simple model including
Does Financial Sophistication Matter in Retirement Preparedness of U.S Households? Evidence from the 2010 Survey of Consumer Finances
Does Financial Sophistication Matter in Retirement Preparedness of U.S Households? Evidence from the 2010 Survey of Consumer Finances Kyoung Tae Kim, The Ohio State University 1 Sherman D. Hanna, The Ohio
Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)
Cairo University Faculty of Economics and Political Science Statistics Department English Section Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study) Prepared
Study Design and Statistical Analysis
Study Design and Statistical Analysis Anny H Xiang, PhD Department of Preventive Medicine University of Southern California Outline Designing Clinical Research Studies Statistical Data Analysis Designing
CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal
CHILDHOOD CANCER SURVIVOR STUDY Analysis Concept Proposal 1. STUDY TITLE: Longitudinal Assessment of Chronic Health Conditions: The Aging of Childhood Cancer Survivors 2. WORKING GROUP AND INVESTIGATORS:
Finding Supporters. Political Predictive Analytics Using Logistic Regression. Multivariate Solutions
Finding Supporters Political Predictive Analytics Using Logistic Regression Multivariate Solutions What is Logistic Regression? In a political application, logistic regression can describe the outcome
VI. Introduction to Logistic Regression
VI. Introduction to Logistic Regression We turn our attention now to the topic of modeling a categorical outcome as a function of (possibly) several factors. The framework of generalized linear models
Medicare Advantage National Senior Survey 600 Senior Registered Voters in the Medicare Advantage Program February 24-28, 2015
Medicare Advantage National Senior Survey 600 Senior Registered Voters in the Medicare Advantage Program February 24-28, 2015 1. In what year were you born? 1. Before 1950 (CONTINUE TO QUESTION 2) 100
Rural Disparities in posthospitalization. after traumatic brain injury.
Rural Disparities in posthospitalization rehabilitation after traumatic brain injury. Ashley D Meagher MD, Jennifer Doorey MS, Christopher Beadles MD PhD, Anthony Charles MD MPH University of North Carolina
Correlates of not receiving HIV care among HIV-infected women enrolling in a HRSA SPNS multi-site initiative
Correlates of not receiving HIV care among HIV-infected women enrolling in a HRSA SPNS multi-site initiative Oni J. Blackstock, MD, MHS Assistant Professor of Medicine Division of General Internal Medicine
Basic Statistical and Modeling Procedures Using SAS
Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom
Racial Disparities and Barrier to Statin Utilization in Patients with Diabetes in the U.S. School of Pharmacy Virginia Commonwealth University
Racial Disparities and Barrier to Statin Utilization in Patients with Diabetes in the U.S. School of Pharmacy Virginia Commonwealth University Outline Background Motivation Objectives Study design Results
Sue Flocke, PhD Eileen L. Seeholzer, MD MS Heidi Gullett, MD MPH
Sue Flocke, PhD Eileen L. Seeholzer, MD MS Heidi Gullett, MD MPH Brigid Jackson, MA Samantha Smith, MA Elizabeth Antognoli, PhD Sue Krejci, MBA Peter J. Lawson, MA MPH MBA Practice-based Research Network
Racial Disparities in US Healthcare
Racial Disparities in US Healthcare Paul H. Johnson, Jr. Ph.D. Candidate University of Wisconsin Madison School of Business Research partially funded by the National Institute of Mental Health: Ruth L.
Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer
Analysis of Survey Data Using the SAS SURVEY Procedures: A Primer Patricia A. Berglund, Institute for Social Research - University of Michigan Wisconsin and Illinois SAS User s Group June 25, 2014 1 Overview
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing. C. Olivia Rud, VP, Fleet Bank
Data Mining: An Overview of Methods and Technologies for Increasing Profits in Direct Marketing C. Olivia Rud, VP, Fleet Bank ABSTRACT Data Mining is a new term for the common practice of searching through
EARLY VS. LATE ENROLLERS: DOES ENROLLMENT PROCRASTINATION AFFECT ACADEMIC SUCCESS? 2007-08
EARLY VS. LATE ENROLLERS: DOES ENROLLMENT PROCRASTINATION AFFECT ACADEMIC SUCCESS? 2007-08 PURPOSE Matthew Wetstein, Alyssa Nguyen & Brianna Hays The purpose of the present study was to identify specific
Data Driven Approaches to Prescription Medication Outcomes Analysis Using EMR
Data Driven Approaches to Prescription Medication Outcomes Analysis Using EMR Nathan Manwaring University of Utah Masters Project Presentation April 2012 Equation Consulting Who we are Equation Consulting
Easily Identify Your Best Customers
IBM SPSS Statistics Easily Identify Your Best Customers Use IBM SPSS predictive analytics software to gain insight from your customer database Contents: 1 Introduction 2 Exploring customer data Where do
Evaluating Mode Effects in the Medicare CAHPS Fee-For-Service Survey
Evaluating Mode Effects in the Medicare Fee-For-Service Survey Norma Pugh, MS, Vincent Iannacchione, MS, Trang Lance, MPH, Linda Dimitropoulos, PhD RTI International, Research Triangle Park, NC 27709 Key
Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)
Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.) Logistic regression generalizes methods for 2-way tables Adds capability studying several predictors, but Limited to
Segmentation For Insurance Payments Michael Sherlock, Transcontinental Direct, Warminster, PA
Segmentation For Insurance Payments Michael Sherlock, Transcontinental Direct, Warminster, PA ABSTRACT An online insurance agency has built a base of names that responded to different offers from various
The National Survey of Children s Health 2011-2012 The Child
The National Survey of Children s 11-12 The Child The National Survey of Children s measures children s health status, their health care, and their activities in and outside of school. Taken together,
Modeling Lifetime Value in the Insurance Industry
Modeling Lifetime Value in the Insurance Industry C. Olivia Parr Rud, Executive Vice President, Data Square, LLC ABSTRACT Acquisition modeling for direct mail insurance has the unique challenge of targeting
Logistic (RLOGIST) Example #3
Logistic (RLOGIST) Example #3 SUDAAN Statements and Results Illustrated PREDMARG (predicted marginal proportion) CONDMARG (conditional marginal proportion) PRED_EFF pairwise comparison COND_EFF pairwise
Facts about Diabetes in Massachusetts
Facts about Diabetes in Massachusetts Diabetes is a disease in which the body does not produce or properly use insulin (a hormone used to convert sugar, starches, and other food into the energy needed
A Property & Casualty Insurance Predictive Modeling Process in SAS
Paper AA-02-2015 A Property & Casualty Insurance Predictive Modeling Process in SAS 1.0 ABSTRACT Mei Najim, Sedgwick Claim Management Services, Chicago, Illinois Predictive analytics has been developing
Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY
Guido s Guide to PROC FREQ A Tutorial for Beginners Using the SAS System Joseph J. Guido, University of Rochester Medical Center, Rochester, NY ABSTRACT PROC FREQ is an essential procedure within BASE
A LOGISTIC REGRESSION MODEL TO PREDICT FRESHMEN ENROLLMENTS Vijayalakshmi Sampath, Andrew Flagel, Carolina Figueroa
A LOGISTIC REGRESSION MODEL TO PREDICT FRESHMEN ENROLLMENTS Vijayalakshmi Sampath, Andrew Flagel, Carolina Figueroa ABSTRACT Predictive modeling is the technique of using historical information on a certain
Categorical Data Analysis
Richard L. Scheaffer University of Florida The reference material and many examples for this section are based on Chapter 8, Analyzing Association Between Categorical Variables, from Statistical Methods
Health Insurance Coverage: Estimates from the National Health Interview Survey, 2004
Health Insurance Coverage: Estimates from the National Health Interview Survey, 2004 by Robin A. Cohen, Ph.D., and Michael E. Martinez, M.P.H., Division of Health Interview Statistics, National Center
Cool Tools for PROC LOGISTIC
Cool Tools for PROC LOGISTIC Paul D. Allison Statistical Horizons LLC and the University of Pennsylvania March 2013 www.statisticalhorizons.com 1 New Features in LOGISTIC ODDSRATIO statement EFFECTPLOT
THE CORRELATION BETWEEN PHYSICAL HEALTH AND MENTAL HEALTH
HENK SWINKELS (STATISTICS NETHERLANDS) BRUCE JONAS (US NATIONAL CENTER FOR HEALTH STATISTICS) JAAP VAN DEN BERG (STATISTICS NETHERLANDS) THE CORRELATION BETWEEN PHYSICAL HEALTH AND MENTAL HEALTH IN THE
A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic
A Study to Predict No Show Probability for a Scheduled Appointment at Free Health Clinic Report prepared for Brandon Slama Department of Health Management and Informatics University of Missouri, Columbia
New York Study of Booster Seat Effects on Injury Reduction Compared to Safety Belts in Children Aged 4-8 in Motor Vehicle Crashes
New York Study of Booster Seat Effects on Injury Reduction Compared to Safety Belts in Children Aged 4-8 in Motor Vehicle Crashes Kainan Sun, Ph.D., Michael Bauer, M.S. Sarah M. Sperry, M.S., Susan Hardman
ABSTRACT INTRODUCTION
Paper SP03-2009 Illustrative Logistic Regression Examples using PROC LOGISTIC: New Features in SAS/STAT 9.2 Robert G. Downer, Grand Valley State University, Allendale, MI Patrick J. Richardson, Van Andel
Application of Information Systems and Secondary Data. Lynda Burton, ScD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Sample Size Planning, Calculation, and Justification
Sample Size Planning, Calculation, and Justification Theresa A Scott, MS Vanderbilt University Department of Biostatistics [email protected] http://biostat.mc.vanderbilt.edu/theresascott Theresa
FACILITATOR/MENTOR GUIDE
FACILITATOR/MENTOR GUIDE Descriptive analysis variables table shells hypotheses Measures of association methods design justify analytic assess calculate analysis problem stratify confounding statistical
Task 7: Study of the Uninsured and Underinsured
75 Washington Avenue, Suite 206 Portland, Maine04101 Phone: 207-767-6440 Email: [email protected] www.marketdecisions.com Task 7: Study of the Uninsured and Underinsured Vermont Office of Health
An Electronic Health Record Alert for Hepatitis C Screening of Baby Boomers in Primary Care: A Cluster Randomized Controlled Trial
An Electronic Health Record Alert for Hepatitis C Screening of Baby Boomers in Primary Care: A Cluster Randomized Controlled Trial K Krauskopf, N Kil, A Sofianou, W Toribio, J Lyons, M Singer, J Kannry,
White Paper. Medicare Part D Improves the Economic Well-Being of Low Income Seniors
White Paper Medicare Part D Improves the Economic Well-Being of Low Income Seniors Kathleen Foley, PhD Barbara H. Johnson, MA February 2012 Table of Contents Executive Summary....................... 1
Paper D10 2009. Ranking Predictors in Logistic Regression. Doug Thompson, Assurant Health, Milwaukee, WI
Paper D10 2009 Ranking Predictors in Logistic Regression Doug Thompson, Assurant Health, Milwaukee, WI ABSTRACT There is little consensus on how best to rank predictors in logistic regression. This paper
Analyzing Research Data Using Excel
Analyzing Research Data Using Excel Fraser Health Authority, 2012 The Fraser Health Authority ( FH ) authorizes the use, reproduction and/or modification of this publication for purposes other than commercial
Improved Interaction Interpretation: Application of the EFFECTPLOT statement and other useful features in PROC LOGISTIC
Paper AA08-2013 Improved Interaction Interpretation: Application of the EFFECTPLOT statement and other useful features in PROC LOGISTIC Robert G. Downer, Grand Valley State University, Allendale, MI ABSTRACT
PharmaSUG2011 Paper HS03
PharmaSUG2011 Paper HS03 Using SAS Predictive Modeling to Investigate the Asthma s Patient Future Hospitalization Risk Yehia H. Khalil, University of Louisville, Louisville, KY, US ABSTRACT The focus of
Disparities in Realized Access: Patterns of Health Services Utilization by Insurance Status among Children with Asthma in Puerto Rico
Disparities in Realized Access: Patterns of Health Services Utilization by Insurance Status among Children with Asthma in Puerto Rico Ruth Ríos-Motta, PhD, José A. Capriles-Quirós, MD, MPH, MHSA, Mario
Depression. Definition: Respondents who were told by a doctor, nurse, or health professional that they had some form of depression.
DEPRESSION Definition: Respondents who were told by a doctor, nurse, or health professional that they had some form of depression. Prevalence of o South Dakota 15% o Nationwide median 18% Healthy People
The Role of Insurance in Providing Access to Cardiac Care in Maryland. Samuel L. Brown, Ph.D. University of Baltimore College of Public Affairs
The Role of Insurance in Providing Access to Cardiac Care in Maryland Samuel L. Brown, Ph.D. University of Baltimore College of Public Affairs Heart Disease Heart Disease is the leading cause of death
Profiles and Data Analysis. 5.1 Introduction
Profiles and Data Analysis PROFILES AND DATA ANALYSIS 5.1 Introduction The survey of consumers numbering 617, spread across the three geographical areas, of the state of Kerala, who have given information
THE IMPACT OF USING BLOOD SUGAR HOME MONITORING DEVICE TO CONTROL BLOOD SUGAR LEVEL IN DIABETIC PATIENTS
THE IMPACT OF USING BLOOD SUGAR HOME MONITORING DEVICE TO CONTROL BLOOD SUGAR LEVEL IN DIABETIC PATIENTS Alshammari S., *Al-Jameel N., Al-Johani H, Al-Qahtani A., Al-Hakbani A., Khan A. and Alfaraj S.
Health Status, Health Insurance, and Medical Services Utilization: 2010 Household Economic Studies
Health Status, Health Insurance, and Medical Services Utilization: 2010 Household Economic Studies Current Population Reports By Brett O Hara and Kyle Caswell Issued July 2013 P70-133RV INTRODUCTION The
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma [email protected] The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
United 2020: Measuring Impact
United 2020: Measuring Impact Health The Institute for Urban Policy Research At The University of Texas at Dallas Kristine Lykens, PhD United 2020: Measuring Impact Health Overview In the Dallas area,
