Influence of missing data on analysis and power calculation in bioequivalence studies
|
|
|
- Felix Ross
- 9 years ago
- Views:
Transcription
1 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 1/33 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel Ulm University in cooperation with Boehringer Ingelheim Pharma GmbH & Co. KG May 4 th, 2011
2 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 2/33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion
3 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 3/33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion
4 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 4/33 Bioequivalence studies (BE studies) Test if test- and reference formulations differ marginally with respect to pharmacokinetic (PK) characteristics Endpoints: PK-parameters Primary: AUC and C max Problem concerning analysis and power calculation: At how many missing values should further subjects be recruited?
5 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 5/33 Typical design of BE studies: 2x2-Crossover Test- (T) and reference (R) formulation are given in two periods N subjetcs are allocated randomly evenly to one of the following sequences: RT and TR Treatment according to sequence and period
6 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 6/33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion
7 Model (on log-scale) Y ijk = S ik + P j + F jk + C k + e ijk Y ijk S ik P j F jk C k e ijk logarithm of response (PK parameter) measured on subject i in sequence k in period j, i = 1,...,N, j=1, 2, k = 1, 2 random i-th subject effect in sequence k, N(0,τ 2 )i.i.d. fixed effect in period j fixed treatment effect in the k-th sequence in period j fixed effect in the k-th sequence residual (random) error associated with the i-th subject in sequence k in period j, N(0,σ 2 )i.i.d. Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 7/33
8 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 8/33 Why random subject effect? EMA: All effects should be considered fixed Subjects with one missing value excluded from analysis If applicable, further subjects have to be recruited! FDA: All avaible data should be included in the analysis Properties of REML can be used (e.g. in SAS with PROC MIXED) When should further subjects be recruited?
9 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th,2011 9/33 Model assumptions Number of subjects: N = 24 Y ijk = S ik + P j + F jk + C k + e ijk S ik : subject S ik N(0,τ 2 )withτ 2 = 0.25 P j : period P 1 =0, P 2 = 0.2 F jk : treatment F 11 = F 22 =0, F 12 = F 21 := F 2 = 0.1 C k : sequence C 1 = C 2 = 0 e ijk : residual error e ijk N(0,σ 2 )withσ 2 = 0.06 Modeling of response per sequence and period Sequence Period 1 Period 2 RT Y i11 = S i1 + e i11 Y i21 = S i e i21 TR Y i12 = S i e i12 Y i22 = S i e i22
10 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion
11 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Randomly missing values With N = 24 subjects: Number of missing values: 1, 2, 3, 4, 8, 12, 16, 20 Three alternatives to distribute missing values: - for both treatments and in both periods (case I) - only in period 2 (case II) - only for treatment R (case III)
12 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Pseudo code DO 1 TO 1000 END 1. Simulate complete dataset for 24 subjects 2. Analyse complete dataset with PROC MIXED 3. FOR i = 1, 2, 3, 4, 8, 12, 16, 20 DO END PROC SURVEYSELECT to simulate incomplete datasets with i missing values for case I - III 4. Analyse incomplete dataset with PROC MIXED
13 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 One missing value for both treatments (case I)
14 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Two missing values for both treatments (case I)
15 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Three or more missing values for both treatments (case I)
16 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Missing values in period 2 (case II) or for treatment R (case III)
17 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 PROC SURVEYSELECT
18 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion
19 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Is there a difference between the formulations? Test if: the 90% confidence interval, which covers the treatment quotient µ T µ R, lies within the equivalence domain [0.80, 1.25] the 90% confidence interval, which covers the treatment difference µ T µ R, lies within the equivalence domain [ 0.223, 0.223] (log-scale)
20 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 PROC MIXED Theory: Mixed model Y = X β + Zγ + fix random
21 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Value of interest: gcv Among others considered: gcv := 100 exp(mse) 1 derived from CV of a log-normally distributed random variable X CV := Var(X ) E(X )
22 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Evaluation with PROC BOXPLOT σ 2 =0.06 gcv = 25, missing values for treatment R (case III)
23 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Summary statistics with PROC MEANS σ 2 =0.06 gcv = 25, missing values for treatment R (case III) Missing values Number of simulations Min P25 Mean P75 Max
24 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Summary statistics graphic σ 2 =0.06 gcv = 25, missing values for treatment R (case III)
25 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Power Goal: The probability to reject the hypothesis that the formulations are different when there is no relevent difference should lie e.g. between 80% and90% This probability is called Power gcv is a decisive factor here
26 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 PROC POWER
27 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Power graphic: Mean of simulations σ 2 =0.06 gcv = 25, missing values for treatment R (case III)
28 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Power graphic: P75 of simulations σ 2 =0.06 gcv = 25, missing values for treatment R (case III)
29 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Outline 1 Motivation 2 Model 3 Simulation 4 SAS Procedures and evaluation 5 Conclusion
30 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 When should further subjects be recruited? For N = 24 subjects and 1000 simulations each (per number of missing values and case): In general, no further subjects have to be recruited Based on the observations in the 75% percentile: - at 10 to 12 missing values further subjects should be recruited - especially if the values are missing mainly for both treatments and in both periods
31 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Outlook: Further questions cohorts and other covariates Variability of the rest variance σ 2
32 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 References [1] SAS/Stat 9.2 User s Guide: The Mixed Procedure (Book Excerpt). SAS Publishing, 2008 [2] S.C. Chow, J.P. Liu. Design and Analysis of Bioavailability and Bioequivalence Studies. Marcel Dekker, Inc., New York, 1992 [3] S. Patterson, B. Jones. Bioequivalence and Statistics in Clinical Pharmacology. Chapman & Hall / CRC, Boca Raton, 2006 [4] G.A. Milliken, D.E. Johnson. Analysis of Messy Data Volume I: Designed Experiments. Wadsworth, Inc., Belmont, 1984
33 Influence of missing data on analysis and power calculation in bioequivalence studies Henrike Häbel May 4 th, /33 Thank you for your attention! Special thanks to Julia Habeck and Michaela Mattheus from Boehringer Ingelheim Pharma GmbH & Co. KG
Bioavailability / Bioequivalence
Selection of CROs Selection of a Reference Product Metrics (AUC, C max /t max, Shape of Profile) Acceptance Ranges (0.80 1.25 and beyond) Sample Size Planning (Literature References, Pilot Studies) Steps
How to get accurate sample size and power with nquery Advisor R
How to get accurate sample size and power with nquery Advisor R Brian Sullivan Statistical Solutions Ltd. ASA Meeting, Chicago, March 2007 Sample Size Two group t-test χ 2 -test Survival Analysis 2 2 Crossover
200628 - DAIC - Advanced Experimental Design in Clinical Research
Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 200 - FME - School of Mathematics and Statistics 1004 - UB - (ENG)Universitat de Barcelona MASTER'S DEGREE IN STATISTICS AND
Statistics and Pharmacokinetics in Clinical Pharmacology Studies
Paper ST03 Statistics and Pharmacokinetics in Clinical Pharmacology Studies ABSTRACT Amy Newlands, GlaxoSmithKline, Greenford UK The aim of this presentation is to show how we use statistics and pharmacokinetics
THE LOG TRANSFORMATION IS SPECIAL
STATISTICS IN MEDICINE, VOL. 4,8-89 (995) THE LOG TRANSFORMATION IS SPECIAL OLIVER N. KEENE Department of Medical Statistics, GIaxo Research and Development Ltd., Greenford Road, Greenford, Middlesex.
BIOAVAILABILITY & BIOEQUIVALENCE TRIALS
BIOAVAILABILITY & BIOEQUIVALENCE TRIALS Shubha Rani,, Ph.D. Technical Director & Head-Biometrics and Data Management Synchron Research Services Pvt. Ltd. Ahmedabad 380 054 [email protected]
Statistics 104: Section 6!
Page 1 Statistics 104: Section 6! TF: Deirdre (say: Dear-dra) Bloome Email: [email protected] Section Times Thursday 2pm-3pm in SC 109, Thursday 5pm-6pm in SC 705 Office Hours: Thursday 6pm-7pm SC
A MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION PROFILES
Journal of Biopharmaceutical Statistics, 15: 265 278, 2005 Copyright Taylor & Francis, Inc. ISSN: 1054-3406 print/1520-5711 online DOI: 10.1081/BIP-200049832 A MULTIVARIATE TEST FOR SIMILARITY OF TWO DISSOLUTION
A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA
REVSTAT Statistical Journal Volume 4, Number 2, June 2006, 131 142 A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA Authors: Daiane Aparecida Zuanetti Departamento de Estatística, Universidade Federal de São
General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.
General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n
Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)
Chapter 45 Two-Sample T-Tests Allowing Unequal Variance (Enter Difference) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when no assumption
Dongfeng Li. Autumn 2010
Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis
Randomization Based Confidence Intervals For Cross Over and Replicate Designs and for the Analysis of Covariance
Randomization Based Confidence Intervals For Cross Over and Replicate Designs and for the Analysis of Covariance Winston Richards Schering-Plough Research Institute JSM, Aug, 2002 Abstract Randomization
A Multivariate Test For Similarity of Two Dissolution Profiles
A Multivariate Test For Similarity of Two Dissolution Profiles * H. Saranadasa and K. Krishnamoorthy Ortho McNeil Pharmaceutical, Inc., Raritan, New Jersey, USA University of Louisiana at Lafayette, Lafayette,
Introduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina
Paper 267-27 Generating Randomization Schedules Using SAS Programming Chunqin Deng and Julia Graz, PPD, Inc., Research Triangle Park, North Carolina ABSTRACT Randomization as a method of experimental control
Two-Sample T-Tests Assuming Equal Variance (Enter Means)
Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Lecture 8. Confidence intervals and the central limit theorem
Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of
Joint Exam 1/P Sample Exam 1
Joint Exam 1/P Sample Exam 1 Take this practice exam under strict exam conditions: Set a timer for 3 hours; Do not stop the timer for restroom breaks; Do not look at your notes. If you believe a question
Clinical Study Synopsis for Public Disclosure
abcd Clinical Study for Public Disclosure This clinical study synopsis is provided in line with s Policy on Transparency and Publication of Clinical Study Data. The synopsis - which is part of the clinical
How To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
The Variability of P-Values. Summary
The Variability of P-Values Dennis D. Boos Department of Statistics North Carolina State University Raleigh, NC 27695-8203 [email protected] August 15, 2009 NC State Statistics Departement Tech Report
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
Final Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
Math 151. Rumbos Spring 2014 1. Solutions to Assignment #22
Math 151. Rumbos Spring 2014 1 Solutions to Assignment #22 1. An experiment consists of rolling a die 81 times and computing the average of the numbers on the top face of the die. Estimate the probability
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
COMPLETE MARKETS DO NOT ALLOW FREE CASH FLOW STREAMS
COMPLETE MARKETS DO NOT ALLOW FREE CASH FLOW STREAMS NICOLE BÄUERLE AND STEFANIE GRETHER Abstract. In this short note we prove a conjecture posed in Cui et al. 2012): Dynamic mean-variance problems in
START Selected Topics in Assurance
START Selected Topics in Assurance Related Technologies Table of Contents Introduction Some Statistical Background Fitting a Normal Using the Anderson Darling GoF Test Fitting a Weibull Using the Anderson
3.4 Statistical inference for 2 populations based on two samples
3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted
Module 2 Probability and Statistics
Module 2 Probability and Statistics BASIC CONCEPTS Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The standard deviation of a standard normal distribution
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
Overview of Dissolution for BA/BE
Biopharmaceutics Classification System based on Solubility/Permeability Biowaivers for BCS I Drugs Discussion of BCS III Drugs Models establishing in vivo-in vitro Correlations (IVIVC Levels A-C) 1 Biopharmaceutics
GLMs: Gompertz s Law. GLMs in R. Gompertz s famous graduation formula is. or log µ x is linear in age, x,
Computing: an indispensable tool or an insurmountable hurdle? Iain Currie Heriot Watt University, Scotland ATRC, University College Dublin July 2006 Plan of talk General remarks The professional syllabus
HYPOTHESIS TESTING: POWER OF THE TEST
HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,
UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST
UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly
Chapter 7 Section 1 Homework Set A
Chapter 7 Section 1 Homework Set A 7.15 Finding the critical value t *. What critical value t * from Table D (use software, go to the web and type t distribution applet) should be used to calculate the
PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION
PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION Chin-Diew Lai, Department of Statistics, Massey University, New Zealand John C W Rayner, School of Mathematics and Applied Statistics,
Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:
Chapter 7 Notes - Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a
Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan. 15.450, Fall 2010
Simulation Methods Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Simulation Methods 15.450, Fall 2010 1 / 35 Outline 1 Generating Random Numbers 2 Variance Reduction 3 Quasi-Monte
I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
Geostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras [email protected]
Predict the Popularity of YouTube Videos Using Early View Data
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
A Model of Optimum Tariff in Vehicle Fleet Insurance
A Model of Optimum Tariff in Vehicle Fleet Insurance. Bouhetala and F.Belhia and R.Salmi Statistics and Probability Department Bp, 3, El-Alia, USTHB, Bab-Ezzouar, Alger Algeria. Summary: An approach about
1. Then f has a relative maximum at x = c if f(c) f(x) for all values of x in some
Section 3.1: First Derivative Test Definition. Let f be a function with domain D. 1. Then f has a relative maximum at x = c if f(c) f(x) for all values of x in some open interval containing c. The number
IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem
IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem Time on my hands: Coin tosses. Problem Formulation: Suppose that I have
2 Sample t-test (unequal sample sizes and unequal variances)
Variations of the t-test: Sample tail Sample t-test (unequal sample sizes and unequal variances) Like the last example, below we have ceramic sherd thickness measurements (in cm) of two samples representing
Handling missing data in Stata a whirlwind tour
Handling missing data in Stata a whirlwind tour 2012 Italian Stata Users Group Meeting Jonathan Bartlett www.missingdata.org.uk 20th September 2012 1/55 Outline The problem of missing data and a principled
Content Distribution Scheme for Efficient and Interactive Video Streaming Using Cloud
Content Distribution Scheme for Efficient and Interactive Video Streaming Using Cloud Pramod Kumar H N Post-Graduate Student (CSE), P.E.S College of Engineering, Mandya, India Abstract: Now days, more
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September
Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne
Applied Statistics J. Blanchet and J. Wadsworth Institute of Mathematics, Analysis, and Applications EPF Lausanne An MSc Course for Applied Mathematicians, Fall 2012 Outline 1 Model Comparison 2 Model
Sensitivity Analysis in Multiple Imputation for Missing Data
Paper SAS270-2014 Sensitivity Analysis in Multiple Imputation for Missing Data Yang Yuan, SAS Institute Inc. ABSTRACT Multiple imputation, a popular strategy for dealing with missing values, usually assumes
東 海 大 學 資 訊 工 程 研 究 所 碩 士 論 文
東 海 大 學 資 訊 工 程 研 究 所 碩 士 論 文 指 導 教 授 楊 朝 棟 博 士 以 網 路 功 能 虛 擬 化 實 作 網 路 即 時 流 量 監 控 服 務 研 究 生 楊 曜 佑 中 華 民 國 一 零 四 年 五 月 摘 要 與 的 概 念 一 同 發 展 的, 是 指 利 用 虛 擬 化 的 技 術, 將 現 有 的 網 路 硬 體 設 備, 利 用 軟 體 來 取 代 其
Introduction to proc glm
Lab 7: Proc GLM and one-way ANOVA STT 422: Summer, 2004 Vince Melfi SAS has several procedures for analysis of variance models, including proc anova, proc glm, proc varcomp, and proc mixed. We mainly will
Clinical Study Synopsis
Clinical Study Synopsis This Clinical Study Synopsis is provided for patients and healthcare professionals to increase the transparency of Bayer's clinical research. This document is not intended to replace
Using PROC MIXED in Hierarchical Linear Models: Examples from two- and three-level school-effect analysis, and meta-analysis research
Using PROC MIXED in Hierarchical Linear Models: Examples from two- and three-level school-effect analysis, and meta-analysis research Sawako Suzuki, DePaul University, Chicago Ching-Fan Sheu, DePaul University,
Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab
Monte Carlo Simulation: IEOR E4703 Fall 2004 c 2004 by Martin Haugh Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab 1 Overview of Monte Carlo Simulation 1.1 Why use simulation?
Hypothesis Testing for Beginners
Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes
4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
Analysis of Variance. MINITAB User s Guide 2 3-1
3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced
The Relationship Between Rodent Offspring Blood Lead Levels and Maternal Diet
The Relationship Between Rodent Offspring Blood Lead Levels and Maternal Diet Allison Crawford, Xiahong Li, Mira Shapiro 1, Ruitao Zhang Introduction A study was undertaken to understand the effect of
E(y i ) = x T i β. yield of the refined product as a percentage of crude specific gravity vapour pressure ASTM 10% point ASTM end point in degrees F
Random and Mixed Effects Models (Ch. 10) Random effects models are very useful when the observations are sampled in a highly structured way. The basic idea is that the error associated with any linear,
University of Chicago Graduate School of Business. Business 41000: Business Statistics Solution Key
Name: OUTLINE SOLUTIONS University of Chicago Graduate School of Business Business 41000: Business Statistics Solution Key Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper
Statistical Data analysis With Excel For HSMG.632 students
1 Statistical Data analysis With Excel For HSMG.632 students Dialog Boxes Descriptive Statistics with Excel To find a single descriptive value of a data set such as mean, median, mode or the standard deviation,
Elements of statistics (MATH0487-1)
Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -
Sample Size Calculation for Longitudinal Studies
Sample Size Calculation for Longitudinal Studies Phil Schumm Department of Health Studies University of Chicago August 23, 2004 (Supported by National Institute on Aging grant P01 AG18911-01A1) Introduction
Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.
Normal Distribution Definition A continuous random variable has a normal distribution if its probability density e -(y -µ Y ) 2 2 / 2 σ function can be written as for < y < as Y f ( y ) = 1 σ Y 2 π Notation:
2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
Unit 26 Estimation with Confidence Intervals
Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...
HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men
The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].
Probability Theory Probability Spaces and Events Consider a random experiment with several possible outcomes. For example, we might roll a pair of dice, flip a coin three times, or choose a random real
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
xtmixed & denominator degrees of freedom: myth or magic
xtmixed & denominator degrees of freedom: myth or magic 2011 Chicago Stata Conference Phil Ender UCLA Statistical Consulting Group July 2011 Phil Ender xtmixed & denominator degrees of freedom: myth or
THE NUMBER OF GRAPHS AND A RANDOM GRAPH WITH A GIVEN DEGREE SEQUENCE. Alexander Barvinok
THE NUMBER OF GRAPHS AND A RANDOM GRAPH WITH A GIVEN DEGREE SEQUENCE Alexer Barvinok Papers are available at http://www.math.lsa.umich.edu/ barvinok/papers.html This is a joint work with J.A. Hartigan
Crossover - A search algorithm and GUI for cross-over designs. Kornelius Rohmeyer
Crossover - A search algorithm and GUI for cross-over designs Kornelius Rohmeyer July 29, 2015 Contents 1 Introduction 2 1.1 Installation.............................................. 2 1.2 Overview...............................................
Practice Problems for Homework #6. Normal distribution and Central Limit Theorem.
Practice Problems for Homework #6. Normal distribution and Central Limit Theorem. 1. Read Section 3.4.6 about the Normal distribution and Section 4.7 about the Central Limit Theorem. 2. Solve the practice
Notes on Applied Linear Regression
Notes on Applied Linear Regression Jamie DeCoster Department of Social Psychology Free University Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam The Netherlands phone: +31 (0)20 444-8935 email:
Prediction for Multilevel Models
Prediction for Multilevel Models Sophia Rabe-Hesketh Graduate School of Education & Graduate Group in Biostatistics University of California, Berkeley Institute of Education, University of London Joint
Hypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
Challenges of Cloud Scale Natural Language Processing
Challenges of Cloud Scale Natural Language Processing Mark Dredze Johns Hopkins University My Interests? Information Expressed in Human Language Machine Learning Natural Language Processing Intelligent
HANDOUT #2 - TYPES OF STATISTICAL STUDIES
HANDOUT #2 - TYPES OF STATISTICAL STUDIES TOPICS 1. Ovservational vs Experimental Studies 2. Retrospective vs Prospective Studies 3. Sampling Principles: (a) Probability Sampling: SRS, Systematic, Stratified,
The Probit Link Function in Generalized Linear Models for Data Mining Applications
Journal of Modern Applied Statistical Methods Copyright 2013 JMASM, Inc. May 2013, Vol. 12, No. 1, 164-169 1538 9472/13/$95.00 The Probit Link Function in Generalized Linear Models for Data Mining Applications
Non-Inferiority Tests for Two Means using Differences
Chapter 450 on-inferiority Tests for Two Means using Differences Introduction This procedure computes power and sample size for non-inferiority tests in two-sample designs in which the outcome is a continuous
SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS
SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS Copyright 005 by the Society of Actuaries and the Casualty Actuarial Society
Exploratory Data Analysis
Exploratory Data Analysis Johannes Schauer [email protected] Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
PATTERN MIXTURE MODELS FOR MISSING DATA. Mike Kenward. London School of Hygiene and Tropical Medicine. Talk at the University of Turku,
PATTERN MIXTURE MODELS FOR MISSING DATA Mike Kenward London School of Hygiene and Tropical Medicine Talk at the University of Turku, April 10th 2012 1 / 90 CONTENTS 1 Examples 2 Modelling Incomplete Data
MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
SAS Syntax and Output for Data Manipulation:
Psyc 944 Example 5 page 1 Practice with Fixed and Random Effects of Time in Modeling Within-Person Change The models for this example come from Hoffman (in preparation) chapter 5. We will be examining
Lecture 11: Confidence intervals and model comparison for linear regression; analysis of variance
Lecture 11: Confidence intervals and model comparison for linear regression; analysis of variance 14 November 2007 1 Confidence intervals and hypothesis testing for linear regression Just as there was
Credit Risk Models: An Overview
Credit Risk Models: An Overview Paul Embrechts, Rüdiger Frey, Alexander McNeil ETH Zürich c 2003 (Embrechts, Frey, McNeil) A. Multivariate Models for Portfolio Credit Risk 1. Modelling Dependent Defaults:
F nest. Monte Carlo and Bootstrap using Stata. Financial Intermediation Network of European Studies
F nest Financial Intermediation Network of European Studies S U M M E R S C H O O L Monte Carlo and Bootstrap using Stata Dr. Giovanni Cerulli 8-10 October 2015 University of Rome III, Italy Lecturer Dr.
Multiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
9.07 Introduction to Statistical Methods Homework 4. Name:
1. Estimating the population standard deviation and variance. Homework #2 contained a problem (#4) on estimating the population standard deviation. In that problem, you showed that the method of estimating
