How NOT to do a data analysis

Size: px
Start display at page:

Download "How NOT to do a data analysis"

Transcription

1 How NOT to do a data analysis The Physics case: neutrinoless double beta decay The experimental apparatus The reported results A critical review of the analysis The ROOT code A. Fontana and P. Pedroni INFN Pavia

2 It all starts from here

3 and from here

4 Neutrino mass absolute mass? Tritium decay m ve <2.3 ev Cosmology (0n) m i <~1 ev <m n > < ev Mass scale? m 2 m 1 2 m 2 2 m 3 2? Normal hierarchy m 3 > m 2 ~m 1 Inverted hierarchy m 2 ~m 1 >m 3 Degenerate m 1 m 2 m 3» m i -m j F. Piquemal (CENBG) Nuppec Bordeaux, November

5 Nature of the neutrino Dirac neutrino particle antiparticle 4 states n L,n R,n L, n R conservation of global leptonic number Majorana neutrino n=n 2 states n L and n R No conservation of global leptonic number Majorana neutrinos exist in most of GUT and supersymmetric models If n Majorana and CP violation in leptonic sector -> leptogenesis Majorana neutrinos imply global leptonic number violation F. Piquemal (CENBG) CS IN2P3 2005/03/05

6 Neutrinoless Double Beta decay n n n e e - h h ne h e - p h p n=n L=2 Majorana neutrino n L n R massive neutrino Other possible process: V+A, Majoron, SUSY (A,Z) (A,Z+1) (A,Z+2) Phase space factor Nuclear matrix element -1 5 T 1/2 = F(Q,Z) M 2 <m n > 2 effective neutrino Mass: <m n >= m 1 U e1 2 + m 2 U e2 2.e i1 + m 3 U e3 2.e i2 Uei : mixing matrix elements 1 et 2: Majorana phases F. Piquemal (CENBG) Nuppec Bordeaux, November

7 Double Beta Decay Signature 2nDBD: (A,Z) (A,Z+2) + 2e nDBD: (A,Z) (A,Z+2) + 2e - n e Allowed by SM new physics beyond the SM two electrons each with a continuous spectrum and a monochromatic sum energy 2 neutrinos Double Beta Decay continuous spectrum Neutrinoless Double Beta Decay peak enlarged by the detector energy resolution sum electron energy / Q

8 Double Beta Decay sources

9 0nDBD and neutrino physics How 0nDBD is connected to neutrino mixing matrix and masses? In case of dominant mass mechanism: neutrinoless Double Beta Decay rate Phase space Nuclear matrix elements Effective Majorana mass 1/t = G(Q,Z) M nucl 2 M 2 what the experimentalists try to measure what the nuclear theorists try to calculate parameter containing the physics M = U e1 2 M 1 + e i 1 U e2 2 M 2 + e i 2 U e3 2 M 3

10 Experimental parameters How experimental parameters are connected to the Majorana mass sensitivity of experiment? sensitivity F: lifetime corresponding to the minimum detectable number of events over background at a given confidence level b 0 b = 0 b: specific background coefficient [counts/(kev kg y)] source mass live time energy resolution F (MT / be) 1/2 F MT background level importance of the nuclide choice (but large uncertainty due to nuclear physics) sensitivity to m (F/Q M nucl 2 ) 1/2 Q 1/2 1 be M nucl MT 1/4

11 Experimental techniques e - e - Source Detector Easy to approach the ton scale High energy resolution (<2%) No tracking capability Easy to reject 2nDBD background e - detector source e - Source Detector detector Easy to get tracking capability Low energy resolution (>2%) Tracking / topology capability Easy to approach zero backround (with the exception of 2n DBD component)

12 Heidelberg Moscow Exp and the 0nDBD claim December 2001, 4 authors (KDHK) of HM collaboration claim the 0nDBD of 76 Ge Source = Detector Well known Ge diodes technology 5 Ge diodes with a total statistic of 10.9 kg - ( 86%) 76 Ge location: Underground Gran Sasso Laboratory (Italy) detectors shielded with lead and N 2 fluxed Reduction of Bkg with Pulse Shape Analysis (PSA) (factor 5) Ge nuclei Multi-site events identification (gamma bkg) Spectrum with 71.7 kg y

13 Heidelberg Moscow Exp and the 0nDBD claim most probable value: 28.7 in 71.7 kg y exposition KKDC claim: m ee = ev (0.44 ev b.v.) t 0n 1/2 (y) = ( ) y ( y b.v.) (99,9973 % c.l. 4.2 σ) H.V. Klapdor-Kleingrothaus et al. NIM. A 522(2004)371 Skepticism of scientific community Aalseth CE et al., Mod. Phys. Lett. A 17 (2002) 1475 Feruglio F et al., Nucl. Phys. B 637 (2002) 345 Zdezenko Yu G et al., Phys. Lett. B546(2002)206 Klapdor-Kleingrothaus HV hep-ph/ H.L. Harney, hep-ph/ Klapdor-Kleingrothaus HV et al., NIM A510(2003)281 Klapdor-Kleingrothaus et al., NIM A 522(2004)371 Comments and analysis HD-M data Independent answers of authors Other articles Not totally accepted result unrecognized peaks dimension of analyzed energy window

14 Experiments

15 GERDA 2013

16 GERDA 2013

17 GERDA 2013

18 Our Minuit fit

19 and the ROOT code

20 A critical review of the analysis «known contributions» from decays Number of contents at the maximum: SL Linear Background

21 Neutrinoless double beta decay Authors were smart : they did not put errors bars How many gaussians are there? Authors were smart : they did not perform an hypotehsis test. They just assumed that there was «something»

22 P(H 0 ) P(H 1 ) Hypothesis test in Physics 1-1- exp value power 22

23 Hypothesis test Level of the test to be estabilished before the measurement Observed significance level 0 (or SL) (p-value) Error of the first kind: is the incorrect rejection of a true null hypothesis. The probability of this error is In general =1% - 5% «The burden of proof for a major new discovery should be as high as the importance of the claim.» Higgs boson discovery 10-6

24 Neutrinoless double beta decay Null hypothesis: histogram generated only from a pure background (according to authors background has a linear decreasing shape) =?? (at least 10-3 ) «Standard» best fit (LS) 2 =91 /59 degrees of freedom: p-value Correct remarks of the authors: LS method should not be used in this case

25 a single random variable X Likelihood function x=(x 1, x 2,., x n ) are the values observed in n indepenent measures p(x;q): probability density of X dependent on a set of parameters q =(q 1, q 2,., q p ) to be experimentally determined n = 1 ; q) p( x2; q) p( xn; q) p( x i ; ) i= 1 L( q; x) = p( x q Maximum likelihood method(ml) The maximum likelihood estimate of is the global maximum point (if existing) of the likelihood function max (R.A. Fischer, 1912) n = i= 1 L( q; x) max p( x ; q) L(ˆ; q x) qˆ i = q

26 The matematical properties of the likelihood function that are of statistocal interest (maximum position) aree th same as his logarithm Loglikelihood function L = -ln( L( q, x)) = - n i= 1 Sign is inverted ln( p( x i ; q)) A maximum of L corresponds to a minimum of L dl dq k = n i= 1 1 p( x ; q) i p( xi; q) = qk 0 ( k = 1,2,, p) Likelihood equations

27 Likelihhood and histograms Number of expected events per channel: i ( θ) = N p( xi;θ) dx = p( x0i;θ) i Npi (θ) «Event by event» likelihood i Random variable = probability for a single event to fall inside the i-th histogram bin. This event has a probability p i (q) and happens n i times L e ni = -ln( L(θ, n)) = -ln[ pi (θ)] = - n k i= 1 k i= 1 i ln( p i (θ))

28 «Globale likelihood» (big samples) Random variabile = number of events n i inside the i-th histogram bin. (multinomial probability) L g = -ln = k i= 1 k i= 1 ln( Np i 2Np (θ)) 1 i 1 ( ni - Npi ( q )) exp - ( q ) 2 Npi ( q ) k i= 1 ( n i - Npi ( q )) Np ( q ) i 2 2 L g k i= 1 ( ni - i ( q )) n i 2 Only for «big» samples: n i > n i Np i - gaussian variable 2 test can be performed after minimization.

29 2 test can not be be performed after minimization. L e L g The LS approximation is not too bad.

30 Fisher method for multiple comaprisons Sometimes a possible effect can be present only within a part of the measured interval ; this interval is split in different (disjoint) zones and a test is performed in each zone. N=20 hypotheses to test (disjoint intervals) and a significance level = 5%. What is the probability of observing at least one «significant» result just due to pure chance? P{effect} = 1- P{no effect} = 1- (1-) Bonferroni rule: 1- (1-) N 1- (1 - N) = N N = 0.64 «Look Elsewhere effect» 1- (1- / N) N New p-value H 0 is rejected if at least a single test gives 0 < /N

31 Fisher method for multiple comparisons 0 (as all variables from cumulative distribution functions) - when H 0 is true - is a uniform random variabile U in [0,1] the random variable -2 ln(u) is distributed as 2 (n=2) P{y = -2ln u < x} = P{u e - x / 2 } = e - x / 2 F( y) = 1- e -x / 2 p( y) = df dx = 1 2 e -x / 2 2 = (2) In a multiple test over k disjoint intervals, with p-values p 1, p k under H 0 F k = -2ln i= 1 p i = 2 (2n)

32 H 0 = linear background, k=12 intervals (5 bins/interval) F = 46 ( n=24)

33 Bin content + 40%

34 «known contributions» fromi Number of contents at the decays maximum: SL points 20 parameters to be fitted (!) straight line+6 gaussians Without giving strong contraints to ALL parameters it is impossible to reproduce the published results

35 n i b Assuming background b to be well known n ( i - n i b %)

Status Report of the Gerda Phase II Startup

Status Report of the Gerda Phase II Startup Status Report of the Gerda Phase II Startup Valerio D Andrea on behalf of the Gerda Collaboration Gran Sasso Science Institute (INFN), L Aquila ICNPA 2016 18th International Conference on Neutrino Physics

More information

Theory versus Experiment. Prof. Jorgen D Hondt Vrije Universiteit Brussel jodhondt@vub.ac.be

Theory versus Experiment. Prof. Jorgen D Hondt Vrije Universiteit Brussel jodhondt@vub.ac.be Theory versus Experiment Prof. Jorgen D Hondt Vrije Universiteit Brussel jodhondt@vub.ac.be Theory versus Experiment Pag. 2 Dangerous cocktail!!! Pag. 3 The basics in these lectures Part1 : Theory meets

More information

From beta- decay to neutrino- less double beta decay: The quest for Majorana neutrinos

From beta- decay to neutrino- less double beta decay: The quest for Majorana neutrinos From beta- decay to neutrino- less double beta decay: EPS Historic Site: The Neutrino Experiment at Debrecen by J. Csikai and A. Szalay Debrecen, Hungary October 25, 2013 Stefan Schönert Physik-Department,

More information

CHI-SQUARE: TESTING FOR GOODNESS OF FIT

CHI-SQUARE: TESTING FOR GOODNESS OF FIT CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

How To Find The Higgs Boson

How To Find The Higgs Boson Dezső Horváth: Search for Higgs bosons Balaton Summer School, Balatongyörök, 07.07.2009 p. 1/25 Search for Higgs bosons Balaton Summer School, Balatongyörök, 07.07.2009 Dezső Horváth MTA KFKI Research

More information

Practice problems for Homework 11 - Point Estimation

Practice problems for Homework 11 - Point Estimation Practice problems for Homework 11 - Point Estimation 1. (10 marks) Suppose we want to select a random sample of size 5 from the current CS 3341 students. Which of the following strategies is the best:

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby

More information

PROSPECT: Precision Reactor Oscillation and Spectrum experiment

PROSPECT: Precision Reactor Oscillation and Spectrum experiment PROSPECT: Precision Reactor Oscillation and Spectrum experiment DAVID MARTINEZ CAICEDO on behalf of PROSPECT collaboration ILLINOIS INSTITUTE OF TECHNOLOGY NUFACT 2015 AUGUST 14th 2015 1 Outline Motivations:

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

Basics of Statistical Machine Learning

Basics of Statistical Machine Learning CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu Modern machine learning is rooted in statistics. You will find many familiar

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Some Essential Statistics The Lure of Statistics

Some Essential Statistics The Lure of Statistics Some Essential Statistics The Lure of Statistics Data Mining Techniques, by M.J.A. Berry and G.S Linoff, 2004 Statistics vs. Data Mining..lie, damn lie, and statistics mining data to support preconceived

More information

Basic Nuclear Concepts

Basic Nuclear Concepts Section 7: In this section, we present a basic description of atomic nuclei, the stored energy contained within them, their occurrence and stability Basic Nuclear Concepts EARLY DISCOVERIES [see also Section

More information

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...

MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators... MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

Statistical Testing of Randomness Masaryk University in Brno Faculty of Informatics

Statistical Testing of Randomness Masaryk University in Brno Faculty of Informatics Statistical Testing of Randomness Masaryk University in Brno Faculty of Informatics Jan Krhovják Basic Idea Behind the Statistical Tests Generated random sequences properties as sample drawn from uniform/rectangular

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

THE STATISTICAL TREATMENT OF EXPERIMENTAL DATA 1

THE STATISTICAL TREATMENT OF EXPERIMENTAL DATA 1 THE STATISTICAL TREATMET OF EXPERIMETAL DATA Introduction The subject of statistical data analysis is regarded as crucial by most scientists, since error-free measurement is impossible in virtually all

More information

Nuclear Physics. Nuclear Physics comprises the study of:

Nuclear Physics. Nuclear Physics comprises the study of: Nuclear Physics Nuclear Physics comprises the study of: The general properties of nuclei The particles contained in the nucleus The interaction between these particles Radioactivity and nuclear reactions

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Nuclear Physics Lab I: Geiger-Müller Counter and Nuclear Counting Statistics

Nuclear Physics Lab I: Geiger-Müller Counter and Nuclear Counting Statistics Nuclear Physics Lab I: Geiger-Müller Counter and Nuclear Counting Statistics PART I Geiger Tube: Optimal Operating Voltage and Resolving Time Objective: To become acquainted with the operation and characteristics

More information

A global analysis of neutrino masses, mixings and phases: entering the new era of leptonic CP violation searches

A global analysis of neutrino masses, mixings and phases: entering the new era of leptonic CP violation searches XCVIII Congresso Nazionale Napoli 17-21 Settembre 2012 A global analysis of neutrino masses, mixings and phases: entering the new era of leptonic CP violation searches Gian Luigi Fogli Dipartimento di

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

How To Teach Physics At The Lhc

How To Teach Physics At The Lhc LHC discoveries and Particle Physics Concepts for Education Farid Ould- Saada, University of Oslo On behalf of IPPOG EPS- HEP, Vienna, 25.07.2015 A successful program LHC data are successfully deployed

More information

Chapter 4: Statistical Hypothesis Testing

Chapter 4: Statistical Hypothesis Testing Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin

More information

Reliability and Risk Analysis. Analysis of Model Tasks from Selected Application Areas Using a Computer (software R, Excel)

Reliability and Risk Analysis. Analysis of Model Tasks from Selected Application Areas Using a Computer (software R, Excel) Reliability and Risk Analysis Analysis of Model Tasks from Selected Application Areas Using a Computer (software R, Excel) Task The aim of this study is to assess the wastewater treatment plant (WWTP)

More information

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS Copyright 005 by the Society of Actuaries and the Casualty Actuarial Society

More information

How to compute Random acceleration, velocity, and displacement values from a breakpoint table.

How to compute Random acceleration, velocity, and displacement values from a breakpoint table. How to compute Random acceleration, velocity, and displacement values from a breakpoint table. A random spectrum is defined as a set of frequency and amplitude breakpoints, like these: 0.050 Acceleration

More information

Tests of Hypotheses Using Statistics

Tests of Hypotheses Using Statistics Tests of Hypotheses Using Statistics Adam Massey and Steven J. Miller Mathematics Department Brown University Providence, RI 0292 Abstract We present the various methods of hypothesis testing that one

More information

Normality Testing in Excel

Normality Testing in Excel Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Gamma Distribution Fitting

Gamma Distribution Fitting Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

More information

Multinomial and Ordinal Logistic Regression

Multinomial and Ordinal Logistic Regression Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon

t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com www.excelmasterseries.com

More information

Poisson Models for Count Data

Poisson Models for Count Data Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

More information

Variations of Statistical Models

Variations of Statistical Models 38. Statistics 1 38. STATISTICS Revised September 2013 by G. Cowan (RHUL). This chapter gives an overview of statistical methods used in high-energy physics. In statistics, we are interested in using a

More information

What is Statistics? Lecture 1. Introduction and probability review. Idea of parametric inference

What is Statistics? Lecture 1. Introduction and probability review. Idea of parametric inference 0. 1. Introduction and probability review 1.1. What is Statistics? What is Statistics? Lecture 1. Introduction and probability review There are many definitions: I will use A set of principle and procedures

More information

Dongfeng Li. Autumn 2010

Dongfeng Li. Autumn 2010 Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis

More information

5. Linear Regression

5. Linear Regression 5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4

More information

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

More information

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................

More information

HYPOTHESIS TESTING WITH SPSS:

HYPOTHESIS TESTING WITH SPSS: HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER

More information

REALIZING EINSTEIN S DREAM Exploring Our Mysterious Universe

REALIZING EINSTEIN S DREAM Exploring Our Mysterious Universe REALIZING EINSTEIN S DREAM Exploring Our Mysterious Universe The End of Physics Albert A. Michelson, at the dedication of Ryerson Physics Lab, U. of Chicago, 1894 The Miracle Year - 1905 Relativity Quantum

More information

ABOUT THE POSSIBILITY TO MEASURE SOME STANDARD MODEL PARAMETERS AND SEARCH FOR NEW PHYSICS WITH LOW ENERGY NEUTRINOS *

ABOUT THE POSSIBILITY TO MEASURE SOME STANDARD MODEL PARAMETERS AND SEARCH FOR NEW PHYSICS WITH LOW ENERGY NEUTRINOS * Romanian Reports in Physics, Vol. 64, No. 1, P. 4 3, 01 ELEMENTARY PARTICLE PHYSICS ABOUT THE POSSIBILITY TO MEASURE SOME STANDARD MODEL PARAMETERS AND SEARCH FOR NEW PHYSICS WITH LOW ENERGY NEUTRINOS

More information

AP Physics 1 and 2 Lab Investigations

AP Physics 1 and 2 Lab Investigations AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks

More information

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page Errata for ASM Exam C/4 Study Manual (Sixteenth Edition) Sorted by Page 1 Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page Practice exam 1:9, 1:22, 1:29, 9:5, and 10:8

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

Statistics in Retail Finance. Chapter 6: Behavioural models

Statistics in Retail Finance. Chapter 6: Behavioural models Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural

More information

Selected Topics in Elementary Particle Physics ( Haupt-Seminar )

Selected Topics in Elementary Particle Physics ( Haupt-Seminar ) Selected Topics in Elementary Particle Physics ( Haupt-Seminar ) Paola Avella, Veronika Chobanova, Luigi Li Gioi, Christian Kiesling, Hans-Günther Moser, Martin Ritter, Pit Vanhoefer Time: Do, 12 ct -14

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is. Some Continuous Probability Distributions CHAPTER 6: Continuous Uniform Distribution: 6. Definition: The density function of the continuous random variable X on the interval [A, B] is B A A x B f(x; A,

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

Effect of energy scale imperfections on results of neutrino mass measurements from β-decay

Effect of energy scale imperfections on results of neutrino mass measurements from β-decay Effect of energy scale imperfections on results of neutrino mass measurements from β-decay J. Kašpar 1, M. Ryšavý, A. Špalek and O. Dragoun Nuclear Physics Institute, Acad. Sci. Czech Rep., CZ 250 68 Řež

More information

How To Calculate The Power Of A Cluster In Erlang (Orchestra)

How To Calculate The Power Of A Cluster In Erlang (Orchestra) Network Traffic Distribution Derek McAvoy Wireless Technology Strategy Architect March 5, 21 Data Growth is Exponential 2.5 x 18 98% 2 95% Traffic 1.5 1 9% 75% 5%.5 Data Traffic Feb 29 25% 1% 5% 2% 5 1

More information

Measurement of the Mass of the Top Quark in the l+ Jets Channel Using the Matrix Element Method

Measurement of the Mass of the Top Quark in the l+ Jets Channel Using the Matrix Element Method Measurement of the Mass of the Top Quark in the l+ Jets Channel Using the Matrix Element Method Carlos Garcia University of Rochester For the DØ Collaboration APS Meeting 2007 Outline Introduction Top

More information

A System for Possible Early Warning of Disastrous Solar Flares

A System for Possible Early Warning of Disastrous Solar Flares A System for Possible Early Warning of Disastrous Solar Flares Jere Jenkins Ephraim Fischbach 116m In Decay 500 450 400 350 y = -0.000203x + 6.044443 R 2 = 0.980358 7 6 5 Counts/10s 300 250 200 4 3 ln(counts/10s)

More information

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables. SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Signal Detection C H A P T E R 14 14.1 SIGNAL DETECTION AS HYPOTHESIS TESTING

Signal Detection C H A P T E R 14 14.1 SIGNAL DETECTION AS HYPOTHESIS TESTING C H A P T E R 4 Signal Detection 4. SIGNAL DETECTION AS HYPOTHESIS TESTING In Chapter 3 we considered hypothesis testing in the context of random variables. The detector resulting in the minimum probability

More information

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA Csilla Csendes University of Miskolc, Hungary Department of Applied Mathematics ICAM 2010 Probability density functions A random variable X has density

More information

Search for double beta decay to excited states with CUORICINO and data acquisition system for CUORE

Search for double beta decay to excited states with CUORICINO and data acquisition system for CUORE Università degli studi di Genova Facoltà di scienze Matematiche, Fisiche e Naturali Scuola di Dottorato in Scienze e Tecnologie per l informazione e la conoscenza XXI Ciclo, A.A. 2006-2009 A Thesis submitted

More information

MULTIPLE REGRESSION EXAMPLE

MULTIPLE REGRESSION EXAMPLE MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

More information

Multivariate normal distribution and testing for means (see MKB Ch 3)

Multivariate normal distribution and testing for means (see MKB Ch 3) Multivariate normal distribution and testing for means (see MKB Ch 3) Where are we going? 2 One-sample t-test (univariate).................................................. 3 Two-sample t-test (univariate).................................................

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September

More information

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Two-Sample T-Tests Assuming Equal Variance (Enter Means) Chapter 4 Two-Sample T-Tests Assuming Equal Variance (Enter Means) Introduction This procedure provides sample size and power calculations for one- or two-sided two-sample t-tests when the variances of

More information

Waste Management 04 Conference, February 29 - March 4, 2004, Tucson, AZ Copyright WM Symposia, Inc. All Rights Reserved. Reprinted with permission.

Waste Management 04 Conference, February 29 - March 4, 2004, Tucson, AZ Copyright WM Symposia, Inc. All Rights Reserved. Reprinted with permission. RADIOLOGICAL FALSE POSITIVES IN ENVIRONMENTAL SOIL AND GROUNDWATER DATA FROM COMMERCIAL LABORATORIES Walter Kubilius Westinghouse Savannah River Company Thomas Coffey, Paul Mark EXR, Inc. Allen Volesky

More information

Stirling s formula, n-spheres and the Gamma Function

Stirling s formula, n-spheres and the Gamma Function Stirling s formula, n-spheres and the Gamma Function We start by noticing that and hence x n e x dx lim a 1 ( 1 n n a n n! e ax dx lim a 1 ( 1 n n a n a 1 x n e x dx (1 Let us make a remark in passing.

More information

arxiv:cond-mat/9301024v1 20 Jan 1993 ABSTRACT

arxiv:cond-mat/9301024v1 20 Jan 1993 ABSTRACT Anyons as Dirac Strings, the A x = 0 Gauge LPTB 93-1 John McCabe Laboratoire de Physique Théorique, 1 Université Bordeaux I 19 rue du Solarium, 33175 Gradignan FRANCE arxiv:cond-mat/930104v1 0 Jan 1993

More information

MATH 31B: MIDTERM 1 REVIEW. 1. Inverses. yx 3y = 1. x = 1 + 3y y 4( 1) + 32 = 1

MATH 31B: MIDTERM 1 REVIEW. 1. Inverses. yx 3y = 1. x = 1 + 3y y 4( 1) + 32 = 1 MATH 3B: MIDTERM REVIEW JOE HUGHES. Inverses. Let f() = 3. Find the inverse g() for f. Solution: Setting y = ( 3) and solving for gives and g() = +3. y 3y = = + 3y y. Let f() = 4 + 3. Find a domain on

More information

SF2940: Probability theory Lecture 8: Multivariate Normal Distribution

SF2940: Probability theory Lecture 8: Multivariate Normal Distribution SF2940: Probability theory Lecture 8: Multivariate Normal Distribution Timo Koski 24.09.2015 Timo Koski Matematisk statistik 24.09.2015 1 / 1 Learning outcomes Random vectors, mean vector, covariance matrix,

More information

Concepts in Theoretical Physics

Concepts in Theoretical Physics Concepts in Theoretical Physics Lecture 6: Particle Physics David Tong e 2 The Structure of Things 4πc 1 137 e d ν u Four fundamental particles Repeated twice! va, 9608085, 9902033 Four fundamental forces

More information

INSURANCE RISK THEORY (Problems)

INSURANCE RISK THEORY (Problems) INSURANCE RISK THEORY (Problems) 1 Counting random variables 1. (Lack of memory property) Let X be a geometric distributed random variable with parameter p (, 1), (X Ge (p)). Show that for all n, m =,

More information

ON-STREAM XRF ANALYSIS OF HEAVY METALS AT PPM CONCENTRATIONS

ON-STREAM XRF ANALYSIS OF HEAVY METALS AT PPM CONCENTRATIONS Copyright JCPDS - International Centre for Diffraction Data 2004, Advances in X-ray Analysis, Volume 47. 130 ABSTRACT ON-STREAM XRF ANALYSIS OF HEAVY METALS AT PPM CONCENTRATIONS G Roach and J Tickner

More information

PHY4604 Introduction to Quantum Mechanics Fall 2004 Practice Test 3 November 22, 2004

PHY4604 Introduction to Quantum Mechanics Fall 2004 Practice Test 3 November 22, 2004 PHY464 Introduction to Quantum Mechanics Fall 4 Practice Test 3 November, 4 These problems are similar but not identical to the actual test. One or two parts will actually show up.. Short answer. (a) Recall

More information

GLM I An Introduction to Generalized Linear Models

GLM I An Introduction to Generalized Linear Models GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial

More information

The Compact Muon Solenoid Experiment. CMS Note. Mailing address: CMS CERN, CH-1211 GENEVA 23, Switzerland. D. J. Mangeol, U.

The Compact Muon Solenoid Experiment. CMS Note. Mailing address: CMS CERN, CH-1211 GENEVA 23, Switzerland. D. J. Mangeol, U. Available on CMS information server CMS NOTE 6/96 The Compact Muon Solenoid Experiment CMS Note Mailing address: CMS CERN, CH11 GENEVA 3, Switzerland July, 6 Search for χ decays to ττ and SUSY mass spectrum

More information

An Innovative Method for Dead Time Correction in Nuclear Spectroscopy

An Innovative Method for Dead Time Correction in Nuclear Spectroscopy An Innovative Method for Dead Time Correction in Nuclear Spectroscopy Upp, Daniel L.; Keyser, Ronald M.; Gedcke, Dale A.; Twomey, Timothy R.; and Bingham, Russell D. PerkinElmer Instruments, Inc. ORTEC,

More information

Chapter 4 and 5 solutions

Chapter 4 and 5 solutions Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,

More information

5. Continuous Random Variables

5. Continuous Random Variables 5. Continuous Random Variables Continuous random variables can take any value in an interval. They are used to model physical characteristics such as time, length, position, etc. Examples (i) Let X be

More information

Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression

Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression Logistic Regression Department of Statistics The Pennsylvania State University Email: jiali@stat.psu.edu Logistic Regression Preserve linear classification boundaries. By the Bayes rule: Ĝ(x) = arg max

More information

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010 MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times

More information

Theoretical Particle Physics FYTN04: Oral Exam Questions, version ht15

Theoretical Particle Physics FYTN04: Oral Exam Questions, version ht15 Theoretical Particle Physics FYTN04: Oral Exam Questions, version ht15 Examples of The questions are roughly ordered by chapter but are often connected across the different chapters. Ordering is as in

More information

POLYNOMIAL HISTOPOLATION, SUPERCONVERGENT DEGREES OF FREEDOM, AND PSEUDOSPECTRAL DISCRETE HODGE OPERATORS

POLYNOMIAL HISTOPOLATION, SUPERCONVERGENT DEGREES OF FREEDOM, AND PSEUDOSPECTRAL DISCRETE HODGE OPERATORS POLYNOMIAL HISTOPOLATION, SUPERCONVERGENT DEGREES OF FREEDOM, AND PSEUDOSPECTRAL DISCRETE HODGE OPERATORS N. ROBIDOUX Abstract. We show that, given a histogram with n bins possibly non-contiguous or consisting

More information

Search for supersymmetric Dark Matter with GLAST!!

Search for supersymmetric Dark Matter with GLAST!! Search for supersymmetric Dark Matter with GLAST!! Aldo Morselli INFN, Sezione di Roma 2 & Università di Roma Tor Vergata ICCRC2003 The 28th International Cosmic Ray Conference Tsukuba, Japan, July 31-

More information

Reject Inference in Credit Scoring. Jie-Men Mok

Reject Inference in Credit Scoring. Jie-Men Mok Reject Inference in Credit Scoring Jie-Men Mok BMI paper January 2009 ii Preface In the Master programme of Business Mathematics and Informatics (BMI), it is required to perform research on a business

More information

Chapter 23. Two Categorical Variables: The Chi-Square Test

Chapter 23. Two Categorical Variables: The Chi-Square Test Chapter 23. Two Categorical Variables: The Chi-Square Test 1 Chapter 23. Two Categorical Variables: The Chi-Square Test Two-Way Tables Note. We quickly review two-way tables with an example. Example. Exercise

More information

Principle of Data Reduction

Principle of Data Reduction Chapter 6 Principle of Data Reduction 6.1 Introduction An experimenter uses the information in a sample X 1,..., X n to make inferences about an unknown parameter θ. If the sample size n is large, then

More information

Module 2 Probability and Statistics

Module 2 Probability and Statistics Module 2 Probability and Statistics BASIC CONCEPTS Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The standard deviation of a standard normal distribution

More information

1 Sufficient statistics

1 Sufficient statistics 1 Sufficient statistics A statistic is a function T = rx 1, X 2,, X n of the random sample X 1, X 2,, X n. Examples are X n = 1 n s 2 = = X i, 1 n 1 the sample mean X i X n 2, the sample variance T 1 =

More information