Program for statistical comparison of histograms
|
|
|
- Kerry Cole
- 10 years ago
- Views:
Transcription
1 ROOT 3, Saas-Fee, March, Program for statistical comparison of histograms S. Bityukov (IHEP, INR), N. Tsirova (NPI MSU) Scope Introduction Statistical comparison of two histograms Normalized significance Example Distance measure Internal resolution of the method Script stat analyzer.c Missing E T : large difference between histograms P T light jet: only statistical difference Output of the script Conclusion
2 ROOT 3, Saas-Fee, March, Introduction Sometimes we have two histograms and are faced with the question: Are the realizations of random variables in the form of these histograms taken from the same statistical population? There are many approaches for comparison of two histograms: Kolmogorov-Smirnov test, Cramer-von- Mises test, Anderson-Darling test, Likelihood ratio test for shape and for slope,.... The review on this subject is in the paper Testing Consistency of Two Histograms by F. Porter. Usually, a special test-statistic is used for comparing the two histograms (for example, χ, NIM, A64 () 87 ).
3 ROOT 3, Saas-Fee, March, Statistical comparison of two histograms A method, which uses the distribution of test-statistics for each bin of histograms, is proposed in note arxiv:3.65. These test-statistics obeys the distribution close to standard normal distribution if both histograms are obtained from the same statistical population by the same analyzer of data. Correspondingly, the joint distribution must be close to standard normal distribution. For example, if we consider observed values which are produced by the same Poisson flow of events during equal independent time ranges then a test statistic Ŝ c = ( ˆn i ˆn i ), where ˆn i is a number events in bin i of histogram, ˆn i is a number events in bin i of histogram, is the test statistic of such type. It is shown in paper PoS(ACAT8)8. Often test-statistics of such type are named as significances of a deviation or significances of an enhancement.
4 ROOT 3, Saas-Fee, March, Normalized significance Let us consider a model with two histograms where the random variable in each bin obeys the normal distribution ϕ(x n ik ) = e (x n ik ) σ ik. () πσik Here the expected value in the bin i is equal to n ik and the variance σ n ik is also equal to n ik. k is the histogram number (k =, ). Let integral in the histogram equals N and integral in the histogram equals N (for example, if we have different integrated luminosity in experiments). We define the normalized significance as Ŝ i (K) = ˆn i Kˆn i ˆσ ni + Kˆσ n i. () Here ˆn ik is an observed value in the bin i of the histogram k, ˆσ n ik = ˆn ik and coefficient of normalization K = N N.
5 ROOT 3, Saas-Fee, March, Example hist h Entries Mean RMS 35.8 s_i 3 u Entries Mean 499 RMS hist h Entries Mean RMS 35.6 Signif u Entries Mean RMS Figure : Triangle distributions (K =, M = ): the observed values ˆn i in the first histogram (left,up), the observed values n i in the second histogram (left, down), observed normalized significances Ŝi bin-by-bin (right, up), the distribution of observed normalized significances (right, down). The example with histograms produced from the same events flow during unequal independent time ranges shows that the standard deviation (RMS in the ROOT notation) of the distribution in the picture (right, down) can be used as estimator of the statistical difference between histograms (this distribution is close to standard normal distribution in our example).
6 ROOT 3, Saas-Fee, March, Distance measure The RMS as a distance measure in our case has a clear interpretation (in fact, we set a scale of this differmeter ): RMS = histograms are identical; RMS both histograms are obtained (by the using independent samples) from the same parent distribution; RMS >> histograms are obtained from different parent distributions. An accuracy (internal resolution) of the method depends on the number of bins M in histograms and on the normalized coefficient K. The accuracy can be estimated via Monte Carlo experiments. RMS p Entries 5 Mean.5 RMS.4395 χ / ndf.8 / Constant. ±. Mean.4 ±. Sigma.457 ± Figure : Distribution of RMS for 5 comparisons of histograms (triangle distribution, K =, M = 3).
7 ROOT 3, Saas-Fee, March, Internal resolution of the method The dependencies of the internal resolution on the bin numbers M and on the value of coefficient of normalization K are shown in Fig. 3. standard deviation of RMS & binning for histogram σ RMS The precision of the RMS estimation Monte Carlo (5 trials for each point). K =.5.5 K = M, bins Figure 3: The dependence of the standard deviation of RMS on number of bins M. This dependence is shown for two values of normalized coefficient K (K = and K =.5).
8 ROOT 3, Saas-Fee, March, Script stat analyzer.c Two input files (*.root) with a set of THD histograms to compare. User should indicate which histograms he/she wants to compare. Processing: calculate mean value, RMS and p-value for each pair of histograms; sort variables by RMS in descending order; print sorted results (variable p-value RMS mean). Output: to screen; to text file. Commands for start-up of script in ROOT root[].l stat analyzer.c++ root[] stat analyzer( signal rv, signal )
9 ROOT 3, Saas-Fee, March, Missing E T : large difference between histograms.8 MET, signal s_i 5 Significances, bin-by-bin s Entries Mean RMS MET, anomalous Wtb coupling Signif 4 Distribution of significances d Entries Mean.993 RMS p-value = e Figure 4: MET: the observed values in the first histogram (left,up), the observed values in the second histogram (left, down), observed normalized significances bin-by-bin (right, up), the distribution of observed normalized significances (right, down). Let us consider the distributions of the probability (with errors) of the missing E T to be in corresponding bin of histogram in the case of registration of Standard Model events and the events which are produced due to the presence of anomalous Wtb coupling. The comparison of these distributions is shown in Fig. 4 (RMS =.74, Mean =.99).
10 ROOT 3, Saas-Fee, March, P T light jet: only statistical difference.6.4 Pt_LJ, signal s_i.5 Significances, bin-by-bin s Entries Mean 83.3 RMS Pt_LJ, anomalous Wtb coupling Signif 8 Distribution of significances d Entries Mean.446 RMS p-value = Figure 5: Pt LJ: the observed values in the first histogram (left,up), the observed values in the second histogram (left, down), observed normalized significances bin-by-bin (right, up), the distribution of observed normalized significances (right, down). The case of the absence of the difference between distributions for the production of single top quark in frame of SM and model with anomalous Wtb coupling is shown in Fig. 5 for P t distribution of jet from light quark (RMS =.98, Mean =.4).
11 ROOT 3, Saas-Fee, March, Output of the script Figure 6: Output of the script
12 ROOT 3, Saas-Fee, March, Conclusions We discussed the possible tool for comparison of histograms in frame of the ROOT system (script stat analyzer.c). This tool very easy in use and very easy to understand the results. This tool can be used in tasks of monitoring of the equipment. In this moment the method is used for choice of the most significant variables in multivariate analysis. This tool requires the additional development (now the method works for histograms THD).
13 ROOT 3, Saas-Fee, March, η of lepton: the difference exits.9 Eta_Lep, signal s_i 3 Significances, bin-by-bin s Entries Mean.45 RMS Eta_Lep, anomalous Wtb coupling Signif 7 Distribution of significances d Entries Mean.67 RMS p-value = Figure 7: Eta Lep: the observed values in the first histogram (left,up), the observed values in the second histogram (left, down), observed normalized significances bin-by-bin (right, up), the distribution of observed normalized significances (right, down). The case of small difference between the histograms is shown in Fig. 7 (RMS =.37, Mean =.9).
Theory versus Experiment. Prof. Jorgen D Hondt Vrije Universiteit Brussel [email protected]
Theory versus Experiment Prof. Jorgen D Hondt Vrije Universiteit Brussel [email protected] Theory versus Experiment Pag. 2 Dangerous cocktail!!! Pag. 3 The basics in these lectures Part1 : Theory meets
Chapter 6: Point Estimation. Fall 2011. - Probability & Statistics
STAT355 Chapter 6: Point Estimation Fall 2011 Chapter Fall 2011 6: Point1 Estimat / 18 Chap 6 - Point Estimation 1 6.1 Some general Concepts of Point Estimation Point Estimate Unbiasedness Principle of
Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0.
Statistical analysis using Microsoft Excel Microsoft Excel spreadsheets have become somewhat of a standard for data storage, at least for smaller data sets. This, along with the program often being packaged
12: Analysis of Variance. Introduction
1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider
Measurement of Neutralino Mass Differences with CMS in Dilepton Final States at the Benchmark Point LM9
Measurement of Neutralino Mass Differences with CMS in Dilepton Final States at the Benchmark Point LM9, Katja Klein, Lutz Feld, Niklas Mohr 1. Physikalisches Institut B RWTH Aachen Introduction Fast discovery
NCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
NCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
Introduction. The Supplement shows results of locking using a range of smoothing parameters α, and checkerboard tests.
Auxiliary Material Submission for Paper 2014JB010945 Robert McCaffrey Portland State University Inter-seismic locking on the Hikurangi subduction zone: Uncertainties from slow-slip events Introduction
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition
Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition Online Learning Centre Technology Step-by-Step - Excel Microsoft Excel is a spreadsheet software application
5 Analysis of Variance models, complex linear models and Random effects models
5 Analysis of Variance models, complex linear models and Random effects models In this chapter we will show any of the theoretical background of the analysis. The focus is to train the set up of ANOVA
4.1 4.2 Probability Distribution for Discrete Random Variables
4.1 4.2 Probability Distribution for Discrete Random Variables Key concepts: discrete random variable, probability distribution, expected value, variance, and standard deviation of a discrete random variable.
business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
1.5 Oneway Analysis of Variance
Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments
Statistical Functions in Excel
Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.
Recall this chart that showed how most of our course would be organized:
Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical
Face Recognition in Low-resolution Images by Using Local Zernike Moments
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie
Java Modules for Time Series Analysis
Java Modules for Time Series Analysis Agenda Clustering Non-normal distributions Multifactor modeling Implied ratings Time series prediction 1. Clustering + Cluster 1 Synthetic Clustering + Time series
How To Test For Significance On A Data Set
Non-Parametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A non-parametric equivalent of the 1 SAMPLE T-TEST. ASSUMPTIONS: Data is non-normally distributed, even after log transforming.
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma [email protected] The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
Web-based pre-analysis Tools
Summer Student Project Web-based pre-analysis Tools Student: Tetiana Moskalets V. N. Karazin Kharkiv National University Supervisors: Dr. Arturo Rodolfo Sanchez Pineda University of Naples Federico II
Measurement of the Mass of the Top Quark in the l+ Jets Channel Using the Matrix Element Method
Measurement of the Mass of the Top Quark in the l+ Jets Channel Using the Matrix Element Method Carlos Garcia University of Rochester For the DØ Collaboration APS Meeting 2007 Outline Introduction Top
MEASURES OF LOCATION AND SPREAD
Paper TU04 An Overview of Non-parametric Tests in SAS : When, Why, and How Paul A. Pappas and Venita DePuy Durham, North Carolina, USA ABSTRACT Most commonly used statistical procedures are based on the
Top rediscovery at ATLAS and CMS
Top rediscovery at ATLAS and CMS on behalf of ATLAS and CMS collaborations CNRS/IN2P3 & UJF/ENSPG, LPSC, Grenoble, France E-mail: [email protected] We describe the plans and strategies of the
Chapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
Simple Linear Regression
STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze
Determining Measurement Uncertainty for Dimensional Measurements
Determining Measurement Uncertainty for Dimensional Measurements The purpose of any measurement activity is to determine or quantify the size, location or amount of an object, substance or physical parameter
Intro Root and statistics
HASCO summerschool, July 2014 (Göttingen) muon muon Analysis walkthrough muon muon Intro Root and statistics Ivo van Vulpen (UvA/Nikhef) July 17 th 2014 MH17 plane crash 298 people died (193 Dutch) HASCO
Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection
Directions in Statistical Methodology for Multivariable Predictive Modeling Frank E Harrell Jr University of Virginia Seattle WA 19May98 Overview of Modeling Process Model selection Regression shape Diagnostics
The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median
CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box
How To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
Normality Testing in Excel
Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. [email protected]
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
Northumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
Lecture 25. December 19, 2007. Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
2 Binomial, Poisson, Normal Distribution
2 Binomial, Poisson, Normal Distribution Binomial Distribution ): We are interested in the number of times an event A occurs in n independent trials. In each trial the event A has the same probability
ANALYZING NETWORK TRAFFIC FOR MALICIOUS ACTIVITY
CANADIAN APPLIED MATHEMATICS QUARTERLY Volume 12, Number 4, Winter 2004 ANALYZING NETWORK TRAFFIC FOR MALICIOUS ACTIVITY SURREY KIM, 1 SONG LI, 2 HONGWEI LONG 3 AND RANDALL PYKE Based on work carried out
Nonparametric Statistics
Nonparametric Statistics References Some good references for the topics in this course are 1. Higgins, James (2004), Introduction to Nonparametric Statistics 2. Hollander and Wolfe, (1999), Nonparametric
5. Linear Regression
5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4
Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania
Modeling Individual Claims for Motor Third Party Liability of Insurance Companies in Albania Oriana Zacaj Department of Mathematics, Polytechnic University, Faculty of Mathematics and Physics Engineering
Millikan Oil Drop Experiment Matthew Norton, Jurasits Christopher, Heyduck William, Nick Chumbley. Norton 0
Millikan Oil Drop Experiment Matthew Norton, Jurasits Christopher, Heyduck William, Nick Chumbley Norton 0 Norton 1 Abstract The charge of an electron can be experimentally measured by observing an oil
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
ON MODELING INSURANCE CLAIMS
ON MODELING INSURANCE CLAIMS USING COPULAS FILIP ERNTELL Master s thesis 213:E55 Faculty of Science Centre for Mathematical Sciences Mathematical Statistics CENTRUM SCIENTIARUM MATHEMATICARUM ON MODELING
HLM software has been one of the leading statistical packages for hierarchical
Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush
Quantitative Methods for Finance
Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain
Data Preparation and Statistical Displays
Reservoir Modeling with GSLIB Data Preparation and Statistical Displays Data Cleaning / Quality Control Statistics as Parameters for Random Function Models Univariate Statistics Histograms and Probability
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
Basic Probability and Statistics Review. Six Sigma Black Belt Primer
Basic Probability and Statistics Review Six Sigma Black Belt Primer Pat Hammett, Ph.D. January 2003 Instructor Comments: This document contains a review of basic probability and statistics. It also includes
Using R for Linear Regression
Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional
Prentice Hall Connected Mathematics 2, 7th Grade Units 2009
Prentice Hall Connected Mathematics 2, 7th Grade Units 2009 Grade 7 C O R R E L A T E D T O from March 2009 Grade 7 Problem Solving Build new mathematical knowledge through problem solving. Solve problems
Notes on the Negative Binomial Distribution
Notes on the Negative Binomial Distribution John D. Cook October 28, 2009 Abstract These notes give several properties of the negative binomial distribution. 1. Parameterizations 2. The connection between
Copyright 2010-2012 PEOPLECERT Int. Ltd and IASSC
PEOPLECERT - Personnel Certification Body 3 Korai st., 105 64 Athens, Greece, Tel.: +30 210 372 9100, Fax: +30 210 372 9101, e-mail: [email protected], www.peoplecert.org Copyright 2010-2012 PEOPLECERT
CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS
Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce
Multiple Choice: 2 points each
MID TERM MSF 503 Modeling 1 Name: Answers go here! NEATNESS COUNTS!!! Multiple Choice: 2 points each 1. In Excel, the VLOOKUP function does what? Searches the first row of a range of cells, and then returns
Gerry Hobbs, Department of Statistics, West Virginia University
Decision Trees as a Predictive Modeling Method Gerry Hobbs, Department of Statistics, West Virginia University Abstract Predictive modeling has become an important area of interest in tasks such as credit
Descriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
Statistical Distributions in Astronomy
Data Mining In Modern Astronomy Sky Surveys: Statistical Distributions in Astronomy Ching-Wa Yip [email protected]; Bloomberg 518 From Data to Information We don t just want data. We want information from
DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
Geostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras [email protected]
Need for Sampling. Very large populations Destructive testing Continuous production process
Chapter 4 Sampling and Estimation Need for Sampling Very large populations Destructive testing Continuous production process The objective of sampling is to draw a valid inference about a population. 4-
Two-Group Hypothesis Tests: Excel 2013 T-TEST Command
Two group hypothesis tests using Excel 2013 T-TEST command 1 Two-Group Hypothesis Tests: Excel 2013 T-TEST Command by Milo Schield Member: International Statistical Institute US Rep: International Statistical
Sampling Distributions
Sampling Distributions You have seen probability distributions of various types. The normal distribution is an example of a continuous distribution that is often used for quantitative measures such as
A Case Study in Software Enhancements as Six Sigma Process Improvements: Simulating Productivity Savings
A Case Study in Software Enhancements as Six Sigma Process Improvements: Simulating Productivity Savings Dan Houston, Ph.D. Automation and Control Solutions Honeywell, Inc. [email protected] Abstract
Real Time Tracking with ATLAS Silicon Detectors and its Applications to Beauty Hadron Physics
Real Time Tracking with ATLAS Silicon Detectors and its Applications to Beauty Hadron Physics Carlo Schiavi Dottorato in Fisica - XVII Ciclo Outline The ATLAS Experiment The SiTrack Algorithm Application
Generalized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes
Simulation Exercises to Reinforce the Foundations of Statistical Thinking in Online Classes Simcha Pollack, Ph.D. St. John s University Tobin College of Business Queens, NY, 11439 [email protected]
The Assumption(s) of Normality
The Assumption(s) of Normality Copyright 2000, 2011, J. Toby Mordkoff This is very complicated, so I ll provide two versions. At a minimum, you should know the short one. It would be great if you knew
Simulating Investment Portfolios
Page 5 of 9 brackets will now appear around your formula. Array formulas control multiple cells at once. When gen_resample is used as an array formula, it assures that the random sample taken from the
Performance Metrics for Graph Mining Tasks
Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics Supervised Learning Performance Metrics Unsupervised Learning Performance Metrics Optimizing Metrics Statistical
Study of the B D* ℓ ν with the Partial Reconstruction Technique
Study of the B D* ℓ ν with the Partial Reconstruction Technique + University of Ferrara / INFN Ferrara Dottorato di Ricerca in Fisica Ciclo XVII Mirco Andreotti 4 March 25 Measurement of B(B D*ℓν) from
Some Essential Statistics The Lure of Statistics
Some Essential Statistics The Lure of Statistics Data Mining Techniques, by M.J.A. Berry and G.S Linoff, 2004 Statistics vs. Data Mining..lie, damn lie, and statistics mining data to support preconceived
STAT 360 Probability and Statistics. Fall 2012
STAT 360 Probability and Statistics Fall 2012 1) General information: Crosslisted course offered as STAT 360, MATH 360 Semester: Fall 2012, Aug 20--Dec 07 Course name: Probability and Statistics Number
Binomial Distribution n = 20, p = 0.3
This document will describe how to use R to calculate probabilities associated with common distributions as well as to graph probability distributions. R has a number of built in functions for calculations
Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:
Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours
CRLS Mathematics Department Algebra I Curriculum Map/Pacing Guide
Curriculum Map/Pacing Guide page 1 of 14 Quarter I start (CP & HN) 170 96 Unit 1: Number Sense and Operations 24 11 Totals Always Include 2 blocks for Review & Test Operating with Real Numbers: How are
The Turning of JMP Software into a Semiconductor Analysis Software Product:
The Turning of JMP Software into a Semiconductor Analysis Software Product: The Implementation and Rollout of JMP Software within Freescale Semiconductor Inc. Jim Nelson, Manager IT, Yield Management Systems
Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005
Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005 Philip J. Ramsey, Ph.D., Mia L. Stephens, MS, Marie Gaudard, Ph.D. North Haven Group, http://www.northhavengroup.com/
Winter 2016 MATH 631 Online University of Waterloo
Course Schedule IMPORTANT: ALL TIMES EASTERN - Please see the University Policies section of your Syllabus for details. Week Lecture Content Readings Activities and Assignments End / Due Date Weight (%)
Condensing PQ Data and Visualization Analytics. Thomas Cooke
1 Condensing PQ Data and Visualization Analytics Thomas Cooke Storing and Transmitting Data One Monitor / Site (2 bytes per sample) X (512 samples/cycle) X (60 cycles/sec) X (60 sec/min) X (60 min/hr)
STAT 830 Convergence in Distribution
STAT 830 Convergence in Distribution Richard Lockhart Simon Fraser University STAT 830 Fall 2011 Richard Lockhart (Simon Fraser University) STAT 830 Convergence in Distribution STAT 830 Fall 2011 1 / 31
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
Analysis of Variance. MINITAB User s Guide 2 3-1
3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced
Statistical Process Control (SPC) Training Guide
Statistical Process Control (SPC) Training Guide Rev X05, 09/2013 What is data? Data is factual information (as measurements or statistics) used as a basic for reasoning, discussion or calculation. (Merriam-Webster
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Topics Exploratory Data Analysis Summary Statistics Visualization What is data exploration?
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm
Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the
Supplementary PROCESS Documentation
Supplementary PROCESS Documentation This document is an addendum to Appendix A of Introduction to Mediation, Moderation, and Conditional Process Analysis that describes options and output added to PROCESS
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
Financial Econometrics MFE MATLAB Introduction. Kevin Sheppard University of Oxford
Financial Econometrics MFE MATLAB Introduction Kevin Sheppard University of Oxford October 21, 2013 2007-2013 Kevin Sheppard 2 Contents Introduction i 1 Getting Started 1 2 Basic Input and Operators 5
How To Find The Higgs Boson
Dezső Horváth: Search for Higgs bosons Balaton Summer School, Balatongyörök, 07.07.2009 p. 1/25 Search for Higgs bosons Balaton Summer School, Balatongyörök, 07.07.2009 Dezső Horváth MTA KFKI Research
RISK MITIGATION IN FAST TRACKING PROJECTS
Voorbeeld paper CCE certificering RISK MITIGATION IN FAST TRACKING PROJECTS Author ID # 4396 June 2002 G:\DACE\certificering\AACEI\presentation 2003 page 1 of 17 Table of Contents Abstract...3 Introduction...4
