5 Aleatory Variability and Epistemic Uncertainty
|
|
|
- Berniece Joy Stanley
- 9 years ago
- Views:
Transcription
1 5 Aleatory Variability and Epistemic Uncertainty Aleatory variability and epistemic uncertainty are terms used in seismic hazard analysis that are not commonly used in other fields, but the concepts are well known. Aleatory variability is the natural randomness in a process. For discrete variables, the randomness is parameterized by the probability of each possible value. For continuous variables, the randomness is parameterized by the probability density function. Epistemic uncertainty is the scientific uncertainty in the model of the process. It is due to limited data and knowledge. The epistemic uncertainty is characterized by alternative models. For discrete random variables, the epistemic uncertainty is modelled by alternative probability distributions. For continuous random variabiles, the epstemic uncertainty is modelled by alternative probability density functions. In addition, there is epistemic uncertainty in parameters that are not random by have only a single correct (but unknown) value. The terms randomness and uncertainty have also been used for aleatory variability and epistemic uncertainty, respectively; however, these terms are commonly used in generic ways. As a result, they are often mixed up when used in hazard analysis. The terms aleatory variability and epistemic uncertainty do not roll off the tongue easily. This unfamiliarity causes people to stop and think about what they are trying to say before using them. The overall goal is to have a clear terminology that will avoid misunderstandings. 5.1 Example of the Unknown Die As a simple example, consider the problem of rolling a die. Assume that you have not seen the die, but you have seen the results of four previous rolls. Those four previous rolls came up 2, 3, 3, and 4. What is the model for this die? 5-1
2 As one approach to developing a model, you may consider that although there are only four observations from this die, you have previous experience with dice. Most dice have six sides that are equally likely. The sparse data set is consistent with a standard die. So you construct a model of the die in which the aleatory is given by a uniform distribution with values between 1 and 6 (model 1 in Table 5-1). An alternative approach to developing the model could be purely empirical. Given the four observations are all between 2 and 4, you could develop a model that assumes that this is a five-sided loaded die so that it comes up 3 most often, 2 and 4 less often, and 1 and 5 least often. This model is shown as model 2 in Table 51. Which one of these two models is correct? We don t know until additional data is collected (e.g. more rolls of the die). With time, as more rolls of the die become available, we will be able to distinguish between these two models. These two models represent epistemic uncertainty in the properties of the die. As additional rolls of the die become available, the aleatory variability does not go to zero. Rather, our estimate of the aleatory variability becomes more accurate. It may increase or decrease from our original estimate. In contrast, the epistemic uncertainty will go to zero as the number of rolls of the die becomes large. If we had a large enough number of rolls, we could develop a very accurate empirical model of the die. Table 5-1. Example of aleatory variability and epistemic uncertainty for a die with unknown properties. Probability Value Model 1 Model 2 1 1/ / / / / /6 0.0 The alternative models may not be equally credible. In this example, it may be judged to be more likely that the die is a standard six-sided die than some special loaded five-sided 5-2
3 die. The alternative models are assigned weights using logic trees as discussed in detail in section 5.3. This example demonstrates the situation that is common in developing models for seismic hazard analysis. Often we have a very small amount of data that is from the particular region under study. The two alternatives are to build a model based on the very limited, but region-specific data (e.g. model 2 above) or to use a larger set of data from regions that we consider to be analogous to the region under study (model 1 above). 5.2 Is it Aleatory Variability or Epistemic Uncertainty? The idea of distinguishing between aleatory variability and epistemic uncertainty sounds simple enough and if seismic hazard analyses were about throwing dice it would be easy. In practice, the distinction between aleatory variability and epistemic uncertainty can get confusing. In distinguishing between aleatory variability and epistemic uncertainty it can be helpful to think how you would describe, in words, the parameter under consideration. If the parameter sometimes has one value and sometimes has another values, then it has aleatory variability. That is, the variability is random. If the parameter always has either one value or another, but we are not sure which it is, then the parameter has epistemic uncertainty. As an example, consider a fault with a postulated segmentation point. One model may consider that the segmentation point is an impenetrable barrier and an alternative model may consider that the segmentation point does not exist. In this case, the existence of the segmentation point is epistemic uncertainty. There are two alternative segmentation models for the fault. The fault is either unsegmented or it is segmented and always stops the rupture. Another model of the fault segmentation may consider that the segmentation point exists, but only stops some of the ruptures. In this case, there is aleatory variability in rupture 5-3
4 mode. Sometimes the rupture is stopped at the segmentation point and sometimes the rupture breaks through the segmentation point. In this model, the epistemic uncertainty is in the probability that the rupture will stop at the segmentation point. For example, one model may have a 30% probability that the rupture is stopped at the segmentation point, whereas, another model may have a 10% probability that the rupture is stopped at the segmentation point. One recurring issue in separating aleatory variability and epistemic uncertainty concerns the limits of what could be learned in the future. There is a school of thought that there is no aleatory variability in the earthquake process. In principle, earthquakes are responding to stresses and strains in the earth. Eventually, given enough time, we will collect enough data to develop detailed models of the earthquake process that give the magnitudes and locations of future earthquakes. Since the earthquake process is in theory knowable, there is only epistemic uncertainty due to our lack of knowledge which will be reduced in time. A similar issue comes up for ground motion attenuation relations. Ground motion attenuation relations typically only use distance from the site to the source to describe the wave propagation. The detailed 3-D structure of the crust is knowable (or empirical Green s functions could be collected). In principle, with time, a wave propagation model could be determined for each specific source and site. One could argue that the variability of the ground motion attenuation relation that is due to the complexities of the wave propagation should be epistemic uncertainty since it can be determined as additional data become available. In practise, we have not used this concept of what is potentially knowable long in the future in the estimation of epistemic uncertainty and aleatory variability. Rather, the aleatory variability is determined in the context of the models and is based on the parameterization used in the model. With this approach, a model that only uses distance for the wave propagation will include the variability due to different wave propagation effects as part of the aleatory variability even though it is potentially knowable. 5-4
5 With this approach, the aleatory variability can be reduced as additional fixed parameters are added to the model. For example, consider the case of attenuation relations. In the 1980s, attenuation relations began to include a parameter for the style-of-faulting (e.g. strike-slip or reverse) as part of the model. Since there was a systematic difference in the median ground motion from strike-slip and reverse faults, the inclusion of this factor reduced the standard deviation (aleatory variability) of the attenuation relation. For faults with a single predominate style-of-faulting, then this factor is fixed for future earthquakes and there is a net reduction in the aleatory variability. The penalty for the additional parameter is that in the short term, there is additional epistemic uncertainty as to the value of the style-of-faulting factor term. What then happens to the models that did not include this additional parameter? If the new parameterization results in a significant reduction in the aleatory variability, then the previous models that did not use this improved parameterization should be downweighted, until they are finally given zero weight (e.g. they are superseeded). The addition of parameters to the model that are not fixed for future events does not lead to a reduction of the aleatory variability. For example, consider a new attenuation relation that includes stress-drop as a parameter. The addition of this parameter results in a reduction of the standard deviation determined from a regression analysis; however, since the stress-drop is not fixed for future earthquakes, it must then be randomized for future earthquakes. The result is that there is no systematic reduction in aleatory variability. (There is still a net benfit as the source of the aleatory varibaility is better understood. This is discussed further in sections 6, 7, and 8). 5-5
6 5.3 Logic Trees Epistemic uncertainty is considered by using alternative models and/or parameter values for the source characterization and ground motion attenuation relation. For each combination of alternative models, the hazard is recomputed resulting in a suite of alternative hazard curves. In seismic hazard analyses, it is common to use logic trees to handle the epistemic uncertainty. A logic tree consists of a series of branches that describe the alternative models and/or parameter values. At each branch, there is a set of branch tips that represent the alternative credible models or parameter. The weights on the branch tips represent the judgment about the credibility of the alternative models. The branch tip weights must sum to unity at each branch point. Only epistemic uncertainty should be on the logic tree. A common error in seismic hazard analyses is to put aleatory variability on some of the branches. The weights on the branches of logic trees are often called probabilities, but they are better characterized as weights that reflect the current scientific judgments in the relative merit in the alternative models. Calling these weights probabilities implies a mathematical basis that does not exist. Epistemic uncertainty is due to limited data (often very limited). In seismic hazard analyses, evaluating the alternative models involves considering alternative simplified physical models, data from analogous regions, and empirical observations. These are subjective. In some cases, uncertainties are developed from statistical evaluations, but that is not usually the case. Prior to the use of logic tress, the approach was to develop the single best model. In controversial projects, there would be disagreement between the project sponsors, regulators, and interveners as to which model was the best model. Logic trees were used to allow multiple models to be considered with weights that reflected the degree of belief of the scientific community (or at least the seismic hazard analyst) in the alternative models. In this way, all proposed models that were credible could be considered without having to select a single best model. 5-6
7 Using logic trees results in a suite of alternative of the estimates of the hazard each with an associated weight. 5.4 Underestimation of Epistemic Uncertainty In the above discussion, the logic trees are interpreted to represent the scientific uncertainty in the source characterization and ground motion attenuation; however, in practice, logic trees represent the range of available alternative models. In many cases, the range of available models will not cover the epistemic uncertainty. In developing the epistemic uncertainty, the guiding concept should be that less data means larger uncertainty. This seems like a simple concept, yet it is often not followed. Consider two faults, one that has had many studies and another that has had only a single study. In current practice, the logic tree will consider only available models (sometime this is further restricted to models published in referred journals). So for the well-studied fault, there will be several alternative models available, but for the poorly studied fault there will be only a single model available. By considering only the one available model for the poorly studied fault, 100% weight is given to that model, implying that there is no uncertainty in the model. In contrast, for the well-studied fault, there will be several alternative models available over which the weights will be spread. The result of this approach is that the computed epistemic uncertainty will be larger for the fault with more data. An additional consequence of the current practice is that as additional research is conducted, additional models will be developed, leading to more branches on the logic tree and larger uncertainty in the computed hazard. Over time this is what has happened. In many cases, our estimates of the epistemic uncertainty have increased, not decreased as additional data have been collected and models developed (ref). This reflects a tendency of scientists to underestimate epistemic uncertainty in regions with little data. But how can you develop uncertainty estimates with no data? Recall our guiding concept is that less data means larger uncertainty. Regions with more data and more models can 5-7
8 be used as a lower bound of the uncertainty for regions with little or no data. What is needed is a set of generic uncertainties that can be used with simple models to capture the possible range of behaviors of more complex models for regions with little or no data. 5-8
Descriptive Statistics and Measurement Scales
Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample
Determination of source parameters from seismic spectra
Topic Determination of source parameters from seismic spectra Authors Michael Baumbach, and Peter Bormann (formerly GeoForschungsZentrum Potsdam, Telegrafenberg, D-14473 Potsdam, Germany); E-mail: [email protected]
99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm
Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the
Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10
CS 70 Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10 Introduction to Discrete Probability Probability theory has its origins in gambling analyzing card games, dice,
Topic #6: Hypothesis. Usage
Topic #6: Hypothesis A hypothesis is a suggested explanation of a phenomenon or reasoned proposal suggesting a possible correlation between multiple phenomena. The term derives from the ancient Greek,
Stat 20: Intro to Probability and Statistics
Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum
Background Biology and Biochemistry Notes A
Background Biology and Biochemistry Notes A Vocabulary dependent variable evidence experiment hypothesis independent variable model observation prediction science scientific investigation scientific law
2.6. Probability. In general the probability density of a random variable satisfies two conditions:
2.6. PROBABILITY 66 2.6. Probability 2.6.. Continuous Random Variables. A random variable a real-valued function defined on some set of possible outcomes of a random experiment; e.g. the number of points
AMS 5 CHANCE VARIABILITY
AMS 5 CHANCE VARIABILITY The Law of Averages When tossing a fair coin the chances of tails and heads are the same: 50% and 50%. So if the coin is tossed a large number of times, the number of heads and
Unit 4 Lesson 6 Measuring Earthquake Waves. Copyright Houghton Mifflin Harcourt Publishing Company
Shake, Rattle, and Roll What happens during an earthquake? As plates of the lithosphere move, the stress on rocks at or near the edges of the plates increases. This stress causes faults to form. A fault
Name: Date: Class: Finding Epicenters and Measuring Magnitudes Worksheet
Example Answers Name: Date: Class: Finding Epicenters and Measuring Magnitudes Worksheet Objective: To use seismic data and an interactive simulation to triangulate the location and measure the magnitude
Seismic Risk Assessment Procedures for a System consisting of Distributed Facilities -Part three- Insurance Portfolio Analysis
Seismic Risk Assessment Procedures for a System consisting of Distributed Facilities -Part three- Insurance Portfolio Analysis M. Achiwa & M. Sato Yasuda Risk Engineering Co., Ltd., Tokyo, Japan M. Mizutani
AP Physics 1 and 2 Lab Investigations
AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks
CALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
Plate Tectonics: Ridges, Transform Faults and Subduction Zones
Plate Tectonics: Ridges, Transform Faults and Subduction Zones Goals of this exercise: 1. review the major physiographic features of the ocean basins 2. investigate the creation of oceanic crust at mid-ocean
6 3 The Standard Normal Distribution
290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since
DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
= δx x + δy y. df ds = dx. ds y + xdy ds. Now multiply by ds to get the form of the equation in terms of differentials: df = y dx + x dy.
ERROR PROPAGATION For sums, differences, products, and quotients, propagation of errors is done as follows. (These formulas can easily be calculated using calculus, using the differential as the associated
FOURTH GRADE EARTHQUAKES 1 WEEK LESSON PLANS AND ACTIVITIES
FOURTH GRADE EARTHQUAKES 1 WEEK LESSON PLANS AND ACTIVITIES PLATE TECTONIC CYCLE OVERVIEW OF FOURTH GRADE VOLCANOES WEEK 1. PRE: Comparing different structures of volcanoes. DURING: Modeling three types
This Performance Standards include four major components. They are
Eighth Grade Science Curriculum Approved July 12, 2004 The Georgia Performance Standards are designed to provide students with the knowledge and skills for proficiency in science at the eighth grade level.
Mechanics 1: Conservation of Energy and Momentum
Mechanics : Conservation of Energy and Momentum If a certain quantity associated with a system does not change in time. We say that it is conserved, and the system possesses a conservation law. Conservation
MATH 140 Lab 4: Probability and the Standard Normal Distribution
MATH 140 Lab 4: Probability and the Standard Normal Distribution Problem 1. Flipping a Coin Problem In this problem, we want to simualte the process of flipping a fair coin 1000 times. Note that the outcomes
MEASURES OF VARIATION
NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are
1 Prior Probability and Posterior Probability
Math 541: Statistical Theory II Bayesian Approach to Parameter Estimation Lecturer: Songfeng Zheng 1 Prior Probability and Posterior Probability Consider now a problem of statistical inference in which
Significant Figures, Propagation of Error, Graphs and Graphing
Chapter Two Significant Figures, Propagation of Error, Graphs and Graphing Every measurement has an error associated with it. If you were to put an object on a balance and weight it several times you will
Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary
Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:
The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median
CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box
Conn Valuation Services Ltd.
CAPITALIZED EARNINGS VS. DISCOUNTED CASH FLOW: Which is the more accurate business valuation tool? By Richard R. Conn CMA, MBA, CPA, ABV, ERP Is the capitalized earnings 1 method or discounted cash flow
EARTHQUAKE MAGNITUDE
EARTHQUAKE MAGNITUDE Earliest measure of earthquake size Dimensionless number measured various ways, including M L local magnitude m b body wave magnitude M s surface wave magnitude M w moment magnitude
Hypothesis Testing for Beginners
Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes
MEASURES OF CENTER AND SPREAD MEASURES OF CENTER 11/20/2014. What is a measure of center? a value at the center or middle of a data set
MEASURES OF CENTER AND SPREAD Mean and Median MEASURES OF CENTER What is a measure of center? a value at the center or middle of a data set Several different ways to determine the center: Mode Median Mean
Determination of g using a spring
INTRODUCTION UNIVERSITY OF SURREY DEPARTMENT OF PHYSICS Level 1 Laboratory: Introduction Experiment Determination of g using a spring This experiment is designed to get you confident in using the quantitative
CHAPTER 14 NONPARAMETRIC TESTS
CHAPTER 14 NONPARAMETRIC TESTS Everything that we have done up until now in statistics has relied heavily on one major fact: that our data is normally distributed. We have been able to make inferences
Math/Stats 425 Introduction to Probability. 1. Uncertainty and the axioms of probability
Math/Stats 425 Introduction to Probability 1. Uncertainty and the axioms of probability Processes in the real world are random if outcomes cannot be predicted with certainty. Example: coin tossing, stock
Remodelling the Big Bang
Remodelling the Big Bang Dewey B. Larson Unquestionably, the most significant development that has taken place in cosmology in recent years is the replacement of the original Big Bang theory by a totally
Measurement with Ratios
Grade 6 Mathematics, Quarter 2, Unit 2.1 Measurement with Ratios Overview Number of instructional days: 15 (1 day = 45 minutes) Content to be learned Use ratio reasoning to solve real-world and mathematical
Mathematical goals. Starting points. Materials required. Time needed
Level S2 of challenge: B/C S2 Mathematical goals Starting points Materials required Time needed Evaluating probability statements To help learners to: discuss and clarify some common misconceptions about
Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1
Lecture 2: Discrete Distributions, Normal Distributions Chapter 1 Reminders Course website: www. stat.purdue.edu/~xuanyaoh/stat350 Office Hour: Mon 3:30-4:30, Wed 4-5 Bring a calculator, and copy Tables
RANDOM VIBRATION AN OVERVIEW by Barry Controls, Hopkinton, MA
RANDOM VIBRATION AN OVERVIEW by Barry Controls, Hopkinton, MA ABSTRACT Random vibration is becoming increasingly recognized as the most realistic method of simulating the dynamic environment of military
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September
Earthquakes. Earthquakes: Big Ideas. Earthquakes
Earthquakes Earthquakes: Big Ideas Humans cannot eliminate natural hazards but can engage in activities that reduce their impacts by identifying high-risk locations, improving construction methods, and
Econometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
Reality in the Eyes of Descartes and Berkeley. By: Nada Shokry 5/21/2013 AUC - Philosophy
Reality in the Eyes of Descartes and Berkeley By: Nada Shokry 5/21/2013 AUC - Philosophy Shokry, 2 One person's craziness is another person's reality. Tim Burton This quote best describes what one finds
OUTLIER ANALYSIS. Data Mining 1
OUTLIER ANALYSIS Data Mining 1 What Are Outliers? Outlier: A data object that deviates significantly from the normal objects as if it were generated by a different mechanism Ex.: Unusual credit card purchase,
Probability Using Dice
Using Dice One Page Overview By Robert B. Brown, The Ohio State University Topics: Levels:, Statistics Grades 5 8 Problem: What are the probabilities of rolling various sums with two dice? How can you
Statistics. Measurement. Scales of Measurement 7/18/2012
Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does
Jitter Measurements in Serial Data Signals
Jitter Measurements in Serial Data Signals Michael Schnecker, Product Manager LeCroy Corporation Introduction The increasing speed of serial data transmission systems places greater importance on measuring
8. THE NORMAL DISTRIBUTION
8. THE NORMAL DISTRIBUTION The normal distribution with mean μ and variance σ 2 has the following density function: The normal distribution is sometimes called a Gaussian Distribution, after its inventor,
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 [email protected] Genomics A genome is an organism s
Circuits 1 M H Miller
Introduction to Graph Theory Introduction These notes are primarily a digression to provide general background remarks. The subject is an efficient procedure for the determination of voltages and currents
6.3 Conditional Probability and Independence
222 CHAPTER 6. PROBABILITY 6.3 Conditional Probability and Independence Conditional Probability Two cubical dice each have a triangle painted on one side, a circle painted on two sides and a square painted
1 Sufficient statistics
1 Sufficient statistics A statistic is a function T = rx 1, X 2,, X n of the random sample X 1, X 2,, X n. Examples are X n = 1 n s 2 = = X i, 1 n 1 the sample mean X i X n 2, the sample variance T 1 =
1 Annex 11: Market failure in broadcasting
1 Annex 11: Market failure in broadcasting 1.1 This annex builds on work done by Ofcom regarding market failure in a number of previous projects. In particular, we discussed the types of market failure
Spatial Data Analysis
14 Spatial Data Analysis OVERVIEW This chapter is the first in a set of three dealing with geographic analysis and modeling methods. The chapter begins with a review of the relevant terms, and an outlines
New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Introduction
Introduction New Work Item for ISO 3534-5 Predictive Analytics (Initial Notes and Thoughts) Predictive analytics encompasses the body of statistical knowledge supporting the analysis of massive data sets.
Bachelor's Degree in Business Administration and Master's Degree course description
Bachelor's Degree in Business Administration and Master's Degree course description Bachelor's Degree in Business Administration Department s Compulsory Requirements Course Description (402102) Principles
Point and Interval Estimates
Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number
Correlating PSI and CUP Denton Bramwell
Correlating PSI and CUP Denton Bramwell Having inherited the curiosity gene, I just can t resist fiddling with things. And one of the things I can t resist fiddling with is firearms. I think I am the only
E3: PROBABILITY AND STATISTICS lecture notes
E3: PROBABILITY AND STATISTICS lecture notes 2 Contents 1 PROBABILITY THEORY 7 1.1 Experiments and random events............................ 7 1.2 Certain event. Impossible event............................
EDUCATION AND EXAMINATION COMMITTEE SOCIETY OF ACTUARIES RISK AND INSURANCE. Copyright 2005 by the Society of Actuaries
EDUCATION AND EXAMINATION COMMITTEE OF THE SOCIET OF ACTUARIES RISK AND INSURANCE by Judy Feldman Anderson, FSA and Robert L. Brown, FSA Copyright 25 by the Society of Actuaries The Education and Examination
QUANTITATIVE RISK ASSESSMENT FOR ACCIDENTS AT WORK IN THE CHEMICAL INDUSTRY AND THE SEVESO II DIRECTIVE
QUANTITATIVE RISK ASSESSMENT FOR ACCIDENTS AT WORK IN THE CHEMICAL INDUSTRY AND THE SEVESO II DIRECTIVE I. A. PAPAZOGLOU System Reliability and Industrial Safety Laboratory National Center for Scientific
Covariance and Correlation
Covariance and Correlation ( c Robert J. Serfling Not for reproduction or distribution) We have seen how to summarize a data-based relative frequency distribution by measures of location and spread, such
Session 6 Number Theory
Key Terms in This Session Session 6 Number Theory Previously Introduced counting numbers factor factor tree prime number New in This Session composite number greatest common factor least common multiple
Interpreting Data in Normal Distributions
Interpreting Data in Normal Distributions This curve is kind of a big deal. It shows the distribution of a set of test scores, the results of rolling a die a million times, the heights of people on Earth,
Probability & Probability Distributions
Probability & Probability Distributions Carolyn J. Anderson EdPsych 580 Fall 2005 Probability & Probability Distributions p. 1/61 Probability & Probability Distributions Elementary Probability Theory Definitions
12.510 Introduction to Seismology Spring 2008
MIT OpenCourseWare http://ocw.mit.edu 12.510 Introduction to Seismology Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 04/30/2008 Today s
Plate Tectonics. Introduction. Boundaries between crustal plates
Plate Tectonics KEY WORDS: continental drift, seafloor spreading, plate tectonics, mid ocean ridge (MOR) system, spreading center, rise, divergent plate boundary, subduction zone, convergent plate boundary,
MIDLAND ISD ADVANCED PLACEMENT CURRICULUM STANDARDS AP ENVIRONMENTAL SCIENCE
Science Practices Standard SP.1: Scientific Questions and Predictions Asking scientific questions that can be tested empirically and structuring these questions in the form of testable predictions SP.1.1
Notes on Continuous Random Variables
Notes on Continuous Random Variables Continuous random variables are random quantities that are measured on a continuous scale. They can usually take on any value over some interval, which distinguishes
ST 371 (IV): Discrete Random Variables
ST 371 (IV): Discrete Random Variables 1 Random Variables A random variable (rv) is a function that is defined on the sample space of the experiment and that assigns a numerical variable to each possible
4. Continuous Random Variables, the Pareto and Normal Distributions
4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random
Plato gives another argument for this claiming, relating to the nature of knowledge, which we will return to in the next section.
Michael Lacewing Plato s theor y of Forms FROM SENSE EXPERIENCE TO THE FORMS In Book V (476f.) of The Republic, Plato argues that all objects we experience through our senses are particular things. We
EARTHQUAKE PREDICTION
Lecture 15 Earthquake Prediction EARTHQUAKE PREDICTION To successfully predict an earthquake we would like to know:- PLACE TIME MAGNITUDE (rather like a weather forecast) 1 Evidence must be integrated
4.1 4.2 Probability Distribution for Discrete Random Variables
4.1 4.2 Probability Distribution for Discrete Random Variables Key concepts: discrete random variable, probability distribution, expected value, variance, and standard deviation of a discrete random variable.
The Normal Approximation to Probability Histograms. Dice: Throw a single die twice. The Probability Histogram: Area = Probability. Where are we going?
The Normal Approximation to Probability Histograms Where are we going? Probability histograms The normal approximation to binomial histograms The normal approximation to probability histograms of sums
HOW TO DO A SCIENCE PROJECT Step-by-Step Suggestions and Help for Elementary Students, Teachers, and Parents Brevard Public Schools
HOW TO DO A SCIENCE PROJECT Step-by-Step Suggestions and Help for Elementary Students, Teachers, and Parents Brevard Public Schools 1. Get an Idea for Your Project Find an area that interests you. You
HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS
Mathematics Revision Guides Histograms, Cumulative Frequency and Box Plots Page 1 of 25 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS
Probability: Terminology and Examples Class 2, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom
Probability: Terminology and Examples Class 2, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom 1 Learning Goals 1. Know the definitions of sample space, event and probability function. 2. Be able to
Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion
Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research
1 Error in Euler s Method
1 Error in Euler s Method Experience with Euler s 1 method raises some interesting questions about numerical approximations for the solutions of differential equations. 1. What determines the amount of
Intro to Data Analysis, Economic Statistics and Econometrics
Intro to Data Analysis, Economic Statistics and Econometrics Statistics deals with the techniques for collecting and analyzing data that arise in many different contexts. Econometrics involves the development
6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
Problem of the Month: Fair Games
Problem of the Month: The Problems of the Month (POM) are used in a variety of ways to promote problem solving and to foster the first standard of mathematical practice from the Common Core State Standards:
Experimental Uncertainties (Errors)
Experimental Uncertainties (Errors) Sources of Experimental Uncertainties (Experimental Errors): All measurements are subject to some uncertainty as a wide range of errors and inaccuracies can and do happen.
Chapter 1 Introduction. 1.1 Introduction
Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations
Lesson 1. Basics of Probability. Principles of Mathematics 12: Explained! www.math12.com 314
Lesson 1 Basics of Probability www.math12.com 314 Sample Spaces: Probability Lesson 1 Part I: Basic Elements of Probability Consider the following situation: A six sided die is rolled The sample space
Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini
NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building
STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
5.1 Identifying the Target Parameter
University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying
Appendix A: Science Practices for AP Physics 1 and 2
Appendix A: Science Practices for AP Physics 1 and 2 Science Practice 1: The student can use representations and models to communicate scientific phenomena and solve scientific problems. The real world
Oxford Cambridge and RSA Examinations
Oxford Cambridge and RSA Examinations OCR FREE STANDING MATHEMATICS QUALIFICATION (ADVANCED): ADDITIONAL MATHEMATICS 6993 Key Features replaces and (MEI); developed jointly by OCR and MEI; designed for
Presentations. Session 1. Slide 1. Earthquake Risk Reduction. 1- Concepts & Terminology
Earthquake Risk Reduction Presentations Session 1 Slide 1 Earthquake Risk Reduction 1- Concepts & Terminology Welcome to the World Bank Institute s (WBI) Distance Learning (DL) course on Earthquake Risk
EXPERIMENTAL ERROR AND DATA ANALYSIS
EXPERIMENTAL ERROR AND DATA ANALYSIS 1. INTRODUCTION: Laboratory experiments involve taking measurements of physical quantities. No measurement of any physical quantity is ever perfectly accurate, except
Mathematics Pre-Test Sample Questions A. { 11, 7} B. { 7,0,7} C. { 7, 7} D. { 11, 11}
Mathematics Pre-Test Sample Questions 1. Which of the following sets is closed under division? I. {½, 1,, 4} II. {-1, 1} III. {-1, 0, 1} A. I only B. II only C. III only D. I and II. Which of the following
Alabama Department of Postsecondary Education
Date Adopted 1998 Dates reviewed 2007, 2011, 2013 Dates revised 2004, 2008, 2011, 2013, 2015 Alabama Department of Postsecondary Education Representing Alabama s Public Two-Year College System Jefferson
