Uncertainty Evaluation in a Nutshell. Evaluating Measurement Uncertainty. Uncertainty Evaluation Area of Circle MEASUREMENT EQUATION & GAUSS S FORMULA
|
|
|
- Amos Griffith
- 10 years ago
- Views:
Transcription
1 Uncertainty Evaluation in a Nutshell Evaluating Measurement Uncertainty Antonio Possolo October 30th, 2013 SIM METROLOGY SCHOOL Measurement uncertainty Reflects incomplete knowledge about value of measurand Expressed most completely by probability distribution Experimental data may be used alone or combined with other information to Estimate measurand Evaluate measurement uncertainty Bottom Up vs. Top Down Measurement equation Monte Carlo / Gauss NIST Uncertainty Machine Observation equation Mathematical Statistics Expert collaboration Possolo 1 / 51 Possolo 2 / 51 Uncertainty Evaluation Area of Circle MEASUREMENT EQUATION & GAUSS S FORMULA Measure diameter D, compute radius R = D/2, estimate measurand A = πr 2 GAUSS S FORMULA (A) A 2 (R) R Uncertainty Evaluation Area of Circle MEASUREMENT EQUATION & MONTE CARLO METHOD Measure diameter D, compute radius R = D/2, estimate measurand A = πr 2 MONTE CARLO METHOD Specify probability model to describe uncertainty when defining diameter Simulate process many times Assess dispersion of area estimates D = m R = m (R) = m A = m 2 (A) m 2 Diameter has Gaussian azimuth with no bias and uncertainty π/30 A = m 2 (A) = m 2 Possolo 3 / 51 Possolo 4 / 51
2 Uncertainty Evaluation Thermal Bath OBSERVATION EQUATION & STATISTICAL MODEL Temperature ( C) Measure temperature of thermal bath every minute for 100 minutes Determine if bath is thermally stable, and if so evaluate uncertainty of average temperature ARIMA STATISTICAL MODEL Time (minute) Identify best ARIMA model Fit best model to data Best model is stationary: bath thermally stable t = C (t) = C Possolo 5 / 51 Outline Uncertainty Evaluation in a Nutshell Area of Circle Gauss s Formula Area of Circle Monte Carlo Method Thermal Bath Statistical Model Probability Distributions & Random Variables Measurement Uncertainty & Measurement Error Measurement Models Measurement Equations Observation Equations Evaluation of Measurement Uncertainty NIST Uncertainty Machine Load Cell Calibration Possolo 6 / 51 Uncertainty Meaning Uncertainty Interpretation (GUM) MEANING Uncertainty is the condition of being uncertain (unsure, doubtful, not possessing complete or fully reliable knowledge) Also a qualitative or quantitative expression of the degree or extent of such condition It is a subjective condition because it pertains to the perception or understanding that you have of the object of interest INTERPRETATION (GUM) The uncertainty of the result of a measurement reflects the lack of exact knowledge of the value of the measurand GUM [3.3.1] State of knowledge described most completely by probability distribution over set of possible values of measurand Probability distribution expresses how well one believes one knows the measurand s true value Possolo 7 / 51 Possolo 8 / 51
3 Outline Probability Distributions Motivation 1 Probability Distributions & Random Variables 2 Measurement Uncertainty & Measurement Error 3 Measurement Models 4 Evaluation of Measurement Uncertainty 5 Load Cell Calibration Measurand: Number of jelly beans in the jar Frequency distribution of students estimates Frequency of Guess Depicts dispersion of measurement results Characterizes students collective uncertainty Possolo 9 / 51 Number of Jelly Beans Possolo 10 / 51 Probability Distributions & Random Variables CONCEPTS Probability distribution specifies probability of unknown value of a quantity Y being in any given subset of its range B Pr(Y B), for B a subset of B Random variable is a mathematical model for unknown value of a quantity that has associated a probability distribution All quantities about whose values there is uncertainty can be modeled as random variables Even if the quantity value is fixed (but unknown) Irrespective of whether they relate to chance events Possolo 11 / 51 Probability Distributions & Random Variables EXAMPLE EXAMPLE ATOMIC WEIGHT OF BORON Probability Density A r (B) Random variable A r (B) describes atomic weight of boron in sample known to come from one of main commercial sources in US, Turkey, Chile, Argentina, or Russia Probability density gives probability of A r (B) s value being in any given interval as area under curve Possolo 12 / 51
4 Outline Measurement Uncertainty VIM / GUM DEFINITION 1 Probability Distributions & Random Variables 2 Measurement Uncertainty & Measurement Error 3 Measurement Models 4 Evaluation of Measurement Uncertainty 5 Load Cell Calibration DEFINITION Measurement uncertainty is a non-negative parameter characterizing the dispersion of the quantity values being attributed to a measurand, based on the information used VIM 2.26 For scalar measurands, that parameter typically chosen to be standard deviation of probability distribution describing dispersion of values For the many measurands that are not scalars (spectra, maps, shapes, genomes, etc.) use suitable generalization Possolo 13 / 51 Possolo 14 / 51 Measurement Uncertainty Sources Measurement Error SOURCES / EFFECTS Principles and methods of measurement Definition of measurand Environmental conditions Instrument calibration Calibration standards and corrections Temporal drifts of relevant attributes of measurand, instruments, procedures Differences between operators and between laboratories Statistical models and methods for data reduction Difference between measured quantity value and true (or, reference) quantity value Not observable, and generally unknown If measurand has conventional value (speed of light), then that difference can be computed and measurement error becomes known Contemporary approach emphasizes uncertainty, as opposed to error Facilitates use of uniform methods for uncertainty analysis Possolo 15 / 51 Possolo 16 / 51
5 Measurement Uncertainty Evaluation Measurement Error Fixed (Systematic) Evaluated so as to be fit for purpose Sources of uncertainty may be evaluated based on: Experimental data (Type A evaluations) Other sources of information (Type B evaluations) Elicitation of expert opinion structured procedure to do Type B evaluations, for example using Sheffield Elicitation Framework (SHELF) FIXED (Systematic) MEASUREMENT ERROR Component of measurement error that, in replicate measurements, either remains constant or varies in a predictable manner Incomplete extraction when measuring mass fractions of multiple PCB congeners Wrong value of thermal expansion coefficient in length measurements Evaluated either by Type A or Type B methods Possolo 17 / 51 Possolo 18 / 51 Measurement Error Variable (Random) Accuracy & Precision Illustration VARIABLE (Random) MEASUREMENT ERROR Component of measurement error that in replicate measurements varies in an unpredictable manner Manifest in dispersion of multiple measured values obtained under repeatability conditions Same measurement procedure, same operators, same measuring system, same operating conditions and same location, made on same object over short period of time Evaluated either by Type A or Type B methods Possolo 19 / 51 Possolo 20 / 51
6 Accuracy & Precision Definition Accuracy & Precision Where s the Target? MEASUREMENT ACCURACY (VIM 2.13) Closeness of agreement between a measured quantity value and a true quantity value of a measurand How to evaluate if true quantity is unknown? MEASUREMENT PRECISION (VIM 2.15) Closeness of agreement between indications or measured quantity values obtained by replicate measurements on the same or similar objects under specified conditions Possolo 21 / 51 Possolo 22 / 51 Measurement Uncertainty Evaluation BOTTOM-UP / TOP-DOWN Types of Measurement Uncertainty Evaluations TYPE A Bottom-up assessments uncertainty budgets for individual labs or measurement methods Top-down assessments via interlaboratory and multiple method studies Often reveal unsuspected uncertainty components Dark uncertainty Thompson & Ellison (2011) TYPE B Based on statistical scatter of measured values obtained under comparable measurement conditions (repeatability, reproducibility) Based on other evidence, including information Published in compilations of quantity values Obtained from a calibration certificate, or associated with certified reference material Obtained from experts Possolo 23 / 51 Possolo 24 / 51
7 Uncertainty Components Interlab Study Outline Lab-specific Dispersion of measured values from different labs 1 Probability Distributions & Random Variables 2 Measurement Uncertainty & Measurement Error As (mg/kg) LABORATORIES 3 Measurement Models 4 Evaluation of Measurement Uncertainty 5 Load Cell Calibration Possolo 25 / 51 Possolo 26 / 51 Measurement Models & Uncertainty Evaluations ORIENTATION MEASUREMENT MODELS Measurement equations Observation equations May have to use both in the course of an uncertainty evaluation UNCERTAINTY EVALUATIONS Measurement equations: Gauss s formula or Monte Carlo Method NIST Uncertainty Machine Observation equations: Statistical methods Measurement Models MEASUREMENT EQUATION DEFINITION Measurand (output quantity) is known function of input quantities Estimates of the input quantities and characterizations of associated uncertainties are available EXAMPLE Thermal expansion coefficient α = L 1 L 0 L 0 (T 1 T 0 ) Possolo 27 / 51 Possolo 28 / 51
8 Measurement Models OBSERVATION EQUATION DEFINITION & EXAMPLE (TEMPERATURE) Measurement Models OBSERVATION EQUATION EXAMPLE (WEIBULL LIFETIME) DEFINITION Measurand is function of parameters of statistical model for experimental data EXAMPLE Replicated measurements of temperature τ: t 1 = C, t 2 = C, t 3 = C, t 4 = C Measurement model (observation equation) t = τ + ε, for = 1,..., 4 ε 1, ε 2, ε 3, ε 4 non-observable measurement errors Lifetime W of mechanical part has Weibull probability distribution with shape α and scale β log W = log β + 1 log( log U) α Non-observable error U has rectangular distribution on (0, 1) Measurand is expected lifetime η = β (1 + 1 α ) Possolo 29 / 51 Possolo 30 / 51 Model Selection MEASUREMENT EQUATION vs. OBSERVATION EQUATION Measurement Uncertainty Evaluation MEASUREMENT EQUATION & GAUSS S FORMULA MEASUREMENT EQUATION Measurand is known function of input quantities, and these do not depend on value of measurand Estimates of input quantities, and characterizations of associated uncertainties, are available OBSERVATION EQUATION Observable quantities (experimental data) depend on value of measurand but relationship between them is stochastic, not deterministic Y = ƒ (X 1,..., X n ) GAUSS S (1823) FORMULA GUM (10), (13) 2 (y) n j=1 n 1 c 2 j 2 ( j ) + 2 n j=1 k=j+1 c j c k ( j ) ( k )r( j, k ) c j Value at ( 1,..., n ) of first-order partial derivative of ƒ with respect to jth argument Replicated values of same observable quantity typically vary (i.e., they are mutually inconsistent) even though measurand remains invariant r( j, k ) Correlation between X j and X k Possolo 31 / 51 Possolo 32 / 51
9 Gauss s Formula INGREDIENTS & ASSUMPTIONS Measurement Uncertainty Evaluation MEASUREMENT EQUATION & MONTE CARLO METHOD Estimates, standard uncertainties, and correlations of input quantities Values of partial derivatives of ƒ Measurement function ƒ approximately linear in neighborhood of estimates of input quantities Uncertainty of input quantities small relative to size of that neighborhood Probabilistic interpretation of y ± k (y) involves additional assumptions Possolo 33 / 51 Y = ƒ (X 1,..., X n ) MONTE CARLO METHOD GUM-S1 Apply perturbations to values of input quantities drawn from respective probability distributions Compute value of output quantity for each set of perturbed values of input quantities Values of output quantity are sample from corresponding probability distribution: Their standard deviation is an evaluation of (y) Use them also to produce coverage intervals and to characterize Y s probability density Possolo 34 / 51 Monte Carlo Method INGREDIENTS & ASSUMPTIONS Temperature Measurement OBSERVATION EQUATION Joint probability distribution of input quantities specified fully Specialized software to simulate draws from that distribution No need for linearizations (approximations) or for partial derivatives Results automatically interpretable probabilistically Replicated measurements of temperature τ: t 1 = C, t 2 = C, t 3 = C, t 4 = C Observation equation t = τ + ε If non-observable measurement errors {ε } have same Gaussian probability distribution with mean 0 and same standard deviation, then τ = t = ( )/4 = C estimates τ with minimum mean squared error Under different assumptions, best estimate need not be the average! Possolo 35 / 51 Possolo 36 / 51
10 Temperature Measurement UNCERTAINTY EVALUATION F-100 Super Sabre Lifetime OBSERVATION EQUATION EXAMPLE In the absence of any other recognized sources of uncertainty, and under same assumptions that make t best estimate: (t) = SD(99.49, , 99.56, )/ 4 = 0.35 C ± ( ) = (98.98 C, C) is approximate 95 % coverage interval for τ 4.30 is the 97.5th percentile of Student s t distribution with 3 degrees of freedom Measurand: component lifetime Times to failure (hour): 0.22, 0.50, 0.88, 1.00, 1.32, 1.33, 1.54, 1.76, 2.50, 3.00, 3+, 3+, 3+ Possolo 37 / 51 Possolo 38 / 51 F-100 Super Sabre Lifetime VALUE ASSIGNMENT, UNCERTAINTY EVALUATION Observation equation (statistical model) Observed lifetimes are sample from Weibull distribution with shape α and scale β F-100 Super Sabre Lifetime MAXIMUM LIKELIHOOD ESTIMATE, STATISTICAL BOOTSTRAP Measurement result: η = 1.84 h, ( η) = 0.30 h Lifetime shorter than 4.5 h with 99 % probability Value assignment Maximum likelihood estimates of parameters are α and β that maximize likelihood function, and η = β (1 + 1/ α) Uncertainty evaluation Parametric statistical bootstrap Draw samples from fitted distribution, of the same size and censored as experimental data Estimate parameters from these samples and compute corresponding values of expected lifetime: their standard deviation is evaluation of (η) Probability Density (1/h) η h Possolo 39 / 51 Possolo 40 / 51
11 Outline Evaluation of Measurement Uncertainty METHODS & TOOLS 1 Probability Distributions & Random Variables 2 Measurement Uncertainty & Measurement Error 3 Measurement Models 4 Evaluation of Measurement Uncertainty 5 Load Cell Calibration METHODS (1) Gauss s formula (Measurement Equation) (2) Monte Carlo method (Measurement Equation and Observation Equation) (3) Statistical method (Observation Equation) TOOLS (1) NIST Uncertainty Machine (2) NIST Uncertainty Machine (3) Collaboration between metrologist and statistician Possolo 41 / 51 Possolo 42 / 51 NIST Uncertainty Machine EXAMPLES NIST Uncertainty Machine Example THERMAL EXPANSION COEFFICIENT Thermal expansion coefficient Falling ball viscometer End-Gauge calibration Resistance Stefan-Boltzmann constant Freezing point depression α = L 1 L 0 L 0 (T 1 T 0 ) ( ) ν T K 0.02 K 3 L m m 3 T K 0.05 K 3 L m m 3 Possolo 43 / 51 Possolo 44 / 51
12 Example FALLING BALL VISCOMETER Outline μ M = μ C ρ B ρ M ρ B ρ C t M t C 1 Probability Distributions & Random Variables Prob. density GUM GUM S1 2 Measurement Uncertainty & Measurement Error 3 Measurement Models 4 Evaluation of Measurement Uncertainty 5 Load Cell Calibration µ / mpa s 22% solution of sodium hydroxide in water at 20 C HAAKE boron silica glass ball no. 2 Possolo 45 / 51 Possolo 46 / 51 Load Cell Calibration DATA & RESIDUALS Load Cell Calibration PROBLEM & SOLUTION Force (kn) Residual Force (kn) Calibration function produces values of force corresponding to response indications: F = φ(r) PROBLEM Both variables are affected by uncertainty Dispersion of response indications exceeds undertainty associated with each one individually Indication (mv/v) Fitted Force (kn) Relative standard uncertainties for both forces and indications are % Possolo 47 / 51 SOLUTION Errors-in-variables regression, as in ISO 6143:2001(E) for calibration of gas mixtures Possolo 48 / 51
13 Load Cell Calibration ERRORS-IN-VARIABLES REGRESSION Load Cell Calibration UNCERTAINTY EVALUATION GUM SUPPLEMENT 1 Second-degree polynomial is an adequate model for calibration function: φ(r) = α + βr + γr 2 Determine α, β, γ, and ρ 1,..., ρ m that minimize m F φ(ρ ) 2 2 R ρ + 2 (F ) 2 (R ) + σ 2 =1 σ describes the dispersion of replicated response indications at a fixed force setting Repeat the following steps n times Add Gaussian perturbations to the { ρ } with mean 0 and variances { 2 (R ) + σ 2 } Add Gaussian perturbations to the { F } with mean 0 and variances { 2 (F )} Fit the errors-in-variables model to the perturbed values Possolo 49 / 51 Possolo 50 / 51 Load Cell Calibration UNCERTAINTY EVALUATION RESULTS Probability distributions describing uncertainties associated with forces Prob. Density (1/kN) Prob. Density (1/kN) Prob. Density (1/kN) Prob. Density (1/kN) Force (kn) Force (kn) Force (kn) Force (kn) Prob. Density (1/kN) Prob. Density (1/kN) Prob. Density (1/kN) Prob. Density (1/kN) Force (kn) Force (kn) Force (kn) Force (kn) Prob. Density (1/kN) Prob. Density (1/kN) Prob. Density (1/kN) Relative Uncertainty % Force (kn) Force (kn) Force (kn) Possolo 51 / 51
14 Uncertainty Machine User s Manual Thomas Lafarge Antonio Possolo Statistical Engineering Division Information Technology Laboratory National Institute of Standards and Technology Gaithersburg, Maryland, USA July 10, Purpose NIST s UncertaintyMachine is a software application to evaluate the measurement uncertainty associated with an output quantity defined by a measurement model of the form y = f (x 1,..., x n ), where the real-valued function f is specified fully and explicitly, and the input quantities are modeled as random variables whose joint probability distribution also is specified fully. The UncertaintyMachine evaluates measurement uncertainty by application of two different methods: The method introduced by Gauss [1823] and described in the Guide to the Evaluation of Uncertainty in Measurement (GUM) [Joint Committee for Guides in Metrology, 2008a] and also by Taylor and Kuyatt [1994]; The Monte Carlo method described by Morgan and Henrion [1992] and specified in the Supplement 1 to the GUM (GUM-S1) [Joint Committee for Guides in Metrology, 2008b]. 2 Gauss s Formula vs. Monte Carlo Method The method described in the GUM produces an approximation to the standard measurement uncertainty u( y) of the output quantity, and it requires: (a) Estimates x 1,..., x n of the input quantities; (b) Standard measurement uncertainties u(x 1 ),..., u(x n ); (c) Correlations {r i j } between every pair of different input quantities (by default these are all assumed to be zero); (d) Values of the partial derivatives of f evaluated at x 1,..., x n.
15 When the probability distribution of the output quantity is approximately Gaussian, then the interval y ± 2u(y) may be interpreted as a coverage interval for the measurand with approximately 95 % coverage probability. The GUM also considers the case where the distribution of the output quantity is approximately Student s t with a number of degrees of freedom that may be a function of the numbers of degrees of freedom that the {u(x j )} are based on, computed using the Welch- Satterthwaite formula [Satterthwaite, 1946, Welch, 1947]. In general, neither the Gaussian nor the Student s t distributions need model the dispersion of values of the output quantity accurately, even when all the input quantities are modeled as Gaussian random variables. The GUM suggests that the Central Limit Theorem (CLT) lends support to the Gaussian approximation for the distribution of the output quantity. However, without a detailed examination of the measurement function f, and of the probability distribution of the input quantities (examinations that the GUM does not explain how to do), it is impossible to guarantee the adequacy of the Gaussian or Student s t approximations. The CLT states that, under some conditions, a sum of independent random variables has a probability distribution that is approximately Gaussian [Billingsley, 1979, Theorem 27.2]. The CLT is a limit theorem, in the sense that it concerns an infinite sequence of sums, and provides no indication about how close to Gaussian the distribution of a sum of a finite number of summands will be. Other results in probability theory provide such indications, but they involve more than just the means and variances that are required to apply Gauss s formula. The Monte Carlo method provides an arbitrarily large sample from the probability distribution of the output quantity, and it requires that the joint probability distribution of the random variables modeling the input quantities be specified fully. This sample alone suffices to compute the standard uncertainty associated with the output quantity, and to compute and to interpret coverage intervals probabilistically. Suppose that the measurement model is y = ab/c, and that a, b, and c are modeled as independent random variables such that: a is Gaussian with mean 32 and standard deviation 0.5; b has a uniform (or, rectangular) distribution with mean 0.9 and standard deviation 0.025; c has a symmetrical triangular distribution with mean 1 and standard deviation 0.3. Figure 1 on Page 3 shows the graphical user interface of the UncertaintyMachine filled in to reflect these modeling choices, and the results that are printed on-screen. Figure 2 on Page 4 shows a probability density estimate of the distribution of the output quantity. The method described in the GUM produces y = 28.8 and u(y) = 8.7. According to the conventional interpretation, the interval (11.4, 46.2) may be a coverage interval with approximately 95 % coverage probability. A sample of size produced by application of the Monte Carlo method has average and standard deviation Since only 88 % of the sample values lie within (11.4, 46.2), the coverage probability of this coverage interval is much lower than the conventional interpretation would have led one to believe.
16 Monte Carlo Method Summary statistics for sample of size 1e+06 ave = 32.2 sd = 12.5 median = 28.8 mad = 8.9 Coverage intervals 99% ( 17.12, 85 ) k = % ( 18, 67 ) k = 2 90% ( 19.11, 58 ) k = % ( 21.8, 42 ) k = % ( 23.7, 37 ) k = Gauss s Formula (GUM s Linear Approximation) y = 28.8 u(y) = 8.69 SensitivityCoeffs Percent.u2 a b c Correlations NA 0.00 ============================================ Figure 1: ABC. Entries in the GUI correspond to the example discussed in 2. In each numerical result, only the digits that the UncertaintyMachine deems to be significant are printed.
17 Figure 2: ABC Densities. Estimate of the probability density of the output quantity (solid blue line), and probability density (dotted red line) of a Gaussian distribution with the same mean and standard deviation as the output quantity, corresponding to the results listed in Figure 1 on Page 3. In this case, the Gaussian approximation is very inaccurate. 3 Software NIST s UncertaintyMachine should run on any computer where Oracle s Java ( com) is installed, irrespective of the operating system. Since the computations are done using facilities of the R environment for statistical computing and graphics [R Development Core Team, 2012], this too, must be installed. The software is installed as described in 10. Some commercial products, including software, are identified in this manual in order to specify the means whereby the UncertaintyMachine may be employed. Such identification is not intended to imply recommendation or endorsement by the National Institute of Standards and Technology, nor is it intended to imply that the software identified is necessarily the only or best available for the purpose. 4 Usage The following instructions are for using the UncertaintyMachine under the Microsoft Windows operating system: under other operating systems, the steps are similar. (U-1) Either by double-clicking the UncertaintyMachine icon where it will have been installed, or by selecting the appropriate item in the Start menu, launch the application s graphical user interface (GUI), which is displayed in a resizable window. (U-2) If one wishes to use a saved configuration, click Load Parameters, select the file where parameters will have been saved previously, and continue.
18 (U-3) Choose the number of input quantities from the drop-down menu corresponding to the entry Number of input quantities. In response to this, the GUI will update itself and show as many boxes as there are input quantities, and assign default names to them (which may be changed as explained below). (U-4) Enter the size of the sample to be drawn from the probability distribution of the output quantity, into the box labeled Number of realizations of output quantity: the default value, , is the minimum recommended sample size. (U-5) Enter the names of the input quantities into the boxes following Names of input quantities. (U-6) Click the button labeled Update quantity names: this will update the labels of the boxes that appear farther down in the GUI that are used to assign probability distributions to the input quantities. (U-7) Enter a valid R expression into the box labeled Value of output quantity (R expression) that defines the value of the output quantity. This expression should involve only the input quantities, and functions and numerical constants that R knows how to evaluate. (Remember that R is case sensitive.) Alternatively, the definition may comprise several R expressions, possibly in different lines within this box (pressing Enter on the keyboard, with the cursor in this box, creates a new line), but the last expression must evaluate the output quantity (without assigning this value to any variable). If the measurement model is A = (L 1 L 0 )/ L 0 (T 1 T 0 ), then the R expression that should then be entered into this box is (L1-L0)/(L0*(T1-T0)). Alternatively, the box may comprise these three lines: N = L1-L0 D = L0*(T1-T0) N / D Note that the last expression produces the value f (x 1,..., x n ) that the measurement function takes at the estimates of the input quantities. (U-8) Assign a probability distribution to each of the input quantities, using the drop-down menus in front of them. Once a choice is made, one or more additional input boxes will appear, where values of parameters must be entered fully to specify the probability distribution that was selected. Table 1 on Page 7 lists the distributions implemented currently, and their parametrizations. Note that some distributions have more than one parametrization: in such cases, only one of the parametrizations needs to be specified. (U-9) If there are correlations between input quantities that need to be taken into account, then check the box marked Correlations, and enter the values of non-zero correlations into the appropriate boxes in the upper triangle of the correlation matrix that the GUI will display. (U-10) If the box marked Correlations has been checked, then besides having specified correlations in (U-9), also select a copula (currently, either Gaussian or Student s t) to manufacture a joint probability distribution for the input quantities. If the copula
19 chosen is (multivariate) Student s t, then another box will appear nearby to receive the number of degrees of freedom. The resulting joint distribution reproduces the correlation structure that has been specified, and has the distributions specified for the input quantities as margins. Possolo [2010] explains and illustrates the role that copulas play in uncertainty analysis. (U-11) Optionally, if you wish to save the results of the calculations (numerical and graphical), then click the button labeled Choose file, and use the file selection dialog that is displayed to select the location, and the prefix for the output files. The prefix will be used to define the names of the three output files that will be created: (i) a plain ASCII text file where the sample of values of the output quantity will be written to, one per line; (ii) a JPEG file with a plot; and (iii) a plain ASCII text file with summary statistics of the Monte Carlo sample drawn from the distribution of the output quantity, and with the estimate of the measurand and the standard uncertainty evaluated as specified in the GUM. If the specified prefix is ABC.txt or ABC, then the three output files that will be created and named automatically will be called ABC-values.txt, ABC-density.jpg and ABC-results.txt. (U-12) Optionally, save the parameters specified in the GUI by clicking Save Parameters and choosing a file to save the GUI s current configuration to. This configuration comprises the definition of the measurement model and the parameter settings. (U-13) Click the button labeled Run. In response to this, a window will open where numerical results will be printed, and a graphics window will also open to display graphical results. If a file name will have been specified in (U-11), then both numerical and graphical output are saved to files. The UncertaintyMachine estimates the number of significant digits in the results, and reports only these. To increase the number of significant digits, another run will have to be done with a larger sample size than what was specified in (U-4). (U-14) To quit, press the button labeled Quit on the GUI, and close the window that the UncertaintyMachine created in (U-13), which will induce the graphics window also to close. 5 Results The UncertaintyMachine produces output in two windows on the screen, and optionally writes three files to disk, described next. One of the outputs shown on the computer screen is an R graphics window that shows a kernel estimate [Silverman, 1986] of the probability density of the output quantity (drawn in a solid blue line), and the probability density of the Gaussian distribution with the same mean and standard deviation as the Monte Carlo sample of values of the output quantity (drawn as a red dotted line).
20 Bernoulli Prob. of success 0 < Prob. of success < 1 Beta Mean, StdDev 0 < Mean < 1, 0 < StdDev < ½ Shape1, Shape2 Shape1 > 0, Shape2 > 0 Chi-Squared DF DF > 0 Exponential Mean Mean > 0 Gamma Mean, StdDev Mean > 0, StdDev > 0 Shape, Scale Shape > 0, Scale > 0 Gaussian Mean, StdDev StdDev > 0 Gaussian Truncated Mean, StdDev, Left, Right StdDev > 0, Left < Right Rectangular Mean, StdDev StdDev > 0 Left, Right Left < Right Student s t Mean, StdDev, DF StdDev > 0, DF > 2 Center, Scale, DF Scale > 0, DF > 0 Triangular Symmetric Mean, StdDev StdDev > 0 Left, Right Left < Right Triangular Asymmetric Left, Right, Mode Left Mode Right; Left Right Uniform Mean, StdDev StdDev > 0 Left, Right Left < Right Weibull Mean, StdDev Mean > 0, StdDev > 0 Shape, Scale Shape > 0, Scale > 0 Table 1: Distributions. Several distributions are available with alternative parametrizations: for these, it suffices to select and specify one of them. The rectangular distribution is the same as the uniform distribution. DF stands for number of degrees of freedom. Left and Right denote the left and right endpoints of the interval to which a distribution assigns probability 1. For the truncated Gaussian distribution, Mean and StdDev denote the mean and standard deviation without truncation: the actual mean and standard deviation depend also on the truncation points, and it is the actual mean and standard deviation that the GUM and Monte Carlo methods use in their calculations. The mode of a distribution is where its probability density reaches its maximum. A Student s t distribution will have infinite standard deviation unless DF > 2, and its mean will be undefined unless DF > 1. The values assigned to the parameters must satisfy the constraints listed.
21 Numerical output is written in another window, in the form of a table with summary statistics for the sample that was drawn from the probability distribution of the output quantity: average, standard deviation, median,. denotes the median absolute deviation from the median, multiplied by a factor (1.4826) that makes the result comparable to the standard deviation when applied to samples from Gaussian distributions. Also listed are coverage intervals with coverage probabilities 99 %, 95 %, 90 %, 68 %, and 50 %. The interval with 68 % coverage probability is often called a 1-sigma interval, and the interval with 95 % coverage probability is often called a 2-sigma interval : however, these designations are appropriate only when the distribution of the output quantity is approximately Gaussian. Next to each interval is listed the value of the corresponding coverage factor k (cf. GUM 3.3.7, 6.2). Below these, and in the same window, are listed the value of the output quantity corresponding to the estimates of the input quantities, and the value of u( y) computed using formula (13) in the GUM [Joint Committee for Guides in Metrology, 2008a, Page 21]. Finally, a table shows the sensitivity coefficients that are defined in the GUM 5.1.3: the values of the partial derivatives of the measurement function f evaluated at the estimates of the input quantities. The same table also shows the percentage contributions that the different input quantities make to the squared standard uncertainty of the output quantity. If the input quantities are uncorrelated, then these contributions add up to 100 % approximately. If they are correlated, then the contributions may add up to more or less than 100 %: in this case, the line labeled Correlations will indicate the percentage of u 2 (y) that is attributable to those correlations (this percentage is positive if u 2 (y) is larger than it would have been in the absence of correlations). If the user has specified a file name prefix in (U-11), following Save results in file in the GUI, then the first output file name ends in -values.txt and is a plain ASCII text file with one value per line of the sample that was drawn from the probability distribution of the output quantity. This file may be read into R or into any other computer program for statistical analysis, to produce additional numerical and graphical summaries. The second output file has the same prefix as the file just mentioned, but its name ends in -results.txt, and contains the same summary statistics and GUM uncertainty evaluation that were already displayed on the screen. The third output file has the same prefix as the file just mentioned, but its name ends in -density.jpg, and it is a JPEG file with the same graphical output that was displayed in the graphics window on the screen. 6 Example Thermal Expansion Coefficient To measure the coefficient of linear thermal expansion of a cylindrical copper bar, the length L 0 = m of the bar was measured with the bar at temperature T 0 = K, and 8
22 then again at temperature T 1 = K, yielding L 1 = m. The measurement model is A = (L 1 L 0 )/ L 0 (T 1 T 0 ) (this A denotes uppercase Greek alpha). For the purpose of this illustration we will assume that the input quantities are like (scaled and shifted) Student s t random variables with 3 degrees of freedom, with means equal to the measured values given, and standard deviations u(l 0 ) = m, u(l 1 ) = m, u(t 0 ) = 0.02 K, and u(t 1 ) = 0.05 K. The assignment of distributions to the four input quantities would be appropriate if their estimates were averages of four replicated readings each, and these were outcomes of independent Gaussian random variables with unknown common mean and standard deviation. The GUM s approach yields α = K 1 and u(α) = K 1, and the Monte Carlo method reproduces these results. Figure 3 on Page 10 reflects these facts, and lists the numerical results. The graphical results are displayed in Figure 4 on Page Example End-Gauge Calibration In Example H.1 of the GUM (which is reconsidered by Guthrie et al. [2009]), the measurement model is l = l S + d l S (δα θ + α S δθ). The estimates and standard measurement uncertainties of the input quantities are listed in Table 2. For the Monte Carlo method, we model the input quantities as independent Gaussian random variables with means and standard deviations equal to these estimates and standard measurement uncertainties. x u(x) l S nm 25 nm d 215 nm 9.7 nm δα 0 C C 1 θ 0.1 C 0.41 C α S C C 1 δθ 0 C C Table 2: End-Gauge Calibration. Estimates and standard measurement uncertainties for the input quantities in the measurement model of Example H.1 in the GUM. The GUM s approach yields l = nm and u(l) = 32 nm, while the Monte Carlo method reproduces the value for l but evaluates u(l) = 34 nm. The GUM (Page 84) gives ( nm, nm) as an approximate 99 % coverage interval for l, and the results of the Monte Carlo method confirm this coverage probability. If one chooses a coverage interval that is probabilistically symmetric (meaning that it leaves 0.5 % of the Monte Carlo sample uncovered on both sides), then the Monte Carlo method produces ( nm, nm) as 99 % coverage interval (and this is not quite centered at the estimate of y). Figure 5 on Page12 reflects these facts and lists the numerical results. The graphical results are displayed in Figure 6 on Page 13.
23 Monte Carlo Method Summary statistics for sample of size 1e+06 ave = 1.727e-05 sd = 1.7e-06 median = 1.727e-05 mad = 1e-06 Coverage intervals 99% ( 1.15e-05, 2.3e-05 ) k = % ( 1.4e-05, 2.05e-05 ) k = % ( 1.48e-05, 1.973e-05 ) k = % ( 1.6e-05, 1.857e-05 ) k = % ( 1.643e-05, 1.811e-05 ) k = Gauss s Formula (GUM s Linear Approximation) y = 1.727e-05 u(y) = 1.8e-06 SensitivityCoeffs Percent.u2 L0-7.9e e+01 L1 7.8e e+01 T0 2.0e e-04 T1-2.0e e-03 Correlations NA 0.0e+00 ## ============================================ Figure 3: Thermal Expansion Coefficient. Entries in the GUI correspond to the example discussed in 6. In each numerical result, only the digits that the UncertaintyMachine deems to be significant are printed.
24 Figure 4: Thermal Expansion Coefficient Densities. Estimate of the probability density of the output quantity (solid blue line), and probability density (dotted red line) of a Gaussian distribution with the same mean and standard deviation as the output quantity, corresponding to the results listed in Figure 3 on Page Example Resistance In Example H.2 of the GUM, the measurement model for the resistance of an element of an electrical circuit is R = (V /I) cos(φ). The estimates and standard uncertainties of the input quantities, and the correlations between them, are listed in Table 3 on Page 11. For the Monte Carlo method, we model the input quantities as correlated Gaussian random variables with means and standard deviations equal to the estimates and standard uncertainties listed in Table 3, and with correlations identical to those given in the same table. We also adopt a Gaussian copula to manufacture a joint probability distribution consistent with the assumptions already listed. x u(x) V V V I A A φ rad rad r(v, I) = 0.36 r(v, φ) = 0.86 r(i, φ) = 0.65 Table 3: Resistance. Estimates and standard measurement uncertainties for the input quantities in the measurement model of Example H.2 in the GUM, and correlations between them, all as listed in Table H.2 of the GUM. The GUM s approach and the Monte Carlo method produce the same values of the output quantity R = Ω and of the standard uncertainty u(r) = 0.07 Ω. The Monte Carlo method yields ( Ω, Ω) as approximate 95 % coverage interval for the resistance without invoking any additional assumptions about R. Figure 7 on Page 14 reflects
25 Monte Carlo Method Summary statistics for sample of size 1e+06 ave = sd = 33.9 median = mad = 34 Coverage intervals 99% ( , ) k = % ( , ) k = % ( , ) k = % ( , ) k = 1 50% ( , ) k = Gauss s Formula (GUM s Linear Approximation) y = u(y) = 31.7 SensitivityCoeffs Percent.u2 ls d dalpha theta alphas dtheta Correlations NA 0.00 ============================================ Figure 5: End-Gauge Calibration.
26 Figure 6: End-Gauge Calibration Densities. Estimate of the probability density of the output quantity (solid blue line), and probability density (dotted red line) of a Gaussian distribution with the same mean and standard deviation as the output quantity, corresponding to the results listed in Figure 5 on Page 12. these facts, and lists the numerical results. The graphical results are displayed in Figure 8 on Page Example Stefan-Boltzmann Constant The functional relation used to define the Stefan-Boltzmann constant σ involves the Planck constant h, the molar gas constant R, Rydberg s constant R, the relative atomic mass of the electron A r (e), the molar mass constant M u, the speed of light in vacuum c, and the fine-structure constant α: σ = 32π5 hr 4 R 4 15A r (e) 4 Mu 4c6 α 8. (1) Table 4 lists the 2010 CODATA [Mohr et al., 2012] recommended values of the quantities that determine the value of the Stefan-Boltzmann constant, and the measurement uncertainties associated with them. According to the GUM, the estimate of the measurand equals the value of the measurement function evaluated at the estimates of the input quantities, as σ = W m 2 K 4. Both the GUM s approximation and the Monte Carlo method produce the same evaluation of u(σ) = W m 2 K 4. These evaluations disregard the correlations between the input quantities that result from the adjustment process used by CODATA. However, once these correlations are taken into account via Equation (13) in the GUM, the same value still obtains for u(σ) to within the single significant digit reported above.
27 Monte Carlo Method Summary statistics for sample of size 1e+06 ave = sd = 0.07 median = mad = 0.07 Coverage intervals 99% ( , ) k = % ( , ) k = 2 90% ( , ) k = % ( , ) k = 1 50% ( , ) k = Gauss s Formula (GUM s Linear Approximation) y = u(y) = 0.07 SensitivityCoeffs Percent.u2 V I phi Correlations NA -670 ============================================ Figure 7: Resistance. Entries in the GUI correspond to the example discussed in 8. Note that, in this case, the UncertaintyMachine reconfigured its graphical user automatically to accommodate the correlations that had to be specified. In each numerical result, only the digits that the UncertaintyMachine deems to be significant are printed.
28 Figure 8: Resistance Densities. Estimate of the probability density of the output quantity (solid blue line), and probability density (dotted red line) of a Gaussian distribution with the same mean and standard deviation as the output quantity, corresponding to the results listed in Figure 7 on Page 14. h J s R J mol 1 K 1 R m 1 A r (e) u M u kg/mol c m/s α Table 4: Stefan-Boltzmann CODATA recommended values and standard measurement uncertainties for the quantities used to define the value of the Stefan- Boltzmann constant.
29 Without additional assumptions, it is impossible to interpret an expression like σ ± u(σ) probabilistically. The assumptions that are needed to apply the Monte Carlo method of the GUM Supplement 1 deliver not only an evaluation of uncertainty, but also enable a probabilistic interpretation. If the measurement uncertainties associated with h, R, R, A r (e), and α are expressed by modeling these quantities as independent Gaussian random variables with means and standard deviations set equal to the values and standard measurement uncertainties listed in Table 4, then the distribution that the Monte Carlo method of the GUM Supplement 1 assigns to the measurand happens to be approximately Gaussian as gauged by the Anderson-Darling test of Gaussian shape [Anderson and Darling, 1952]. Figure 9 on Page 17 reflects these facts and lists the numerical results, which imply that the interval from W m 2 K 4 to W m 2 K 4 is a coverage interval for σ with approximate 95 % coverage probability. The probability density of σ, and the corresponding Gaussian approximation are displayed in Figure 10 on Page Software Installation If R has not been previously installed in the target machine, then it will have to be installed first. R is free and open-source, with versions available for all major operating systems: it may be downloaded from A Java Runtime Environment (JRE) is also necessary: it may be downloaded from Microsoft Windows Open the distribution ZIP archive, which contains the installer and the user s manual, and execute the installer. During installation, a dialog box will prompt the user to select whether (the default is to do them all): Rscript.exe should be added to the search path for executables; Missing R packages should be installed automatically; A desktop icon should be created Linux Assuming that R (version 2.14 or newer), a JRE, and xterm are installed on the system and are in the user s PATH for executables, then installation amounts to extracting the contents of the distribution archive, and placing them in the desired folder, then executing the command Rscript configure-r.r to install the required R packages. The software can be started by executing either the command java -jar UncertaintyMachine.jar or the shell script run.sh Apple OS X The installation under Apple OS X is similar to the installation under Linux, including the requirement that xterm be installed: it is different from the Terminal application, and it
30 Monte Carlo Method Summary statistics for sample of size 1e+06 ave sd median mad = e-08 = 2.05e-13 = e-08 = 2.05e-13 Coverage intervals 99% ( e-08, e-08 ) k = % ( e-08, e-08 ) k = 2 90% ( e-08, e-08 ) k = % ( e-08, e-08 ) k = 1 50% ( e-08, e-08 ) k = Gauss s Formula (GUM s Linear Approximation) y = e-08 u(y) = 2.05e-13 SensitivityCoeffs Percent.u2 h 8.6e e-02 R 2.7e e+02 Rinfty 2.1e e-09 Ae -4.1e e-05 Mu -2.3e e+00 c -1.1e e+00 alpha -6.2e e-05 Correlations NA 0.0e+00 ============================================ Figure 9: Stefan Boltzmann constant.
31 Figure 10: Stefan-Boltzmann Densities. Estimate of the probability density of the output quantity (solid blue line), and probability density (dotted red line) of a Gaussian distribution with the same mean and standard deviation as the output quantity, corresponding to the results listed in Figure 9 on Page 17. is part of Apple s X11 package. Under Mountain Lion, X11 installs on demand: when an application is first launched that requires X11 libraries, the user is directed to a download location for the most up-to-date version of X11 for the Mac. Acknowledgments The authors are grateful to their colleagues Will Guthrie, Alan Heckert, Narain Krishnamurthy, Adam Pintar, Jolene Splett, and Jack Wang, for the many suggestions that they offered for improvement of the UncertaintyMachine and of this user s manual. The authors will be very grateful to users of the software, and to readers of this manual, for information about errors and other deficiencies, and for suggestions for improvement, which may be sent via to [email protected]. References T. W. Anderson and D. A. Darling. Asymptotic theory of certain goodness-of-fit criteria based on stochastic processes. Annals of Mathematical Statistics, 23: , P. Billingsley. Probability and Measure. John Wiley & Sons, New York, NY, C. Gauss. Theoria combinationis observationum erroribus minimis obnoxiae. In Werke, Band IV, Wahrscheinlichkeitsrechnung und Geometrie. Könighlichen Gesellschaft der Wissenschaften, Göttingen,
32 W. F. Guthrie, H.-k. Liu, A. L. Rukhin, B. Toman, J. C.M. Wang, and N.-f. Zhang. Three statistical paradigms for the assessment and interpretation of measurement uncertainty. In F. Pavese and A. B. Forbes, editors, Data Modeling for Metrology and Testing in Measurement Science, Modeling and Simulation in Science, Engineering and Technology, pages Birkhäuser Boston, doi: / _3. Joint Committee for Guides in Metrology. Evaluation of measurement data Guide to the expression of uncertainty in measurement. International Bureau of Weights and Measures (BIPM), Sèvres, France, September 2008a. URL publications/guides/gum.html. BIPM, IEC, IFCC, ILAC, ISO, IUPAC, IUPAP and OIML, JCGM 100:2008, GUM 1995 with minor corrections. Joint Committee for Guides in Metrology. Evaluation of measurement data Supplement 1 to the Guide to the expression of uncertainty in measurement Propagation of distributions using a Monte Carlo method. International Bureau of Weights and Measures (BIPM), Sèvres, France, 2008b. URL BIPM, IEC, IFCC, ILAC, ISO, IUPAC, IUPAP and OIML, JCGM 101:2008. P. J. Mohr, B. N. Taylor, and D. B. Newell. CODATA recommended values of the fundamental physical constants: Reviews of Modern Physics, 84(4): , October- December M. G. Morgan and M. Henrion. Uncertainty A Guide to Dealing with Uncertainty in Quantitative Risk and Policy Analysis. Cambridge University Press, New York, NY, first paperback edition, th printing, A. Possolo. Copulas for uncertainty analysis. Metrologia, 47: , R Development Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, URL R-project.org. ISBN F. E. Satterthwaite. An approximate distribution of estimates of variance components. Biometrics Bulletin, 2(6): , December B. W. Silverman. Density Estimation. Chapman and Hall, London, B. N. Taylor and C. E. Kuyatt. Guidelines for Evaluating and Expressing the Uncertainty of NIST Measurement Results. National Institute of Standards and Technology, Gaithersburg, MD, URL NIST Technical Note B. L. Welch. The generalization of Student s problem when several different population variances are involved. Biometrika, 34:28 35, January 1947.
JCGM 101:2008. First edition 2008
Evaluation of measurement data Supplement 1 to the Guide to the expression of uncertainty in measurement Propagation of distributions using a Monte Carlo method Évaluation des données de mesure Supplément
General and statistical principles for certification of RM ISO Guide 35 and Guide 34
General and statistical principles for certification of RM ISO Guide 35 and Guide 34 / REDELAC International Seminar on RM / PT 17 November 2010 Dan Tholen,, M.S. Topics Role of reference materials in
AP Physics 1 and 2 Lab Investigations
AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks
(Uncertainty) 2. How uncertain is your uncertainty budget?
(Uncertainty) 2 How uncertain is your uncertainty budget? Paper Author and Presenter: Dr. Henrik S. Nielsen HN Metrology Consulting, Inc 10219 Coral Reef Way, Indianapolis, IN 46256 Phone: (317) 849 9577,
Gamma Distribution Fitting
Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics
How To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
IAS CALIBRATION and TESTING LABORATORY ACCREDITATION PROGRAMS DEFINITIONS
REFERENCES NIST Special Publication 330 IAS CALIBRATION and TESTING LABORATORY ACCREDITATION PROGRAMS DEFINITIONS Revised October 2013 International vocabulary of metrology Basic and general concepts and
COMPARATIVE EVALUATION OF UNCERTAINTY TR- ANSFORMATION FOR MEASURING COMPLEX-VALU- ED QUANTITIES
Progress In Electromagnetics Research Letters, Vol. 39, 141 149, 2013 COMPARATIVE EVALUATION OF UNCERTAINTY TR- ANSFORMATION FOR MEASURING COMPLEX-VALU- ED QUANTITIES Yu Song Meng * and Yueyan Shan National
MBA 611 STATISTICS AND QUANTITATIVE METHODS
MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain
JCGM 106:2012. Evaluation of measurement data The role of measurement uncertainty in conformity assessment
Evaluation of measurement data The role of measurement uncertainty in conformity assessment Évaluation des données de mesure Le rôle de l incertitude de mesure dans l évaluation de la conformité October
Quantitative Inventory Uncertainty
Quantitative Inventory Uncertainty It is a requirement in the Product Standard and a recommendation in the Value Chain (Scope 3) Standard that companies perform and report qualitative uncertainty. This
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
UNCERTAINTY AND CONFIDENCE IN MEASUREMENT
Ελληνικό Στατιστικό Ινστιτούτο Πρακτικά 8 ου Πανελληνίου Συνεδρίου Στατιστικής (2005) σελ.44-449 UNCERTAINTY AND CONFIDENCE IN MEASUREMENT Ioannis Kechagioglou Dept. of Applied Informatics, University
Confidence Intervals for Cp
Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process
Supplement to Call Centers with Delay Information: Models and Insights
Supplement to Call Centers with Delay Information: Models and Insights Oualid Jouini 1 Zeynep Akşin 2 Yves Dallery 1 1 Laboratoire Genie Industriel, Ecole Centrale Paris, Grande Voie des Vignes, 92290
Introduction to Error Analysis
UNIVERSITÄT BASEL DEPARTEMENT CHEMIE Introduction to Error Analysis Physikalisch Chemisches Praktikum Dr. Nico Bruns, Dr. Katarzyna Kita, Dr. Corinne Vebert 2012 1. Why is error analysis important? First
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
Chapter 3 RANDOM VARIATE GENERATION
Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.
ModelRisk for Insurance and Finance. Quick Start Guide
ModelRisk for Insurance and Finance Quick Start Guide Copyright 2008 by Vose Software BVBA, All rights reserved. Vose Software Iepenstraat 98 9000 Gent, Belgium www.vosesoftware.com ModelRisk is owned
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL STATIsTICs 4 IV. RANDOm VECTORs 1. JOINTLY DIsTRIBUTED RANDOm VARIABLEs If are two rom variables defined on the same sample space we define the joint
Normality Testing in Excel
Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. [email protected]
G104 - Guide for Estimation of Measurement Uncertainty In Testing. December 2014
Page 1 of 31 G104 - Guide for Estimation of Measurement Uncertainty In Testing December 2014 2014 by A2LA All rights reserved. No part of this document may be reproduced in any form or by any means without
Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data
Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996)
MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL by Michael L. Orlov Chemistry Department, Oregon State University (1996) INTRODUCTION In modern science, regression analysis is a necessary part
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September
Operating Systems. and Windows
Operating Systems and Windows What is an Operating System? The most important program that runs on your computer. It manages all other programs on the machine. Every PC has to have one to run other applications
JCGM 104:2009. Evaluation of measurement data An introduction to the Guide to the expression of uncertainty in measurement and related documents
Evaluation of measurement data An introduction to the Guide to the expression of uncertainty in measurement and related documents Évaluation des données de mesure Une introduction au Guide pour l expression
Assessing Measurement System Variation
Assessing Measurement System Variation Example 1: Fuel Injector Nozzle Diameters Problem A manufacturer of fuel injector nozzles installs a new digital measuring system. Investigators want to determine
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
The KaleidaGraph Guide to Curve Fitting
The KaleidaGraph Guide to Curve Fitting Contents Chapter 1 Curve Fitting Overview 1.1 Purpose of Curve Fitting... 5 1.2 Types of Curve Fits... 5 Least Squares Curve Fits... 5 Nonlinear Curve Fits... 6
Exploratory Data Analysis
Exploratory Data Analysis Johannes Schauer [email protected] Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
GelAnalyzer 2010 User s manual. Contents
GelAnalyzer 2010 User s manual Contents 1. Starting GelAnalyzer... 2 2. The main window... 2 3. Create a new analysis... 2 4. The image window... 3 5. Lanes... 3 5.1 Detect lanes automatically... 3 5.2
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
Package SHELF. February 5, 2016
Type Package Package SHELF February 5, 2016 Title Tools to Support the Sheffield Elicitation Framework (SHELF) Version 1.1.0 Date 2016-01-29 Author Jeremy Oakley Maintainer Jeremy Oakley
An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R)
DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ An Interactive Tool for Residual Diagnostics for Fitting Spatial Dependencies (with Implementation in R) Ernst
Getting Started using the SQuirreL SQL Client
Getting Started using the SQuirreL SQL Client The SQuirreL SQL Client is a graphical program written in the Java programming language that will allow you to view the structure of a JDBC-compliant database,
Point Biserial Correlation Tests
Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable
5 Cumulative Frequency Distributions, Area under the Curve & Probability Basics 5.1 Relative Frequencies, Cumulative Frequencies, and Ogives
5 Cumulative Frequency Distributions, Area under the Curve & Probability Basics 5.1 Relative Frequencies, Cumulative Frequencies, and Ogives We often have to compute the total of the observations in a
2DI36 Statistics. 2DI36 Part II (Chapter 7 of MR)
2DI36 Statistics 2DI36 Part II (Chapter 7 of MR) What Have we Done so Far? Last time we introduced the concept of a dataset and seen how we can represent it in various ways But, how did this dataset came
Univariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
Distributed computing of failure probabilities for structures in civil engineering
Distributed computing of failure probabilities for structures in civil engineering Andrés Wellmann Jelic, University of Bochum ([email protected]) Matthias Baitsch, University of Bochum ([email protected])
Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification
Tail-Dependence an Essential Factor for Correctly Measuring the Benefits of Diversification Presented by Work done with Roland Bürgi and Roger Iles New Views on Extreme Events: Coupled Networks, Dragon
BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
Chapter 4 and 5 solutions
Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,
THE STATISTICAL TREATMENT OF EXPERIMENTAL DATA 1
THE STATISTICAL TREATMET OF EXPERIMETAL DATA Introduction The subject of statistical data analysis is regarded as crucial by most scientists, since error-free measurement is impossible in virtually all
Life Table Analysis using Weighted Survey Data
Life Table Analysis using Weighted Survey Data James G. Booth and Thomas A. Hirschl June 2005 Abstract Formulas for constructing valid pointwise confidence bands for survival distributions, estimated using
http://school-maths.com Gerrit Stols
For more info and downloads go to: http://school-maths.com Gerrit Stols Acknowledgements GeoGebra is dynamic mathematics open source (free) software for learning and teaching mathematics in schools. It
January 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
Directions for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
Software Support for Metrology Best Practice Guide No. 6. Uncertainty and Statistical Modelling M G Cox, M P Dainton and P M Harris
Software Support for Metrology Best Practice Guide No. 6 Uncertainty and Statistical Modelling M G Cox, M P Dainton and P M Harris March 2001 Software Support for Metrology Best Practice Guide No. 6 Uncertainty
IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
Fitting Subject-specific Curves to Grouped Longitudinal Data
Fitting Subject-specific Curves to Grouped Longitudinal Data Djeundje, Viani Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh, EH14 4AS, UK E-mail: [email protected] Currie,
t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon
t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. [email protected] www.excelmasterseries.com
MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...
MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................
1 Sufficient statistics
1 Sufficient statistics A statistic is a function T = rx 1, X 2,, X n of the random sample X 1, X 2,, X n. Examples are X n = 1 n s 2 = = X i, 1 n 1 the sample mean X i X n 2, the sample variance T 1 =
DATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
Tutorial: Get Running with Amos Graphics
Tutorial: Get Running with Amos Graphics Purpose Remember your first statistics class when you sweated through memorizing formulas and laboriously calculating answers with pencil and paper? The professor
Statistics 104: Section 6!
Page 1 Statistics 104: Section 6! TF: Deirdre (say: Dear-dra) Bloome Email: [email protected] Section Times Thursday 2pm-3pm in SC 109, Thursday 5pm-6pm in SC 705 Office Hours: Thursday 6pm-7pm SC
Survival Analysis of Left Truncated Income Protection Insurance Data. [March 29, 2012]
Survival Analysis of Left Truncated Income Protection Insurance Data [March 29, 2012] 1 Qing Liu 2 David Pitt 3 Yan Wang 4 Xueyuan Wu Abstract One of the main characteristics of Income Protection Insurance
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
How To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
Basics of Statistical Machine Learning
CS761 Spring 2013 Advanced Machine Learning Basics of Statistical Machine Learning Lecturer: Xiaojin Zhu [email protected] Modern machine learning is rooted in statistics. You will find many familiar
Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering
Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques
Non-Inferiority Tests for One Mean
Chapter 45 Non-Inferiority ests for One Mean Introduction his module computes power and sample size for non-inferiority tests in one-sample designs in which the outcome is distributed as a normal random
Getting started manual
Getting started manual XLSTAT Getting started manual Addinsoft 1 Table of Contents Install XLSTAT and register a license key... 4 Install XLSTAT on Windows... 4 Verify that your Microsoft Excel is up-to-date...
Data Analysis on the ABI PRISM 7700 Sequence Detection System: Setting Baselines and Thresholds. Overview. Data Analysis Tutorial
Data Analysis on the ABI PRISM 7700 Sequence Detection System: Setting Baselines and Thresholds Overview In order for accuracy and precision to be optimal, the assay must be properly evaluated and a few
Tutorial: Get Running with Amos Graphics
Tutorial: Get Running with Amos Graphics Purpose Remember your first statistics class when you sweated through memorizing formulas and laboriously calculating answers with pencil and paper? The professor
This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.
Algebra I Overview View unit yearlong overview here Many of the concepts presented in Algebra I are progressions of concepts that were introduced in grades 6 through 8. The content presented in this course
Nonparametric adaptive age replacement with a one-cycle criterion
Nonparametric adaptive age replacement with a one-cycle criterion P. Coolen-Schrijner, F.P.A. Coolen Department of Mathematical Sciences University of Durham, Durham, DH1 3LE, UK e-mail: [email protected]
Nonlinear Regression:
Zurich University of Applied Sciences School of Engineering IDP Institute of Data Analysis and Process Design Nonlinear Regression: A Powerful Tool With Considerable Complexity Half-Day : Improved Inference
Spreadsheets and Laboratory Data Analysis: Excel 2003 Version (Excel 2007 is only slightly different)
Spreadsheets and Laboratory Data Analysis: Excel 2003 Version (Excel 2007 is only slightly different) Spreadsheets are computer programs that allow the user to enter and manipulate numbers. They are capable
Finite Mathematics Using Microsoft Excel
Overview and examples from Finite Mathematics Using Microsoft Excel Revathi Narasimhan Saint Peter's College An electronic supplement to Finite Mathematics and Its Applications, 6th Ed., by Goldstein,
Validation and Calibration. Definitions and Terminology
Validation and Calibration Definitions and Terminology ACCEPTANCE CRITERIA: The specifications and acceptance/rejection criteria, such as acceptable quality level and unacceptable quality level, with an
Dirichlet forms methods for error calculus and sensitivity analysis
Dirichlet forms methods for error calculus and sensitivity analysis Nicolas BOULEAU, Osaka university, november 2004 These lectures propose tools for studying sensitivity of models to scalar or functional
Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005
Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005 Philip J. Ramsey, Ph.D., Mia L. Stephens, MS, Marie Gaudard, Ph.D. North Haven Group, http://www.northhavengroup.com/
ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA
ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA Michael R. Middleton, McLaren School of Business, University of San Francisco 0 Fulton Street, San Francisco, CA -00 -- [email protected]
YASAIw.xla A modified version of an open source add in for Excel to provide additional functions for Monte Carlo simulation.
YASAIw.xla A modified version of an open source add in for Excel to provide additional functions for Monte Carlo simulation. By Greg Pelletier, Department of Ecology, P.O. Box 47710, Olympia, WA 98504
An Introduction to Point Pattern Analysis using CrimeStat
Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1
Bill Burton Albert Einstein College of Medicine [email protected] April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Standard Deviation Estimator
CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of
ADD-INS: ENHANCING EXCEL
CHAPTER 9 ADD-INS: ENHANCING EXCEL This chapter discusses the following topics: WHAT CAN AN ADD-IN DO? WHY USE AN ADD-IN (AND NOT JUST EXCEL MACROS/PROGRAMS)? ADD INS INSTALLED WITH EXCEL OTHER ADD-INS
STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
JCGM 102:2011. October 2011
Evaluation of measurement data Supplement 2 to the Guide to the expression of uncertainty in measurement Extension to any number of output quantities Évaluation des données de mesure Supplément 2 du Guide
The Determination of Uncertainties in Charpy Impact Testing
Manual of Codes of Practice for the Determination of Uncertainties in Mechanical Tests on Metallic Materials Code of Practice No. 06 The Determination of Uncertainties in Charpy Impact Testing M.A. Lont
Lecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS
NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document
Master s Theory Exam Spring 2006
Spring 2006 This exam contains 7 questions. You should attempt them all. Each question is divided into parts to help lead you through the material. You should attempt to complete as much of each problem
