SPC Data Visualization of Seasonal and Financial Data Using JMP WHITE PAPER
|
|
|
- Magdalen Mills
- 9 years ago
- Views:
Transcription
1 SPC Data Visualization of Seasonal and Financial Data Using JMP WHITE PAPER
2 SAS White Paper Table of Contents Abstract Background Example 1: Telescope Company Monitors Revenue Example 2: Water Quality Data... 7 Statistical Properties of Residual Charts Conclusion References
3 SPC Data Visualization of Seasonal and Financial Data Using JMP Abstract JMP software offers many types of statistical process control (SPC) charts, including Shewhart, CUSUM and moving average charts. SPC chart features in JMP can be accessed through the menus and dialogs, scripting, and now in version 10.0, through a drag and drop interface. The periodic nature of some financial data makes it unsuitable in its original form for detecting anomalies using an SPC chart. One viable method for presenting this data on an SPC chart is to apply time series techniques first, then chart the output. This paper investigates two case studies applying these techniques. SPC charts work very well under the ideal conditions of data independence and normality. SPC has traditionally been used in manufacturing, where these conditions are often satisfied. However, SPC is beginning to be used more outside of manufacturing, in areas like insurance claims processing, banking, health care and survey research. In many of these environments, the desired mean may be shifting up or down, or the responses may be cyclic in nature. In this paper, we examine some of the problems with plotting time series data on control charts and suggest remedies. Background Statistical process control (SPC) is a methodology for monitoring a time series process to detect shifts in either the location or scale of the process. SPC has been used for almost 100 years in manufacturing. Recently, it has become more prevalent in other industries, including finance, health care, insurance and survey research. (See Roberts, 2006.) Data for control charts is collected in subgroups. When the subgroup size is one, an individual-moving range (X-MR) chart is used to control the process. An X-MR control scheme consists of two related run charts. An individual (or X) chart plots the data in time order, along with control limits. Changes in the mean are detected when any point is outside of the control limits. The absolute value of successive differences the moving ranges are plotted on an MR chart, along with control limits to detect changes in the process spread. Figure 1 is an example of a typical X-MR chart commonly seen in manufacturing. When the subgroup size is greater than one, Xbar-S control schemes are used. The Xbar chart plots the mean of the subgroup; the S chart plots the standard deviation of the subgroup. The upper control limit and lower control limit are drawn at a distance of three standard deviations ( ) of the estimate of the parameter from the centerline. That is, we assume the process has been stable for long enough to estimate without error, and draw control limits at. Control limits are calculated based on historic subgroup mean and standard deviation values. (See Wheeler, 2010.) 1
4 SAS White Paper Individual & Moving Range Chart of DIAMETER Figure 1. Typical X-MR chart. One of the assumptions in using control charts is that the data come from an independent process. That is, knowing the value of the time series at any point gives no information about future trends of the process, except for the location and scale. In manufacturing, this assumption can often be met by appropriate choice of subgrouping. For example, if data is highly autocorrelated, sampling can be done less frequently. Financial or other business data is often collected daily, weekly or monthly, and the sampling plan cannot be changed. Using traditional control charts on positively autocorrelated data causes false signals. (See Michelson, 1994.) One remedy is to model the autocorrelation using autoregressive-moving average (ARMA) time series models. The residuals from the model should be uncorrelated and thus are suitable for displaying with a control chart. If a shift occurs in the process mean at time t 0, the same shift will occur in the first residual value after t 0. The residuals will not see the full shift after time t 0, but they will see a dampened shift, leading to longer times until signal for positively correlated data. 2
5 SPC Data Visualization of Seasonal and Financial Data Using JMP In this paper, we give an introduction to ARMA models through two examples. In the first, an equipment company who had successfully used SPC charts in their manufacturing department in the past was interested in monitoring monthly revenue. Often revenue figures are cyclic in nature, dropping off at the beginning of each year and peaking in December. This company, however, produced telescopes, and the cycles corresponded directly to astronomy events like eclipses, meteors and planet sightings. While they could visually see the trend, weeks with unusual patterns were masked by all the weeks triggering alarms as traditional control charting methods failed to work. In another company, control charts were being used on water quality data in a water purification scheme. City water was pumped through a series of filters. At the end of the filtering process, the quality of the water was measured twice per second. The frequency of measurement was important to catching dirty water before it arrived at processing equipment, but the resulting positive serial correlation led to an increase in the false alarm rate of the control chart. Example 1: Telescope Company Monitors Revenue Background A telescope company who had successfully used SPC charts in their manufacturing department in the past was interested in monitoring revenue. Their revenue figures were cyclic in nature, peaking several weeks before astronomical activities (total eclipses, meteor showers and planetary sightings), then dropping during periods of lower activity. The cyclical nature meant that not only did the data have large swings rather than spikes, but the swings gradually ascended and descended. First Attempt at Control Charts To graph this in JMP, Select Analyze g Quality and Process g Control Chart Builder Drag Revenue to the Y drop zone Drag Date to the X region 3
6 SAS White Paper Individual & Moving Range Chart of Revenue Figure 2. Individual and moving range chart of weekly revenues. Figure 2 shows 10 years of weekly total revenue for the company. The swings were so large that nearly half the data on the individual chart fell outside the limits. Turning on the tests for points out of control would be meaningless for this chart, as so many of the points would signal. The limits, on the other hand, were narrow because the relative change from week to week was typically small, so the moving range chart didn t widen or detect many odd spikes. While they could visually see the trend, weeks with unusual patterns were masked by all the weeks triggering alarms as traditional control charting methods failed to work. Look for Autocorrelation Using Lag and Bivariate To evaluate the possibility of autocorrelation, look at the relationship between each consecutive point. To do this in JMP: Create a new column with the formula, Lag ( : Revenue, 1). Select Analyze 0 Fit Y by X, using Revenue as the Y and Lag(Revenue) as the X. In the Bivariate report, select Density Ellipse g
7 SPC Data Visualization of Seasonal and Financial Data Using JMP Table 1. Fit Y by X correlation output In this output, we see that the correlation is high at 0.97 and significantly different from zero, with a p-value < From this test, we can reject the hypothesis that the lag of Revenue is uncorrelated with Revenue and conclude there is an autocorrelation effect between consecutive weeks. Applying a Time Series Model JMP offers many options for modeling autocorrelation through the Time Series platform. To try out some modeling options: Select Analyze g Modeling g Time Series. Select Revenue as the Y, Time Series and Date as the X, Time ID. Click OK. In the output window, select ARIMA model group with the inputs shown in dialog 1: Dialog 1. Model group dialog for specifying the ARIMA model We see that there are a few models that fit very well. The AR(1) model has the lowest value of Schwarz s Bayesian criterion, as well as a low value of Akaike s information criterion. Next, select output from the AR(1) model and select Save Columns to save the residuals. Output 1. Time series model comparison output 5
8 SAS White Paper Second attempt at control charts Because the significant autocorrelation has been removed from the residuals of the time series model, we can use these residuals to identify unusual dates on the control chart. To run a control chart using the residuals saved in a new data table from the AR(1) model: Select Analyze g Quality and Process g Control Chart Builder. Drag Residual Revenue to the Y-axis drop region. In Figure 3, we see that most of the points are now in control, and we can investigate the few that went out of control more carefully to find the special causes influencing these dates. Individual & Moving Range Chart of Residual Revenue Figure 3. Individual and moving range chart of revenue residuals by week In fact, we can now see that the value on March 20, 2010 flags as an out-of-control point on the graph of residuals. In preparation for the July 2010 total solar eclipse, the marketing department began running advertisements on March 1 in popular astronomy magazines. In Figure 3, the positive change in revenue is much more clearly identified than in the control chart in Figure 1. The high residual on Dec. 10, 2005, has a similar explanation. A marketing campaign targeted toward the total solar eclipse in March 2006 caused a spike in sales three months prior. 6
9 SPC Data Visualization of Seasonal and Financial Data Using JMP Example 2: Water Quality Data Background Many industries, including semiconductors, pharmaceuticals and power generation, rely on water use for many of their steps. City water is fed into a facility through pipes. While the water is safe for human consumption, the particulates and organic compounds in the water can cause all sorts of problems in the manufacturing process. Water is fed through filters and is subjected to chemical and other processing to reduce levels of contaminants to parts-per-billion or parts-per-trillion levels. The resulting water is often called ultrapure water. At a certain semiconductor manufacturer, the count of particulates and percent total oxygenated compounds (TOC) were measured before the water was sent from the central utility building (after filtration) to the unit process step. An automated system was set up to signal the engineers if the counts ever went over a specified limit. The process had improved to the point that engineers were interested in moving to a data-based decision system, relying on control limits to tell them when the process had changed rather than using a specification limit for process control. Data was collected twice per second, due to the need to quickly signal when the particulates or TOC were above the specification limit, so that the contaminated water would not reach the processing equipment. Due to the frequency of data collection, both the particulate count data and TOC data time series were highly positively autocorrelated. The engineers knew this positive correlation would cause an increase in false alarms on the Individual chart. Time series modeling One hour s worth of data was collected for modeling purposes. There were 7,200 observations of TOC. Due to the proprietary nature of the data, the TOC values in this paper were simulated from the same model used in the application. 7
10 SAS White Paper Individual & Moving Range Chart of TOC Figure 4. Individual and moving range chart of total oxygenated compounds (percent). Figure 4 contains an individual and moving range chart of the first 100 observations taken in 50 seconds. The positive serial correlation has caused two signals on the Individual chart. Control limits were calculated on the first 1,000 observations of the TOC process, using the Control Chart Builder. The limits were copied to a column property on the control chart for all 7,200 observations. A control chart was then built on all observations. Select Analyze g Quality and Process g Control Chart Builder. Drag TOC to the Y zone. Drag Time to the X zone. 8
11 SPC Data Visualization of Seasonal and Financial Data Using JMP Individual & Moving Range Chart of TOC Figure 5. Individual and moving range chart on 7,200 data points from TOC process. Figure 5 shows that the limits look too tight, and the data wanders around the mean value of just over 6 percent. The first 1,000 data points were used to fit time series models, and the best model was chosen based on Akaike s information criterion, Schwarz s Bayesian criterion, and model simplicity. An AR(5) process was determined to be the best fit. The prediction formula from this model was saved to the full table, and residuals were calculated for each point. These residuals were plotted on a control chart, shown in Figure 6. Individual & Moving Range Chart of Residual TOC Figure 6. Individual and moving range chart on residuals from AR(5) model on TOC process. 9
12 SAS White Paper The reduction in false alarms is seen on the residual chart. In practice, the engineers decided to widen the limits of the residual chart to avoid even these false alarms, due to the frequent sampling of the process. The final system consisted of specification limits on the raw data, and widened limits on a residuals chart. It has been successfully monitoring water quality, signaling quality shifts, for over 10 years. Statistical Properties of Residual Charts The use of residual charts for processes with positive autocorrelation has a theoretical justification. Control charts are measured by their run length, a random variable that represents the time until a signal. When the process is stable, and consists of just one distribution, the run length should be long. When the process mean has shifted, the run length should be short. It is easily shown that the average run length (arl) is 1/p, where p is the probability of a signal on the chart. We denote the average run length as arl(δ), where Δ represents the size of a mean shift in standard deviation units. For an Individuals chart with 3σ limits, assuming the process follows a normal distribution, p= and arl(0)=370. That is, it takes on average 370 runs before this chart falsely signals a mean shift. By contrast, for this chart, arl(1)=43.9. That is, it takes on average 43.9 runs before this chart signals a mean shift of 1σ. Suppose now that { X t } is an autoregressive process of order 1 (AR(1)) with a mean shift of size Δσ at time t = t 0. Then, where and. The residuals from the model are {e t }, where for all t. Properties of the residuals are given as follows. The entire shift is seen by the residuals at time t = t 0. For t > t 0, only a fraction of the shift is seen, for ø>0. Smaller shifts are harder to detect, resulting in higher run lengths for positively correlated data. For example, arl(1)=101.9 for an Individual chart on an AR(1) process with ø=0.4, and arl(1)=223.3 for an Individual chart on an AR(1) process with ø=0.9. For extremely high correlation the average run length is low, for example, for an AR(1) process with ø=0.98, arl(1)=
13 SPC Data Visualization of Seasonal and Financial Data Using JMP The probability of a signal on the first observation after the shift is much higher for highly positively correlated data than it is for independent data. Thus, even though the average run length is large, the probability of a signal on the first observation is greater for positively correlated data than for independent data. For example, the probability of signal on the first observation after a 1σ mean shift on an Individuals chart on independent data is about 2 percent. The same probability for a residual chart on an AR(1) process with ø=0.9 is 24 percent and for ø=0.95 is 58 percent. Conclusion At first glance, statistical process control charts appear wholly unsuitable for seasonal or financial data. As illustrated in the two examples given, modeling techniques applied to the data can sufficiently smooth out the cyclical behavior through the residuals. This smoothed data allows for both applying statistical control charts to the data and further identification and investigation of unusual dates or points. Instead of approximating the expected shape and reacting to changes, by applying time series techniques together with statistical process control on the model residuals, the analyst can gain considerable knowledge by both better understanding the financial state itself and identifying which outside indicators or events might affect the state both positively and negatively. While this is a very effective method, it is computationally intensive and is best suited for a system that is controlled by computers. It is not suitable for paper charts filled out by human operators. References Roberts, L. (2006), SPC for Right-Brain Thinkers: Process Control for Non-Statisticians: Quality Press, Milwaukee. Wheeler, D. W., and Chambers, D. S. (2010), Understanding Statistical Process Control:, 3rd Ed. SPC Press, Knoxville. Michelson, Diane K. (1994) Statistical Process Control for Correlated Data: unpublished Ph.D. dissertation, Texas A&M University, College Station. Your comments and questions are valued and encouraged. Contact the authors at: Diane K. Michelson, Senior Analytical Training Consultant, Education and Training, SAS [email protected] Annie Dudley Zangi, Senior Research Statistician Developer, JMP [email protected] 11
14 About SAS and JMP JMP is a software solution from SAS that was first launched in John Sall, SAS co-founder and Executive Vice President, is the chief architect of JMP. SAS is the leader in analytics. Through innovative analytics, business intelligence and data management software and services, SAS helps customers at more than 80,000 sites make better decisions faster. Since 1976, SAS has been giving customers around the world THE POWER TO KNOW. SAS Institute Inc. World Headquarters JMP is a software solution from SAS. To learn more about SAS, visit sas.com For JMP sales in the US and Canada, call or go to jmp.com... SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies _S
Variables Control Charts
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. Variables
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER JMP Genomics Step-by-Step Guide to Bi-Parental Linkage Mapping Introduction JMP Genomics offers several tools for the creation of linkage maps
Using JMP Version 4 for Time Series Analysis Bill Gjertsen, SAS, Cary, NC
Using JMP Version 4 for Time Series Analysis Bill Gjertsen, SAS, Cary, NC Abstract Three examples of time series will be illustrated. One is the classical airline passenger demand data with definite seasonal
Control Charts for Variables. Control Chart for X and R
Control Charts for Variables X-R, X-S charts, non-random patterns, process capability estimation. 1 Control Chart for X and R Often, there are two things that might go wrong in a process; its mean or its
The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA
Paper 156-2010 The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA Abstract JMP has a rich set of visual displays that can help you see the information
10 CONTROL CHART CONTROL CHART
Module 10 CONTOL CHT CONTOL CHT 1 What is a Control Chart? control chart is a statistical tool used to distinguish between variation in a process resulting from common causes and variation resulting from
Advanced Forecasting Techniques and Models: ARIMA
Advanced Forecasting Techniques and Models: ARIMA Short Examples Series using Risk Simulator For more information please visit: www.realoptionsvaluation.com or contact us at: [email protected]
USE OF ARIMA TIME SERIES AND REGRESSORS TO FORECAST THE SALE OF ELECTRICITY
Paper PO10 USE OF ARIMA TIME SERIES AND REGRESSORS TO FORECAST THE SALE OF ELECTRICITY Beatrice Ugiliweneza, University of Louisville, Louisville, KY ABSTRACT Objectives: To forecast the sales made by
Promotional Forecast Demonstration
Exhibit 2: Promotional Forecast Demonstration Consider the problem of forecasting for a proposed promotion that will start in December 1997 and continues beyond the forecast horizon. Assume that the promotion
Change-Point Analysis: A Powerful New Tool For Detecting Changes
Change-Point Analysis: A Powerful New Tool For Detecting Changes WAYNE A. TAYLOR Baxter Healthcare Corporation, Round Lake, IL 60073 Change-point analysis is a powerful new tool for determining whether
Directions for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
HYPOTHESIS TESTING WITH SPSS:
HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER
THE BASICS OF STATISTICAL PROCESS CONTROL & PROCESS BEHAVIOUR CHARTING
THE BASICS OF STATISTICAL PROCESS CONTROL & PROCESS BEHAVIOUR CHARTING A User s Guide to SPC By David Howard Management-NewStyle "...Shewhart perceived that control limits must serve industry in action.
Process Capability Analysis Using MINITAB (I)
Process Capability Analysis Using MINITAB (I) By Keith M. Bower, M.S. Abstract The use of capability indices such as C p, C pk, and Sigma values is widespread in industry. It is important to emphasize
Introduction Course in SPSS - Evening 1
ETH Zürich Seminar für Statistik Introduction Course in SPSS - Evening 1 Seminar für Statistik, ETH Zürich All data used during the course can be downloaded from the following ftp server: ftp://stat.ethz.ch/u/sfs/spsskurs/
Comparative study of the performance of the CuSum and EWMA control charts
Computers & Industrial Engineering 46 (2004) 707 724 www.elsevier.com/locate/dsw Comparative study of the performance of the CuSum and EWMA control charts Vera do Carmo C. de Vargas*, Luis Felipe Dias
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
THE UNIVERSITY OF CHICAGO, Booth School of Business Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Homework Assignment #2
THE UNIVERSITY OF CHICAGO, Booth School of Business Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay Solutions to Homework Assignment #2 Assignment: 1. Consumer Sentiment of the University of Michigan.
Time Series Analysis: Basic Forecasting.
Time Series Analysis: Basic Forecasting. As published in Benchmarks RSS Matters, April 2015 http://web3.unt.edu/benchmarks/issues/2015/04/rss-matters Jon Starkweather, PhD 1 Jon Starkweather, PhD [email protected]
Selecting SPC Software for Batch and Specialty Chemicals Processing
WHITE PAPER Selecting SPC Software for Batch and Specialty Chemicals Processing Statistical Process Control (SPC) is a necessary part of modern chemical processing. The software chosen to collect quality
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING
LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.
Geostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras [email protected]
Pearson's Correlation Tests
Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation
16 : Demand Forecasting
16 : Demand Forecasting 1 Session Outline Demand Forecasting Subjective methods can be used only when past data is not available. When past data is available, it is advisable that firms should use statistical
SPC Response Variable
SPC Response Variable This procedure creates control charts for data in the form of continuous variables. Such charts are widely used to monitor manufacturing processes, where the data often represent
Projects Involving Statistics (& SPSS)
Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,
Use and interpretation of statistical quality control charts
International Journal for Quality in Health Care 1998; Volume 10, Number I: pp. 69-73 Methodology matters VIII 'Methodology Matters' is a series of intermittently appearing articles on methodology. Suggestions
Interaction between quantitative predictors
Interaction between quantitative predictors In a first-order model like the ones we have discussed, the association between E(y) and a predictor x j does not depend on the value of the other predictors
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
Spreadsheet software for linear regression analysis
Spreadsheet software for linear regression analysis Robert Nau Fuqua School of Business, Duke University Copies of these slides together with individual Excel files that demonstrate each program are available
PharmaSUG 2013 - Paper DG06
PharmaSUG 2013 - Paper DG06 JMP versus JMP Clinical for Interactive Visualization of Clinical Trials Data Doug Robinson, SAS Institute, Cary, NC Jordan Hiller, SAS Institute, Cary, NC ABSTRACT JMP software
How To Model A Series With Sas
Chapter 7 Chapter Table of Contents OVERVIEW...193 GETTING STARTED...194 TheThreeStagesofARIMAModeling...194 IdentificationStage...194 Estimation and Diagnostic Checking Stage...... 200 Forecasting Stage...205
TIME SERIES ANALYSIS
TIME SERIES ANALYSIS L.M. BHAR AND V.K.SHARMA Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-0 02 [email protected]. Introduction Time series (TS) data refers to observations
Chapter 27 Using Predictor Variables. Chapter Table of Contents
Chapter 27 Using Predictor Variables Chapter Table of Contents LINEAR TREND...1329 TIME TREND CURVES...1330 REGRESSORS...1332 ADJUSTMENTS...1334 DYNAMIC REGRESSOR...1335 INTERVENTIONS...1339 TheInterventionSpecificationWindow...1339
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
Advanced Topics in Statistical Process Control
Advanced Topics in Statistical Process Control The Power of Shewhart s Charts Second Edition Donald J. Wheeler SPC Press Knoxville, Tennessee Contents Preface to the Second Edition Preface The Shewhart
5. Multiple regression
5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful
t Tests in Excel The Excel Statistical Master By Mark Harmon Copyright 2011 Mark Harmon
t-tests in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. [email protected] www.excelmasterseries.com
!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"
!"!!"#$$%&'()*+$(,%!"#$%$&'()*""%(+,'-*&./#-$&'(-&(0*".$#-$1"(2&."3$'45"!"#"$%&#'()*+',$$-.&#',/"-0%.12'32./4'5,5'6/%&)$).2&'7./&)8'5,5'9/2%.%3%&8':")08';:
Paper DV-06-2015. KEYWORDS: SAS, R, Statistics, Data visualization, Monte Carlo simulation, Pseudo- random numbers
Paper DV-06-2015 Intuitive Demonstration of Statistics through Data Visualization of Pseudo- Randomly Generated Numbers in R and SAS Jack Sawilowsky, Ph.D., Union Pacific Railroad, Omaha, NE ABSTRACT Statistics
SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard
p. 1 SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard The following tutorial is a guide to some basic procedures in SPSS that will be useful as you complete your data assignments for PPA 722. The purpose
START Selected Topics in Assurance
START Selected Topics in Assurance Related Technologies Table of Contents Introduction Some Essential Concepts Some QC Charts Summary For Further Study About the Author Other START Sheets Available Introduction
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
COMP6053 lecture: Time series analysis, autocorrelation. [email protected]
COMP6053 lecture: Time series analysis, autocorrelation [email protected] Time series analysis The basic idea of time series analysis is simple: given an observed sequence, how can we build a model that
The G Chart in Minitab Statistical Software
The G Chart in Minitab Statistical Software Background Developed by James Benneyan in 1991, the g chart (or G chart in Minitab) is a control chart that is based on the geometric distribution. Benneyan
STATISTICAL REASON FOR THE 1.5σ SHIFT Davis R. Bothe
STATISTICAL REASON FOR THE 1.5σ SHIFT Davis R. Bothe INTRODUCTION Motorola Inc. introduced its 6σ quality initiative to the world in the 1980s. Almost since that time quality practitioners have questioned
Introduction to STATISTICAL PROCESS CONTROL TECHNIQUES
Introduction to STATISTICAL PROCESS CONTROL TECHNIQUES Preface 1 Quality Control Today 1 New Demands On Systems Require Action 1 Socratic SPC -- Overview Q&A 2 Steps Involved In Using Statistical Process
Real vs. Synthetic Web Performance Measurements, a Comparative Study
Real vs. Synthetic Web Performance Measurements, a Comparative Study By John Bartlett and Peter Sevcik December 2004 Enterprises use today s Internet to find customers, provide them information, engage
Query Builder. The New JMP 12 Tool for Getting Your SQL Data Into JMP WHITE PAPER. By Eric Hill, Software Developer, JMP
Query Builder The New JMP 12 Tool for Getting Your SQL Data Into JMP By Eric Hill, Software Developer, JMP WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Section 1: Making the Connection....
Alex Vidras, David Tysinger. Merkle Inc.
Using PROC LOGISTIC, SAS MACROS and ODS Output to evaluate the consistency of independent variables during the development of logistic regression models. An example from the retail banking industry ABSTRACT
Anomaly Detection and Predictive Maintenance
Anomaly Detection and Predictive Maintenance Rosaria Silipo Iris Adae Christian Dietz Phil Winters [email protected] [email protected] [email protected] [email protected]
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 [email protected] 1. Descriptive Statistics Statistics
Getting Correct Results from PROC REG
Getting Correct Results from PROC REG Nathaniel Derby, Statis Pro Data Analytics, Seattle, WA ABSTRACT PROC REG, SAS s implementation of linear regression, is often used to fit a line without checking
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management
KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To
Studying Achievement
Journal of Business and Economics, ISSN 2155-7950, USA November 2014, Volume 5, No. 11, pp. 2052-2056 DOI: 10.15341/jbe(2155-7950)/11.05.2014/009 Academic Star Publishing Company, 2014 http://www.academicstar.us
Minitab Tutorials for Design and Analysis of Experiments. Table of Contents
Table of Contents Introduction to Minitab...2 Example 1 One-Way ANOVA...3 Determining Sample Size in One-way ANOVA...8 Example 2 Two-factor Factorial Design...9 Example 3: Randomized Complete Block Design...14
GE Fanuc Automation CIMPLICITY. Statistical Process Control. CIMPLICITY Monitoring and Control Products. Operation Manual
GE Fanuc Automation CIMPLICITY Monitoring and Control Products CIMPLICITY Statistical Process Control Operation Manual GFK-1413E December 2000 Following is a list of documentation icons: GFL-005 Warning
9 RUN CHART RUN CHART
Module 9 RUN CHART RUN CHART 1 What is a Run Chart? A Run Chart is the most basic tool used to display how a process performs over time. It is a line graph of data points plotted in chronological order
SAS Enterprise Guide in Pharmaceutical Applications: Automated Analysis and Reporting Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN
Paper PH200 SAS Enterprise Guide in Pharmaceutical Applications: Automated Analysis and Reporting Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN ABSTRACT SAS Enterprise Guide is a member
IBM SPSS Forecasting 22
IBM SPSS Forecasting 22 Note Before using this information and the product it supports, read the information in Notices on page 33. Product Information This edition applies to version 22, release 0, modification
Using SPC Chart Techniques in Production Planning and Scheduling : Two Case Studies. Operations Planning, Scheduling and Control
Using SPC Chart Techniques in Production Planning and Scheduling : Two Case Studies Operations Planning, Scheduling and Control Thananya Wasusri* and Bart MacCarthy Operations Management and Human Factors
DISCRETE MODEL DATA IN STATISTICAL PROCESS CONTROL. Ester Gutiérrez Moya 1. Keywords: Quality control, Statistical process control, Geometric chart.
VI Congreso de Ingeniería de Organización Gijón, 8 y 9 de septiembre 005 DISCRETE MODEL DATA IN STATISTICAL PROCESS CONTROL Ester Gutiérrez Moya Dpto. Organización Industrial y Gestión de Empresas. Escuela
MBA 611 STATISTICS AND QUANTITATIVE METHODS
MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain
GEORGETOWN UNIVERSITY SCHOOL OF BUSINESS ADMINISTRATION STATISTICAL QUALITY CONTROL. Professor José-Luis Guerrero-Cusumano
GEORGETOWN UNIVERSITY SCHOOL OF BUSINESS ADMINISTRATION STATISTICAL QUALITY CONTROL Professor José-Luis Guerrero-Cusumano [email protected] Office: Telephone; 202-687.4338 Meeting Times: At
Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:
Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:
LEGENDplex Data Analysis Software
LEGENDplex Data Analysis Software Version 7.0 User Guide Copyright 2013-2014 VigeneTech. All rights reserved. Contents Introduction... 1 Lesson 1 - The Workspace... 2 Lesson 2 Quantitative Wizard... 3
Microsoft Excel. Qi Wei
Microsoft Excel Qi Wei Excel (Microsoft Office Excel) is a spreadsheet application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot
Integrating SAS with JMP to Build an Interactive Application
Paper JMP50 Integrating SAS with JMP to Build an Interactive Application ABSTRACT This presentation will demonstrate how to bring various JMP visuals into one platform to build an appealing, informative,
Instruction Manual for SPC for MS Excel V3.0
Frequency Business Process Improvement 281-304-9504 20314 Lakeland Falls www.spcforexcel.com Cypress, TX 77433 Instruction Manual for SPC for MS Excel V3.0 35 30 25 LSL=60 Nominal=70 Capability Analysis
12: Analysis of Variance. Introduction
1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider
Using SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
Anomaly Detection in Predictive Maintenance
Anomaly Detection in Predictive Maintenance Anomaly Detection with Time Series Analysis Phil Winters Iris Adae Rosaria Silipo [email protected] [email protected] [email protected] Copyright
Benchmarking Student Learning Outcomes using Shewhart Control Charts
Benchmarking Student Learning Outcomes using Shewhart Control Charts Steven J. Peterson, MBA, PE Weber State University Ogden, Utah This paper looks at how Shewhart control charts a statistical tool used
I/A Series Information Suite AIM*SPC Statistical Process Control
I/A Series Information Suite AIM*SPC Statistical Process Control PSS 21S-6C3 B3 QUALITY PRODUCTIVITY SQC SPC TQC y y y y y y y y yy y y y yy s y yy s sss s ss s s ssss ss sssss $ QIP JIT INTRODUCTION AIM*SPC
Data Exploration Data Visualization
Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select
Ads Analytics. User Guide
Ads Analytics User Guide For many marketers, measuring ad performance is either too complicated or time-consuming. Too much time is wasted wading through meaningless metrics, struggling to get an aggregated
Please follow these guidelines when preparing your answers:
PR- ASSIGNMNT 3000500 Quantitative mpirical Research The objective of the pre- assignment is to review the course prerequisites and get familiar with SPSS software. The assignment consists of three parts:
Time Series Analysis and Forecasting
Time Series Analysis and Forecasting Math 667 Al Nosedal Department of Mathematics Indiana University of Pennsylvania Time Series Analysis and Forecasting p. 1/11 Introduction Many decision-making applications
SAS VISUAL ANALYTICS AN OVERVIEW OF POWERFUL DISCOVERY, ANALYSIS AND REPORTING
SAS VISUAL ANALYTICS AN OVERVIEW OF POWERFUL DISCOVERY, ANALYSIS AND REPORTING WELCOME TO SAS VISUAL ANALYTICS SAS Visual Analytics is a high-performance, in-memory solution for exploring massive amounts
CALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
Using Statistical Process Control (SPC) for improved Utility Management
Using Statistical Process Control (SPC) for improved Utility Management Scott Dorner Hach Company Manage and Transform data into information to gain efficiencies 1 Data, Data Everywhere We track enormous
Getting started in Excel
Getting started in Excel Disclaimer: This guide is not complete. It is rather a chronicle of my attempts to start using Excel for data analysis. As I use a Mac with OS X, these directions may need to be
2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
The Turning of JMP Software into a Semiconductor Analysis Software Product:
The Turning of JMP Software into a Semiconductor Analysis Software Product: The Implementation and Rollout of JMP Software within Freescale Semiconductor Inc. Jim Nelson, Manager IT, Yield Management Systems
Forecasting of Paddy Production in Sri Lanka: A Time Series Analysis using ARIMA Model
Tropical Agricultural Research Vol. 24 (): 2-3 (22) Forecasting of Paddy Production in Sri Lanka: A Time Series Analysis using ARIMA Model V. Sivapathasundaram * and C. Bogahawatte Postgraduate Institute
Lecture 2: ARMA(p,q) models (part 3)
Lecture 2: ARMA(p,q) models (part 3) Florian Pelgrin University of Lausanne, École des HEC Department of mathematics (IMEA-Nice) Sept. 2011 - Jan. 2012 Florian Pelgrin (HEC) Univariate time series Sept.
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005
Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005 Philip J. Ramsey, Ph.D., Mia L. Stephens, MS, Marie Gaudard, Ph.D. North Haven Group, http://www.northhavengroup.com/
Gage Studies for Continuous Data
1 Gage Studies for Continuous Data Objectives Determine the adequacy of measurement systems. Calculate statistics to assess the linearity and bias of a measurement system. 1-1 Contents Contents Examples
Journal of Statistical Software
JSS Journal of Statistical Software June 2009, Volume 30, Issue 13. http://www.jstatsoft.org/ An Excel Add-In for Statistical Process Control Charts Samuel E. Buttrey Naval Postgraduate School Abstract
Practical Time Series Analysis Using SAS
Practical Time Series Analysis Using SAS Anders Milhøj Contents Preface... vii Part 1: Time Series as a Subject for Analysis... 1 Chapter 1 Time Series Data... 3 1.1 Time Series Questions... 3 1.2 Types
