Trend Analysis and Presentation
|
|
|
- Henry Jenkins
- 9 years ago
- Views:
Transcription
1 Percent Total Concentration Trend Monitoring What is it and why do we do it? Trend monitoring looks for changes in environmental parameters over time periods (E.g. last 10 years) or in space (e.g. as you move downstream) 100 Box Plot of Workshop Participants Concentration VS Time of Day *** :00 09:00 10:00 11:00 12:00 13:00 14:00 15:00 16:00 17:00 Time of Day *** Created with StatPlus Software
2 Be vewy, vewy quiet I m hunting fow a twend How to find a trend Visual- Good News...Graphing or mapping data for people to see is the easiest way to communicate trends, especially to a non-technical crowd. Bad News No way of measuring that there is a trend or how big it is. Statistical- Good News Can identify hard to see trends and gives a number that is defensible and repeatable. Bad News Easy to do wrong and hard to figure out.
3 Selecting How to Analyze and Present a Trend Did your data meet your data quality objectives? Trend monitoring requires strict monitoring protocols. Who is your audience and what type of analysis or presentation is appropriate for them? If a statistical test is required, does the data satisfy all the statistic s assumptions? What is the trend telling you? Determining the cause of the trend is more difficult than determining the trend. Have you considered all the exogenous parameters that could influence your trends (flow, time of day, etc.)?
4 Avgerage 7 Day Max for July 2000 Visual methods for displaying trends Average 7 day max by River Mile Summer 2000 E.coli Concentrations Concentration (MPN/100mL) October Sept River Mile Time/area series scatter plot 0 1 Bar Graphs Site Number August July June May Month Time series smoothed scatter plot* Time/area series box plot of statistics From Helsel D.R. and R.M. Hirsch Statistical Methods in Water Resources. Techniques of Water-Resources Investigations of the USGS Book 4. Chapter A3 p. 287
5 The Trouble with Graphs People tend to focus on the outliers and not on more subtle changes Gradual trends are hard to detect by eye Seasonal variation and exogenous variables can mask trends in a parameter. Viewers can see what they want to see sometimes
6 Statistical Trend Analysis Methods Picking a statistic can be tricky- Review your data for the following characteristics: Normal Distribution Abrupt Changes Cycles Outliers Missing Values Censored Data Serial Correlation Select a Statistic that is appropriate for the type of data set you have.
7 Statistical Trend Analysis Methods Type of statistic depends on data characteristics- To test if an existing data set is normally distributed you may (a) plot a histogram of the results; or (b) conduct tests for normality like the Shapiro-Wilk W test, the Filliben s statistic, or the studentized rage test (EPA, 2000, p. 4-6). Parametric- Statistics for normally distributed data. Includes regression of parameter against time or spatial measure, like river mile. Parametric statistics are rarely appropriate for environmental samples without massaging data. See the USGS guide by Helsel and Hirsch for descriptions of using regression. Nonparametric- Statistics that are not as dependent on assumptions about data distribution. Generally the safer statistics to use, but are not as readily available. Test include the Kendall test for presence of consistent trend, Sen slope test for measure of magnitude of slope, and Wilcoxon-Mann-Whitney step trend analysis.
8 Things to watch out for with statistical analysis Verify and document that all assumptions have been tested and met. Failure to do so, may lead to misleading results The easiest test to do, OLS regression, is usually not appropriate. More robust, non-parametric tests like the seasonal Kendall test are not readily available and can be computationally demanding. Finding no trend may only mean your data was insufficient to find the trend. Infrequent use of statistical methods often requires relearning fairly complicated procedures every year. If you do use a statistical package, keep excellent notes of what you do and why. Four OLS regression charts with the same R 2, slope, and intercept* * From Helsel D.R. and R.M. Hirsch Statistical Methods in Water Resources. Techniques of Water-Resources Investigations of the USGS Book 4. Chapter A3 p.245 *
9 Statistical Methods for Determining Trends Non-Parametric, distribution independent methods Seasonal Kendall Test: Compares the relationship between points at separate time periods or seasons and determines if there is a trend. Highly robust and relatively powerful, recommended method for most water quality trend monitoring (Aroner, 2001). * Sen Slope or Kendall Theil: Compares the change in value vs time (slope) for each point on the chart and takes the median slope as the a summary statistic describing the magnitude of the trend. ** ** Wilcoxon-Mann-Whitney Step Trend: Seasonal or non-seasonal test to identify changes that take place at a distinct time. Combine with Hodges- Lehmann estimator to determine magnitude of step (Aroner, 2001) * From Helsel D.R. and R.M. Hirsch Statistical Methods in Water Resources. Techniques of Water-Resources Investigations of the USGS Book 4. Chapter A3. p. 266 ** Created with WQHydro Water Quality/Hydrology Graphics/Analysis System Software
10 Some Units How to Calculate Non-Parametric Statistics Kendall Test to determine if a significant trend exists see EPA guide QA/G-9 pages 4-16 to XY Plot of data vs time Year Units Yearly Value Sen Slope or Kendall-Theil line to measure the magnitude of the trend see USGS Statistical Methods for Water Resources pages
11 What type of trend analysis is right for you to use? What would you need for temperature trend monitoring? Is there an exogenous variable? What about changes in other water quality parameters? How could changes in monitoring schedule impact trends? Example Bob has been monitoring DO in the morning at a site for 3 years. He adds 4 new sites to his route and now visits his old site in the afternoon every visit. Now samples are collected in the pm. What impact will this have on your ability to track trends? What could you do to correct the problem? DO Saturation in Willamette R. at Portland S.P. & S. Railroad Bridge
12 Statistical Methods for Determining Trends Regressions Regression methods draw a line as close to all the data as possible. Use regressions only if you can satisfy the data requirements. Helsel and Hirsch (1991) give a good description of the steps needed to satisfy assumptions and data transformation tools to bring your data within the limits of the assumptions. * * From Helsel D.R. and R.M. Hirsch Statistical Methods in Water Resources. Techniques of Water-Resources Investigations of the USGS Book 4. Chapter A3. p. 279
13 OWQI.month OWQI.anual Trend Analysis and Presentation Trend Monitoring... The devil is in the details Issues to consider when doing trend monitoring 1 Consistent data quality- Trend monitoring assumes that the same or equivalent methods and protocols are used for all the monitoring. Time frame & number of samples- 5 years of monthly data for WQ monotonic trend analysis (Lettenmaier et.al., 1982); for step trends, at least 2 years of monthly data before and after management change (Hirsch, 1982). (Statistic should be predefined in QAPP to make sure you have everything you need.) Seasonality- Parameters that vary naturally in different seasons of the year require special statistics or de-seasonalization. Type of data distribution (Is your data Normal?)*- If your data is not normally distributed, you need to use a non-parametric statistical test. Comparisonof Trends: Anual vs. MonthlyData WilameteRiverat NewbergBridge OWQI.month OWQI.anual /12/1906/27/1911/8/1923/23/1948/5/195 Date 1. Based on Aroner, E.R WQHYDRO Environmental Data Analysis Technical App. p. 146 * Issue most applicable to statistical analysis
14 Issues to consider when doing trend monitoring 1 Similar variance across data set*- Statistical methods require consistent sampling frequency (i.e., constant number of samples per period, equally spaced across the period) to minimize variance. Minimal missing observations*- Missing observations may invalidate some tests by changing variance. Outliers- Statistical tests designed for normal distributions are very sensitive to outliers Serial correlation*- Each result must be independent of other samples (across time and space). Exogenous variables- Parameters like conductivity and ph can be greatly impacted by exogenous parameters, like stream discharge or time of day, respectively. Changes in these exogenous variables may impact trends you measure in the parameter of interest. To remove these impacts develop a regression between the two variables. Calculate the residuals (the difference between the measured value and value predicted by the regression) and then run the trend analysis on the residuals. Differences in detection limits*- Changing detection limits might fool the statistic into thinking concentrations are decreasing (make sure you have a way to statistically handle changing detection limits}. 1. Based on Aroner, E.R WQHYDRO Environmental Data Analysis Technical Appendix p. 146 * Issue most applicable to statistical analysis
15 References Aroner, E.R., WQHYDRO Environmental Data Analysis Technical Appendix. Berk K.N., and P. Carey, Data Analysis with Microsoft Excel. Duxbury Press, Pacific Grove, CA. pp EPA, 2000 (EPA/600/R-96/084). Guidance for Data Quality Assessment: Practical methods for Data Analysis, EPA QA/G-9, QA00 Update Helsel D.R. and R.M. Hirsh Statistical Methods in Water Resources. Techniques of Water-Resources Investigations of the USGS Book 4. Chapter A3 Hirsch R.M., J.R. Slack and R.A. Smith, Techniques of Trend Analysis for Monthly Water Quality Data. Water Resources Research, Vol. 18 (1) pp February. Hirsch R.M., R.B. Alexander, and R.A. Smith Selection of Methods for the Detection and Estimation of Trends in Water Quality. Water Resources Research, Vol. 27(5), pp May. Lettenmaier, D.P., L.L. Conquest and J.P. Hughes, Routine Streams and Rivers Water Quality Trend Monitoring Review. C.W. Harris Hydraulics Laboratory, Technical Report No. 75, Seattle, WA.
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
How Far is too Far? Statistical Outlier Detection
How Far is too Far? Statistical Outlier Detection Steven Walfish President, Statistical Outsourcing Services [email protected] 30-325-329 Outline What is an Outlier, and Why are
Introduction to Minitab and basic commands. Manipulating data in Minitab Describing data; calculating statistics; transformation.
Computer Workshop 1 Part I Introduction to Minitab and basic commands. Manipulating data in Minitab Describing data; calculating statistics; transformation. Outlier testing Problem: 1. Five months of nickel
Projects Involving Statistics (& SPSS)
Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,
Temperature Data Analysis
Continuous Temperature Data Analysis and Presentation Representing multiple sites on a single chart creates a better understanding of the watershed Using river mile is an easy way to set the geographic
Statistical Analysis for Monotonic Trends
November 2011 Donald W. Meals, Jean Spooner, Steven A. Dressing, and Jon B. Harcum. 2011. Statistical analysis for monotonic trends, Tech Notes 6, November 2011. Developed for U.S. Environmental Protection
Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
Simple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
Module 5: Statistical Analysis
Module 5: Statistical Analysis To answer more complex questions using your data, or in statistical terms, to test your hypothesis, you need to use more advanced statistical tests. This module reviews the
T O P I C 1 2 Techniques and tools for data analysis Preview Introduction In chapter 3 of Statistics In A Day different combinations of numbers and types of variables are presented. We go through these
Module 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
Part II Chapter 9 Chapter 10 Chapter 11 Chapter 12 Chapter 13 Chapter 14 Chapter 15 Part II
Part II covers diagnostic evaluations of historical facility data for checking key assumptions implicit in the recommended statistical tests and for making appropriate adjustments to the data (e.g., consideration
business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
Scatter Plot, Correlation, and Regression on the TI-83/84
Scatter Plot, Correlation, and Regression on the TI-83/84 Summary: When you have a set of (x,y) data points and want to find the best equation to describe them, you are performing a regression. This page
SPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,
3.2 Statistical Analysis Procedures
3.2 Statistical Analysis Procedures There are many different types of statistical analysis that can be performed on water quality data sets for reporting and interpretation purposes. Many inferences can
Formula for linear models. Prediction, extrapolation, significance test against zero slope.
Formula for linear models. Prediction, extrapolation, significance test against zero slope. Last time, we looked the linear regression formula. It s the line that fits the data best. The Pearson correlation
Simple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
Homework 8 Solutions
Math 17, Section 2 Spring 2011 Homework 8 Solutions Assignment Chapter 7: 7.36, 7.40 Chapter 8: 8.14, 8.16, 8.28, 8.36 (a-d), 8.38, 8.62 Chapter 9: 9.4, 9.14 Chapter 7 7.36] a) A scatterplot is given below.
Data Visualization Techniques
Data Visualization Techniques From Basics to Big Data with SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Generating the Best Visualizations for Your Data... 2 The
9 RUN CHART RUN CHART
Module 9 RUN CHART RUN CHART 1 What is a Run Chart? A Run Chart is the most basic tool used to display how a process performs over time. It is a line graph of data points plotted in chronological order
MTH 140 Statistics Videos
MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative
Regression and Correlation
Regression and Correlation Topics Covered: Dependent and independent variables. Scatter diagram. Correlation coefficient. Linear Regression line. by Dr.I.Namestnikova 1 Introduction Regression analysis
Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test
The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation
FORECASTING. Operations Management
2013 FORECASTING Brad Fink CIT 492 Operations Management Executive Summary Woodlawn hospital needs to forecast type A blood so there is no shortage for the week of 12 October, to correctly forecast, a
Data Visualization Techniques
Data Visualization Techniques From Basics to Big Data with SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Generating the Best Visualizations for Your Data... 2 The
ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA
ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA Michael R. Middleton, McLaren School of Business, University of San Francisco 0 Fulton Street, San Francisco, CA -00 -- [email protected]
A spreadsheet Approach to Business Quantitative Methods
A spreadsheet Approach to Business Quantitative Methods by John Flaherty Ric Lombardo Paul Morgan Basil desilva David Wilson with contributions by: William McCluskey Richard Borst Lloyd Williams Hugh Williams
X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
Simple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
Building Credibility: Quality Assurance & Quality Control for Volunteer Monitoring Programs
Building Credibility: Quality Assurance & Quality Control for Volunteer Monitoring Programs Elizabeth Herron URI Watershed Watch/ National Facilitation of CSREES Volunteer Monitoring Ingrid Harrald Cook
The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces
The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces Or: How I Learned to Stop Worrying and Love the Ball Comment [DP1]: Titles, headings, and figure/table captions
Module 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
Diagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
Quality. Guidance for Data Quality Assessment. Practical Methods for Data Analysis EPA QA/G-9 QA00 UPDATE. EPA/600/R-96/084 July, 2000
United States Environmental Protection Agency Office of Environmental Information Washington, DC 060 EPA/600/R-96/08 July, 000 Guidance for Data Quality Assessment Quality Practical Methods for Data Analysis
PS 271B: Quantitative Methods II. Lecture Notes
PS 271B: Quantitative Methods II Lecture Notes Langche Zeng [email protected] The Empirical Research Process; Fundamental Methodological Issues 2 Theory; Data; Models/model selection; Estimation; Inference.
Teaching Business Statistics through Problem Solving
Teaching Business Statistics through Problem Solving David M. Levine, Baruch College, CUNY with David F. Stephan, Two Bridges Instructional Technology CONTACT: [email protected] Typical
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma [email protected] The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
A Review of Statistical Outlier Methods
Page 1 of 5 A Review of Statistical Outlier Methods Nov 2, 2006 By: Steven Walfish Pharmaceutical Technology Statistical outlier detection has become a popular topic as a result of the US Food and Drug
Statistics 2014 Scoring Guidelines
AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home
Module 4: Data Exploration
Module 4: Data Exploration Now that you have your data downloaded from the Streams Project database, the detective work can begin! Before computing any advanced statistics, we will first use descriptive
Using Statistical Process Control (SPC) for improved Utility Management
Using Statistical Process Control (SPC) for improved Utility Management Scott Dorner Hach Company Manage and Transform data into information to gain efficiencies 1 Data, Data Everywhere We track enormous
Evaluation of Open Channel Flow Equations. Introduction :
Evaluation of Open Channel Flow Equations Introduction : Most common hydraulic equations for open channels relate the section averaged mean velocity (V) to hydraulic radius (R) and hydraulic gradient (S).
Financial Risk Management Exam Sample Questions/Answers
Financial Risk Management Exam Sample Questions/Answers Prepared by Daniel HERLEMONT 1 2 3 4 5 6 Chapter 3 Fundamentals of Statistics FRM-99, Question 4 Random walk assumes that returns from one time period
This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.
Algebra I Overview View unit yearlong overview here Many of the concepts presented in Algebra I are progressions of concepts that were introduced in grades 6 through 8. The content presented in this course
The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)
Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,
CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there
CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there is a relationship between variables, To find out the
OUTLIER ANALYSIS. Data Mining 1
OUTLIER ANALYSIS Data Mining 1 What Are Outliers? Outlier: A data object that deviates significantly from the normal objects as if it were generated by a different mechanism Ex.: Unusual credit card purchase,
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
. 58 58 60 62 64 66 68 70 72 74 76 78 Father s height (inches)
PEARSON S FATHER-SON DATA The following scatter diagram shows the heights of 1,0 fathers and their full-grown sons, in England, circa 1900 There is one dot for each father-son pair Heights of fathers and
the Median-Medi Graphing bivariate data in a scatter plot
the Median-Medi Students use movie sales data to estimate and draw lines of best fit, bridging technology and mathematical understanding. david c. Wilson Graphing bivariate data in a scatter plot and drawing
Statistical And Trend Analysis Of Rainfall And River Discharge: Yala River Basin, Kenya
Statistical And Trend Analysis Of Rainfall And River Discharge: Yala River Basin, Kenya Githui F. W.* +, A. Opere* and W. Bauwens + *Department of Meteorology, University of Nairobi, P O Box 30197 Nairobi,
Data Quality Assessment: A Reviewer s Guide EPA QA/G-9R
United States Office of Environmental EPA/240/B-06/002 Environmental Protection Information Agency Washington, DC 20460 Data Quality Assessment: A Reviewer s Guide EPA QA/G-9R FOREWORD This document is
6. MEASURING EFFECTS OVERVIEW CHOOSE APPROPRIATE METRICS
45 6. MEASURING EFFECTS OVERVIEW In Section 4, we provided an overview of how to select metrics for monitoring implementation progress. This section provides additional detail on metric selection and offers
Using SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
APPENDIX N. Data Validation Using Data Descriptors
APPENDIX N Data Validation Using Data Descriptors Data validation is often defined by six data descriptors: 1) reports to decision maker 2) documentation 3) data sources 4) analytical method and detection
Normality Testing in Excel
Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. [email protected]
Time Series Laboratory
Time Series Laboratory Computing in Weber Classrooms 205-206: To log in, make sure that the DOMAIN NAME is set to MATHSTAT. Use the workshop username: primesw The password will be distributed during the
Integrating the Internet into Your Measurement System. DataSocket Technical Overview
Integrating the Internet into Your Measurement System DataSocket Technical Overview Introduction The Internet continues to become more integrated into our daily lives. This is particularly true for scientists
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
Using kernel methods to visualise crime data
Submission for the 2013 IAOS Prize for Young Statisticians Using kernel methods to visualise crime data Dr. Kieran Martin and Dr. Martin Ralphs [email protected] [email protected] Office
Lean Six Sigma Black Belt-EngineRoom
Lean Six Sigma Black Belt-EngineRoom Course Content and Outline Total Estimated Hours: 140.65 *Course includes choice of software: EngineRoom (included for free), Minitab (must purchase separately) or
The importance of graphing the data: Anscombe s regression examples
The importance of graphing the data: Anscombe s regression examples Bruce Weaver Northern Health Research Conference Nipissing University, North Bay May 30-31, 2008 B. Weaver, NHRC 2008 1 The Objective
Charts, Tables, and Graphs
Charts, Tables, and Graphs The Mathematics sections of the SAT also include some questions about charts, tables, and graphs. You should know how to (1) read and understand information that is given; (2)
430 Statistics and Financial Mathematics for Business
Prescription: 430 Statistics and Financial Mathematics for Business Elective prescription Level 4 Credit 20 Version 2 Aim Students will be able to summarise, analyse, interpret and present data, make predictions
2.2 Elimination of Trend and Seasonality
26 CHAPTER 2. TREND AND SEASONAL COMPONENTS 2.2 Elimination of Trend and Seasonality Here we assume that the TS model is additive and there exist both trend and seasonal components, that is X t = m t +
2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
EXPLORING SPATIAL PATTERNS IN YOUR DATA
EXPLORING SPATIAL PATTERNS IN YOUR DATA OBJECTIVES Learn how to examine your data using the Geostatistical Analysis tools in ArcMap. Learn how to use descriptive statistics in ArcMap and Geoda to analyze
Getting Correct Results from PROC REG
Getting Correct Results from PROC REG Nathaniel Derby, Statis Pro Data Analytics, Seattle, WA ABSTRACT PROC REG, SAS s implementation of linear regression, is often used to fit a line without checking
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 [email protected] 1. Descriptive Statistics Statistics
Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.
SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. [email protected] Course Objective This course is designed
Using Excel for Statistical Analysis
Using Excel for Statistical Analysis You don t have to have a fancy pants statistics package to do many statistical functions. Excel can perform several statistical tests and analyses. First, make sure
PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION
PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION Chin-Diew Lai, Department of Statistics, Massey University, New Zealand John C W Rayner, School of Mathematics and Applied Statistics,
We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?
Statistics: Correlation Richard Buxton. 2008. 1 Introduction We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries? Do
MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
MATH 3/GRACEY PRACTICE EXAM/CHAPTERS 2-3 Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The frequency distribution
Mean, Median, and Mode
DELTA MATH SCIENCE PARTNERSHIP INITIATIVE M 3 Summer Institutes (Math, Middle School, MS Common Core) Mean, Median, and Mode Hook Problem: To compare two shipments, five packages from each shipment were
Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model
Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity
Regression III: Advanced Methods
Lecture 16: Generalized Additive Models Regression III: Advanced Methods Bill Jacoby Michigan State University http://polisci.msu.edu/jacoby/icpsr/regress3 Goals of the Lecture Introduce Additive Models
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
DEALING WITH THE DATA An important assumption underlying statistical quality control is that their interpretation is based on normal distribution of t
APPLICATION OF UNIVARIATE AND MULTIVARIATE PROCESS CONTROL PROCEDURES IN INDUSTRY Mali Abdollahian * H. Abachi + and S. Nahavandi ++ * Department of Statistics and Operations Research RMIT University,
The Seven Basic Tools. QUALITY CONTROL TOOLS (The Seven Basic Tools) What are check sheets? CHECK SHEET. Illustration (Painting defects)
QUALITY CONTROL TOOLS (The Seven Basic Tools) Dr. Ömer Yağız Department of Business Administration Eastern Mediterranean University TRNC The Seven Basic Tools The seven basic tools are: Check sheet Flow
Introduction to nonparametric regression: Least squares vs. Nearest neighbors
Introduction to nonparametric regression: Least squares vs. Nearest neighbors Patrick Breheny October 30 Patrick Breheny STA 621: Nonparametric Statistics 1/16 Introduction For the remainder of the course,
WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat
Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise
APPENDIX E THE ASSESSMENT PHASE OF THE DATA LIFE CYCLE
APPENDIX E THE ASSESSMENT PHASE OF THE DATA LIFE CYCLE The assessment phase of the Data Life Cycle includes verification and validation of the survey data and assessment of quality of the data. Data verification
Session 7 Bivariate Data and Analysis
Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares
A Sales Strategy to Increase Function Bookings
A Sales Strategy to Increase Function Bookings It s Time to Start Selling Again! It s time to take on a sales oriented focus for the bowling business. Why? Most bowling centres have lost the art and the
Examples of Data Representation using Tables, Graphs and Charts
Examples of Data Representation using Tables, Graphs and Charts This document discusses how to properly display numerical data. It discusses the differences between tables and graphs and it discusses various
Lesson 2: Constructing Line Graphs and Bar Graphs
Lesson 2: Constructing Line Graphs and Bar Graphs Selected Content Standards Benchmarks Assessed: D.1 Designing and conducting statistical experiments that involve the collection, representation, and analysis
DATA COLLECTION AND ANALYSIS
DATA COLLECTION AND ANALYSIS Quality Education for Minorities (QEM) Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. August 23, 2013 Objectives of the Discussion 2 Discuss
1997-98 UPPER DESCHUTES R-EMAP TEMPERATURE SUMMARY
1997-98 UPPER DESCHUTES R-EMAP TEMPERATURE SUMMARY Daria G. Mochan Oregon Department of Environmental Quality Laboratory Division Biomonitoring Section 1712 S.W. Eleventh Avenue Portland, Oregon 97201
SP500 September 2011 Outlook
SP500 September 2011 Outlook This document is designed to provide the trader and investor of the Standard and Poor s 500 with an overview of the seasonal tendency as well as the current cyclic pattern
