EXPLORING SPATIAL PATTERNS IN YOUR DATA


 Angela Boyd
 3 years ago
 Views:
Transcription
1 EXPLORING SPATIAL PATTERNS IN YOUR DATA
2 OBJECTIVES Learn how to examine your data using the Geostatistical Analysis tools in ArcMap. Learn how to use descriptive statistics in ArcMap and Geoda to analyze data. Be able to identify Geostatistical Analysis tools that can be used for further analysis.
3 WHY EXPLORE YOUR DATA? It allows you to better select an appropriate tool to analyze your data. If you skip exploring your data, you may miss key information about it that may lead to incorrect conclusions and decisions.
4 GEODA VS. ARCMAP Geoda free, opensource, simple, software specifically for statistical analysis ArcMap proprietary, GIS software that can perform statistical analysis along with hundreds of other analyses
5 GEODA VS. ARCMAP With ArcMap you can view several data layers at once. In Geoda, you view only one data layer. Some tools are found in both programs, while some are found in only one.
6 EXPLORE THE LOCATION OF YOUR DATA
7 EXPLORE THE LOCATION OF YOUR DATA Explore: size of the study area mean median direction data are oriented You will see where data are clustered relative to the rest of the data.
8 MEAN CENTER The geographic center for a set of features. Constructed from the average x and y values for the input feature centroids (middle points, if input features are polygons).
9 MEDIAN CENTER Median Center is robust to outliers. Uses an algorithm to find the point that minimizes travel from it to all other features in the dataset. At each step (t) in the algorithm, a candidate Median Center is found (X t, Y t ) and refined until it represents the location that minimizes Euclidian Distance d to all features (i) in the dataset.
10 DIRECTION DISTRIBUTION (STANDARD DEVIATIONAL ELLIPSE) Standard deviational ellipses summarize the spatial characteristics of geographic features: central tendency, dispersion, and directional trends. The ellipse allows you to see if the distribution of features is elongated and hence has a particular orientation. When the underlying spatial pattern of features is concentrated in the center with fewer features toward the periphery (a spatial normal distribution), a one standard deviation ellipse polygon will cover approximately 68 percent of the features two standard deviations will contain approximately 95 percent of the features three standard deviations will cover approximately 99 percent of the features
11
12 EXPLORE THE VALUES OF YOUR DATA
13 NORMAL DISTRIBUTION Some analysis tools assume a normal distribution: Mean and median are similar Data are symmetrical
14 DATA FREQUENCY USING HISTOGRAMS
15 DATA DISTRIBUTION USING A QQ PLOT Many A characteristics normally Not distributed normal of a normal dataset dataset A normal QQ plot shows the relationship of your data to a normal distribution line.
16 BOX PLOT Displays the median and interquartile range (IQ) (25%75%) Hinge = multiple of interquartile range
17 MAPS For examining data values and frequencies: Quantile Map Natural breaks Equal intervals For finding outliers: Percentile Map Box Map Standard Deviation Map
18 QUANTILE MAP Displays the distribution of values in categories with an equal number of observations in each category.
19 EQUAL INTERVAL MAP Sets the value ranges in each category equal in size. The entire range of data values is divided equally into however many categories have been chosen.
20 NATURAL BREAKS MAP Seeks to reduce the variance within classes and maximize the variance between classes
21 OTHER EXPLORATORY METHODS Scatter Plot (2 variables) Parallel coordinate plot (A pattern of lines is drawn that connects the coordinates of each observation across the variables on parallel xaxes.)
22 DETECT OUTLIERS
23 OUTLIERS Outliers can reveal mistakes, unusual occurrences, and shift points in data patterns (a valley in a mountain range). You should use more than one method to find outliers because some techniques will only highlight data values near the two ends of your range.
24 PERCENTILE MAP Groups ranked data into 6 categories Lowest and highest 1% are potential outliers
25 BOX MAP Groups data into 4 categories, plus 2 outlier categories at both ends Data are outliers if they are 1.5 or 3 times the IQ. Detects outliers with more certainty than a percentile map
26 STANDARD DEVIATION MAP Displays data 3 standard deviations above and below the mean. As a parametric map, it is sensitive to outliers.
27 SEMIVARIOGRAM CLOUD When points closer together have greater differences in their values, this may indicate an outlier in the data. The selected points may be outliers.
28 VORONOI MAP The gray polygons may be outliers. Cluster Voronoi maps show spatial outliers in your data; simple Voronoi maps can pinpoint data values that are many class breaks removed from surrounding polygons.
29 HISTOGRAM Values in the last bars to the left or right, if far removed from the adjacent values, may indicate outliers.
30 NORMAL QQ PLOT Values at the tails of a normal QQ plot can also be outliers. This can happen when the tail values do not fall along the reference line.
31 BOXPLOT Points outside the hinges (represented by the black, horizontal lines), maybe outliers.
32 EXPLORE SPATIAL RELATIONSHIPS IN YOUR DATA
33 SPATIAL AUTOCORRELATION Everything is related, but objects closer together are more related than objects farther apart. Explore using a semivariogram graph or cloud Can also be explored using Moran s I and GetisOrd G statistics
34 Height (sill) = variation between data values. Range = distance between points at which the semivariogram flattens out. As the range increase, height should increase, since points further away from each other are not as related, so there should be more variation. If a semivariogram is a horizontal line, there is no spatial autocorrelation.
35 VARIATION IN YOUR DATA Many spatial statistics analysis techniques assume your data are stationary, meaning the relationship between two points and their values depends on the distance between them, not their exact location. Explore variation using a Voronoi map. A Voronoi map is created by defining Thiessen polygons around each point in your dataset. Any location inside a polygon represents the area closer to that data point than to any other data point. This allows you to explore the variation of each sample point based on its relationship to surrounding sample points.
36 A SIMPLE VORONOI MAP Green = little local variation Orange and Red = greater local variation A simple Voronoi map shows the data value at each location. The map is symbolized using a geometrical interval classification. This will show the variation in data values across your entire dataset.
37 TYPES OF VORONOI MAPS Simple: The value assigned to a polygon is the value recorded at the sample point within that polygon. Mean: The value assigned to a polygon is the mean value that is calculated from the polygon and its neighbors. Mode: All polygons are categorized using five class intervals. The value assigned to a polygon is the mode (most frequently occurring class) of the polygon and its neighbors. Cluster: All polygons are categorized using five class intervals. If the class interval of a polygon is different from each of its neighbors, the polygon is colored gray and put into a sixth class to distinguish it from its neighbors. Entropy: All polygons are categorized using five classes based on a natural grouping of data values (smart quantiles). The value assigned to a polygon is the entropy that is calculated from the polygon and its neighbors. Entropy =  Σ (p i * Log p i ),
38 EXPLORE TRENDS IN YOUR DATA
39 TREND ANALYSIS You can use the trend analysis tool in Arcmap to visually compare the trend lines with any patterns in your data. When exploring trends, your data locations are mapped along the x and yaxes. The values of each data location are mapped as height (zaxis). Trends are analyzed based on direction and on the order of the line that fits the trend. The trend line is a mathematical function, or polynomial, that describes the variation in the data.
40 You can determine whether the order of the polynomial fits your data based on the shape created by the line. A secondorder polynomial will appear as an upward or a downward curve (known as a parabola). These polynomials show a clear curve, indicating a secondorder trend in the data.
41 SELECTING AN ANALYSIS TECHNIQUE
42 Each of the following techniques are types of interpolation. Interpolation creates surfaces based on spatially continuous data. Each surface uses the values and locations of your points to create (or interpolate) the values for the remaining points in the surface.
43 GEOSTATISTICAL INTERPOLATION Creates surfaces using the relationships between your data locations and their values. Predicts values based on your existing data. Assumptions: Data is not clustered. (Simple kriging technique has a declustering option.) Data is normally distributed. (Transformation options are available.) Data is stationary (no local variation). Data is autocorrelated. Data has no local trends. (You can remove trends from data as part of the interpolation process. )
44 GLOBAL DETERMINISTIC INTERPOLATION Creates surfaces using the existing values at each location. Uses your entire dataset to create your surface. Assumptions: Outliers have been removed from the data. Global trends exist in the data.
45 LOCAL DETERMINISTIC INTERPOLATION Uses several subsets, or neighborhoods, within an entire dataset to create the different components of the surface. Assumption: Data is normally distributed.
46 INVERSE DISTANCE WEIGHTED INTERPOLATION (IDW) A type of local deterministic interpolation. Assumptions: Data is not clustered. Data is autocorrelated.
47 OTHER SPATIAL STATISTICAL TESTS Tests for spatial autocorrelation GetisOrd General G and Global Moran s I (to determine overall clustering and dispersion of values) Hot Spot Analysis (GetisOrd Gi*) and Anselin s Local Moran s I (to determine specific clusters of high and low values) Regression Used to evaluate relationships between two or more feature attributes. Are location, crime rates, racial makeup, and income related to housing values in a census tract?
Data Exploration Data Visualization
Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select
More informationGeostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras cfelgueiras@isegi.unl.pt
More informationLecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
More informationBNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
More informationDiagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
More informationWhy Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
More informationSTATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members
More informationExercise 1.12 (Pg. 2223)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
More informationExploratory Data Analysis
Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
More informationIntroduction to Exploratory Data Analysis
Introduction to Exploratory Data Analysis A SpaceStat Software Tutorial Copyright 2013, BioMedware, Inc. (www.biomedware.com). All rights reserved. SpaceStat and BioMedware are trademarks of BioMedware,
More informationSpatial Analysis with GeoDa Spatial Autocorrelation
Spatial Analysis with GeoDa Spatial Autocorrelation 1. Background GeoDa is a trademark of Luc Anselin. GeoDa is a collection of software tools designed for exploratory spatial data analysis (ESDA) based
More informationIntroduction to Modeling Spatial Processes Using Geostatistical Analyst
Introduction to Modeling Spatial Processes Using Geostatistical Analyst Konstantin Krivoruchko, Ph.D. Software Development Lead, Geostatistics kkrivoruchko@esri.com Geostatistics is a set of models and
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course
More informationArcGIS 9. Geostatistical Analyst
ArcGIS 9 Using ArcGIS Geostatistical Analyst Copyright 2001, 2003 ESRI All Rights Reserved. Printed in the United States of America. The information contained in this document is the exclusive property
More informationMathematics. Probability and Statistics Curriculum Guide. Revised 2010
Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction
More information3: Summary Statistics
3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes
More informationGeostatistical Analyst Tutorial
Copyright 19952012 Esri All rights reserved. Table of Contents Introduction to the ArcGIS Geostatistical Analyst Tutorial................... 0 Exercise 1: Creating a surface using default parameters...................
More informationLab 7. Exploratory Data Analysis
Lab 7. Exploratory Data Analysis SOC 261, Spring 2005 Spatial Thinking in Social Science 1. Background GeoDa is a trademark of Luc Anselin. GeoDa is a collection of software tools designed for exploratory
More informationScatter Plots with Error Bars
Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each
More informationModule 4: Data Exploration
Module 4: Data Exploration Now that you have your data downloaded from the Streams Project database, the detective work can begin! Before computing any advanced statistics, we will first use descriptive
More informationGIS Tutorial 1. Lecture 2 Map design
GIS Tutorial 1 Lecture 2 Map design Outline Choropleth maps Colors Vector GIS display GIS queries Map layers and scale thresholds Hyperlinks and map tips 2 Lecture 2 CHOROPLETH MAPS Choropleth maps Colorcoded
More informationSummarizing and Displaying Categorical Data
Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency
More informationNorthumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data  November 2012  This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
More information4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"
Data Analysis Plan The appropriate methods of data analysis are determined by your data types and variables of interest, the actual distribution of the variables, and the number of cases. Different analyses
More informationExploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
More informationArcGIS Geostatistical Analyst: Statistical Tools for Data Exploration, Modeling, and Advanced Surface Generation
ArcGIS Geostatistical Analyst: Statistical Tools for Data Exploration, Modeling, and Advanced Surface Generation An ESRI White Paper August 2001 ESRI 380 New York St., Redlands, CA 923738100, USA TEL
More informationLesson 4 Measures of Central Tendency
Outline Measures of a distribution s shape modality and skewness the normal distribution Measures of central tendency mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central
More informationGeography 4203 / 5203. GIS Modeling. Class (Block) 9: Variogram & Kriging
Geography 4203 / 5203 GIS Modeling Class (Block) 9: Variogram & Kriging Some Updates Today class + one proposal presentation Feb 22 Proposal Presentations Feb 25 Readings discussion (Interpolation) Last
More informationBASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi110 012 seema@iasri.res.in Genomics A genome is an organism s
More informationSPSS for Exploratory Data Analysis Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav)
Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav) Organize and Display One Quantitative Variable (Descriptive Statistics, Boxplot & Histogram) 1. Move the mouse pointer
More informationIris Sample Data Set. Basic Visualization Techniques: Charts, Graphs and Maps. Summary Statistics. Frequency and Mode
Iris Sample Data Set Basic Visualization Techniques: Charts, Graphs and Maps CS598 Information Visualization Spring 2010 Many of the exploratory data techniques are illustrated with the Iris Plant data
More informationSimple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
More informationDescriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
More informationLecture 2. Summarizing the Sample
Lecture 2 Summarizing the Sample WARNING: Today s lecture may bore some of you It s (sort of) not my fault I m required to teach you about what we re going to cover today. I ll try to make it as exciting
More informationData Visualization Techniques and Practices Introduction to GIS Technology
Data Visualization Techniques and Practices Introduction to GIS Technology Michael Greene Advanced Analytics & Modeling, Deloitte Consulting LLP March 16 th, 2010 Antitrust Notice The Casualty Actuarial
More informationAn Introduction to Point Pattern Analysis using CrimeStat
Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, UrbanaChampaign
More informationbusiness statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
More informationSpatial Data Analysis
14 Spatial Data Analysis OVERVIEW This chapter is the first in a set of three dealing with geographic analysis and modeling methods. The chapter begins with a review of the relevant terms, and an outlines
More informationconsider the number of math classes taken by math 150 students. how can we represent the results in one number?
ch 3: numerically summarizing data  center, spread, shape 3.1 measure of central tendency or, give me one number that represents all the data consider the number of math classes taken by math 150 students.
More informationAMS 7L LAB #2 Spring, 2009. Exploratory Data Analysis
AMS 7L LAB #2 Spring, 2009 Exploratory Data Analysis Name: Lab Section: Instructions: The TAs/lab assistants are available to help you if you have any questions about this lab exercise. If you have any
More informationA Correlation of. to the. South Carolina Data Analysis and Probability Standards
A Correlation of to the South Carolina Data Analysis and Probability Standards INTRODUCTION This document demonstrates how Stats in Your World 2012 meets the indicators of the South Carolina Academic Standards
More informationDATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
More informationFoundation of Quantitative Data Analysis
Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10  October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1
More information2. Simple Linear Regression
Research methods  II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
More informationThe right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median
CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box
More informationDescriptive Statistics and Measurement Scales
Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample
More informationWhat is GIS? Geographic Information Systems. Introduction to ArcGIS. GIS Maps Contain Layers. What Can You Do With GIS? Layers Can Contain Features
What is GIS? Geographic Information Systems Introduction to ArcGIS A database system in which the organizing principle is explicitly SPATIAL For CPSC 178 Visualization: Data, Pixels, and Ideas. What Can
More informationDemographics of Atlanta, Georgia:
Demographics of Atlanta, Georgia: A Visual Analysis of the 2000 and 2010 Census Data 36315 Final Project Rachel Cohen, Kathryn McKeough, Minnar Xie & David Zimmerman Ethnicities of Atlanta Figure 1: From
More informationData Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar What is data exploration? A preliminary exploration of the data to better understand its characteristics.
More informationDESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
More informationData Preparation and Statistical Displays
Reservoir Modeling with GSLIB Data Preparation and Statistical Displays Data Cleaning / Quality Control Statistics as Parameters for Random Function Models Univariate Statistics Histograms and Probability
More informationChapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs
Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)
More informationCurriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 20092010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 20092010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
More informationNew Tools for Spatial Data Analysis in the Social Sciences
New Tools for Spatial Data Analysis in the Social Sciences Luc Anselin University of Illinois, UrbanaChampaign anselin@uiuc.edu edu Outline! Background! Visualizing Spatial and SpaceTime Association!
More informationData Mining: Exploring Data. Lecture Notes for Chapter 3. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Topics Exploratory Data Analysis Summary Statistics Visualization What is data exploration?
More informationSpatial Data Analysis Using GeoDa. Workshop Goals
Spatial Data Analysis Using GeoDa 9 Jan 2014 Frank Witmer Computing and Research Services Institute of Behavioral Science Workshop Goals Enable participants to find and retrieve geographic data pertinent
More informationExploratory Spatial Data Analysis
Exploratory Spatial Data Analysis Part II Dynamically Linked Views 1 Contents Introduction: why to use noncartographic data displays Display linking by object highlighting Dynamic Query Object classification
More informationMINITAB ASSISTANT WHITE PAPER
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. OneWay
More informationVariables. Exploratory Data Analysis
Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is
More informationProbability and Statistics Vocabulary List (Definitions for Middle School Teachers)
Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence
More informationTutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Getting to know the data An important first step before performing any kind of statistical analysis is to familiarize
More informationDescriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
More informationDESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi  110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics
More informationIntro to GIS Winter 2011. Data Visualization Part I
Intro to GIS Winter 2011 Data Visualization Part I Cartographer Code of Ethics Always have a straightforward agenda and have a defining purpose or goal for each map Always strive to know your audience
More informationCommon Tools for Displaying and Communicating Data for Process Improvement
Common Tools for Displaying and Communicating Data for Process Improvement Packet includes: Tool Use Page # Box and Whisker Plot Check Sheet Control Chart Histogram Pareto Diagram Run Chart Scatter Plot
More informationAMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS
AMARILLO BY MORNING: DATA VISUALIZATION IN GEOSTATISTICS William V. Harper 1 and Isobel Clark 2 1 Otterbein College, United States of America 2 Alloa Business Centre, United Kingdom wharper@otterbein.edu
More informationWorkshop: Using Spatial Analysis and Maps to Understand Patterns of Health Services Utilization
Enhancing Information and Methods for Health System Planning and Research, Institute for Clinical Evaluative Sciences (ICES), January 1920, 2004, Toronto, Canada Workshop: Using Spatial Analysis and Maps
More informationCALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 15 scale to 0100 scores When you look at your report, you will notice that the scores are reported on a 0100 scale, even though respondents
More informationDensity Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve
More informationData Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 1 What is data exploration? A preliminary
More informationWeek 1. Exploratory Data Analysis
Week 1 Exploratory Data Analysis Practicalities This course ST903 has students from both the MSc in Financial Mathematics and the MSc in Statistics. Two lectures and one seminar/tutorial per week. Exam
More informationPart II Chapter 9 Chapter 10 Chapter 11 Chapter 12 Chapter 13 Chapter 14 Chapter 15 Part II
Part II covers diagnostic evaluations of historical facility data for checking key assumptions implicit in the recommended statistical tests and for making appropriate adjustments to the data (e.g., consideration
More informationPractice#1(chapter1,2) Name
Practice#1(chapter1,2) Name Solve the problem. 1) The average age of the students in a statistics class is 22 years. Does this statement describe descriptive or inferential statistics? A) inferential statistics
More informationIntro to Statistics 8 Curriculum
Intro to Statistics 8 Curriculum Unit 1 Bar, Line and Circle Graphs Estimated time frame for unit Big Ideas 8 Days... Essential Question Concepts Competencies Lesson Plans and Suggested Resources Bar graphs
More informationIntroduction to Statistics for Psychology. Quantitative Methods for Human Sciences
Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html
More informationDongfeng Li. Autumn 2010
Autumn 2010 Chapter Contents Some statistics background; ; Comparing means and proportions; variance. Students should master the basic concepts, descriptive statistics measures and graphs, basic hypothesis
More information1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x  x) B. x 3 x C. 3x  x D. x  3x 2) Write the following as an algebraic expression
More informationThe Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)
Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationInterpreting Data in Normal Distributions
Interpreting Data in Normal Distributions This curve is kind of a big deal. It shows the distribution of a set of test scores, the results of rolling a die a million times, the heights of people on Earth,
More informationCOMMON CORE STATE STANDARDS FOR
COMMON CORE STATE STANDARDS FOR Mathematics (CCSSM) High School Statistics and Probability Mathematics High School Statistics and Probability Decisions or predictions are often based on data numbers in
More informationStatistics. Measurement. Scales of Measurement 7/18/2012
Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does
More informationCOM CO P 5318 Da t Da a t Explora Explor t a ion and Analysis y Chapte Chapt r e 3
COMP 5318 Data Exploration and Analysis Chapter 3 What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping
More informationAlgebra I Vocabulary Cards
Algebra I Vocabulary Cards Table of Contents Expressions and Operations Natural Numbers Whole Numbers Integers Rational Numbers Irrational Numbers Real Numbers Absolute Value Order of Operations Expression
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationGeoDa 0.9 User s Guide
GeoDa 0.9 User s Guide Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, UrbanaChampaign Urbana, IL 61801 http://sal.agecon.uiuc.edu/ and
More informationDescriptive Statistics and Exploratory Data Analysis
Descriptive Statistics and Exploratory Data Analysis Dean s s Faculty and Resident Development Series UT College of Medicine Chattanooga Probasco Auditorium at Erlanger January 14, 2008 Marc Loizeaux,
More informationNormality Testing in Excel
Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com
More informationExploratory Data Analysis. Psychology 3256
Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find
More informationData Visualization in R
Data Visualization in R L. Torgo ltorgo@fc.up.pt Faculdade de Ciências / LIAADINESC TEC, LA Universidade do Porto Oct, 2014 Introduction Motivation for Data Visualization Humans are outstanding at detecting
More informationMEASURES OF VARIATION
NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are
More informationCenter: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)
Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center
More informationAP STATISTICS REVIEW (YMS Chapters 18)
AP STATISTICS REVIEW (YMS Chapters 18) Exploring Data (Chapter 1) Categorical Data nominal scale, names e.g. male/female or eye color or breeds of dogs Quantitative Data rational scale (can +,,, with
More informationSPSS Manual for Introductory Applied Statistics: A Variable Approach
SPSS Manual for Introductory Applied Statistics: A Variable Approach John Gabrosek Department of Statistics Grand Valley State University Allendale, MI USA August 2013 2 Copyright 2013 John Gabrosek. All
More informationdetermining relationships among the explanatory variables, and
Chapter 4 Exploratory Data Analysis A first look at the data. As mentioned in Chapter 1, exploratory data analysis or EDA is a critical first step in analyzing the data from an experiment. Here are the
More information430 Statistics and Financial Mathematics for Business
Prescription: 430 Statistics and Financial Mathematics for Business Elective prescription Level 4 Credit 20 Version 2 Aim Students will be able to summarise, analyse, interpret and present data, make predictions
More informationGamma Distribution Fitting
Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics
More informationDescriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion
Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research
More informationOrganizing Your Approach to a Data Analysis
Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize
More informationEXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!
STP 231 EXAM #1 (Example) Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.
More information