Module 2: Introduction to Quantitative Data Analysis
|
|
|
- Daisy Marilyn Newton
- 1 years ago
- Views:
Transcription
1 Module 2: Introduction to Quantitative Data Analysis Contents Antony Fielding 1 University of Birmingham & Centre for Multilevel Modelling Rebecca Pillinger Centre for Multilevel Modelling Introduction C2.1.1 Frequency distributions... 3 C2.1.2 Summary statistics of key features of distributions...10 C2.2 Comparisons and relationships: the role of variability C2.2.1 Comparing subgroups as a form of relationship...15 C2.2.2 Relationships and their direction...16 C2.2.3 The role of variability...18 C2.2.4 Homoscedasticity and heteroscedasticity...20 C2.3 Some other examples of relationships and the role of explained variability Introduction Some of the sections within this module have online quizzes for you to test your understanding. To find the quizzes: EXAMPLE From within the LEMMA learning environment Go down to the Lesson for Module 2: Introduction to Quantitative Data Analysis Click"2.1 Univariate Data Summary" to open Lesson 2.1 Click to open the first question Q1(i) Introduction The aim of this module is to give an overall view of some the principles of effective data analysis. The focus is on how we summarise data to uncover patterns and relationships between variables, and how these relationships can begin to explain the values of the variables that we observe. Some key statistical vocabulary is introduced and concepts are illustrated by example. You will learn about the following: C2.3.1 Extending to two categorical explanators for a continuous response: the possibility of interactions...21 C2.3.2 Variability of categorical responses...26 C2.3.3 Where both response and explanatory variables are continuous...28 C2.3.4 Patterns other than straight lines...31 C2.3.5 Relationship between a continuous response and combinations of categorical and continuous explanators: Progress of students in schools...33 C2.4 Working towards the idea of a formal statistical model C2.5 Comments on statistical inference; uncertainty, estimation and hypothesis testing 44 C2.5.1 Estimation...44 C2.5.2 Confidence Intervals...47 C2.5.3 Testing hypotheses...47 Ways of summarising the shape and pattern of the values of a single variable, at different levels of measurement. Understanding the role of variability in comparative analysis and the study of relationships. The elaboration of relationships, the importance of control variables, and the concepts of confounding, suppression and interaction. The essential parts of a statistical model: pattern and residual variation. Key concepts in inference from samples. 1 With contributed material from Kelvyn Jones and Fiona Steele and extensive comment by Harvey Goldstein. Centre for Multilevel Modelling, Centre for Multilevel Modelling,
2 Before we decide on appropriate methods of analysis to address our research question we will usually want to get to know our data. At the very minimum we will want to establish that the variables we plan to use do indeed vary. We will also want to establish an initial picture of the nature of that variation, since it is the study of such variation that will form the basis of further analysis. Thus the first step in any quantitative investigation is to separately summarise each particular variable in various informative ways. Since these initial summaries deal with only one variable they will be called univariate analyses. For instance in our research we might be contemplating using the variable Years of Education, from the European Social Survey data introduced in Module 1. It will be initially useful to know something about the distribution of respondents over the values of this variable. What is the general level of education? How widely dispersed are the values? Is there a concentration of a large number of units (individuals) at particular points? Are there units with extreme values in our data set? Are there curious or inexplicable values which may lead us to suspect errors? In the case of nominal variables, such as Marital Status or Ethnic Group, or of ordinal variables, such as Income measured by twelve ranges, how many units fall into each category? The answers to these questions will affect the analyses we perform. We should also re-emphasise the major point discussed in Module 1 that the level of measurement of a variable may restrict what summaries are appropriate. C2.1.1 Frequency distributions Frequency distributions are perhaps the most commonly used initial summaries. They count how many in the set of units under consideration have different values, or groups of values, of the variable. Frequency distributions can usually be employed whatever the level of measurement. However, there are a number of conventions which influence different ways of grouping values. C Categorical data Table 2.1 and Table 2.2 represent typical summaries for the categories of a nominal variable, Marital Status, and an ordered variable, Education Level (defined as the highest level of education attained). Table 2.1. Frequency distribution of Marital Status Number % of all cases % of valid cases Married Separated Divorced Widowed Never married Refusal Don't know No answer Total Table 2.2 Frequency distribution of Education Level Number % of all cases % of valid cases Cumulative % of valid cases Not completed 1, primary education Primary or first 5, stage of basic Lower secondary 9, or second stage of basic Upper secondary 13, Post secondary, 3, non-tertiary First stage of 5, tertiary Second stage of 2, tertiary Refusal Don't know No answer Total Centre for Multilevel Modelling, Centre for Multilevel Modelling,
3 These tables show the number of respondents in a subsample from the European Social Survey (ESS) data that fall into each of the categories of the variable. 2 There are also a number of categories relating to different sorts of missing data. Handling such missing data is a complex problem beyond the scope of this module, but it is important to recognise their existence. When they are removed we have what are usually termed valid cases for further analysis. For the rest of this section we will focus only on summarising valid cases. The tables also show percentage distributions. Percentages are usually easier to interpret because of their familiar feel, and because they show us the relative frequency of the values of the variable in the data. This enables us to better judge the relative importance of the values. In Table 2.1 we can see, for instance, that over half of our subsample is married. In Table 2.2 we can see that there are very few cases in the first and last valid categories. Since the ordering of the categories has meaning here, we note that the extremes of Education Level contain relatively small numbers. The most frequently occurring categories, married in the first case and upper secondary in the second, are called the modes of the distributions. There is an extra column called the cumulative frequency in Table 2.2 that is appropriate for ordinal (or grouped ratio or interval level) variables, and which gives the percentage of units with a value of the variable less than or equal to the value of the row in question. So, for example, 72.8% achieve secondary education or lower or, to look at it another way, 27.2% go beyond secondary education. A cumulative summary in the opposite direction would produce figures like the latter 3, showing the percentage of units with a value of the variable greater than or equal to the value of the row in question. It is often easier to pick out key features of the data by visualising them. Figure 2.1 is a pie-chart for the distribution of marital status where the angles (and thus the sizes) of the slices of the pie are in proportion to the relative frequencies in the last column of Table 2.1. The pie chart is perhaps the most common way of visualising distributions for categorical data but bar charts are sometimes also used. Figure 2.1: Pie chart of distribution of Marital Status Married Separated Divorced Widowed Never married Refusal Don't know No answer C Continuous data There are a few new issues connected with frequency distributions for continuous variables since to display patterns we must often group values in a meaningful way. In the ESS there is a variable defined by answers to the question How long have you lived in this area?, which we will call Years of Residence. Data are recorded to the nearest whole year and the set of possible values are integers between zero and ninety eight, so only a small percentage of units will take each particular value. A full frequency distribution with ninety nine groups would be too detailed, patterns would be difficult to discern and the table would not be very informative. To avoid these problems the values must be grouped into classes or intervals. In determining the appropriate number of intervals and their widths we need to strike a balance between too many intervals (which leads to groups with relatively few cases in each) or too few intervals (which results in too great a loss of information). There are no hard and fast rules for this. One possible set of intervals for Years of Residence is shown in Table 2.3. Here the intervals are of an equal width of ten years, and are perhaps easier to handle when this is the case, but as will be seen in another example later in this section there is no necessity that this should be so. 2 This is the subsample of cases made available on the ESS website for on-line analysis at 3 but shifted down a row. Centre for Multilevel Modelling, Centre for Multilevel Modelling,
4 Table 2.3 Grouped frequency distribution of approximate Years of Residence in area at interview (valid cases only) Whole year range Number of valid cases Percentage Cumulative Percentage Total variables graphically. The most common are histograms although other devices such as stem and leaf plots (for small datasets) or box and whisker plots (also called boxplots) may be employed. Figure 2.2 depicts a histogram for the distribution of Years of Residence in Table 2.3. Each rectangle here corresponds to one of the intervals, with the horizontal width of the rectangle proportional to the width of that interval. The areas of the rectangles are proportional to the numbers (or percentages) in each range. It is important to note that the construction of a histogram is governed by the areas of the rectangles, not their heights. Only if the intervals are of equal width, as they are here, are the two equivalent. The areas give a proper visual interpretation of the relative sizes of these ranges. In the diagram an approximate smooth curve has also been drawn to connect the tops of the rectangles. This is called a frequency curve and gives us an impression of the shape of the distribution. Here we have an example of an asymmetric distribution with a long tail, called a skewed distribution. Almost 50% of lengths of residence are under 20 years and after that point the number of units in each range decreases gradually as the length of residence increases. 4 The long tail is therefore to the right and so this is an example of positive skew. If the long tail had been to the left, i.e. towards smaller values along the continuous range, we would refer to the distribution as negatively skewed. Sometimes the raw data provided to the analyst is already grouped into intervals of values, so that there is less flexibility and the only decision then might be whether to summarise further by aggregating intervals. One such example is the Income variable, recorded from questionnaires in twelve ranges of values and coded 1-12 in our example dataset. The frequency distribution of valid cases is displayed in Table 2.4. The widths of the ranges here are not equal but are wider for ranges at the higher end of the income scale. Figure 2.2 Histogram of approximate Years of Residence in area at interview (valid cases only) There are also useful ways of presenting such frequency distributions of continuous 4 More detailed examination of the ungrouped data would reveal that the cases in the first interval are almost uniformly spread over each of the separate single values in that group. Centre for Multilevel Modelling, Centre for Multilevel Modelling,
5 This document is only the first few pages of the full version. To see the complete document please go to learning materials and register: The course is completely free. We ask for a few details about yourself for our research purposes only. We will not give any details to any other organisation unless it is with your express permission.
Summarizing and Displaying Categorical Data
Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency
Diagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
Exploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
Graphical and Tabular. Summarization of Data OPRE 6301
Graphical and Tabular Summarization of Data OPRE 6301 Introduction and Re-cap... Descriptive statistics involves arranging, summarizing, and presenting a set of data in such a way that useful information
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics
Chapter 2 - Graphical Summaries of Data
Chapter 2 - Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense
Data exploration with Microsoft Excel: univariate analysis
Data exploration with Microsoft Excel: univariate analysis Contents 1 Introduction... 1 2 Exploring a variable s frequency distribution... 2 3 Calculating measures of central tendency... 16 4 Calculating
Variables. Exploratory Data Analysis
Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is
Ungrouped data. A list of all the values of a variable in a data set is referred to as ungrouped data.
1 Social Studies 201 September 21, 2006 Presenting data See text, chapter 4, pp. 87-160. Data sets When data are initially obtained from questionnaires, interviews, experiments, administrative sources,
Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures
Introductory Statistics Lectures Visualizing Data Descriptive Statistics I Department of Mathematics Pima Community College Redistribution of this material is prohibited without written permission of the
Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
Foundation of Quantitative Data Analysis
Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1
Graphical methods for presenting data
Chapter 2 Graphical methods for presenting data 2.1 Introduction We have looked at ways of collecting data and then collating them into tables. Frequency tables are useful methods of presenting data; they
There are some general common sense recommendations to follow when presenting
Presentation of Data The presentation of data in the form of tables, graphs and charts is an important part of the process of data analysis and report writing. Although results can be expressed within
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
Using SPSS, Chapter 2: Descriptive Statistics
1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,
Analyzing Experimental Data
Analyzing Experimental Data The information in this chapter is a short summary of some topics that are covered in depth in the book Students and Research written by Cothron, Giese, and Rezba. See the end
STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members
Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
Sta 309 (Statistics And Probability for Engineers)
Instructor: Prof. Mike Nasab Sta 309 (Statistics And Probability for Engineers) Chapter 2 Organizing and Summarizing Data Raw Data: When data are collected in original form, they are called raw data. The
Presenting numerical data
Student Learning Development Presenting numerical data This guide offers practical advice on how to incorporate numerical information into essays, reports, dissertations, posters and presentations. The
MTH 140 Statistics Videos
MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative
Describing, Exploring, and Comparing Data
24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter
Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
Common Tools for Displaying and Communicating Data for Process Improvement
Common Tools for Displaying and Communicating Data for Process Improvement Packet includes: Tool Use Page # Box and Whisker Plot Check Sheet Control Chart Histogram Pareto Diagram Run Chart Scatter Plot
AMS 7L LAB #2 Spring, 2009. Exploratory Data Analysis
AMS 7L LAB #2 Spring, 2009 Exploratory Data Analysis Name: Lab Section: Instructions: The TAs/lab assistants are available to help you if you have any questions about this lab exercise. If you have any
BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
Module 1: Using Quantitative Data in Research: Concepts and Definitions
Module 1: Using Quantitative Data in Research: Concepts and Definitions Antony Fielding 1 University of Birmingham & Centre for Multilevel Modelling Contents Rebecca Pillinger Centre for Multilevel Modelling
Northumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
MBA 611 STATISTICS AND QUANTITATIVE METHODS
MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain
Demographics of Atlanta, Georgia:
Demographics of Atlanta, Georgia: A Visual Analysis of the 2000 and 2010 Census Data 36-315 Final Project Rachel Cohen, Kathryn McKeough, Minnar Xie & David Zimmerman Ethnicities of Atlanta Figure 1: From
Mathematics. Probability and Statistics Curriculum Guide. Revised 2010
Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction
Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs
Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)
Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013
Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives
Consultation paper Presenting data in graphs, charts and tables science subjects
Consultation paper Presenting data in graphs, charts and tables science subjects This edition: November 2015 (version 1.0) Scottish Qualifications Authority 2015 Contents Presenting data in graphs, charts
The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)
Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,
Data Exploration Data Visualization
Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select
SPSS for Exploratory Data Analysis Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav)
Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav) Organize and Display One Quantitative Variable (Descriptive Statistics, Boxplot & Histogram) 1. Move the mouse pointer
Lecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
Get The Picture: Visualizing Financial Data part 1
Get The Picture: Visualizing Financial Data part 1 by Jeremy Walton Turning numbers into pictures is usually the easiest way of finding out what they mean. We're all familiar with the display of for example
Chapter 2: Frequency Distributions and Graphs
Chapter 2: Frequency Distributions and Graphs Learning Objectives Upon completion of Chapter 2, you will be able to: Organize the data into a table or chart (called a frequency distribution) Construct
Excel Charts & Graphs
MAX 201 Spring 2008 Assignment #6: Charts & Graphs; Modifying Data Due at the beginning of class on March 18 th Introduction This assignment introduces the charting and graphing capabilities of SPSS and
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
Graphing Data Presentation of Data in Visual Forms
Graphing Data Presentation of Data in Visual Forms Purpose of Graphing Data Audience Appeal Provides a visually appealing and succinct representation of data and summary statistics Provides a visually
HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS
Mathematics Revision Guides Histograms, Cumulative Frequency and Box Plots Page 1 of 25 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS
Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study.
Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study Prepared by: Centers for Disease Control and Prevention National
Numerical Summarization of Data OPRE 6301
Numerical Summarization of Data OPRE 6301 Motivation... In the previous session, we used graphical techniques to describe data. For example: While this histogram provides useful insight, other interesting
Biostatistics: A QUICK GUIDE TO THE USE AND CHOICE OF GRAPHS AND CHARTS
Biostatistics: A QUICK GUIDE TO THE USE AND CHOICE OF GRAPHS AND CHARTS 1. Introduction, and choosing a graph or chart Graphs and charts provide a powerful way of summarising data and presenting them in
Data Visualization Techniques
Data Visualization Techniques From Basics to Big Data with SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Generating the Best Visualizations for Your Data... 2 The
Knowing and Using Number Facts
Knowing and Using Number Facts Use knowledge of place value and Use knowledge of place value and addition and subtraction of two-digit multiplication facts to 10 10 to numbers to derive sums and derive
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
Basic Tools for Process Improvement
What is a Histogram? A Histogram is a vertical bar chart that depicts the distribution of a set of data. Unlike Run Charts or Control Charts, which are discussed in other modules, a Histogram does not
Descriptive Analysis
Research Methods William G. Zikmund Basic Data Analysis: Descriptive Statistics Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging,
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Getting to know the data An important first step before performing any kind of statistical analysis is to familiarize
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences
Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html
2: Frequency Distributions
2: Frequency Distributions Stem-and-Leaf Plots (Stemplots) The stem-and-leaf plot (stemplot) is an excellent way to begin an analysis. Consider this small data set: 218 426 53 116 309 504 281 270 246 523
Data Visualization Techniques
Data Visualization Techniques From Basics to Big Data with SAS Visual Analytics WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Generating the Best Visualizations for Your Data... 2 The
All Visualizations Documentation
All Visualizations Documentation All Visualizations Documentation 2 Copyright and Trademarks Licensed Materials - Property of IBM. Copyright IBM Corp. 2013 IBM, the IBM logo, and Cognos are trademarks
2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
Lean Six Sigma Training/Certification Book: Volume 1
Lean Six Sigma Training/Certification Book: Volume 1 Six Sigma Quality: Concepts & Cases Volume I (Statistical Tools in Six Sigma DMAIC process with MINITAB Applications Chapter 1 Introduction to Six Sigma,
Descriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
Lecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course
Exploratory Data Analysis. Psychology 3256
Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find
A Picture Really Is Worth a Thousand Words
4 A Picture Really Is Worth a Thousand Words Difficulty Scale (pretty easy, but not a cinch) What you ll learn about in this chapter Why a picture is really worth a thousand words How to create a histogram
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
Intro to Statistics 8 Curriculum
Intro to Statistics 8 Curriculum Unit 1 Bar, Line and Circle Graphs Estimated time frame for unit Big Ideas 8 Days... Essential Question Concepts Competencies Lesson Plans and Suggested Resources Bar graphs
1 Lesson 3: Presenting Data Graphically
1 Lesson 3: Presenting Data Graphically 1.1 Types of graphs Once data is organized and arranged, it can be presented. Graphic representations of data are called graphs, plots or charts. There are an untold
business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
IBM SPSS Direct Marketing 23
IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
Module 4: Data Exploration
Module 4: Data Exploration Now that you have your data downloaded from the Streams Project database, the detective work can begin! Before computing any advanced statistics, we will first use descriptive
Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th
Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th Standard 3: Data Analysis, Statistics, and Probability 6 th Prepared Graduates: 1. Solve problems and make decisions that depend on un
DATA INTERPRETATION AND STATISTICS
PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE
Statistics Chapter 2
Statistics Chapter 2 Frequency Tables A frequency table organizes quantitative data. partitions data into classes (intervals). shows how many data values are in each class. Test Score Number of Students
Getting started manual
Getting started manual XLSTAT Getting started manual Addinsoft 1 Table of Contents Install XLSTAT and register a license key... 4 Install XLSTAT on Windows... 4 Verify that your Microsoft Excel is up-to-date...
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools
Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools Occam s razor.......................................................... 2 A look at data I.........................................................
Exploratory Data Analysis
Exploratory Data Analysis Learning Objectives: 1. After completion of this module, the student will be able to explore data graphically in Excel using histogram boxplot bar chart scatter plot 2. After
T O P I C 1 2 Techniques and tools for data analysis Preview Introduction In chapter 3 of Statistics In A Day different combinations of numbers and types of variables are presented. We go through these
Statistical Analysis of. Manual Therapists Funded by ACC:
Statistical Analysis of Manual Therapists Funded by ACC: Market Analysis by Provider and Injury Type with Trend Analysis Prepared for Osteopathic Council of NZ by: Dr Carl Scarrott FINAL REPORT statconsultancy.com
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-
SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)
Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence
Exploratory Spatial Data Analysis
Exploratory Spatial Data Analysis Part II Dynamically Linked Views 1 Contents Introduction: why to use non-cartographic data displays Display linking by object highlighting Dynamic Query Object classification
Exploratory Data Analysis
Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
Descriptive Statistics and Exploratory Data Analysis
Descriptive Statistics and Exploratory Data Analysis Dean s s Faculty and Resident Development Series UT College of Medicine Chattanooga Probasco Auditorium at Erlanger January 14, 2008 Marc Loizeaux,
STATGRAPHICS Online. Statistical Analysis and Data Visualization System. Revised 6/21/2012. Copyright 2012 by StatPoint Technologies, Inc.
STATGRAPHICS Online Statistical Analysis and Data Visualization System Revised 6/21/2012 Copyright 2012 by StatPoint Technologies, Inc. All rights reserved. Table of Contents Introduction... 1 Chapter
Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.
Graphical Representations of Data, Mean, Median and Standard Deviation In this class we will consider graphical representations of the distribution of a set of data. The goal is to identify the range of
What Does the Normal Distribution Sound Like?
What Does the Normal Distribution Sound Like? Ananda Jayawardhana Pittsburg State University ananda@pittstate.edu Published: June 2013 Overview of Lesson In this activity, students conduct an investigation
THE BINOMIAL DISTRIBUTION & PROBABILITY
REVISION SHEET STATISTICS 1 (MEI) THE BINOMIAL DISTRIBUTION & PROBABILITY The main ideas in this chapter are Probabilities based on selecting or arranging objects Probabilities based on the binomial distribution
STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance
Principles of Statistics STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis
Part 2: Data Visualization How to communicate complex ideas with simple, efficient and accurate data graphics
Part 2: Data Visualization How to communicate complex ideas with simple, efficient and accurate data graphics Why visualize data? The human eye is extremely sensitive to differences in: Pattern Colors
What is Statistics? Statistics is about Collecting data Organizing data Analyzing data Presenting data
Introduction What is Statistics? Statistics is about Collecting data Organizing data Analyzing data Presenting data What is Statistics? Statistics is divided into two areas: descriptive statistics and
STAB22 section 1.1. total = 88(200/100) + 85(200/100) + 77(300/100) + 90(200/100) + 80(100/100) = 176 + 170 + 231 + 180 + 80 = 837,
STAB22 section 1.1 1.1 Find the student with ID 104, who is in row 5. For this student, Exam1 is 95, Exam2 is 98, and Final is 96, reading along the row. 1.2 This one involves a careful reading of the
Plotting Data with Microsoft Excel
Plotting Data with Microsoft Excel Here is an example of an attempt to plot parametric data in a scientifically meaningful way, using Microsoft Excel. This example describes an experience using the Office
Using Excel for descriptive statistics
FACT SHEET Using Excel for descriptive statistics Introduction Biologists no longer routinely plot graphs by hand or rely on calculators to carry out difficult and tedious statistical calculations. These
Data exploration with Microsoft Excel: analysing more than one variable
Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical
T O P I C 1 1 Introduction to statistics Preview Introduction In previous topics we have looked at ways of gathering data for research purposes and ways of organising and presenting it. In topics 11 and
An introduction to using Microsoft Excel for quantitative data analysis
Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing
