Descriptive Statistics


 Austin Turner
 2 years ago
 Views:
Transcription
1 Descriptive Statistics Suppose following data have been collected (heights of 99 fiveyearold boys) Initial impression is of indigestibility 1
2 Put data into ascending order Sorted data Much more can now be seen but hardly succinct: need simple summaries 2
3 all values are between 96.1 cm and cm middle value is 18.7 cm  indication of location values quarter of way up and down data are 15.6 and cm  indication of spread these comprise the five number summary for these data Middle value is known as the Median Also upper and lower quartiles Definitions need some care 3
4 Median is middle value. Definition of Median For five values there is a definite middle one But not for four values? 4
5 Definition of Median Median is the middle value of a sample when the sample size is odd Median is the average of the two middle values when sample size is even Definitions for quartiles are similar and given more precisely in appendix 1 of handout (see web page) 5
6 Graphical displays of the data Display the five figure summary  Box and Whisker plots 12 Top of box is upper quartile Height (cm) 11 Line across box is at level of median 1 Bottom of box is lower quartile Whiskers can extend to maximum and minimum but 6
7 Alternative Box and Whisker plots 12 Whiskers extend to only a given multiple of the box height Height (cm) 11 This is intended to allows outliers to be seen 1 7
8 Histograms A useful way to display all the data in a sample: Divide range into bins and plot numbers/fractions in bins Choose bin width judiciously Proportion.4 Proportion Height (cm) Height (cm) Proportion.4 Proportion Height (cm) (2, 6 15 and 5 bins: clockwise) Height (cm) 8
9 Histograms for increasing sample size Samples of sizes 99, 3, 1, 1 99 heights 3 heights Proportion.1 Proportion Height (cm) 1 heights Height (cm) 1 heights Proportion.4 Proportion Height (cm) Height (cm) Form gets more regular as sample size increases  tends to a limiting form  notion of a population and inferential statistics 9
10 Populations, Parameters and Inference Sample rarely of interest in its own right Value lies in its representativeness of a greater whole, about which we wish to learn Inferential Statistics seeks to draw conclusions about a population from a sample Sample must be representative of population  random selection Population often defined in terms of unknown parameters µ, σ Estimated by sample analogues or statistics, such as m or s 1
11 . 9. Describing Populations Distributional form: we consider only the Normal distribution: symmetric bellshaped curve Associated parameters: measure of location µ (mean) measure of spread σ (SD) σ µ Height (cm) (e.g.) 11
12 Sample Estimates of µ and σ Estimate µ by sample mean (arithmetic mean  simple average ) m x 1 + = x x n where x 1, x 2,,x n are the values in the sample (n.b. size n) Estimate of σ is sample standard deviation (SD): n s = ( x1 m) + ( x2 m) + + ( xn m) n 1 For sample of 99 boys m = cm s = 5.21 cm (note units) {cf median 18.7 cm quartiles, 15.6 cm, cm IQR = 5.5 cm} 12
13 little need these days to work directly with these formulae indeed, using a computer is preferable as well as laboursaving appendix 2 describes some peculiarities of SD formula for those interested mean and SD appropriate summaries for a Normal distribution what about nonnormal data? 13
14 Skewed data Medians and quartiles  always legitimate tools means and SDs  less easy to give general answers.4.3 Fraction Cumulative dose of phototherapy, J/cm2 NonNormal data, especially skewed unduly mean influenced by large values in sample 14
15 Quantitative use of the Normal curve Interpretable feature of curve is area under curve.9 Shaded area = Probability a child's height < X (= 112 cm here) Height (cm) Area under whole curve is 1: area under curve up to X is P x Probability randomly chosen member of population has value < X 15
16 Cumulative probabilities Given X, and µ, σ the value of P can be found but you need a computer P is the cumulative probability at X In example µ = 18 cm σ = 4.7 cm and X = 112 cm Gives P =.826 I.e. about 8% of five year old boy have height < 112 cm 16
17 Some obvious results: Other probabilities probability value is larger than X is 1P I.e or about 2% of boys have height above 112cm Symmetry means that it follows that proportion.1974 of boys have height below 14 cm (14 cm is 4 cm below mean, and 112 cm was 4 cm above mean) Allows virtually all other probabilities to be found 17
18 Q = probability height < 11cm = mean  7 cm =.68 Q = probability height > 115cm = mean + 7 cm = Height (cm) µ = 18 cm Can also ask inverse question: given P what is X? For example, 3% of boys have height less than what? I.e. P =.3 and X = cm X is the inverse cumulative probability of P. 18
19 Zscores P depends on X but there is a simpler structure underlying the relationship. X can be expressed as being so many SDs from the mean I.e. X = µ + Zσ and P depends on X only through Z Allows quicker apprehension of the Normal distribution (e.g. 95% within ± 2 SDs of mean, 68% within ± 1 SD of mean) Z P Proportion within Z SDs of mean (=2P1)
20 Why is the Normal Distribution Important? Empirically important  many variables are Normal There are reasons why some variables are Normal e.g. polygenic control  see appendix 3 Means are more Normal 2
21 Distribution of sample means Population Sample size Fraction Fraction X Sample size Sample means Sample size Fraction.5 Fraction Sample means Sample means Figure 8: histograms of 1 observations form a population and of 5 sample means for samples of size 1, 5 and 1. 21
22 Assessing Normality Quite a preoccupation  not as necessary as might be thought cf. Distributionfree or nonparametric methods Many methods that assume Normality are robust (or other types of assumption are more important) Assessing Normality is not straightforward and its importance depends on purpose of analysis 22
23 Assessing Normality  a Quick and Dirty method About 2½% of population lies below µ  2σ About 16% of population lies below µ  σ If variable is positive and µ < σ then Normal distribution highly questionable If variable positive and µ 2σ then Normal distribution might be just about OK If value lies inbetween then appropriateness should be judged accordingly Handy method when reading papers  not at all foolproof 23
24 Plot data: I histogram Histogram is obvious way to see if shape of distribution of sample is close to that of Normal curve In small samples this is not easy: Expt==1 Expt==2 Expt==3.2 Samples of size 1 from distribution known to be Normal Fraction Expt==4 Expt==7 Expt==5 Expt==8 Expt==6 Expt== X Histograms 24
25 Plot data: II Normal Probability Plot Normal Probability Plot for C Mean: StDev: Percent Data Plot ordered sample values against appropriate yvalue If data are Normal then straight line will result 25
26 How does a Normal probability plot work? Depends on expected positions for ordered values in a Normal sample Sample size µ 2σ µ σ µ µ+σ Ordered variable µ+2σ 26
27 Normal plot for nonnormal data Normal Probability Plot for C Mean: StDev: Percent Fraction Data Variable 27
Descriptive Statistics. Understanding Data: Categorical Variables. Descriptive Statistics. Dataset: Shellfish Contamination
Descriptive Statistics Understanding Data: Dataset: Shellfish Contamination Location Year Species Species2 Method Metals Cadmium (mg kg  ) Chromium (mg kg  ) Copper (mg kg  ) Lead (mg kg  ) Mercury
More informationDescriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
More information1.5 NUMERICAL REPRESENTATION OF DATA (Sample Statistics)
1.5 NUMERICAL REPRESENTATION OF DATA (Sample Statistics) As well as displaying data graphically we will often wish to summarise it numerically particularly if we wish to compare two or more data sets.
More informationSTAT 155 Introductory Statistics. Lecture 5: Density Curves and Normal Distributions (I)
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Introductory Statistics Lecture 5: Density Curves and Normal Distributions (I) 9/12/06 Lecture 5 1 A problem about Standard Deviation A variable
More informationNumerical Measures of Central Tendency
Numerical Measures of Central Tendency Often, it is useful to have special numbers which summarize characteristics of a data set These numbers are called descriptive statistics or summary statistics. A
More informationCenter: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)
Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center
More informationChapter 6. The Standard Deviation as a Ruler and the Normal Model. Copyright 2012, 2008, 2005 Pearson Education, Inc.
Chapter 6 The Standard Deviation as a Ruler and the Normal Model Copyright 2012, 2008, 2005 Pearson Education, Inc. The Standard Deviation as a Ruler The trick in comparing very differentlooking values
More informationMCQ S OF MEASURES OF CENTRAL TENDENCY
MCQ S OF MEASURES OF CENTRAL TENDENCY MCQ No 3.1 Any measure indicating the centre of a set of data, arranged in an increasing or decreasing order of magnitude, is called a measure of: (a) Skewness (b)
More informationDescriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion
Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research
More informationSummary of Formulas and Concepts. Descriptive Statistics (Ch. 14)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 14) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationA frequency distribution is a table used to describe a data set. A frequency table lists intervals or ranges of data values called data classes
A frequency distribution is a table used to describe a data set. A frequency table lists intervals or ranges of data values called data classes together with the number of data values from the set that
More informationUnit 21 Student s t Distribution in Hypotheses Testing
Unit 21 Student s t Distribution in Hypotheses Testing Objectives: To understand the difference between the standard normal distribution and the Student's t distributions To understand the difference between
More informationVariables. Exploratory Data Analysis
Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is
More information103 Measures of Central Tendency and Variation
103 Measures of Central Tendency and Variation So far, we have discussed some graphical methods of data description. Now, we will investigate how statements of central tendency and variation can be used.
More informationUnivariate Descriptive Statistics
Univariate Descriptive Statistics Displays: pie charts, bar graphs, box plots, histograms, density estimates, dot plots, stemleaf plots, tables, lists. Example: sea urchin sizes Boxplot Histogram Urchin
More information6.2 Normal distribution. Standard Normal Distribution:
6.2 Normal distribution Slide Heights of Adult Men and Women Slide 2 Area= Mean = µ Standard Deviation = σ Donation: X ~ N(µ,σ 2 ) Standard Normal Distribution: Slide 3 Slide 4 a normal probability distribution
More informationDensity Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve
More information3: Summary Statistics
3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes
More informationGCSE HIGHER Statistics Key Facts
GCSE HIGHER Statistics Key Facts Collecting Data When writing questions for questionnaires, always ensure that: 1. the question is worded so that it will allow the recipient to give you the information
More informationExploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
More information2.0 Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table
2.0 Lesson Plan Answer Questions 1 Summary Statistics Histograms The Normal Distribution Using the Standard Normal Table 2. Summary Statistics Given a collection of data, one needs to find representations
More informationData Mining Part 2. Data Understanding and Preparation 2.1 Data Understanding Spring 2010
Data Mining Part 2. and Preparation 2.1 Spring 2010 Instructor: Dr. Masoud Yaghini Introduction Outline Introduction Measuring the Central Tendency Measuring the Dispersion of Data Graphic Displays References
More informationIntroduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data
A Few Sources for Data Examples Used Introduction to Environmental Statistics Professor Jessica Utts University of California, Irvine jutts@uci.edu 1. Statistical Methods in Water Resources by D.R. Helsel
More informationExercise 1.12 (Pg. 2223)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
More information2. Describing Data. We consider 1. Graphical methods 2. Numerical methods 1 / 56
2. Describing Data We consider 1. Graphical methods 2. Numerical methods 1 / 56 General Use of Graphical and Numerical Methods Graphical methods can be used to visually and qualitatively present data and
More informationGeostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras cfelgueiras@isegi.unl.pt
More informationContent DESCRIPTIVE STATISTICS. Data & Statistic. Statistics. Example: DATA VS. STATISTIC VS. STATISTICS
Content DESCRIPTIVE STATISTICS Dr Najib Majdi bin Yaacob MD, MPH, DrPH (Epidemiology) USM Unit of Biostatistics & Research Methodology School of Medical Sciences Universiti Sains Malaysia. Introduction
More informationAP Statistics Solutions to Packet 2
AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 68 2.1 DENSITY CURVES (a) Sketch a density curve that
More informationNonParametric Tests (I)
Lecture 5: NonParametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of DistributionFree Tests (ii) Median Test for Two Independent
More informationPROPERTIES OF MEAN, MEDIAN
PROPERTIES OF MEAN, MEDIAN In the last class quantitative and numerical variables bar charts, histograms(in recitation) Mean, Median Suppose the data set is {30, 40, 60, 80, 90, 120} X = 70, median = 70
More informationWe will use the following data sets to illustrate measures of center. DATA SET 1 The following are test scores from a class of 20 students:
MODE The mode of the sample is the value of the variable having the greatest frequency. Example: Obtain the mode for Data Set 1 77 For a grouped frequency distribution, the modal class is the class having
More informationIII. GRAPHICAL METHODS
Pie Charts and Bar Charts: III. GRAPHICAL METHODS Pie charts and bar charts are used for depicting frequencies or relative frequencies. We compare examples of each using the same data. Sources: AT&T (1961)
More informationMBA 611 STATISTICS AND QUANTITATIVE METHODS
MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 111) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain
More information4. Introduction to Statistics
Statistics for Engineers 41 4. Introduction to Statistics Descriptive Statistics Types of data A variate or random variable is a quantity or attribute whose value may vary from one unit of investigation
More informationHistograms and density curves
Histograms and density curves What s in our toolkit so far? Plot the data: histogram (or stemplot) Look for the overall pattern and identify deviations and outliers Numerical summary to briefly describe
More informationWeek 1. Exploratory Data Analysis
Week 1 Exploratory Data Analysis Practicalities This course ST903 has students from both the MSc in Financial Mathematics and the MSc in Statistics. Two lectures and one seminar/tutorial per week. Exam
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. C) (a) 2 (b) 1
Unit 2 Review Name Use the given frequency distribution to find the (a) class width. (b) class midpoints of the first class. (c) class boundaries of the first class. 1) Miles (per day) 12 9 34 22 56
More informationWeek 3&4: Z tables and the Sampling Distribution of X
Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal
More informationNumerical Summarization of Data OPRE 6301
Numerical Summarization of Data OPRE 6301 Motivation... In the previous session, we used graphical techniques to describe data. For example: While this histogram provides useful insight, other interesting
More informationDescriptive Statistics. Frequency Distributions and Their Graphs 2.1. Frequency Distributions. Chapter 2
Chapter Descriptive Statistics.1 Frequency Distributions and Their Graphs Frequency Distributions A frequency distribution is a table that shows classes or intervals of data with a count of the number
More informationWhat are Data? The Research Question (Randomised Controlled Trials (RCTs)) The Research Question (Non RCTs)
What are Data? Quantitative Data o Sets of measurements of objective descriptions of physical and behavioural events; susceptible to statistical analysis Qualitative data o Descriptive, views, actions
More informationTreatment and analysis of data Applied statistics Lecture 3: Sampling and descriptive statistics
Treatment and analysis of data Applied statistics Lecture 3: Sampling and descriptive statistics Topics covered: Parameters and statistics Sample mean and sample standard deviation Order statistics and
More informationChapter 3 Descriptive Statistics: Numerical Measures. Learning objectives
Chapter 3 Descriptive Statistics: Numerical Measures Slide 1 Learning objectives 1. Single variable Part I (Basic) 1.1. How to calculate and use the measures of location 1.. How to calculate and use the
More informationconsider the number of math classes taken by math 150 students. how can we represent the results in one number?
ch 3: numerically summarizing data  center, spread, shape 3.1 measure of central tendency or, give me one number that represents all the data consider the number of math classes taken by math 150 students.
More informationDescriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
More informationCH.6 Random Sampling and Descriptive Statistics
CH.6 Random Sampling and Descriptive Statistics Population vs Sample Random sampling Numerical summaries : sample mean, sample variance, sample range StemandLeaf Diagrams Median, quartiles, percentiles,
More informationYou flip a fair coin four times, what is the probability that you obtain three heads.
Handout 4: Binomial Distribution Reading Assignment: Chapter 5 In the previous handout, we looked at continuous random variables and calculating probabilities and percentiles for those type of variables.
More informationStatistics Summary (prepared by Xuan (Tappy) He)
Statistics Summary (prepared by Xuan (Tappy) He) Statistics is the practice of collecting and analyzing data. The analysis of statistics is important for decision making in events where there are uncertainties.
More information13.2 Measures of Central Tendency
13.2 Measures of Central Tendency Measures of Central Tendency For a given set of numbers, it may be desirable to have a single number to serve as a kind of representative value around which all the numbers
More informationExploratory Data Analysis
Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
More informationThe Normal Distribution
Chapter 6 The Normal Distribution 6.1 The Normal Distribution 1 6.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Recognize the normal probability distribution
More informationChiSquare Test. Contingency Tables. Contingency Tables. ChiSquare Test for Independence. ChiSquare Tests for GoodnessofFit
ChiSquare Tests 15 Chapter ChiSquare Test for Independence ChiSquare Tests for Goodness Uniform Goodness Poisson Goodness Goodness Test ECDF Tests (Optional) McGrawHill/Irwin Copyright 2009 by The
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course
More informationChapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs
Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)
More informationMinitab Guide. This packet contains: A Friendly Guide to Minitab. Minitab StepByStep
Minitab Guide This packet contains: A Friendly Guide to Minitab An introduction to Minitab; including basic Minitab functions, how to create sets of data, and how to create and edit graphs of different
More informationF. Farrokhyar, MPhil, PhD, PDoc
Learning objectives Descriptive Statistics F. Farrokhyar, MPhil, PhD, PDoc To recognize different types of variables To learn how to appropriately explore your data How to display data using graphs How
More information1 Measures for location and dispersion of a sample
Statistical Geophysics WS 2008/09 7..2008 Christian Heumann und Helmut Küchenhoff Measures for location and dispersion of a sample Measures for location and dispersion of a sample In the following: Variable
More informationCHINHOYI UNIVERSITY OF TECHNOLOGY
CHINHOYI UNIVERSITY OF TECHNOLOGY SCHOOL OF NATURAL SCIENCES AND MATHEMATICS DEPARTMENT OF MATHEMATICS MEASURES OF CENTRAL TENDENCY AND DISPERSION INTRODUCTION From the previous unit, the Graphical displays
More informationChapter 2. Objectives. Tabulate Qualitative Data. Frequency Table. Descriptive Statistics: Organizing, Displaying and Summarizing Data.
Objectives Chapter Descriptive Statistics: Organizing, Displaying and Summarizing Data Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically
More informationThe right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median
CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box
More information4.3 Areas under a Normal Curve
4.3 Areas under a Normal Curve Like the density curve in Section 3.4, we can use the normal curve to approximate areas (probabilities) between different values of Y that follow a normal distribution Y
More informationBoxplots. The Boxplot as an Indicator of Centrality
Boxplots In its simplest form, the boxplot presents five sample statistics  the minimum, the lower quartile, the median, the upper quartile and the maximum  in a visual display. The box of the plot is
More informationthe number of organisms in the squares of a haemocytometer? the number of goals scored by a football team in a match?
Poisson Random Variables (Rees: 6.8 6.14) Examples: What is the distribution of: the number of organisms in the squares of a haemocytometer? the number of hits on a web site in one hour? the number of
More information1. 2. 3. 4. Find the mean and median. 5. 1, 2, 87 6. 3, 2, 1, 10. Bellwork 32315 Simplify each expression.
Bellwork 32315 Simplify each expression. 1. 2. 3. 4. Find the mean and median. 5. 1, 2, 87 6. 3, 2, 1, 10 1 Objectives Find measures of central tendency and measures of variation for statistical data.
More informationSTATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members
More informationProbability and Statistics Vocabulary List (Definitions for Middle School Teachers)
Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence
More informationHistogram. Graphs, and measures of central tendency and spread. Alternative: density (or relative frequency ) plot /13/2004
Graphs, and measures of central tendency and spread 9.07 9/13/004 Histogram If discrete or categorical, bars don t touch. If continuous, can touch, should if there are lots of bins. Sum of bin heights
More information5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.
The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution
More informationPopulation and sample; parameter and statistic. Sociology 360 Statistics for Sociologists I Chapter 11 Sampling Distributions. Question about Notation
Population and sample; parameter and statistic Sociology 360 Statistics for Sociologists I Chapter 11 Sampling Distributions The Population is the entire group we are interested in A parameter is a number
More informationChapter 2: Exploring Data with Graphs and Numerical Summaries. Graphical Measures Graphs are used to describe the shape of a data set.
Page 1 of 16 Chapter 2: Exploring Data with Graphs and Numerical Summaries Graphical Measures Graphs are used to describe the shape of a data set. Section 1: Types of Variables In general, variable can
More informationMeasures of Central Tendency. There are different types of averages, each has its own advantages and disadvantages.
Measures of Central Tendency According to Prof Bowley Measures of central tendency (averages) are statistical constants which enable us to comprehend in a single effort the significance of the whole. The
More informationLecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
More information6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
More informationDesciptive Statistics Qualitative data Quantitative data Graphical methods Numerical methods
Desciptive Statistics Qualitative data Quantitative data Graphical methods Numerical methods Qualitative data Data are classified in categories Non numerical (although may be numerically codified) Elements
More informationNorthumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data  November 2012  This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
More informationThe Normal distribution
The Normal distribution The normal probability distribution is the most common model for relative frequencies of a quantitative variable. Bellshaped and described by the function f(y) = 1 2σ π e{ 1 2σ
More informationChapter 7 What to do when you have the data
Chapter 7 What to do when you have the data We saw in the previous chapters how to collect data. We will spend the rest of this course looking at how to analyse the data that we have collected. Stem and
More informationChapter 4. iclicker Question 4.4 Prelecture. Part 2. Binomial Distribution. J.C. Wang. iclicker Question 4.4 Prelecture
Chapter 4 Part 2. Binomial Distribution J.C. Wang iclicker Question 4.4 Prelecture iclicker Question 4.4 Prelecture Outline Computing Binomial Probabilities Properties of a Binomial Distribution Computing
More information1 SAMPLE SIGN TEST. NonParametric Univariate Tests: 1 Sample Sign Test 1. A nonparametric equivalent of the 1 SAMPLE TTEST.
NonParametric Univariate Tests: 1 Sample Sign Test 1 1 SAMPLE SIGN TEST A nonparametric equivalent of the 1 SAMPLE TTEST. ASSUMPTIONS: Data is nonnormally distributed, even after log transforming.
More informationNormal distribution. ) 2 /2σ. 2π σ
Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a
More informationChapter 3. The Normal Distribution
Chapter 3. The Normal Distribution Topics covered in this chapter: Zscores Normal Probabilities Normal Percentiles Zscores Example 3.6: The standard normal table The Problem: What proportion of observations
More informationThe Big 50 Revision Guidelines for S1
The Big 50 Revision Guidelines for S1 If you can understand all of these you ll do very well 1. Know what is meant by a statistical model and the Modelling cycle of continuous refinement 2. Understand
More informationVisual Display of Data in Stata
Lab 2 Visual Display of Data in Stata In this lab we will try to understand data not only through numerical summaries, but also through graphical summaries. The data set consists of a number of variables
More informationWhen σ Is Known: Recall the Mystery Mean Activity where x bar = 240.79 and we have an SRS of size 16
8.3 ESTIMATING A POPULATION MEAN When σ Is Known: Recall the Mystery Mean Activity where x bar = 240.79 and we have an SRS of size 16 Task was to estimate the mean when we know that the situation is Normal
More informationIntroduction to Statistics for Psychology. Quantitative Methods for Human Sciences
Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html
More informationNormality Testing in Excel
Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com
More informationConfidence Intervals for the Difference Between Two Means
Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means
More informationChapter 3: Data Description Numerical Methods
Chapter 3: Data Description Numerical Methods Learning Objectives Upon successful completion of Chapter 3, you will be able to: Summarize data using measures of central tendency, such as the mean, median,
More informationLecture 2. Summarizing the Sample
Lecture 2 Summarizing the Sample WARNING: Today s lecture may bore some of you It s (sort of) not my fault I m required to teach you about what we re going to cover today. I ll try to make it as exciting
More informationTHE BINOMIAL DISTRIBUTION & PROBABILITY
REVISION SHEET STATISTICS 1 (MEI) THE BINOMIAL DISTRIBUTION & PROBABILITY The main ideas in this chapter are Probabilities based on selecting or arranging objects Probabilities based on the binomial distribution
More information16. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION
6. THE NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION It is sometimes difficult to directly compute probabilities for a binomial (n, p) random variable, X. We need a different table for each value of
More informationBNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
More informationAnalyzing DataDescriptive Statistics
Analyzing DataDescriptive Statistics Statistical Methods Math 2620  B Alexus Nelson, Miriam Watkins, Kaitlyn Smith Nelson, Watkins, Smith 2 Analyzing Data Descriptive Statistics Data set 1 which is
More information1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x  x) B. x 3 x C. 3x  x D. x  3x 2) Write the following as an algebraic expression
More informationGlossary of numeracy terms
Glossary of numeracy terms These terms are used in numeracy. You can use them as part of your preparation for the numeracy professional skills test. You will not be assessed on definitions of terms during
More information2. Filling Data Gaps, Data validation & Descriptive Statistics
2. Filling Data Gaps, Data validation & Descriptive Statistics Dr. Prasad Modak Background Data collected from field may suffer from these problems Data may contain gaps ( = no readings during this period)
More informationStatistics Chapter 3 Averages and Variations
Statistics Chapter 3 Averages and Variations Measures of Central Tendency Average a measure of the center value or central tendency of a distribution of values. Three types of average: Mode Median Mean
More informationProbability Models for Continuous Random Variables
Density Probability Models for Continuous Random Variables At right you see a histogram of female length of life. (Births and deaths are recorded to the nearest minute. The data are essentially continuous.)
More informationNormal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.
Normal Distribution Definition A continuous random variable has a normal distribution if its probability density e (y µ Y ) 2 2 / 2 σ function can be written as for < y < as Y f ( y ) = 1 σ Y 2 π Notation:
More informationSTT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables
Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random
More information