2-2 Frequency Distributions

Save this PDF as:

Size: px
Start display at page:

Transcription

1 2-2 Distributions Distributions When working with large data sets, it is often helpful to organize and summarize the data by constructing a table that lists the different possible data values (either individually or by groups) along with the corresponding frequencies, which represent the number of times those values occur. Definition A frequency distribution lists data values (either individually or by groups of intervals), along with their corresponding frequencies (or counts). Table 2-2 is a frequency distribution summarizing the measured cotinine levels of the 40 smokers listed in Table 2-1. The frequency for a particular class is the number of original values that fall into that class. For example, the first class in Table 2 2 has a frequency of 11, indicating that 11 of the original data values are between 0 and 99 inclusive. We first present some standard terms used in discussing frequency distributions, and then we describe how to construct and interpret them. Definitions Lower class limits are the smallest numbers that can belong to the different classes. (Table 2-2 has lower class limits of 0, 100, 200, 300, and 400.) Upper class limits are the largest numbers that can belong to the different classes. (Table 2-2 has upper class limits of 99, 199, 299, 399, and 499.) Class boundaries are the numbers used to separate classes, but without the gaps created by class limits. They are obtained as follows: Find the size of the gap between the upper class limit of one class and the lower class limit of the next class. Add half of that amount to each upper class limit to find the upper class boundaries; subtract half of that amount from each lower class limit to find the lower class boundaries. (Table 2-2 has gaps of exactly 1 unit, so 0.5 is added to the upper class limits and subtracted from the lower class limits. The first class continued Table 2-2 Distribution of Cotinine Levels of Smokers Cotinine

2 40 C HAPTER 2 Describing, Exploring, and Comparing Data has boundaries of 20.5 and 99.5, the second class has boundaries of 99.5 and 199.5, and so on. The complete list of boundaries used for all classes is 20.5, 99.5, 199.5, 299.5, 399.5, and Class midpoints are the midpoints of the classes. (Table 2-2 has class midpoints of 49.5, 149.5, 249.5, 349.5, and ) Each class midpoint can be found by adding the lower class limit to the upper class limit and dividing the sum by 2. Class width is the difference between two consecutive lower class limits or two consecutive lower class boundaries. (Table 2-2 uses a class width of 100.) Growth Charts Updated Pediatricians typically use standardized growth charts to compare their patient s weight and height to a sample of other children. Children are considered to be in the normal range if their weight and height fall between the 5th and 95th percentiles. If they fall outside of that range, they are often given tests to ensure that there are no serious medical problems. Pediatricians became increasingly aware of a major problem with the charts: Because they were based on children living between 1929 and 1975, the growth charts were found to be inaccurate. To rectify this problem, the charts were updated in 2000 to reflect the current measurements of millions of children. The weights and heights of children are good examples of populations that change over time. This is the reason for including changing characteristics of data over time as an important consideration for a population. The definitions of class width and class boundaries are tricky. Be careful to avoid the easy mistake of making the class width the difference between the lower class limit and the upper class limit. See Table 2-2 and note that the class width is 100, not 99. You can simplify the process of finding class boundaries by understanding that they basically fill the gaps between classes by splitting the difference between the end of one class and the beginning of the next class. Procedure for Constructing a Distribution distributions are constructed for these reasons: (1) Large data sets can be summarized, (2) we can gain some insight into the nature of data, and (3) we have a basis for constructing important graphs (such as histograms, introduced in the next section). Many uses of technology allow us to automatically obtain frequency distributions without manually constructing them, but here is the basic procedure: 1. Decide on the number of classes you want. The number of classes should be between 5 and 20, and the number you select might be affected by the convenience of using round numbers. 2. Calculate Class width < shighest valued 2 slowest valued number of classes Round this result to get a convenient number. (Usually round up.) You might need to change the number of classes, but the priority should be to use values that are easy to understand. 3. Starting point: Begin by choosing a number for the lower limit of the first class. Choose either the lowest data value or a convenient value that is a little smaller. 4. Using the lower limit of the first class and the class width, proceed to list the other lower class limits. (Add the class width to the starting point to get the second lower class limit. Add the class width to the second lower class limit to get the third, and so on.) 5. List the lower class limits in a vertical column and proceed to enter the upper class limits, which can be easily identified.

3 2-2 Distributions Go through the data set putting a tally in the appropriate class for each data value. Use the tally marks to find the total frequency for each class. When constructing a frequency distribution, be sure that classes do not overlap so that each of the original values must belong to exactly one class. Include all classes, even those with a frequency of zero. Try to use the same width for all classes, although it is sometimes impossible to avoid open-ended intervals, such as 65 years or older. EXAMPLE Cotinine Levels of Smokers Using the 40 cotinine levels for the smokers in Table 2-1, follow the above procedure to construct the frequency distribution shown in Table 2-2. Assume that you want 5 classes. SOLUTION Step 1: Step 2: class width < Step 3: Step 4: Step 5: Step 6: Begin by selecting 5 as the number of desired classes. Calculate the class width. In the following calculation, 98.2 is rounded up to 100, which is a more convenient number. shighest valued 2 slowest valued number of classes We choose a starting point of 0, which is the lowest value in the list and is also a convenient number. Add the class width of 100 to the starting point of 0 to determine that the second lower class limit is 100. Continue to add the class width of 100 to get the remaining lower class limits of 200, 300, and 400. List the lower class limits vertically, as shown in the margin. From this list, we can easily identify the corresponding upper class limits as 99, 199, 299, 399, and 499. After identifying the lower and upper limits of each class, proceed to work through the data set by entering a tally mark for each value. When the tally marks are completed, add them to find the frequencies shown in Table 2-2. Relative Distribution An important variation of the basic frequency distribution uses relative frequencies, which are easily found by dividing each class frequency by the total of all frequencies. A relative frequency distribution includes the same class limits as a frequency distribution, but relative frequencies are used instead of actual frequencies. The relative frequencies are sometimes expressed as percents. relative frequency class frequency sum of all frequencies < 100 In Table 2-3 the actual frequencies from Table 2-2 are replaced by the corresponding relative frequencies expressed as percents. The first class has a relative frequency of 11> , or 27.5%, which is often rounded to 28%. The Table 2-3 Relative Distribution of Cotinine Levels in Smokers Cotinine Relative % % % % %

4 42 C HAPTER 2 Describing, Exploring, and Comparing Data second class has a relative frequency of 12> , or 30.0%, and so on. If constructed correctly, the sum of the relative frequencies should total 1 (or 100%), with some small discrepancies allowed for rounding errors. Because 27.5% was rounded to 28% and 2.5% was rounded to 3%, the sum of the relative frequencies in Table 2-3 is 101% instead of 100%. Because they use simple proportions or percentages, relative frequency distributions make it easier for us to understand the distribution of the data and to compare different sets of data. Authors Identified In Alexander Hamilton, John Jay, and James Madison anonymously published the famous Federalist Papers in an attempt to convince New Yorkers that they should ratify the Constitution. The identity of most of the papers authors became known, but the authorship of 12 of the papers was contested. Through statistical analysis of the frequencies of various words, we can now conclude that James Madison is the likely author of these 12 papers. For many of the disputed papers, the evidence in favor of Madison s authorship is overwhelming to the degree that we can be almost certain of being correct. Cumulative Distribution Another variation of the standard frequency distribution is used when cumulative totals are desired. The cumulative frequency for a class is the sum of the frequencies for that class and all previous classes. Table 2-4 is the cumulative frequency distribution based on the frequency distribution of Table 2-2. Using the original frequencies of 11, 12, 14, 1, and 2, we add to get the second cumulative frequency of 23, then we add to get the third, and so on. See Table 2-4 and note that in addition to using cumulative frequencies, the class limits are replaced by less than expressions that describe the new range of values. Critical Thinking: Interpreting Distributions The transformation of raw data to a frequency distribution is typically a means to some greater end. The following examples illustrate how frequency distributions can be used to describe, explore, and compare data sets. (The following section shows how the construction of a frequency distribution is often the first step in the creation of a graph that visually depicts the nature of the distribution.) EXAMPLE Describing Data Refer to Data Set 1 in Appendix B for the pulse rates of 40 randomly selected adult males. Table 2-5 summarizes the last digits of those pulse rates. If the pulse rates are measured by counting the number of heartbeats in 1 minute, we expect that those last digits should occur with frequencies that are roughly the same. But note that the frequency distribution shows that the last digits are all even numbers; there are no odd numbers present. This suggests that the pulse rates were not counted for 1 minute. Perhaps they were counted for 30 seconds and the values were then doubled. (Upon further examination of the original pulse rates, we can see that every original value is a multiple of four, suggesting that the number of heartbeats was counted for 15 seconds, then that count was multiplied by four.) It s fascinating to learn something about the method of data collection by simply describing some characteristics of the data.

5 2-2 Distributions 43 Table 2-4 Cumulative Distribution of Cotinine Levels in Smokers Table 2-5 Last Digits of Male Pulse Rates Cotinine Cumulative Last Digit Less than Less than Less than Less than Less than EXAMPLE Exploring Data In studying the behavior of the Old Faithful geyser in Yellowstone National Park, geologists collect data for the times (in minutes) between eruptions. Table 2-6 summarizes actual data that were obtained. Examination of the frequency distribution reveals unexpected behavior: The distribution of times has two different peaks. This distribution led geologists to consider possible explanations. EXAMPLE Comparing Data Sets The Chapter Problem given at the beginning of this chapter includes data sets consisting of measured cotinine levels from smokers, nonsmokers exposed to tobacco smoke, and nonsmokers not exposed to tobacco smoke. Table 2-7 shows Table 2-7 Cotinine Levels for Three Groups Nonsmokers Nonsmokers Not Cotinine Smokers Exposed to Smoke Exposed to Smoke Table 2-6 Times (in minutes) Between Old Faithful Eruptions Time % 85% 95% % 5% 0% % 3% 3% % 3% 3% % 0% 0% % 5% 0% continued

6 44 C HAPTER 2 Describing, Exploring, and Comparing Data the relative frequencies for the three groups. By comparing those relative frequencies, it should be obvious that the frequency distribution for smokers is very different from the frequency distributions for the other two groups. Because the two groups of nonsmokers (exposed and not exposed) have such high frequency amounts for the first class, it might be helpful to further compare those data sets with a closer examination of those values. 2-2 Basic Skills and Concepts In Exercises 1 4, identify the class width, class midpoints, and class boundaries for the given frequency distribution based on Data Set 1 in Appendix B. 1. Systolic Blood 2. Pressure of Men Systolic Blood Pressure of Women Table for Exercise 13 Outcome Table for Exercise 14 Digit Cholesterol of Men Body Mass Index of Women In Exercises 5 8, construct the relative frequency distribution that corresponds to the frequency distribution in the exercise indicated. 5. Exercise 1 6. Exercise 2 7. Exercise 3 8. Exercise 4 In Exercises 9 12, construct the cumulative frequency distribution that corresponds to the frequency distribution in the exercise indicated. 9. Exercise Exercise Exercise Exercise Loaded Die The author drilled a hole in a die and filled it with a lead weight, then proceeded to roll it 200 times. (Yes, the author has too much free time.) The results are given in the frequency distribution in the margin. Construct the corresponding relative frequency distribution and determine whether the die is significantly different from a fair die that has not been loaded. 14. Lottery The frequency distribution in the margin is based on the Win Four numbers from the New York State Lottery, as listed in Data Set 26 in Appendix B. Construct the corresponding relative frequency distribution and determine whether the results appear to be selected in such a way that all of the digits are equally likely.

7 2-2 Distributions Bears Refer to Data Set 9 in Appendix B and construct a frequency distribution of the weights of bears. Use 11 classes beginning with a lower class limit of 0 and use a class width of 50 lb. 16. Body Temperatures Refer to Data Set 4 in Appendix B and construct a frequency distribution of the body temperatures for midnight on the second day. Use 8 classes beginning with a lower class limit of 96.5 and use a class width of 0.4 F. Describe two different notable features of the result. 17. Head Circumferences Refer to Data Set 3 in Appendix B. Construct a frequency distribution for the head circumferences of baby boys and construct a separate frequency distribution for the head circumferences of baby girls. In both cases, use the classes of , , and so on. Then compare the results and determine whether there appears to be a significant difference between the two genders. 18. Animated Movies for Children Refer to Data Set 7 in Appendix B. Construct a frequency distribution for the lengths of time that animated movies for children contain tobacco use and construct a separate frequency distribution for the lengths of time for alcohol use. In both cases, use the classes of 0 99, , and so on. Compare the results and determine whether there appears to be a significant difference. 19. Marathon Runners Refer to Data Set 8 in Appendix B. Construct a relative frequency distribution for the ages of the sample of males who finished the New York City marathon, then construct a separate relative frequency distribution for the ages of the females. In both cases, start the first class with a lower class limit of 19 and use a class width of 10. Compare the results and determine whether there appears to be any notable difference between the two groups. 20. Regular Coke> Diet Coke Refer to Data Set 17 in Appendix B. Construct a relative frequency distribution for the weights of regular Coke by starting the first class at lb and use a class width of lb. Then construct another relative frequency distribution for the weights of diet Coke by starting the first class at lb and use a class width of lb. Then compare the results and determine whether there appears to be a significant difference. If so, provide a possible explanation for the difference. 2-2 Beyond the Basics 21. Interpreting Effects of Outliers Refer to Data Set 20 in Appendix B for the axial loads of aluminum cans that are in. thick. The load of 504 lb is called an outlier because it is very far away from all of the other values. Construct a frequency distribution that includes the value of 504 lb, then construct another frequency distribution with the value of 504 lb excluded. In both cases, start the first class at 200 lb and use a class width of 20 lb. Interpret the results by stating a generalization about how much of an effect an outlier might have on a frequency distribution. 22. Number of Classes In constructing a frequency distribution, Sturges guideline suggests that the ideal number of classes can be approximated by 1 1 (log n) > (log 2), where n is the number of data values. Use this guideline to complete the table for determining the ideal number of classes. Table for Exercise 22 Number Ideal Number of Values of Classes

MAT S2.1 2_3 Review and Preview; Frequency Distribution. January 14, Preview. Chapter 2 Summarizing and Graphing Data

MAT 155 Dr. Claude Moore Cape Fear Community College Chapter 2 Summarizing and Graphing Data 2 1 Review and Preview 2 3 Histograms 2 4 Statistical Graphics 2 5 Critical Thinking: Bad Graphs Preview 1.

2.2-Frequency Distributions

2.2-Frequency Distributions When working with large data sets, it is often helpful to organize and summarize data by constructing a table called a frequency distribution, defined later. Because computer

909 responses responded via telephone survey in U.S. Results were shown by political affiliations (show graph on the board)

1 2-1 Overview Chapter 2: Learn the methods of organizing, summarizing, and graphing sets of data, ultimately, to understand the data characteristics: Center, Variation, Distribution, Outliers, Time. (Computer

Chapter 2 - Graphical Summaries of Data

Chapter 2 - Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense

Describing, Exploring, and Comparing Data

24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

Comments 2 For Discussion Sheet 2 and Worksheet 2 Frequency Distributions and Histograms

Comments 2 For Discussion Sheet 2 and Worksheet 2 Frequency Distributions and Histograms Discussion Sheet 2 We have studied graphs (charts) used to represent categorical data. We now want to look at a

Chapter 2 Summarizing and Graphing Data

Chapter 2 Summarizing and Graphing Data 2-1 Review and Preview 2-2 Frequency Distributions 2-3 Histograms 2-4 Graphs that Enlighten and Graphs that Deceive Preview Characteristics of Data 1. Center: A

Chapter 2: Frequency Distributions and Graphs

Chapter 2: Frequency Distributions and Graphs Learning Objectives Upon completion of Chapter 2, you will be able to: Organize the data into a table or chart (called a frequency distribution) Construct

Math Chapter 2 review

Math 116 - Chapter 2 review Name Provide an appropriate response. 1) Suppose that a data set has a minimum value of 28 and a max of 73 and that you want 5 classes. Explain how to find the class width for

A frequency distribution is a table used to describe a data set. A frequency table lists intervals or ranges of data values called data classes

A frequency distribution is a table used to describe a data set. A frequency table lists intervals or ranges of data values called data classes together with the number of data values from the set that

Graphing Data Presentation of Data in Visual Forms

Graphing Data Presentation of Data in Visual Forms Purpose of Graphing Data Audience Appeal Provides a visually appealing and succinct representation of data and summary statistics Provides a visually

Chapter 2: Frequency Distributions and Graphs (or making pretty tables and pretty pictures)

Chapter 2: Frequency Distributions and Graphs (or making pretty tables and pretty pictures) Example: Titanic passenger data is available for 1310 individuals for 14 variables, though not all variables

Table 2-1. Sucrose concentration (% fresh wt.) of 100 sugar beet roots. Beet No. % Sucrose. Beet No.

Chapter 2. DATA EXPLORATION AND SUMMARIZATION 2.1 Frequency Distributions Commonly, people refer to a population as the number of individuals in a city or county, for example, all the people in California.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. C) (a) 2 (b) 1

Unit 2 Review Name Use the given frequency distribution to find the (a) class width. (b) class midpoints of the first class. (c) class boundaries of the first class. 1) Miles (per day) 1-2 9 3-4 22 5-6

Frequency Distributions

Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data to get a general overview of the results. Remember, this is the goal

FINAL EXAM REVIEW - Fa 13

FINAL EXAM REVIEW - Fa 13 Determine which of the four levels of measurement (nominal, ordinal, interval, ratio) is most appropriate. 1) The temperatures of eight different plastic spheres. 2) The sample

Elementary Statistics. Summarizing Raw Data. In Frequency Distribution Table

Summarizing Raw Data In Frequency Distribution Table 1 / 37 What is a Raw Data? Raw Data (sometimes called source data) is data that has not been processed for meaningful use. What is a Frequency Distribution

Organizing & Graphing Data

AGSC 320 Statistical Methods Organizing & Graphing Data 1 DATA Numerical representation of reality Raw data: Data recorded in the sequence in which they are collected and before any processing Qualitative

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MATH 3/GRACEY PRACTICE EXAM/CHAPTERS 2-3 Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The frequency distribution

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Exam Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Find the mean for the given sample data. 1) Bill kept track of the number of hours he spent

2 Describing, Exploring, and

2 Describing, Exploring, and Comparing Data This chapter introduces the graphical plotting and summary statistics capabilities of the TI- 83 Plus. First row keys like \ R (67\$73/276 are used to obtain

Maths Vocabulary Support Booklet

Maths Vocabulary Support Booklet The aim of this booklet is to help parents to understand and use the technical maths vocabulary accurately and confidently so you can help your children to do the same.

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

Slides by. JOHN LOUCKS St. Edward s University

s by JOHN LOUCKS St. Edward s University 1 Chapter 2, Part A Descriptive Statistics: Tabular and Graphical Presentations Summarizing Qualitative Data Summarizing Quantitative Data 2 Summarizing Qualitative

Probability Models for Continuous Random Variables

Density Probability Models for Continuous Random Variables At right you see a histogram of female length of life. (Births and deaths are recorded to the nearest minute. The data are essentially continuous.)

Mind on Statistics. Chapter 2

Mind on Statistics Chapter 2 Sections 2.1 2.3 1. Tallies and cross-tabulations are used to summarize which of these variable types? A. Quantitative B. Mathematical C. Continuous D. Categorical 2. The table

WHICH TYPE OF GRAPH SHOULD YOU CHOOSE?

PRESENTING GRAPHS WHICH TYPE OF GRAPH SHOULD YOU CHOOSE? CHOOSING THE RIGHT TYPE OF GRAPH You will usually choose one of four very common graph types: Line graph Bar graph Pie chart Histograms LINE GRAPHS

FREQUENCY AND PERCENTILES

FREQUENCY DISTRIBUTIONS AND PERCENTILES New Statistical Notation Frequency (f): the number of times a score occurs N: sample size Simple Frequency Distributions Raw Scores The scores that we have directly

MATHEMATICS FOR ENGINEERS STATISTICS TUTORIAL 4 PROBABILITY DISTRIBUTIONS

MATHEMATICS FOR ENGINEERS STATISTICS TUTORIAL 4 PROBABILITY DISTRIBUTIONS CONTENTS Sample Space Accumulative Probability Probability Distributions Binomial Distribution Normal Distribution Poisson Distribution

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

Introductory Statistics Lectures Visualizing Data Descriptive Statistics I Department of Mathematics Pima Community College Redistribution of this material is prohibited without written permission of the

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

Summarizing and Displaying Categorical Data

Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency

F. Farrokhyar, MPhil, PhD, PDoc

Learning objectives Descriptive Statistics F. Farrokhyar, MPhil, PhD, PDoc To recognize different types of variables To learn how to appropriately explore your data How to display data using graphs How

There are some general common sense recommendations to follow when presenting

Presentation of Data The presentation of data in the form of tables, graphs and charts is an important part of the process of data analysis and report writing. Although results can be expressed within

Methods for Describing Data Sets

1 Methods for Describing Data Sets.1 Describing Data Graphically In this section, we will work on organizing data into a special table called a frequency table. First, we will classify the data into categories.

Statistics Chapter 2

Statistics Chapter 2 Frequency Tables A frequency table organizes quantitative data. partitions data into classes (intervals). shows how many data values are in each class. Test Score Number of Students

Central Tendency. n Measures of Central Tendency: n Mean. n Median. n Mode

Central Tendency Central Tendency n A single summary score that best describes the central location of an entire distribution of scores. n Measures of Central Tendency: n Mean n The sum of all scores divided

5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

Lecture 6: Chapter 6: Normal Probability Distributions A normal distribution is a continuous probability distribution for a random variable x. The graph of a normal distribution is called the normal curve.

Chapter 1: Using Graphs to Describe Data

Department of Mathematics Izmir University of Economics Week 1 2014-2015 Introduction In this chapter we will focus on the definitions of population, sample, parameter, and statistic, the classification

Content DESCRIPTIVE STATISTICS. Data & Statistic. Statistics. Example: DATA VS. STATISTIC VS. STATISTICS

Content DESCRIPTIVE STATISTICS Dr Najib Majdi bin Yaacob MD, MPH, DrPH (Epidemiology) USM Unit of Biostatistics & Research Methodology School of Medical Sciences Universiti Sains Malaysia. Introduction

The Ordered Array. Chapter Chapter Goals. Organizing and Presenting Data Graphically. Before you continue... Stem and Leaf Diagram

Chapter - Chapter Goals After completing this chapter, you should be able to: Construct a frequency distribution both manually and with Excel Construct and interpret a histogram Chapter Presenting Data

Northumberland Knowledge

Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

GCSE HIGHER Statistics Key Facts

GCSE HIGHER Statistics Key Facts Collecting Data When writing questions for questionnaires, always ensure that: 1. the question is worded so that it will allow the recipient to give you the information

Sampling, frequency distribution, graphs, measures of central tendency, measures of dispersion

Statistics Basics Sampling, frequency distribution, graphs, measures of central tendency, measures of dispersion Part 1: Sampling, Frequency Distributions, and Graphs The method of collecting, organizing,

AP Statistics 2011 Scoring Guidelines

AP Statistics 2011 Scoring Guidelines The College Board The College Board is a not-for-profit membership association whose mission is to connect students to college success and opportunity. Founded in

2-4 Measures of Center

-4 Measures of Center 59-4 Measures of Center Remember that the main objective of this chapter is to master basic tools for measuring and describing different characteristics of a set of data. In Section

4. Introduction to Statistics

Statistics for Engineers 4-1 4. Introduction to Statistics Descriptive Statistics Types of data A variate or random variable is a quantity or attribute whose value may vary from one unit of investigation

Models for Discrete Variables

Probability Models for Discrete Variables Our study of probability begins much as any data analysis does: What is the distribution of the data? Histograms, boxplots, percentiles, means, standard deviations

and Relativity Frequency Polygons for Discrete Quantitative Data We can use class boundaries to represent a class of data

Section - A: Frequency Polygons and Relativity Frequency Polygons for Discrete Quantitative Data We can use class boundaries to represent a class of data In the past section we created Frequency Histograms.

Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam. Software Profiling Seminar, Statistics 101

Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam Software Profiling Seminar, 2013 Statistics 101 Descriptive Statistics Population Object Object Object Sample numerical description Object

Accuracy, Precision and Uncertainty

Accuracy, Precision and Uncertainty How tall are you? How old are you? When you answered these everyday questions, you probably did it in round numbers such as "five foot, six inches" or "nineteen years,

Chapter 3: Central Tendency

Chapter 3: Central Tendency Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the distribution and represents

Fundamentals of Probability

Fundamentals of Probability Introduction Probability is the likelihood that an event will occur under a set of given conditions. The probability of an event occurring has a value between 0 and 1. An impossible

AP * Statistics Review. Descriptive Statistics

AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

Descriptive Statistics. Frequency Distributions and Their Graphs 2.1. Frequency Distributions. Chapter 2

Chapter Descriptive Statistics.1 Frequency Distributions and Their Graphs Frequency Distributions A frequency distribution is a table that shows classes or intervals of data with a count of the number

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample

Exam # 1 STAT The number of people from the state of Alaska الاسكا) (ولاية who voted for a Republican

King Abdulaziz University Faculty of Sciences Statistics Department Name: ID No: Exam # 1 STAT 11 First Term 149-143H Section: 6 You have 6 questions in 7 pages. You have 1 minutes to solve the exam. Please

College of the Canyons Math 140 Exam 1 Amy Morrow. Name:

Name: Answer the following questions NEATLY. Show all necessary work directly on the exam. Scratch paper will be discarded unread. One point each part unless otherwise marked. 1. Owners of an exercise

Chapter 2. Objectives. Tabulate Qualitative Data. Frequency Table. Descriptive Statistics: Organizing, Displaying and Summarizing Data.

Objectives Chapter Descriptive Statistics: Organizing, Displaying and Summarizing Data Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically

Appendix 2.1 Tabular and Graphical Methods Using Excel

Appendix 2.1 Tabular and Graphical Methods Using Excel 1 Appendix 2.1 Tabular and Graphical Methods Using Excel The instructions in this section begin by describing the entry of data into an Excel spreadsheet.

Visual Display of Data in Stata

Lab 2 Visual Display of Data in Stata In this lab we will try to understand data not only through numerical summaries, but also through graphical summaries. The data set consists of a number of variables

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

Chapter 7. Estimates and Sample Size

Chapter 7. Estimates and Sample Size Chapter Problem: How do we interpret a poll about global warming? Pew Research Center Poll: From what you ve read and heard, is there a solid evidence that the average

Statistics Revision Sheet Question 6 of Paper 2

Statistics Revision Sheet Question 6 of Paper The Statistics question is concerned mainly with the following terms. The Mean and the Median and are two ways of measuring the average. sumof values no. of

Chapter 6 Random Variables

Chapter 6 Random Variables Day 1: 6.1 Discrete Random Variables Read 340-344 What is a random variable? Give some examples. A numerical variable that describes the outcomes of a chance process. Examples:

Report of for Chapter 2 pretest

Report of for Chapter 2 pretest Exam: Chapter 2 pretest Category: Organizing and Graphing Data 1. "For our study of driving habits, we recorded the speed of every fifth vehicle on Drury Lane. Nearly every

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Ch. 2 Organizing and Summarizing Data 2.1 Organizing Qualitative Data 1 Organize qualitative data in tables. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Math 240 Practice Exam 1 Note: Show all work whenever possible for full credit. You may leave your answers in terms of fractions.

Name: (Solution on last page)... Math 240 Practice Exam 1 Note: Show all work whenever possible for full credit. You may leave your answers in terms of fractions. Formulas: 2 2 x x n x x s x x n f x f

Chapter 3: Data Description Numerical Methods

Chapter 3: Data Description Numerical Methods Learning Objectives Upon successful completion of Chapter 3, you will be able to: Summarize data using measures of central tendency, such as the mean, median,

2. Describing Data. We consider 1. Graphical methods 2. Numerical methods 1 / 56

2. Describing Data We consider 1. Graphical methods 2. Numerical methods 1 / 56 General Use of Graphical and Numerical Methods Graphical methods can be used to visually and qualitatively present data and

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

MBA/MIB 5315 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 OVERVIEW STATISTICS PANIK...THE THEORY AND METHODS OF COLLECTING, ORGANIZING, PRESENTING, ANALYZING, AND INTERPRETING DATA SETS SO AS TO DETERMINE THEIR ESSENTIAL

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Ch. 10 Chi SquareTests and the F-Distribution 10.1 Goodness of Fit 1 Find Expected Frequencies Provide an appropriate response. 1) The frequency distribution shows the ages for a sample of 100 employees.

Statistics 100 Midterm Practice Problems

STATISTICS 100 MIDTERM PRACTICE PROBLEMS PAGE 1 OF 8 Statistics 100 Midterm Practice Problems 1. (26 points total) Suppose that in 2004, the verbal portion of the Scholastic Aptitude Test (SAT) had a mean

If the Shoe Fits! Overview of Lesson GAISE Components Common Core State Standards for Mathematical Practice

If the Shoe Fits! Overview of Lesson In this activity, students explore and use hypothetical data collected on student shoe print lengths, height, and gender in order to help develop a tentative description

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)

Cars R and S travel the same distance, so the answer is D.

1 In each graph the distance travelled by the car is represented by the area under the line. Some people can spot which areas are roughly equal. But you can work it out: This gives distances (in metres)

Central Tendency and Variation

Contents 5 Central Tendency and Variation 161 5.1 Introduction............................ 161 5.2 The Mode............................. 163 5.2.1 Mode for Ungrouped Data................ 163 5.2.2 Mode

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

Unit 16 Normal Distributions

Unit 16 Normal Distributions Objectives: To obtain relative frequencies (probabilities) and percentiles with a population having a normal distribution While there are many different types of distributions

Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab

1 Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab I m sure you ve wondered about the absorbency of paper towel brands as you ve quickly tried to mop up spilled soda from

CHAPTER 3 CENTRAL TENDENCY ANALYSES

CHAPTER 3 CENTRAL TENDENCY ANALYSES The next concept in the sequential statistical steps approach is calculating measures of central tendency. Measures of central tendency represent some of the most simple

Chapter 1: Looking at Data Distributions. Dr. Nahid Sultana

Chapter 1: Looking at Data Distributions Dr. Nahid Sultana Chapter 1: Looking at Data Distributions 1.1 Displaying Distributions with Graphs 1.2 Describing Distributions with Numbers 1.3 Density Curves

MCQ S OF MEASURES OF CENTRAL TENDENCY

MCQ S OF MEASURES OF CENTRAL TENDENCY MCQ No 3.1 Any measure indicating the centre of a set of data, arranged in an increasing or decreasing order of magnitude, is called a measure of: (a) Skewness (b)

2-7 Exploratory Data Analysis (EDA)

102 C HAPTER 2 Describing, Exploring, and Comparing Data 2-7 Exploratory Data Analysis (EDA) This chapter presents the basic tools for describing, exploring, and comparing data, and the focus of this section

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Graphical Representations of Data, Mean, Median and Standard Deviation In this class we will consider graphical representations of the distribution of a set of data. The goal is to identify the range of

Introduction to Descriptive Statistics

Mathematics Learning Centre Introduction to Descriptive Statistics Jackie Nicholas c 1999 University of Sydney Acknowledgements Parts of this booklet were previously published in a booklet of the same

SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS

SECTION 2-1: OVERVIEW Chapter 2 Describing, Exploring and Comparing Data 19 In this chapter, we will use the capabilities of Excel to help us look more carefully at sets of data. We can do this by re-organizing

AP Statistics 2008 Scoring Guidelines Form B

AP Statistics 2008 Scoring Guidelines Form B The College Board: Connecting Students to College Success The College Board is a not-for-profit membership association whose mission is to connect students

Chapter 2 - Practice Problems 2

Chapter 2 - Practice Problems 2 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Construct a grouped-data table for the given data. 1) Kevin asked some

Common Tools for Displaying and Communicating Data for Process Improvement

Common Tools for Displaying and Communicating Data for Process Improvement Packet includes: Tool Use Page # Box and Whisker Plot Check Sheet Control Chart Histogram Pareto Diagram Run Chart Scatter Plot

Chapter 8 Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter problem: Does the MicroSort method of gender selection increase the likelihood that a baby will be girl? MicroSort: a gender-selection method developed by Genetics

Unit 1 Number Sense. In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions.

Unit 1 Number Sense In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions. BLM Three Types of Percent Problems (p L-34) is a summary BLM for the material

Statistics Summary (prepared by Xuan (Tappy) He)

Statistics Summary (prepared by Xuan (Tappy) He) Statistics is the practice of collecting and analyzing data. The analysis of statistics is important for decision making in events where there are uncertainties.

MEI Statistics 1. Exploring data. Section 1: Introduction. Looking at data

MEI Statistics Exploring data Section : Introduction Notes and Examples These notes have sub-sections on: Looking at data Stem-and-leaf diagrams Types of data Measures of central tendency Comparison of

Desciptive Statistics Qualitative data Quantitative data Graphical methods Numerical methods

Desciptive Statistics Qualitative data Quantitative data Graphical methods Numerical methods Qualitative data Data are classified in categories Non numerical (although may be numerically codified) Elements