NUMB3RS Activity: We re Number 1! Episode: The Running Man

Similar documents
Benford s Law and Digital Frequency Analysis

Benford s Law: Tables of Logarithms, Tax Cheats, and The Leading Digit Phenomenon

Microeconomics Sept. 16, 2010 NOTES ON CALCULUS AND UTILITY FUNCTIONS

THE FIRST-DIGIT PHENOMENON

The Point-Slope Form

DIVISION OF DECIMALS We then we multiply by the

Fairfield Public Schools

The Quadratic Formula

Representing Decimals (pages )

A Guide to Benford s Law. A CaseWare IDEA Research Department Document

PAYCHEX, INC. BASIC BUSINESS MATH TRAINING MODULE

Chapter 3 Review Math 1030

Mathematics Content: Pie Charts; Area as Probability; Probabilities as Percents, Decimals & Fractions

AP Statistics Solutions to Packet 2

Chapter 13: Fibonacci Numbers and the Golden Ratio

The time value of money: Part II

CHAPTER 7 INTRODUCTION TO SAMPLING DISTRIBUTIONS

STAT 360 Probability and Statistics. Fall 2012

1. The volume of the object below is 186 cm 3. Calculate the Length of x. (a) 3.1 cm (b) 2.5 cm (c) 1.75 cm (d) 1.25 cm

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Example 1: Suppose the demand function is p = 50 2q, and the supply function is p = q. a) Find the equilibrium point b) Sketch a graph

Growth Models. Linear (Algebraic) Growth. Growth Models 95

ANALYSIS OF TREND CHAPTER 5

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

Detecting Fraud in Health Insurance Data: Learning to Model Incomplete Benford s Law Distributions

The Utah Basic Skills Competency Test Framework Mathematics Content and Sample Questions

Interaction at a Distance

How to Win the Stock Market Game

Algebra Bridge Project Cell Phone Plans

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

7. Normal Distributions

HOW MUCH WILL I SPEND ON GAS?

Key Concept. Density Curve

1.7 Graphs of Functions

Multiplying and Dividing Signed Numbers. Finding the Product of Two Signed Numbers. (a) (3)( 4) ( 4) ( 4) ( 4) 12 (b) (4)( 5) ( 5) ( 5) ( 5) ( 5) 20

Comparing Simple and Compound Interest

Finance CHAPTER OUTLINE. 5.1 Interest 5.2 Compound Interest 5.3 Annuities; Sinking Funds 5.4 Present Value of an Annuity; Amortization

High School Functions Interpreting Functions Understand the concept of a function and use function notation.

3 Some Integer Functions

PENNSYLVANIA DEPARTMENT OF EDUCATION About the Mathematics Assessment Anchors

Mathematics as Problem Solving The students will demonstrate the ability to gather information from a graphical representation of an equation.

UNIT AUTHOR: Elizabeth Hume, Colonial Heights High School, Colonial Heights City Schools

Math 728 Lesson Plan

The Difficulty of Faking Data Theodore P. Hill

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Solving Compound Interest Problems

Objectives. Day Day 1 Day 2 Day 3 Day 4 Day 5 Day 6 Day 7. Inventory 2,782 2,525 2,303 2,109 1,955 1,788 1,570

1 (1 + i) 12 = ( 1 + r i = ( 1 + r 2 i = ( 1 + r 2

Interactive Computer Software in College Algebra

Tom wants to find two real numbers, a and b, that have a sum of 10 and have a product of 10. He makes this table.

What You ll Learn. And Why. Key Words. interest simple interest principal amount compound interest compounding period present value future value

Please start the slide show from the beginning to use links. Click here for active links to various courses

Objectives. Materials

LOGNORMAL MODEL FOR STOCK PRICES

6.4 Normal Distribution

Standard 5: The student will analyze the costs and benefits of saving and investing.

Probability. Distribution. Outline

ASSESSMENT DIRECTIONS: Please use the data sets provided below to complete Steps 1-3 of the worksheet. Day 3 Assessment

Characteristics of Binomial Distributions

The Science of Golf. Test Lab Toolkit The Score: Handicap. Grades 6-8

CHAPTER 4 MORE INTEREST FORMULAS

SQUARES AND SQUARE ROOTS

CHAPTER 4 Consumer Choice

Lies My Calculator and Computer Told Me

Napa Valley College Fall 2015 Math : College Algebra (Prerequisite: Math 94/Intermediate Alg.)

Welcome! 8 th Grade Honors Algebra Informational Meeting

Cheating behavior and the Benford s Law Professor Dominique Geyer, ( dgeyer@audencia.com) Nantes Graduate School of Management, France

Lesson 1. Key Financial Concepts INTRODUCTION

Mathematics Higher Level

Solutions of Equations in Two Variables

When 95% Accurate Isn t Written by: Todd CadwalladerOlsker California State University, Fullerton tcadwall@fullerton.edu

The activities in Math Behind the Market are organized into four main mathematic strands:

It s the Last Straw!

Math-in-CTE Lesson Plan Template

Student Activity: To investigate an ESB bill

The Circumference Function

Solving Equations With Fractional Coefficients

Multiplying Fractions

9. Time Value of Money 1: Present and Future Value

What is a Stock? What is a Stock? demonstrates to students how they can own parts of public companies by purchasing shares of their stock.

Normal Probability Distribution

Algebra I. In this technological age, mathematics is more important than ever. When students

Average rate of change of y = f(x) with respect to x as x changes from a to a + h:

Business and Economic Applications

Complete a table of values. Graph the values given in a table. Create an equation representing the information in a table or graph.

Assignment 2: Option Pricing and the Black-Scholes formula The University of British Columbia Science One CS Instructor: Michael Gelbart

3 Introduction to Assessing Risk

Microeconomic Theory: Basic Math Concepts

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

High School Algebra Reasoning with Equations and Inequalities Solve systems of equations.

Finite Mathematics Using Microsoft Excel

Session 7 Bivariate Data and Analysis

Rational Exponents. Squaring both sides of the equation yields. and to be consistent, we must have

RANDOM-NUMBER GUESSING AND THE FIRST DIGIT PHENOMENON1

33 DISCOUNTED PRESENT VALUE

Geometric Series and Annuities

T: Here is a fruit machine with 3 DIALS and with 20 SYMBOLS on each dial. Each of the 20 symbols is equally likely to occur on each dial.

Lesson 5: Percent Change Bellringer

Logarithmic and Exponential Equations

31 Misleading Graphs and Statistics

Transcription:

Teacher Page 1 NUMB3RS Activity: We re Number 1! Topic: Benford s Law Grade Level: 9-12 Objective: Explore Benford s Law Time: 20-30 minutes Introduction In The Running Man, Charlie discusses how Benford s Law can be used to identify a fabricated set of data. Benford s Law describes the surprising fact that in many naturally occurring data sets described by a geometric progression, the leading digit is more likely to be 1 or 2 than any other number. (The leading digit of a number is the leftmost digit of a number, and cannot be zero.) In fact, approximately 30% of the time, the leading digit will be 1. This phenomenon was first discovered in 1881 by astronomer Simon Newcomb, when he discovered that the beginning pages of his book of logarithms were more worn than the others. Dr. Frank Benford, a physicist at the General Electric Company rediscovered this in 1938 and hypothesized that numbers with low leading digits are more prevalent than others. Over a period of six years, he analyzed over 20,000 values, including such diverse data as scientific constants, stock values, lake depths and numbers occurring in newsprint. When his research supported his hypothesis, he developed the following formula for the probability that the leading digit of a number is d: pd ( ) = log 1+ d Some restrictions apply on the use of Benford s law. The set of numbers cannot be assigned values (such as a social security or phone numbers), cannot have a uniform distribution of digits (such as lottery numbers), and should not have a preset maximum or minimum. Because Benford s work assumes that the data can be described by a geometric progression, two common applications are approximating population growth and financial information. However, other types of data sets such as house numbers and Fibonacci numbers have shown the same surprising results. Many people assume that numbers will have a uniform and random distribution, and that all digits are equally as likely to be the leading digit of a number. That is, most people would not consider Benford s Law when falsifying documents. Because of this, the IRS uses Benford s Law to help determine if the numbers on a tax form need further review.

Teacher Page 2 Discuss with Students 1. You are assigned to compile a list of 100 numbers that have been printed in newspapers during the week. Do you think that some numbers will appear more often than others as the leading digit? 2. Suppose half of the students in your class were told to make up a list of 100 values. If you were one of the students assigned to make up values, but don t want to be found out, how could you make your list believable? 3. In this activity, you will generate a hypothetical set of data for the cost of gas. If gas costs $1.00 at time t = 0 and we assume 5% yearly inflation, write a function c(t) that can be used to find the cost of gas in t years. Discuss with Students Answers: 1. Answers will vary. Students will examine this question in the activity. 2. Answers will vary. Discuss with students the idea that the leading digits of numbers are not equally distributed. 3. c(t)=e 0.05t Student Page Answers: 1a.$1.11; $1.28 1b. t = 14 1c. Cost (in dollars per $2.00 $3.00 $4.00 $5.00 $6.00 $7.00 $8.00 $9.00 $10.00 gallon) t (in years) 14 22 28 33 36 39 42 44 47 1d 1e. Cost Range Number of years $1.00 $1.99 $2.00 $2.99 $3.00 $3.99 $4.00 $4.99 $5.00 $5.99 $6.00 $6.99 $7.00 $7.99 $8.00 $8.99 14 8 6 5 3 3 3 2 3 $9.00 $9.99 % of time 30% 17% 13% 11% 6% 6% 6% 4% 6% 1f. For the cost to increase from $1.00 to $2.00, the cost must increase by 100%. However, to increase from $2.00 to $3.00, the cost only increases by 50%. Similarly, the percentage increase continues to go down as the cost in dollars increases. So, the prices spend the longest amount of time in the $1.00 $1.99 range. 2a. d 1 2 3 4 5 6 7 8 9 p(d) 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046 2b. The values in the two charts are roughly the same. The differences arise because time values were rounded to the closest year in Question 1. If that time period were smaller, the results would be closer to those from Question 2. 3. Larry s data set does not follow Benford s Law, so it may have been fabricated.

Student Page 1 Name: Date: NUMB3RS Activity: We re Number 1! In The Running Man, Charlie discusses a law called Benford s Law, which can be used to identify falsified information by analyzing the leading digits of numbers in a data set. The leading digit of a number is the leftmost digit; for example, the leading digit of 392 is 3 and the leading digit of 7,684 is 7. Many people think that in a large set of numbers, the leading digits are fairly evenly distributed (that is, there will be roughly the same number of 1s, 2s, 3s, etc. as leading digits). However, Benford s Law describes the surprising fact that in many naturally occurring data sets, the leading digit is more likely to be 1 or 2 than any other number. To introduce Benford s Law, we will look at an example involving the price of gasoline. 1. Assume that at some point in time (defined as t = 0), gas was priced at $1.00 per gallon. For simplicity, use a yearly inflation rate of 5%. The cost per gallon of gas in t years is given by the formula c(t) = e 0.05t. a. What will the cost of gasoline per gallon be in 2 years? In 5 years? b. Determine the first year that the cost per gallon will be at least $2.00. c. Complete the chart below to find the number of years t it will take for the cost per gallon to exceed each dollar amount. Cost (in dollars per $2.00 $3.00 $4.00 $5.00 $6.00 $7.00 $8.00 $9.00 $10.00 gallon) t (in years) 14 22 d. Determine the number of years that the price per gallon will be within the ranges shown in the table. Use this data to find the total number of years that the cost per gallon will be less than $10.00. Cost Range $1.00 $1.99 $2.00 $2.99 Number of years 14 8 % of time 30% $3.00 $3.99 $4.00 $4.99 $5.00 $5.99 $6.00 $6.99 $7.00 $7.99 $8.00 $8.99 $9.00 $9.99 e. The table now shows information about gasoline costs for 47 years. Find the percent of time, rounded to the nearest whole number, that the cost of gasoline was in each cost range. f. Why is the percent of the time for the $1.00 $1.99 range more than any other range? (Hint: Think of the dollar amount of the cost increase.)

Student Page 2 2. Benford developed the following formula for the probability that the leading digit of a number is d: pd ( ) = log 1+ d a. Use the formula to complete the chart below. Round to the nearest thousandth. d 1 2 3 4 5 6 7 8 9 p(d) b. How do these probabilities compare to your answers to Question 1d? Explain why there are differences in your answers. 3. Suppose that Charlie has data sets submitted by three different people, and he suspects that one person has fabricated his data. After counting the occurrences of leading digits in each number of each person s data set, Charlie records the results in the table below. Use Benford s Law to determine which set of data is most likely to have been fabricated. Leading Digit Number of Occurrences Larry Carl Mark 1 320 390 452 2 192 230 264 3 176 164 185 4 160 120 147 5 320 103 117 6 102 86 102 7 100 75 87 8 110 66 78 9 120 60 68

The goal of this activity is to give your students a short and simple snapshot into a very extensive math topic. TI and NCTM encourage you and your students to learn more about this topic using the extensions provided below and through your own independent research. Extensions For the Student Benford s Law applies to Fibonacci numbers. Write a program or use a spreadsheet to analyze the first digit frequencies for the first n Fibonacci numbers. Just for fun, and to help convince yourself that Benford s Law is true, collect your own set of data and analyze the frequencies of the leading digits. Remember that there are restrictions to the types of data you should use the set of numbers cannot be assigned values (such as a social security or phone numbers), cannot by definition have a uniform distribution of digits (such as lottery numbers), and should not have a preset maximum or minimum. Benford s work assumes that the data can be described by a geometric progression. Imagine a graph of such data, where the x is the occurrence and y is the observed value. A set of data forms a geometric curve from the points (1, 10) to (1000, 100). Write an equation for the function and develop the first digit frequencies, which should agree with Benford s Law. Use your calculator to draw a graph of pd ( ) = log 1+ d. Explain how the shape of the graph describes what you have already learned about the probabilities of the leading digits of numbers. Additional Resources Mark J. Nigrini, PhD, of Saint Michael's College, has done much work with Benford s Law. His Web site includes links to many papers and lectures. http://www.nigrini.com/benford's_law.htm A description of the law and an introduction to the calculus and statistics behind the formula can be found at http://mathworld.wolfram.com/benfordslaw.html. Another resource for the mathematics behind the formula can be found at http://www.mathpages.com/home/kmath302/kmath302.htm. This Web site contains an article about Benford s Law that appeared in the New York Times in August 1998. http://www.math.yorku.ca/who/faculty/brettler/bc_98/benford.html