Assignment #1: Spreadsheets and Basic Data Visualization Sample Solution



Similar documents
Lab 11: Budgeting with Excel

Advanced Excel Charts : Tables : Pivots : Macros

Coins, Presidents, and Justices: Normal Distributions and z-scores

Pulling a Random Sample from a MAXQDA Dataset

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

SOAL-SOAL MICROSOFT EXCEL 1. The box on the chart that contains the name of each individual record is called the. A. cell B. title C. axis D.

Microsoft Excel 2010 Part 3: Advanced Excel

Data exploration with Microsoft Excel: analysing more than one variable

In This Issue: Excel Sorting with Text and Numbers

How to Make the Most of Excel Spreadsheets

Mathematics Task Arcs

AP CALCULUS AB 2007 SCORING GUIDELINES (Form B)

Microsoft Excel 2010 Training. Use Excel tables to manage information

Algebra 2 C Chapter 12 Probability and Statistics

This file contains 2 years of our interlibrary loan transactions downloaded from ILLiad. 70,000+ rows, multiple fields = an ideal file for pivot

Mathematical goals. Starting points. Materials required. Time needed

Math Games For Skills and Concepts

Microsoft Excel Basics

The Taxman Game. Robert K. Moniot September 5, 2003

Create Charts and Graphs with Excel By Lorrie Jackson

Spreadsheets Hop-around Cards

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

CISC - Curriculum & Instruction Steering Committee. California County Superintendents Educational Services Association

Fractions as Numbers INTENSIVE INTERVENTION. National Center on. at American Institutes for Research

Statistics Revision Sheet Question 6 of Paper 2

Descriptive Statistics

Proposal Metrics Dashboard. What Gets Measured Gets Done

Hypothesis Testing: Two Means, Paired Data, Two Proportions

Charts, Tables, and Graphs

Non-Parametric Tests (I)

Data Analysis with Microsoft Excel 2003

Intermediate PowerPoint

Information visualization examples

Excel Database Management

Creating A Grade Sheet With Microsoft Excel

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

+ = has become. has become. Maths in School. Fraction Calculations in School. by Kate Robinson

Infographics in the Classroom: Using Data Visualization to Engage in Scientific Practices

Creating an Excel XY (Scatter) Plot

Probabilistic Strategies: Solutions

Easy Calculations in Excel

Using Formulas, Functions, and Data Analysis Tools Excel 2010 Tutorial

To launch the Microsoft Excel program, locate the Microsoft Excel icon, and double click.

ECDL / ICDL Spreadsheets Syllabus Version 5.0

Excel: Analyze PowerSchool Data

By: Peter K. Mulwa MSc (UoN), PGDE (KU), BSc (KU)

Formulas, Functions and Charts

ACADEMIC TECHNOLOGY SUPPORT

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

RIFIS Ad Hoc Reports

Test A. Calculator not allowed. Mathematics test. First name. Last name. School. DfE no. KEY STAGE LEVELS

Commutative Property Grade One

I PUC - Computer Science. Practical s Syllabus. Contents

Figure 1: Choose your Excel output format.

Data Interpretation QUANTITATIVE APTITUDE

Coding & Data Skills for Communicators Dr. Cindy Royal Texas State University - San Marcos School of Journalism and Mass Communication

AN INTEGRATED APPROACH TO TEACHING SPREADSHEET SKILLS. Mindell Reiss Nitkin, Simmons College. Abstract

ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS

CHAPTER 11: SALES REPORTING

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

EXCEL PIVOT TABLE David Geffen School of Medicine, UCLA Dean s Office Oct 2002

Using games to support. Win-Win Math Games. by Marilyn Burns

Radiometric Dating Lab By Vicky Jordan

Tom wants to find two real numbers, a and b, that have a sum of 10 and have a product of 10. He makes this table.

"Excel with Excel 2013: Pivoting with Pivot Tables" by Venu Gopalakrishna Remani. October 28, 2014

Analyzing Data Using Excel

2030 Districts Performance Metrics Toolkit

Computer Training Centre University College Cork. Excel 2013 Pivot Tables

Search help. More on Office.com: images templates. Here are some basic tasks that you can do in Microsoft Excel 2010.

Excel Unit 4. Data files needed to complete these exercises will be found on the S: drive>410>student>computer Technology>Excel>Unit 4

GCSE Business Studies

Mathematics Content: Pie Charts; Area as Probability; Probabilities as Percents, Decimals & Fractions

Unit 7 The Number System: Multiplying and Dividing Integers

Sta 309 (Statistics And Probability for Engineers)

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Data exploration with Microsoft Excel: univariate analysis

ECO 199 B GAMES OF STRATEGY Spring Term 2004 PROBLEM SET 4 B DRAFT ANSWER KEY

Data Analysis. Using Excel. Jeffrey L. Rummel. BBA Seminar. Data in Excel. Excel Calculations of Descriptive Statistics. Single Variable Graphs

25 Integers: Addition and Subtraction

ISEB Assessments Year 8 Level 2 Maths Test 1 Author: ISEB

USING EXCEL TO MANAGE REVENUE CYCLE. Submitted to: Dr. Richard Jerz. May 4, Tammy A. Pauwels

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

Fundamentals of Probability

Finding the last cell in an Excel range Using built in Excel functions to locate the last cell containing data in a range of cells.

Chapter 3 RANDOM VARIATE GENERATION

Lesson 1: Readiness for Fractions (5 Bars and Fraction Bingo) Your group may select elements from one or both lesson plans.

Using Excel in Research. Hui Bian Office for Faculty Excellence

Assessment For The California Mathematics Standards Grade 2

Dealing with Data in Excel 2010

Describing, Exploring, and Comparing Data

Creating Graphs. Learning Objective-To create graphs that show data.

Additional information >>> HERE <<< Calendars and Calculators by Spreadsheet123 - Scam or Work?

SPREADSHEETS. TIP! Whenever you get some new data, save it under a new name! Then if you mess things up, you can always go back to the original.

Simple Regression Theory II 2010 Samuel L. Baker

Mental Questions. Day What number is five cubed? 2. A circle has radius r. What is the formula for the area of the circle?

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

1.6 The Order of Operations

Probability Distributions

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

Transcription:

Assignment #1: Spreadsheets and Basic Data Visualization Sample Solution Part 1: Spreadsheet Data Analysis Problem 1. Football data: Find the average difference between game predictions and actual outcomes, across all games. The HomeScore and AwayScore columns are the actual scores, while the Prediction column is an estimate of how much the home team will win or lose by, i.e., it s an estimate of HomeScore minus AwayScore. For example, if the Home team beats the Away team by 15 points and the Prediction is 10, then the difference between Prediction and outcome is 5; if the Home team loses to the Away team by 8 points and the Prediction is 2, then the difference between Prediction and outcome is 6. In arithmetic terms, Outcome = (HomeScore AwayScore), and Difference = (Prediction Outcome). Give a single positive or negative number for the average difference, then explain how you manipulated the spreadsheet to arrive at the number. 0.96875 Explanation: Let column H contain the Difference using formula =(G2 (D2 F2)), expanded appropriately through all the other rows. The answer is given by =AVERAGE(H2:H). Note the minus sign in the answer. Problem 2. Football data: Which weeks have the smallest and largest point spreads on average across all games played in that week across all three years? The point spread is the difference between the higher and lower actual scores, i.e., HomeScore AwayScore, always a positive number. (Hint: You might want to use the =abs() function.) Give the smallest and largest point spread weeks, along with the average point spread across all games played in that week in all years, then explain how you manipulated the spreadsheet to arrive at the results. Week 5 has the smallest average point spread with value 9.325 Week 17 has the largest average point spread with value 13.93333 Explanation: Let column H contain the Absolute Difference using formula =ABS(D2 F2), expanded appropriately through all the other rows. Then create a pivot table (referencing the whole table, including column H) where the Rows = Week, Values = Column H AVERAGE. Then, sort by AVERAGE of column H. Problem 3. Football data: Find the teams that scored the least and the most total points. Make sure to include both home and away games when computing total points. Give the two teams and their total points, then explain how you manipulated the spreadsheet to arrive at the answer. Team with least points: Cleveland, 378 points Team with most points: Minnesota, 1352 points

Explanation: In column H use formula =SUMIF(C$2:C,C2,D$2:D) to obtain the sum of HomeScores grouped by matching the Home column with itself. In column I use formula =SUMIF(E$2:E,C2,F$2:F) to obtain the sum of AwayScores grouped by matching the Home column with Away column. In column J use formula =(H2+I2). Sort by column J to find the Home team with the highest and lowest values in column J. Problem 4. Schoolkids data: What is the overall ranking of Grades, Sports, Looks, and Money in perceived popularity impact across all of the students? Note that every row represents one student s ranking from 1 (most important) to 4 (least important) on how important the four factors (Grades, Sports, Looks and Money) are in impacting popularity. State the four factors in order of overall importance (the factor with the highest importance should be first), and explain how you manipulated the spreadsheet to arrive at the answer. In descending order of importance: Sports, Looks, Grades, Money Explanation: Let =AVERAGE(G2:G), =AVERAGE(H2:H), =AVERAGE(I2:I), =AVERAGE(J2:J) be the average rating of Grades, Sports, Looks, and Money. Simply order the four columns by the average rating from lowest to highest. Note that AVERAGE may be replaced with SUM to use the sum of ratings in this problem, since the population size is the same for all Grades, Sports, Looks and Money. Problem 5. Schoolkids data: Which area type puts the least emphasis overall on Looks: Rural, Suburban, or Urban? Give your answer and explain how you manipulated the spreadsheet to arrive at your answer. Suburban Explanation: Create a pivot table where Rows = Type, Values = Looks AVERAGE. Pick the type with the highest average. Note that you may not use SUM for this problem, since the population size differs for each type. For example, say there are 10 students in Urban schools, all of whom gave Looks ratings of 1, and there s only 1 student in Suburban schools, who gave Looks a rating of 4. Using SUM would give Urban a total rating of 10 (less important) and Suburban a total rating of 4 (more important), while obviously it s the other way around. Problem 6. Schoolkids data: Find the school with the highest girl to boy ratio. Provide the school name and the ratio, and explain how you manipulated the spreadsheet to arrive at your answer. Elm, with ratio 3.2 Explanation: Create a pivot table where Rows = School, Columns = Gender, Values = Gender COUNTA. In the pivot table sheet, add a column D=(C2/B2) and manually pick the school with the highest girl to boy ratio. Problem 7. Schoolkids data: Considering only boys who are older than 10, what is their most common goal? Give the goal and the number of boys older than 10 who have that goal, then explain how you manipulated the spreadsheet to arrive at the answer.

Grades, with 60 students. Explanation: Use the filter view to filter Gender=Boy and Age>10. Copy the resulting table to a new sheet, add Sports, Grades, Popular to three rows in Column K, then in Column L use formula =COUNTIF(F$2:F, K2). Alternatively, in column K use formula =UNIQUE(F2:F) expanded to other rows, and in column L use formula =COUNTIFS(A$2:A, "boy", C$2:C, ">10", F$2:F, L2). Part 2: Data Visualization Problem 8. Football bar graph: For Arizona, Atlanta, Baltimore, Buffalo, and Carolina, create a bar graph showing their average score as a home team and average score as an away team. Problem 9. Schoolkids bar graph: Create a stacked bar graph that shows, for each of the three school Types (Rural, Suburban, Urban), the total number of students divided by how many have the three goals (Sports, Popular, Grades). Your graph should have three bars, with each bar divided into three colors.

Problem 10. Schoolkids pie chart: Create a pie chart showing the relative percentage of students attending schools that are Rural, Suburban, and Urban. Problem 11. Schoolkids pie chart: Create a pie chart that shows the relative percentage of students who rank as most important Grades, Sports, Looks, and Money.

Problem 12. Football pie chart: Every game is either a home win (HomeScore > AwayScore), an away win (HomeScore < AwayScore), or a tie (HomeScore = AwayScore). Create a pie chart that shows the relative percentage of home wins, away wins, and ties across all games played. (Hint: you may find the =if() function helpful.) Problem 13. Football scatterplot: Create a scatterplot where the x axis is the Prediction and the y axis is the actual outcome (HomeScore minus AwayScore). There should be one dot in the scatterplot for every game played. What do you conclude from the scatterplot? Explanation: Predictions are somewhat better than random, since we can see that very generally as the outcome goes up so does the prediction. But there s a wide range, i.e., there are still a lot of bad predictions!