Big Data and Modernizing Federal Statistics: Update
|
|
|
- Albert Carpenter
- 10 years ago
- Views:
Transcription
1 Big Data and Modernizing Federal Statistics: Update Bill Bostic Associate Director Economic Programs Directorate Ron Jarmin Ph.D. Assistant Director, Research and Methodology Directorate August
2 Big Data Trends and Challenges Trends Increasingly data-driven economy Individuals are increasingly mobile Technology changes data uses Stakeholder expectations are changing Agency budgets and staffing remain flat/reduced. The next generation of official statistics Utilize broad sources of information Increase granularity, detail, and timeliness Reduce cost & burden Maintain confidentiality and security Measure impact of a global economy Multi-disciplinary challenges : Computation, statistics, informatics, social science, policy 2
3 MIT Workshop Series Objectives Convene experts in computer science, social science, statistics, informatics and business Explore challenges to building the next generation of official statistics Identify new opportunities for using big data to augment official statistics core computational and methodological challenges ongoing research that should inform the Big Data research program 3
4 Workshop Organizers Series Conveners Census: Cavan Capps, Ron Prevost MIT: Micah Altman Workshop conveners First Workshop: Ron Duych (DOT), Joy Sharp (DOT) Second Workshop Amy O Hara, Laura McKenna, Robin Bachman Third Workshop: Peter Miller, Benjamin Reist, Michael Thieme Workshop Sponsors William Bostic, Ron Jarmin 4
5 Workshop Coverage Approach Examine a set of broad topical questions through a specific case Link broad issues to specific approach case Link specific challenge of case to broad challenge Acquisition Topics Acquisition Data Sources Using New forms of Information for Official Economic Statistics [August 3-4] Access Big Data Challenges Retention Access -- Privacy Challenges Location Confidentiality and Official Surveys [October 5-6] Analysis Analysis Inference Challenges Transparency and Inference [December 7-8] 5
6 Preliminary Observations from First Workshop Topic: Sources of Economic Big Data Use Case: Commodity Flow Survey Observations: Different classes of decisions require different sources of data: Designed survey data contributes baseline data for decisions about infrastructure and strategic planning Transaction based big data could contribute frequency and granularity of estimates In big data, data sources are stakeholders Businesses need to react quickly and predict the future and need frequently updated detailed data Critical to provide a value proposition to business Critical to develop a trust relationship Some Potential sources ERP and DRP operations data EDI Mobile Phone Traffic Data 6
7 Privacy and Big Data (Preview) Topic: Location Privacy Use Cases: LEHD Google Now Sample questions: How can technical innovations such as differential privacy be adapted to protect public micro-data and statistical estimates? How can we develop a modern privacy risk index for official data releases? 7
8 Expected Outcomes Workshop Slides [August, October, December] Workshop Executive Summaries [September, October, January] Big Data White Paper [February] projects.informatics.mit.edu/bigdataworkshops 8
9 Retail Big Data Project Goal To explore the use of Big Data to supplement existing monthly/annual retail surveys to fill in data gaps and increase relevance. The primary focus is to try to produce subnational geographic area estimates more frequently than once every five years (from the Economic Census). 9
10 Areas of Focus Point of Sale (POS) / Scanner Data (from companies like NPD, Nielsen, IRI) Credit Card Transaction Data (from credit card companies, 3 rd party payment processors, banks, etc.) Direct Feeds from Companies 10
11 Background on NPD Data 36 Months - January 2012 through December 2014 Aggregated Sales by Geography, Product, Type of Merchant (Not Consistent) Geography (varied by dataset) Census regions Metro areas Product Level Aggregation by SKU Type of merchant (jewelry and watches only) Names of Companies included in aggregated totals 11
12 Evaluation Steps Analyzed NPD data to identify potential errors prior to use Errors in geographic coding Missing data Compared NPD data with Census Bureau estimates to obtain a rough assessment of coverage and to determine if the NPD data could serve as a potentially informative predictor Aggregate levels (monthly or yearly totals) Period-to-period changes (e.g., current-to-prior month, current-to-prior quarter) Census Bureau Data Used Monthly Retail Trade Survey Annual Retail Trade Survey 2012 Economic Census Business Register 12
13 Major Findings 3 Datasets didn t help us achieve our goal Discrepancies Between NPD Data and Census Official Data Geographic area differences Less than full coverage of retail stores Differences in industry/product classification Under-representation of small firms Jewelry dataset highly correlated with national estimates from MRTS 13
14 Next Steps with POS Data Explore the jewelry time series from NPD to better understand using big data in production models Continue discussions with scanner data companies on obtaining more useful data Evaluate the use of big data sources across all parts of the survey life cycle 14
15 Credit Card Transaction Data Credit card transaction data (MasterCard, 3 rd party processors, banks, etc.) Joint effort with BEA Test file obtained from MasterCard Monthly Data from January 2009 through March 2015 Selected retail and service industry breakouts for 3 states Research to be completed by January 2016 Discussions with other companies 15
16 Data Feeds from Companies Data feeds directly from companies Started meeting with companies Received favorable feedback Will proceed with working out details Will test with a few companies in 2017 Economic Census 16
17 Questions for CSAC Committee What is the reaction to name we have selected for new R&M Center? Is Census Bureau still headed in the right direction for its Big Data research? What could derail the Census Bureau's Big Data effort going forward? 17
Big Data and Modernizing Federal Statistics: Update
Big Data and Modernizing Federal Statistics: Update Bill Bostic Associate Director Economic Programs Directorate Ron Jarmin Ph.D. Assistant Director, Research and Methodology Directorate April 16, 2015
Big Data Landscape, 2015-2020
Big Data Landscape, 2015-2020 Brian C. Moyer, Director Global Conference on Big Data for Official Statistics October 20, 2015 Where are we now? Official statistics use a variety of big data sources Current
Principles and Guidelines on Confidentiality Aspects of Data Integration Undertaken for Statistical or Related Research Purposes
Principles and Guidelines on Confidentiality Aspects of Data Integration Undertaken for Statistical or Related Research Purposes These Principles and Guidelines were endorsed by the Conference of European
QUARTERLY RETAIL E-COMMERCE SALES 1 ST QUARTER 2016
FOR IMMEDIATE RELEASE TUESDAY, MAY 17, 2016, AT 10:00 A.M. EDT Rebecca DeNale (Data Collection and Data Products): (301) 763-2713 Deanna Weidenhamer (Methodology): (301) 763-7186 CB16-80 QUARTERLY RETAIL
Department of the Interior Open Data FY14 Plan
Department of the Interior Open Data FY14 Plan November 30, 2013 Contents Overview Introduction DOI Open Data Objectives Transparency Governance and Culture Change Open Data Plan FY14 Quarterly Schedule
How To Understand The Data Collection Of An Electricity Supplier Survey In Ireland
COUNTRY PRACTICE IN ENERGY STATISTICS Topic/Statistics: Electricity Consumption Institution/Organization: Sustainable Energy Authority of Ireland (SEAI) Country: Ireland Date: October 2012 CONTENTS Abstract...
Navigating the Future at the Census Bureau
Navigating the Future at the Census Bureau Dr. Nancy Potok Deputy Director and Chief Operating Officer Council for Community and Economic Research (C2ER) Annual Conference June 6, 2014 Navigating the Future
U.S. Department of the Treasury. Treasury IT Performance Measures Guide
U.S. Department of the Treasury Treasury IT Performance Measures Guide Office of the Chief Information Officer (OCIO) Enterprise Architecture Program June 2007 Revision History June 13, 2007 (Version 1.1)
Big Data: Tackling New Projects and Exploring New Sources
Big Data: Tackling New Projects and Exploring New Sources Dennis Fixler BEA Advisory Committee Meeting 13 November 2015 Background Using Big Data not a big change for BEA Definition of big data: non-survey
THE U.S. ECONOMIC CENSUS MOVING FORWARD
THE U.S. ECONOMIC CENSUS MOVING FORWARD Jack B. Moody International Workshop on Economic Census Business Registers and Integrated Economic Statistics Aguascalientes, Mexico 29 September 1 October, 2015
Youth Crime, Restorative Justice, and GIS - Lee County, Florida Authors: Richard Faris, John Bizelli
Youth Crime, Restorative Justice, and GIS - Lee County, Florida Authors: Richard Faris, John Bizelli Abstract The Lee County Department of Human Services (DHS) is using GIS to analyze underlying geographic
ECLAC Economic Commission for Latin America and the Caribbean
1 FOR PARTICIPANTS ONLY REFERENCE DOCUMENT DDR/2 22 June 2012 ENGLISH ORIGINAL: SPANISH ECLAC Economic Commission for Latin America and the Caribbean Eleventh meeting of the Executive Committee of the
Film Scanner The term Film Scanner can refer to a dedicated slide and negative film scanner or to a capture type scanner.
Film Scanner The term Film Scanner can refer to a dedicated slide and negative film scanner or to a capture type scanner. Negative Scanner The term Negative Scanner can refer to a dedicated slide and negative
Big Data: Uses and Limitations
Big Data: Uses and Limitations Nathaniel Schenker Associate Director for Research and Methodology National Center for Health Statistics Centers for Disease Control and Prevention Presentation for discussion
Report of the 2015 Big Data Survey. Prepared by United Nations Statistics Division
Statistical Commission Forty-seventh session 8 11 March 2016 Item 3(c) of the provisional agenda Big Data for official statistics Background document Available in English only Report of the 2015 Big Data
Documentation of statistics for International Trade in Service 2016 Quarter 1
Documentation of statistics for International Trade in Service 2016 Quarter 1 1 / 11 1 Introduction Foreign trade in services describes the trade in services (imports and exports) with other countries.
Developing the SMEs Innovative Capacity Using a Big Data Approach
Economy Informatics vol. 14, no. 1/2014 55 Developing the SMEs Innovative Capacity Using a Big Data Approach Alexandra Elena RUSĂNEANU, Victor LAVRIC The Bucharest University of Economic Studies, Romania
TxDOT Project 0-6697-CTR: Integration of Data Sources to Optimize Freight Transportation in Texas
0-6697-CTR-P2 COMPANION POWERPOINT PRESENTATION TO UNITY DATABASE TxDOT Project 0-6697-CTR: Integration of Data Sources to Optimize Freight Transportation in Texas DECEMBER 2013; PUBLISHED SEPTEMBER 2014
Portfolio Company Performance Analysis and Reporting Automation
Portfolio Company Performance Analysis and Reporting Automation Providing transparent and accurate performance data to investors, partners and auditors is becoming increasingly important, if not critical
ByteMobile Insight. Subscriber-Centric Analytics for Mobile Operators
Subscriber-Centric Analytics for Mobile Operators ByteMobile Insight is a subscriber-centric analytics platform that provides mobile network operators with a comprehensive understanding of mobile data
Moody s Analytics Solutions for the Asset Manager
ASSET MANAGER Moody s Analytics Solutions for the Asset Manager Moody s Analytics Solutions for the Asset Manager COVERING YOUR ENTIRE WORKFLOW Moody s is the leader in analyzing and monitoring credit
MNsure Compliance Program Strategic Plan. December 17, 2014
MNsure Compliance Program Strategic Plan December 17, 2014 Page 2 of 12 TABLE OF CONTENTS Introduction... 3 Compliance Program Mission... 3 Compliance Department Mission... 3 Regulatory Profile... 4 Key
BIG Data and OFFICIAL Statistics
BIG Data and OFFICIAL Statistics International Year of Statistics November 13, 2013 Michael W. Horrigan Associate Commissioner Office of Prices and Living Conditions Big Data and Official Statistics What
Big Data in Retail Big Data Analytics Central to Customer Acquisition and Retention Strategies in Retail
Big Data in Retail Big Data Analytics Central to Customer Acquisition and Retention Strategies in Retail MAA7-67 September 2014 Contents Section Slide Numbers Executive Summary 4 Methodology 6 Fundamentals
Small Steps Towards Big Data Ric Clarke, Australian Bureau of Statistics
Small Steps Towards Big Data Ric Clarke, Australian Bureau of Statistics ECB Workshop on Using Big Data, 7-8 April 2014 1 Agenda Review some basic Big Data concepts Describe the Big Data opportunity for
Big Data, Official Statistics and Social Science Research: Emerging Data Challenges
Big Data, Official Statistics and Social Science Research: Emerging Data Challenges Professor Paul Cheung Director, United Nations Statistics Division Building the Global Information System Elements of
The Economics of Digitization: An Agenda for NSF. By Shane Greenstein, Josh Lerner, and Scott Stern
The Economics of Digitization: An Agenda for NSF By Shane Greenstein, Josh Lerner, and Scott Stern This work is licensed under the Creative Commons Attribution-NoDerivs 3.0 Unported License. To view a
Household health care spending: comparing the Consumer Expenditure Survey and the National Health Expenditure Accounts Ann C.
Household health care spending: comparing the Consumer Expenditure Survey and the National Health Expenditure Accounts Ann C. Foster Health care spending data produced by the Federal Government include
Table of Contents. FY 2013-2018 BLS Strategic Plan... 2. Our Strategies and Goals... 6. Strategy Framework... 8. Strategy 1 (Products)...
Table of Contents FY 2013-2018 BLS Strategic Plan... 2 Our Strategies and Goals... 6 Strategy Framework... 8 Strategy 1 (Products)... 8 Strategy 2 (Product and Process Improvement)...10 Strategy 3 (Customers)...12
Producing Regional Profiles Based on Administrative and Statistical Data in New Zealand
Proceedings 59th ISI World Statistics Congress, 25-30 August 2013, Hong Kong (Session STS021) p.1489 Producing Regional Profiles Based on Administrative and Statistical Data in New Zealand Michael Slyuzberg
Impact of Connected Automated Vehicles on Traffic Management Centers (TMCs)
Impact of Connected Automated Vehicles on Traffic Management Centers (TMCs) Automated Vehicles Symposium 2015 Breakout Session Impact of Connected and Automated Vehicles on Traffic Management Systems and
Credit Cards and Payment Efficiency 1
Credit Cards and Payment Efficiency 1 Stan Sienkiewicz August 2001 Summary: On May 22, 2001, the Payment Cards Center of the Federal Reserve Bank of Philadelphia sponsored a workshop on the role of interchange
Five Questions to Ask Your Mobile Ad Platform Provider. Whitepaper
June 2014 Whitepaper Five Questions to Ask Your Mobile Ad Platform Provider Use this easy, fool-proof method to evaluate whether your mobile ad platform provider s targeting and measurement are capable
Consultation. Author: Briony Krikorian-Slade
================================================================ HMRC: Tackling the hidden economy: Extension of datagathering powers Consultation Response from The UK Cards Association 13 October 2015
A Framework for Prioritizing Economic Statistics Programs
A Framework for Prioritizing Economic Statistics Programs Thomas L. Mesenbourg, Ronald H. Lee Associate Director for Economic Programs Senior Financial Advisor Office of the Associate Director for Economic
Traffic Management Centers in a Connected Vehicle Environment
Traffic Management Centers in a Connected Vehicle Environment Project Plan Conducted by Kimley-Horn and Associates, Inc. (PI: Lisa Burgess), Noblis (PI: Mike McGurrin), DGD Enterprises (PI: Dorothy Drinkwater),
Integrated Marketing Community
Integrated Marketing Community Omnichannel Strategies in the Financial Sector Margot Vaughan, MasterCard We will be starting at the top of the hour. You will not hear anything until we start. Integrated
Glassdoor Survey: How to Recruit Healthcare Professionals. A Strategic Guide for Talent Acquisition Professionals
2014 Glassdoor Survey: How to Recruit Healthcare Professionals A Strategic Guide for Talent Acquisition Professionals Survey Overview Recruiting healthcare professionals in today s highly competitive recruiting
POLAR IT SERVICES. Business Intelligence Project Methodology
POLAR IT SERVICES Business Intelligence Project Methodology Table of Contents 1. Overview... 2 2. Visualize... 3 3. Planning and Architecture... 4 3.1 Define Requirements... 4 3.1.1 Define Attributes...
Increasing Demand Insight and Forecast Accuracy with Demand Sensing and Shaping. Ganesh Wadawadigi, Ph.D. VP, Supply Chain Solutions, SAP
Increasing Demand Insight and Forecast Accuracy with Demand Sensing and Shaping Ganesh Wadawadigi, Ph.D. VP, Supply Chain Solutions, SAP Legal disclaimer The information in this presentation is confidential
How To Understand The Financial Crisis
White Paper September 2009 Top ten reports every insurance executive needs Executive insight with IBM Cognos software 2 Contents 5 The top 10 reports 5 The Production, Underwriting and Claims Overview
**MT op» ^chv. Adapted Privacy Impact Assessment. Google Analytics. March 19, 2012. Contact
**MT op» ^chv March 19, 2012 Contact Departmental Privacy Office U.S. Department of the Interior 1849 C Street NW Mail Stop MIB-7456 Washington, DC 20240 202-208-1605 [email protected] ^ March 19,2012
Quality and Methodology Information
29 July 2015 Issued by Office for National Statistics, Government Buildings, Cardiff Road, Newport, NP10 8XG Media Office 08456 041858 Business Area 01633 455923 Lead Statistician: Michael Hardie Quality
Effectively using SOC 1, SOC 2, and SOC 3 reports for increased assurance over outsourced operations. kpmg.com
Effectively using SOC 1, SOC 2, and SOC 3 reports for increased assurance over outsourced operations kpmg.com b Section or Brochure name Effectively using SOC 1, SOC 2, and SOC 3 reports for increased
Establishing and Monitoring Statistical Standards in USDA-NASS
Establishing and Monitoring Statistical Standards in USDA-NASS William L. Arends National Agricultural Statistics Service, U.S. Department of Agriculture, Room 5041A, 1400 Independence Ave. S.W., Washington
DETERMINANTS OF A HEALTHCARE INFORMATION SYSTEM
DETERMINANTS OF A HEALTHCARE INFORMATION SYSTEM Assistant Professor PhD Alexandar Kostadinovski, Faculty of Economics, "Goce Delchev University - Stip, [email protected]; Assistant Professor
Uses of Big Data for Official Statistics: Privacy, Incentives, Statistical Challenges, and Other Issues. Abstract
Uses of Big Data for Official Statistics: Privacy, Incentives, Statistical Challenges, and Other Issues Discussion Paper by Steve Landefeld Senior Advisor to the United Nations Statistics Division International
Delivering new insights and value to consumer products companies through big data
IBM Software White Paper Consumer Products Delivering new insights and value to consumer products companies through big data 2 Delivering new insights and value to consumer products companies through big
3 rd Party Vendor Risk Management
3 rd Party Vendor Risk Management Session 402 Tuesday, June 9, 2015 (11 to 12pm) Session Objectives The need for enhanced reporting on vendor risk management Current outsourcing environment Key risks faced
2. Issues using administrative data for statistical purposes
United Nations Statistical Institute for Asia and the Pacific Seventh Management Seminar for the Heads of National Statistical Offices in Asia and the Pacific, 13-15 October, Shanghai, China New Zealand
The Leadership Mystery Defining Leadership Success through Competency Modeling and Workforce Analytics
viapeople Insight - Whitepaper The Leadership Mystery Defining Leadership Success through Competency Modeling and Workforce Analytics Karen N. Caruso, Ph.D. Amanda Seidler, Ph.D. The Leadership Mystery:
At NCCI, producing and delivering
AN OVERVIEW OF DATA QUALITY by Michael B. Spears Chief Officer NCCI Holdings, Inc. ENSURING DATA QUALITY ACROSS THE WORKERS COMPENSATION INDUSTRY REPRESENTS A TREMENDOUS INVESTMENT OF TIME, RESOURCES,
Databases & Data Infrastructure. Kerstin Lehnert
+ Databases & Data Infrastructure Kerstin Lehnert + Access to Data is Needed 2 to allow verification of research results to allow re-use of data + The road to reuse is perilous (1) 3 Accessibility Discovery,
A Pragmatic Guide to Big Data & Meaningful Privacy. kpmg.be
A Pragmatic Guide to Big Data & Meaningful Privacy kpmg.be From predicting criminal behavior to medical breakthroughs, from location-based restaurant recommendations to customer churn predictions, the
Agenda. Strategic Succession Planning: Building Your Bench Strength. The Numbers say. SP Defined* The Art of Choosing Positions
Agenda Strategic Succession Planning: Building Your Bench Strength The Business Case for Succession Planning The Wedding: Succession Planning meets Leadership Development Top 10 Ideas for Building Your
MOHAWK VALLEY POPULATION HEALTH IMPROVEMENT PROGRAM. Data Plan. As of 9/3/2015
MOHAWK VALLEY POPULATION HEALTH IMPROVEMENT PROGRAM Data Plan As of 9/3/2015 Bassett Healthcare Network, Bassett Research Institute, 1 Atwell Rd, Cooperstown, NY 13326 Overview: In order to meet Mohawk
MASTERCARD ADVISORS (MCA) PUBLIC SECTOR BIG DATA ANALYTICS & LESSONS LEARNED FROM THE PRIVATE SECTOR
MASTERCARD ADVISORS (MCA) PUBLIC SECTOR BIG DATA ANALYTICS & LESSONS LEARNED FROM THE PRIVATE SECTOR AUGUST 24, 2015 www.mastercardadvisors.com/solutions Table Contents 2 Big Data Overview: What does it
