Making Good Use of Data at Hand: Government Data Projects. Mark C. Cooke, Ph.D. Tax Management Associates, Inc.



Similar documents
Blazent IT Data Intelligence Technology:

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Digital Collections as Big Data. Leslie Johnston, Library of Congress Digital Preservation 2012

How To Handle Big Data With A Data Scientist

Data Mining & Data Stream Mining Open Source Tools

Predictive Analytics for Procurement Lead Time Forecasting at Lockheed Martin Space Systems

Government Data Analytics Center

Recovering Business Rules from Legacy Source Code for System Modernization

Cisco Data Preparation

CRITICAL MANUFACTURING

Whitepaper. 5 Dos and Don ts of Embedded Analytics.

Simple Service Modeling FAQs TrueSight Operations Management (BPPM) versions 9.5 and /31/2014

An Introduction to KeyLines and Network Visualization

BEYOND BI: Big Data Analytic Use Cases

PROPOSAL To Develop an Enterprise Scale Disease Modeling Web Portal For Ascel Bio Updated March 2015

Manifest for Big Data Pig, Hive & Jaql

What the Hell is Big Data?

Bruhati Technologies. About us. ISO 9001:2008 certified. Technology fit for Business

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Increase Revenue THE JOURNEY TO BIG DATA. Gary Evans. CTO EMC Ireland. Twitter.com/Gary3vans. Copyright 2013 EMC Corporation. All rights reserved.

Professional Organization Checklist for the Computer Science Curriculum Updates. Association of Computing Machinery Computing Curricula 2008

NEXT GENERATION ARCHIVE MIGRATION TOOLS

Big Data 101: Harvest Real Value & Avoid Hollow Hype

CONNECTING DATA WITH BUSINESS

Blueprints for Big Data Success

Government Technology Trends to Watch in 2014: Big Data

The Analytics Value Chain Key to Delivering Value in IoT

Three Open Blueprints For Big Data Success

Hadoop Submitted in partial fulfillment of the requirement for the award of degree of Bachelor of Technology in Computer Science

Scalability and Performance Report - Analyzer 2007

Microsoft Enterprise Search for IT Professionals Course 10802A; 3 Days, Instructor-led

The Lab and The Factory

Pennsylvania Geospatial Data Sharing Standards (PGDSS) V 2.5

Predictive Analytics

Introduction Predictive Analytics Tools: Weka

Unified Data Integration Across Big Data Platforms

White Paper. Unified Data Integration Across Big Data Platforms

Navigating Big Data business analytics

Data and Machine Architecture for the Data Science Lab Workflow Development, Testing, and Production for Model Training, Evaluation, and Deployment

Continuous Integration. Motio Portfolio Existing IBM Cognos Solutions Flexibility, Agility & Control In Deployments Free Stuff!

Setting the Standard for Safe City Projects in the United States

Paxata Security Overview

Overview. The Knowledge Refinery Provides Multiple Benefits:

Unequalled Physical Security Information Management Software

ACTIVITY & LOCATION BASED ANALYTICS APPLICATIONS

DATAMEER WHITE PAPER. Beyond BI. Big Data Analytic Use Cases

KNIME Enterprise server usage and global deployment at NIBR

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Real-Time Big Data Analytics + Internet of Things (IoT) = Value Creation

Vertafore Analytics: Turning Raw Data Into Revenue A CASE STUDY

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Implementing a New Technology: FPS Successes, Challenges, and Best Practices

Copyright 2013 Splunk Inc. Introducing Splunk 6

A Visualization is Worth a Thousand Tables: How IBM Business Analytics Lets Users See Big Data

Extend your analytic capabilities with SAP Predictive Analysis

Worldwide Advanced and Predictive Analytics Software Market Shares, 2014: The Rise of the Long Tail

HSD. W Business Analytics (M.Sc.) IT in Business Analytics. IT Applications in Business Analytics SS2016 / 01 Introduction Thomas Zeutschler

Tools for Managing and Measuring the Value of Big Data Projects

ABSTRACT INTRODUCTION OVERVIEW OF POSTGRESQL AND POSTGIS SESUG Paper RI-14

Program Evaluation Oversight Committee

Big Data for Investment Research Management

How To Learn To Use Big Data

BigMemory and Hadoop: Powering the Real-time Intelligent Enterprise

Laurence Liew General Manager, APAC. Economics Is Driving Big Data Analytics to the Cloud

Information Architecture

Unified Batch & Stream Processing Platform

BIG DATA What it is and how to use?

Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points

SAS Fraud Framework for Banking

Integrating a Big Data Platform into Government:

MARKETING ANALYTICS AS A SERVICE

Load DynamiX Storage Performance Validation: Fundamental to your Change Management Process

Parallel Analysis and Visualization on Cray Compute Node Linux

CLASS SPECIFICATION. Business Intelligence Supervisor

Hexaware E-book on Predictive Analytics

BIG DATA APPLIANCES. July 23, TDWI. R Sathyanarayana. Enterprise Information Management & Analytics Practice EMC Consulting

SAP HANA Cloud Platform. Technical Overview Uwe Heinz

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

Data Mining and Analytics in Realizeit

DATA VISUALIZATION: CONVERTING INFORMATION TO DECISIONS DAVID FRONING, PRINCIPAL PRODUCT MANAGER

Making critical connections: predictive analytics in government

BCIT COMPUTING offers courses and credentials in SIX related information technology sectors

Predictive analytics for the business analyst: your first steps with SAP InfiniteInsight

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

SAS Information Delivery Portal: Organize your Organization's Reporting

ECON 424/CFRM 462 Introduction to Computational Finance and Financial Econometrics

Kai Wähner. The Next-Generation BPM for a Big Data World: Intelligent Business Process Management Suites (ibpms)

Data Science Certificate Program

Transcription:

Making Good Use of Data at Hand: Government Data Projects Mark C. Cooke, Ph.D. Tax

Tax Management Associates Privately held company serving state and local government Markets across eighteen (18) states and more than 500 clients 34 years in business Staff of ~135 Four national offices HQ in Charlotte, NC Making Good Use of Data Mark C Cooke - Tax

Tax Management Associates Services include: Staff of credentialed specialized auditors for various local tax types Technology solutions, including SaaS applications Taxscribe (online tax listing service), CAVS (modeling application) Data-based solutions for business intelligence, fraud detection, and revenue optimization Making Good Use of Data Mark C Cooke - Tax

Data Solutions Problem: Govt s legacy data collection, storage, and management systems Problem: Data is across departments and agencies Problem: Governments have no direct access to data Making Good Use of Data Mark C Cooke - Tax

Data Solutions Revenue and Tax Fire and Police Governance 2011 2012 Force Budget 911 Voter Services Revenues Crime Econ. Dev. Making Good Use of Data Mark C Cooke - Tax

Open Data Concept Open Data 1. Govt s collect enormous quantities of useful data 2. Data made available to a wide audience will leverage insights from industry and academia 3. Open Data and Business Intelligence can be consumed internally by Govts as well Making Good Use of Data Mark C Cooke - Tax

Open Data Making Good Use of Data Mark C Cooke - Tax

Data Scientist

Doing Data the Old Way Data is locked inside systems :-( Software systems are designed to wrap a Graphical User Interface (GUI) around data The GUI functionality, historically, has to be programmed to produce reports, views, and analysis The GUI is driven by the sole purpose of the software. But the data has many purposes

Open Data Way Forward Using data in ways for which it was never intended Connecting data across multiple platforms Using data for novel insight Better governance through using data at hand and rapid development of analytics

Data Science Real Property Personal Property Permits Text Based Data Sales/Use Billing & Collections Police & Fire Expenditures Insight (Business Intelligence) + Answers

Data Science What is the output? Business Intelligence (BI) or actionable information that drives business decisions through insight Actionable insights from existing data Visualizations - making it consumable to a nonspecialist audience According to Friedman (2008) the "main goal of data visualization is to communicate information clearly and effectively through graphical means. http://en.wikipedia.org/wiki/data_visualization

The advantages of Knime: Read data in from multiple resources in real time, re-executing analyses on demand Simple GUI-based analysis environment for nonprogramming oriented users Resulting data can be written out to tables or to visualizations, depending on the context Web-portal allows non-technical end-users to consume output

The advantages of Knime: Rapid development environment Very powerful processing, handling large datasets on commodity hardware Allows for 100% data samples up to millions of elements row-wise Nodes provide access to complex algorithms for statistical or machine learning approaches

Knime Integrates with R R integration is key to expanding the data analysis and visualization capabilities of Knime R supports data ingestion of complex files (including ESRI) R supports complex data manipulation and statistical analysis R supports a wide variety of highly customizable visualizations So, what is R, exactly?

R Project for Statistical Computing www.r-project.org R is an open source scripting language which can be run inside Knime, but also within a command line environment independently Several GUI interfaces for R exist such as R Studio, a group that provides software for using R as well as training and extension packages (www.rstudio.com) Community contributions make up the bulk of R packages, which now total more than 4,700

Applications Case examples for working with county data: Combine real property data with 911 services so that responders can know the size, shape, and details of a property Identify holes in the tax base for entities which may be reporting one tax type but not others 100% sample of revenue impact from policy changes Productivity analyses on units produced over time Revenue resources and time series of annual revenue cycles across the entire revenue base, compared year on year Crime patterns for research and predictive policing

Demonstrations Data: Florida 67 Counties > 1.24 million personal property accounts

Demonstrations Data: database state change table (events table) > 90k events Output produced 200-300% performance improvement

Questions? Thank you for your time and attention. I am always happy to discuss data, so please feel free to reach out to any of the contact information below. Mark C Cooke Mark.Cooke@tma1.com 704.847.1234 (office) 704.953.6349 (cell) www.linkedin.com/in/markccooke