Big data and corrections: what s the big issue? Corrections Technology Association June 4, 2013



Similar documents
Achieving Business Value through Big Data Analytics Philip Russom

Big Data and Your Data Warehouse Philip Russom

Big Data and Analytics in Government

Evolving Data Warehouse Architectures

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Big Data and Trusted Information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

The Enterprise Data Hub and The Modern Information Architecture

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Data Refinery with Big Data Aspects

High-Performance Analytics

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

The Next Wave of Data Management. Is Big Data The New Normal?

Testing Big data is one of the biggest

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Big Data Analytics: Today's Gold Rush November 20, 2013

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

The Future of Data Management

This Symposium brought to you by

HDP Hadoop From concept to deployment.

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

Luncheon Webinar Series May 13, 2013

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Introducing Oracle Exalytics In-Memory Machine

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Il mondo dei DB Cambia : Tecnologie e opportunita`

Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April

Architecting for the Internet of Things & Big Data

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Big Data for Investment Research Management

BEYOND BI: Big Data Analytic Use Cases

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

How the oil and gas industry can gain value from Big Data?

Oracle Big Data Strategy Simplified Infrastrcuture

Getting Started Practical Input For Your Roadmap

UNIFY YOUR (BIG) DATA

Agile BI With SQL Server 2012

Big Data Use Cases Update

Traditional BI vs. Business Data Lake A comparison

Are You Ready for Big Data?

The 3 questions to ask yourself about BIG DATA

IBM Data Warehousing and Analytics Portfolio Summary

Transforming the Telecoms Business using Big Data and Analytics

Session 1: IT Infrastructure Security Vertica / Hadoop Integration and Analytic Capabilities for Federal Big Data Challenges

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

How To Use Big Data For Business

Big Data Analytics- Innovations at the Edge

Are You Ready for Big Data?

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

DATAMEER WHITE PAPER. Beyond BI. Big Data Analytic Use Cases

DATA VISUALIZATION: CONVERTING INFORMATION TO DECISIONS DAVID FRONING, PRINCIPAL PRODUCT MANAGER

Native Connectivity to Big Data Sources in MSTR 10

Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard

Cloudera Enterprise Data Hub in Telecom:

Master big data to optimize the oil and gas lifecycle

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Big Data Big Data/Data Analytics & Software Development

NoSQL for SQL Professionals William McKnight

An Oracle White Paper October Oracle: Big Data for the Enterprise

#TalendSandbox for Big Data

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Big Data at Cloud Scale

Extend your analytic capabilities with SAP Predictive Analysis

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

INTELLIGENT BUSINESS STRATEGIES WHITE PAPER

Exploiting Data at Rest and Data in Motion with a Big Data Platform

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist

Big Data & Analytics for Semiconductor Manufacturing

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Data Integration Checklist

Big Data Technologies Compared June 2014

How To Use Big Data For Telco (For A Telco)

Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

CitusDB Architecture for Real-Time Big Data

Innovative technology for big data analytics

Beyond Watson: The Business Implications of Big Data

HP HAVEn: See the big picture in Big Data

Solving big data problems in real-time with CEP and Dashboards - patterns and tips

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

Understanding traffic flow

BAO & Big Data Overview Applied to Real-time Campaign GSE. Joel Viale Telecom Solutions Lab Solution Architect. Telecom Solutions Lab

Business Intelligence for Big Data

Modern Data Warehouse

Ganzheitliches Datenmanagement

Reference Architecture, Requirements, Gaps, Roles

Transcription:

Big data and corrections: what s the big issue? Corrections Technology Association June 4, 2013 1

Big is a relative term What is considered "big data" varies depending on the capabilities of the organization managing the set. For some organizations, facing hundreds of gigabytes of data for the first time may trigger a need to reconsider data management options. For others, it may take tens or hundreds of terabytes before data size becomes a significant consideration. 2

Corrections by the numbers PEW Center on the States June 2012, The High Cost of Corrections in America: Infographic 3

Corrections by the numbers PEW Center on the States June 2012, The High Cost of Corrections in America: Infographic 4

Corrections by the numbers PEW Center on the States June 2012, The High Cost of Corrections in America: Infographic 5

At year s end 2011, nearly 1 in every 34 adult residents in the U.S. was under some form of correctional supervision* (approximately 7 million) Assume each individual has a circle of 5 contacts worth noting; approximately 35 million individuals to be aware of (relationship mapping) More than 10% of the population *Correctional Populations in the United States, 2011 (Lauren E. Glaze, BJS Statistician, and Erika Parks, BJS Intern) 6

Justice in the data Applying a data-driven lens to improve the efficiency and effectiveness of the criminal justice system has a potentially huge incentive: Improving the bottom lines of state budgets around the country 7

Making decisions 8

Corrections ranks among the highest expenditures in state budgets In 2012, corrections was No. 4 in California State budget It costs $45,000 per year in New Jersey to incarcerate someone The costs of incarceration just prior to trial alone is $9 billion Cost of local systems nationally is estimated by the Department of Justice at more than $130 billion a year 9

The four V s: understanding the trend V V V V olume elocity alue ariety Typically, big data refers to high volumes of information Like water from a fire hose, data is constantly flowing into agencies at intense rates High volumes of data produced at accelerated rates Non-traditional data sets associated with big data, such as video and audio files, structured and unstructured content, spatial tweets, blogs, wiki, and social media entries 10

Big data in corrections: where s the data? Within prisons and jails: structured data Inmate cell assignments, assessments, money transfers and purchases, rules violations, associates, property, program enrollments/outcomes, grievances, health appointments and encounters, pill line. Incidents, consumables, asset management, payroll, system/internet usage Within prisons and jails: unstructured data Document imaging, email, SharePoint, PDF reports Photos: mugshots, tattoos, evidence, visitors, staff Video: surveillance, committee hearings, visits, court hearings, health services Audio: inmate telephone conversations, hearings Biometrics Add the following for community corrections: GPS tracking, case notes, co-workers and associates Internet usage, cellular records (parolee and staff), email, social media 11

Corrections in the digital age Social Media and the Internet

Cyber threats and cyber supervision 13

Social networking Supervising offenders online 14

Securing facilities Investigating online 15 Photo by Wired Photostream under a Creative Commons license

Social media policy Supervising staff online justice.vic.gov.au/socialmedia Statistics sourced from: ComScore State of the Internet, February 2011 16

Investing in a comprehensive information management strategy will result in: better early warning, to enable faster response real-time awareness, to know what s happening on the ground now real-time feedback, perhaps most important, to see what s not happening versus what was intended 17

Information overload It s happening now! 18

The returns for better applying technology in criminal justice extend far beyond reducing crime or costs, to something that government officials are sworn to uphold: justice. Anne Milgram Former Attorney General of New Jersey

What is big data? 20

How much is too much?

A fundamental shift is underway $ CRM, SCM, ERP Transactional Data Core Transaction Systems Human-friendly Information Mobile Audio Texts Social Media Velocity Big Data Volume Other Operational Systems + New technologies Variety Value Analytical Environment Predefined reporting, dashboards and analytics to: Measure/monitor the business Analyze and improve operations Video Images Email Requiring filters for meaning (context, relevance, urgency) to: Operate efficiently Protect the population Improve best practices through trends and public policy Tools for meaning-based computing and real-time analytics processing HW and Appliances to power, process and store big data Cloud and Security for emerging demands to share information System integration services to help clients align, architect and accelerate the shift 22

Leverage data explosion 23

Data capture Technology innovation Technology Feature Value Scale-out Columnar 24 Massive Parallel Processing, Shared-nothing Node fail-over Structured data stored by column Linear scalability, critical SLA Industry standard components Performance on large data volumes Effective Compression In-memory Data permanent in memory Performance on data streams and cubes In-database analytics Complex event processing Search Human Lanaguage and Rich Media Processing Real-time Visualization Content management systems Execute close to data Analyse and react to stream event data Index and search unstructured and structured data Text, audio, video analysis Graphic display of complex data Support variety of formats May not support SQL or ACID Performance on large data volumes Near real-time response to business process Analyze unstructured data Meaning based computing Sentiment rating, other ratings Human perception, faster results Store and process unstructured data Service management Event processing Complex event processing Data acquisition Structured data Semistructured data Unstructured data Governance is one of the main drivers behind Big Data Analytics Rules engine Data transformation Master data Derive metadata and index Match and combine Populate repositories Portfolio management SOA services Repository Data governance Relational DBMS Data warehouse Non-relational DBMS (HDFS, Hbase,...) Non-relational DBMS (content mgt systems) Applications Operations management Data Virtualization Data mart (OLAP cube) Real-time analytical RDBMS Analysis and reporting Rules generator Data mining engine SQL analytics engine Search engine NoSQL (MapReduc) engine Data audit, balance and control Big Data Reference Architecture Visualization Static and OLAP reports Dashboards and alerts Statistical analysis Predictive analysis

Storing data Emerging technology addresses challenges Volume Variety Velocity Hadoop distributions Yes Yes Hadoop distributions Content management systems Yes Yes Real-time analytical RDBMS Yes Yes NoSQL SQL Traditional RDBMS for OLTP/OLAP Content management systems Processing data Real-time analytical RDBMS Enterprise Business Control Systems Control BCS* Operations Operations staff ODS* Model Builders Strategy EDW* 25

The challenges of unmanaged information Missed opportunity Increased risk Cost and complexity Social Media Video Email HRMS Supply Chain Management/ Inventory Mgmt Procurement Texts Messages ERP CRM Images Word, Excel Audio Transactional Data Logs Clickstream Data 26 90% 10% Meaningful intelligent information insight requires analysis of 100% of information

Imagine a world where a conversation with your information drives intelligent business decisions Social Media Video Audio Texts Messages Transactional Data Transactional Data Intelligent information insight across structured, semistructured and unstructured data Word, Excel Logs Clickstream Data Do 40% more of this Right time, intelligent decision making Images Email MGD Logs Clickstream Data Positive ROI (Return on Information) 27 Meaningful intelligent information insight requires analysis of 100% of information

Why is processing human information different? Human Information is made up of ideas, is diverse, and has context. Ideas don t exactly match like data does; they have distance. Human Information is not static it s dynamic and lives everywhere. Legacy techniques have all fallen short. Social Media Video Audio Email Texts Mobile Transactional Data Documents Search Engine Images IT/OT 28

Rich-media data Unstructured text data M ixed structure data Unknown-structure data WWW Semi-structured text data Structured text data SAN/Drop z one ODS EDW Data marts Hadoop Access-in-place Native analytics Udx extensions R- Functions Hadoop Uns truc tured data s tore (HDFS) M ap-reduc e Vertic a Analy tics RDBM S Hadoop extended tools M eaning-based analytics (Autonomy IDOL) NotOnly SQL solutions Autonomy value-add applications Analytic tools BI/ Visualiz ation tools ETL applications Analytics: Optimization of Labor and Post Positions Data: Incident data Time of day Location within prisons Officer post coverage Analyze for patterns and trends Result: Adjust shifts and optimize post positions Labor Optimization Input (From Demand Forecaster) Revenue Forecast Unit Costs & Rates C&B On - Boarding Severance $ Rate cards Utilization Workforce Inventory Operating Limits Sites Offshore % Regular/CTW Contingent %, 3P% Job Codes Optimization Model Best - shoring, 3P Regular vs CTW Hiring and WFR Promotion & Transformation Redeployment Output Financial Projection Operational Plan Strategy Analysis 29

Analytics: monitoring and surveillance Unstructured Data: Arrest reports Pre-sentence investigations Court documents Criminal history Psychological evaluations Incident reports, Grievance and appeal documents Education assessments Correspondence Inmate requests to staff 30 Word, Excel Filtered Content File Shares Records DBs Relationship Analysis Meaning-based correlation Result: Identify safety risks Informed decisions in inmate cell and work assignments Control of movements Concept-based, context aware Real-time self learning Document training

Data capture Unstructured and structured analysis Logical architecture Service management Portfolio management Operations management Event processing Complex event processing Data acquisition Human Language Rich Media Structured data Semistructured data External data Internal data Unstructure d data Rules engine Capture Data transformation Master data Derive metadata and Data index quality Master data mgt Match and Integration combine Aggregation Populate repositories SOA services Repository Raw Data Repositor y Relational DBMS data warehouse Applications Non-relational DBMS Relational (e.g. HDFS, DBMS Hbase,...) Staging Integration Non-relational DBMS Enterprise (content DW mgt systems) Applications Sentiment, Mark-up and Integrate Data Virtualization Data mart (e.g. OLAP Data cube) marts OLAP Real-time cubes analytical RDBMS Analysis and reporting Rules generator Analytical Data mining Data Mart engine SQL analytics engine Reporting Search Dashboards engine OLAP Statistical analysis NoSQL (e.g. Data mining MapReduce) Visualization engine Visualization Visualization Analysis Static and OLAP reports Dashboards and alerts Statistical analysis Predictive analysis Governance Data governance Data audit, balance and control 31

Big data discovery Structured and unstructured data samples Web content Telemetrics Semi-structured Discovery environment Client data Ingestion Insights lead to more questions Business operations Efficiency Cost Target Compliance Risk Intelligent networks Analysis and discovery Patterns insights and ideas Phase 1 Phase 2 Phase 3 New and revised operational and analytical processes to be implemented in the enterprise Phase 4 32

Business enablement Enabling the journey to information optimization Stage 1 - Information Aware Early stage of IO journey Primary focus at executive insight Stage 2 - Information Empowered Transitioning from managing to exploiting information Recognition of information as strategic enterprise asset Stage 3 - Information Enabled Emerging business, technology partnership App-focused vs. infofocused Stage 4 - Information Driven Analytics-driven decisions Business partners w IT Fully incorporating structured and unstructured data Stage 5 - Information Transformed Winning as information leader Ability to value info as a new model Info as strategic lever throughout enterprise 33 Information technology

Business enablement Enabling the journey to information optimization Stage 1 Information Aware Stage 2 Information Empowered Stage 3 Information enabled Stage 4 Information driven Stage 5 Information transformed 34 Information technology

Bright spots in corrections analytics Category of Data Mining/Analytics Efficiency and Cost Avoidance Public Health Safety and Risk Avoidance Example Use Cases Position and post coverage Inmate transportation optimization Incident avoidance, inhibit policy violations Rehabilitation program effectiveness Patient safety Illness outbreak control Relationship analysis for investigations, surveillance 35

Challenges of climbing the maturity model Variety, velocity, volume, time to value 36 90% of digital content created by 2015 will be mixed data types¹ 48% Increase in the projected volume of digital content worldwide¹ 75% 86% of currently deployed data of corporations cannot warehouses will not scale sufficiently deliver the right information, to meet new information velocity at right time, to support and complexity of demands by 2016² enterprise outcomes all of the time³ ¹Source: IDC Predictions 2012: Competing for 2020 ²Source: Gartner - The State of Data Warehousing in 2012 ³Source: Coleman Parkes Survey Nov 2012

Information optimization enables Delivering meaningful outcomes through insights and analytics across all data Understanding and leveraging data as a primary strategic asset Implementing an information governance model that covers data ownership, metadata, security, and visibility of data Discovering patterns and developing insights through predictive analytics to help close the gap between strategy and execution Enabling better decisions and outcomes by providing the right information, at the right time, by the right method Data Visibility Insights Decisions 37

Questions?

Speaker contact information Speaker Company Phone Email Stephen McHugh Chris Litton HP Diana Zavala HP Sierra Systems 908.420.7343 stephen.mchugh@hp.com 250.882.0207 ChristopherLitton@SierraSystems.com 703.728.6441 diana.zavala@hp.com John Ward HP 916.801.9687 john.ward2@hp.com 39