Big Value Data, Not Just Big Data!



Similar documents
Fujitsu Big Data Software Use Cases

Best Practice of Flood Hazard Map in Japan

NTT DATA Big Data Reference Architecture Ver. 1.0

Lessons learned from the tsunami disaster caused by the 2011 Great East Japan Earthquake and improvements in JMA's tsunami warning system


Earthquake hazard mapping for community resilience in Japan

12 th World Telecommunication/ICT Indicators Symposium (WTIS-14)

CRS 610 Ventura County Flood Warning System Website

Maintenance and Management of JR East Civil Engineering Structures

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Preparation. Preparation. Step 2 Prepare an emergency kit. Step 1 Prepare your emergency plan. Step 4 Tune into warnings

Global Flood Alert System (GFAS)

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Tohoku University and the Great East Japan Earthquake Our Role, Responsibility and Mission. Susumu SATOMI President, Tohoku University

The Challenges of Geospatial Analytics in the Era of Big Data

INSTALLATION OF AN AUTOMATED EARLY WARNING SYSTEM FOR ANNOTTO BAY

The Big Data Paradigm Shift. Insight Through Automation

Natural Disaster Impact on Business and Communities in Taiwan. Dr. Chung-Sheng Lee. NCDR Chinese Taipei

ABSTRACT INTRODUCTION PURPOSE

CELL PHONE TRACKING. Index. Purpose. Description. Relevance for Large Scale Events. Options. Technologies. Impacts. Integration potential

Technology Implications of an Instrumented Planet presented at IFIP WG 10.4 Workshop on Challenges and Directions in Dependability

National Disaster Management Institute

Recent activities on Big Data Assimilation in Japan

Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Statistical Challenges with Big Data in Management Science

Using Big Data for Crisis Management. Mohammad Khaled AL Hassan

The Key in Driving the Future

Communications network damage by the Great East Japan Earthquake and securing communications

Smart City Resilience Learning from Emergency Response and Coordination in Japan

Visual Acuity. Hearing. Height and Weight. Blood Pressure MEASURED VALUE

Monitoring and Early Management of Emergences: New Instruments

Improved Warnings for Natural Hazards: A Prototype System for Southern California

Industry Impact of Big Data in the Cloud: An IBM Perspective

Attributes and Objectives of Social Media. What is Social Media? Maximize Reach with Social Media

Big Data. Fast Forward. Putting data to productive use

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Flooding Fast Facts. flooding), seismic events (tsunami) or large landslides (sometime also called tsunami).

Flood Damage Statistics in Japan - What is required for mainstreaming DM?

IoE-Based Rio Operations Center Improves Safety, Traffic Flow, Emergency Response Capabilities

ESTIMATING THE COSTS OF EMERGENCY SERVICES DURING FLOOD EVENTS

Bruhati Technologies. About us. ISO 9001:2008 certified. Technology fit for Business

Graduate School of Disaster Prevention Kangwon National University.

Providing drivers with actionable intelligence can minimize accidents, reduce driver claims and increase your bottom line. Equip motorists with the

Setting the Standard for Safe City Projects in the United States

Flood risk assessment through a detailed 1D/2D coupled model

Appendix J Online Questionnaire

Stormwater Control Measures for Tokyo

How To Understand The Power Of The Internet Of Things

From Big Data to Smart Data Thomas Hahn

ABSG Consulting, Tokyo, Japan 2. Professor, Kogakuin University, Tokyo, Japan 3

Facility Monitoring Services for More Efficient Maintenance of Social Infrastructure

IBM Big Green Innovations Environmental R&D and Services

Big Data overview. Livio Ventura. SICS Software week, Sept Cloud and Big Data Day

SESSION 8: GEOGRAPHIC INFORMATION SYSTEMS AND MAP PROJECTIONS

MES and Industrial Internet

Understanding Diseases and Treatments with Canadian Real-world Evidence

How To Make Data Streaming A Real Time Intelligence

The Real-time Monitoring System of Social Big Data for Disaster Management

Master big data to optimize the oil and gas lifecycle

Japan-World Bank Program for Mainstreaming DRM in Developing Countries

Flood Emergency Response Planning: How to Protect Your Business from a Natural Disaster RIC005

Development of GIS-Based Flood-Simulation Software and Application to Flood-Risk Assessment

Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Gov 3.0. Driving e-government through social, mobile, analytics and the cloud. Microsoft CityNext

Estimation of Travel Demand and Network Simulators to Evaluate Traffic Management Schemes in Disaster

Advanced Methods for Pedestrian and Bicyclist Sensing

Basic system of measures for flood damage mitigation in Japan. Preparedness for major floods

FLOODALERT: A SIMPLIFIED RADAR-BASED EWS FOR URBAN FLOOD WARNING

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Big Data Analytics. Bringing Value out of Volume

YOU VS THE SENSORS. Six Requirements for Visualizing the Internet of Things. Dan Potter Chief Marketing Officer, Datawatch Corporation

PROGRAM DIRECTOR: Arthur O Connor Contact: URL : THE PROGRAM Careers in Data Analytics Admissions Criteria CURRICULUM Program Requirements

Doppler Traffic Flow Sensor For Traveler Information Systems. October,

ICT. Overview. The Information and Communications Industry Japan s Largest Industry. Contribution to real GDP by industry (2010)

The following was presented at DMT 14 (June 1-4, 2014, Newark, DE).

Unlocking the Intelligence in. Big Data. Ron Kasabian General Manager Big Data Solutions Intel Corporation

Data Science & Big Data Practice

Harnessing the Data Flood: Oracle s Visionary Platform from Device to Data Center. Chris Baker Senior Vice President Worldwide ISV/OEM Java Sales

Challenges for Big Data Applications in Japan: Hopes and Concerns

Are You Ready for Big Data?

HAZARD RISK ASSESSMENT, MONITORING, MAINTENANCE AND MANAGEMENT SYSTEM (HAMMS) FOR LANDSLIDE AND FLOOD. Mohd. Nor Desa, Rohayu and Lariyah, UNITEN

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

City Deploys Big Data BI Solution to Improve Lives and Create a Smart-City Template

Smarter Transportation Management

Transcription:

Big Value Data, Not Just Big Data! Dr. Joseph Reger Chief Technology Officer Fujitsu Technology Solutions 0 Copyright 2013 FUJITSU LIMITED

Powers of Ten (SI) Source: Wikipedia 1 Copyright 2013 FUJITSU LIMITED

Big Data? Search Engines Hadoop 100,000 Flights/day 1 billion users 200 million pictures/day 1 billion PCs Billions of requests per day 45 million servers Social Network Scan Sensor Data Business Data Image Analysis Complex Event Processing Data Layer Management Points of Interest Route search Area Managements Congestions Forecast Personal Profiles 650 million smartphones Traffic information Billion objects per hour Billions of measurements per day 1 billion rides per day 100s of millions locations 1 billion cars 2 Copyright 2013 FUJITSU LIMITED

New (Re)Sources, New Technologies Large Data Sets Big Data Powerful hardware Lots of sources New, affordable tools Unstructured Data New analytics Real time 3 Copyright 2013 FUJITSU LIMITED

New use cases Agriculture Big Data Home Healthcare Traffic & Transport Energy Maintenance & Repair Marketing Safety, Public Safety 4 Copyright 2013 FUJITSU LIMITED

Big Value Data! Social Networking Services & Sensor Data Business Data 5 Copyright 2013 FUJITSU LIMITED

Big Data? Search Engines Hadoop 100,000 Flights/day 1 billion users 200 million pictures/day 1 billion PCs 45 million servers Billions of requests per day Social Network Scan Sensor Data Business Data Image Analysis Complex Event Processing Data Layer Management Points of Interest Route search Area Managements Congestions Forecast Personal Profiles 650 million smartphones Billion objects per hour Traffic information 1 billion rides per day 100s of millions locations Billions of measurements per day 1 billion cars 6 Copyright 2013 FUJITSU LIMITED

Big Data research @ Fujitsu Data processing Fast processing of large-scale data Big Data Statistics / history Social media Twitter, blogs, etc. Sensor data GPS, weather, etc. Open data Collection Distributed data collection Privacy security Linked open data Analysis Data/text mining Analysis platform Optimization Simulation Application Transportation Energy Disaster recovery Marketing Topics: Social media analysis Analytic templates Optimal area discovery Social simulation Topics: Transportation simulation Smart grid Event detection from SNS Topics: Incremental data processing Parallel complex event processing Stream data aggregation Faster, more intelligent, more secure technologies 7 Copyright 2013 FUJITSU LIMITED

Application Areas Healthcare Retail Manufacturing Government Agriculture Sectors Usage Disease Prevention New Drug Development Multi-channel Sales Customer Behavior Behavior Management Asset Management Traffic Management Homeland Security Production Efficiency Livestock Reproduction Management Bio Technology SCM New Service M2M Crime Prevention Inventory Management 8 Copyright 2013 FUJITSU LIMITED

Research on Big Data use cases by Fujitsu Laboratories & Fujitsu Limited Kozo Otsuka Technology Office Fujitsu Technology Solutions 9 Copyright 2013 FUJITSU LIMITED

Early detection of lifestyle related diseases Accurate prediction by the variety of health related data analysis Medical record data Health checkup data Vital data 12 M 0.8 M ¼ M over 5 years over 5 years from sampling for ½ year Early detection of signs of habits leading to lifestyle diseases Link to expert advice for preventive actions Example: Diabetic patients & candidates in Japan: 33% of male, 23% of female adults (2011) Health Crime Rain Tsunami 10 Copyright 2013 FUJITSU LIMITED

Prediction method Fujitsu developed methodology Conventional diag. param. HbA1c Blood Glucose Reg. health checkup data Serum creatinine HDL cholesterol BMI Platelet count γ-gt(γ-gtp) Abdom. Circum. GOT(AST) MCH Total protein MCV White blood cell + GPT(ALT) MCHC Serum uric acid Diastolic blood pressure Neutral fat LDL cholesterol Total cholesterol Systolic blood pressure Hematocrit Hemoglobin content Medical record data Diag./Treatment Prescriptions Machine Learning to create rules 2,000-dimensions Finer distinctions for accurate prediction by high dimensional rules Health Crime Rain Tsunami 11 Copyright 2013 FUJITSU LIMITED

Example: predict diabetes Test conducted targeting Fujitsu employee volunteers (26,000) Fujitsu employees (Including diabetes patients) 1. Collect and sort relevant data Analytics Historical medical record data Historical health checkup data Historical vital data 2. Analyze all data to build highly probable diabetes model Vital Med. rec. Checkup Vital Med. rec. Checkup Outcome Probability HIGH 3. Feed target individual s data 4. Compare against model 5. Diabetics probability Reduce medical care cost by accurate prediction Health Crime Rain Tsunami 12 Copyright 2013 FUJITSU LIMITED

Extremely high population density in Tokyo Tokyo population density https://en.wikipedia.org/wiki/greater_tokyo_area Over 12 million people in Tokyo metropolitan area (= ca. 10% of the total Japanese population) Over 14,000 people / km 2 in Tokyo city (ref. ca. 4,400 people / km 2 in Munich, Germany) Small personal space, higher risk for human conflicts Over 6 million people in Tokyo metropolitan area using smartphones during commute, office & private hours Huge # of social media data TOKYO Metropolitan http://www.targetmap.com/viewer.aspx?reportid=5845 Health Crimes Rain Tsunami 13 Copyright 2013 FUJITSU LIMITED

Map of criminal activities Visualization of social network information Use Twitter (40 mil. tweets / day in Japan) as huge number of event sensors Create database of the detected events mapped to geographic locations Filtering and selecting tweets for a target topic Classify selected tweets into sub-categories Identify locations of the events in the tweets Criminal activity map Select tweets related to crimes 1 2 Correlate crime types, time & locations Overlay the correlation data onto a map 3 Health Crimes Rain Tsunami 14 Copyright 2013 FUJITSU LIMITED

Map of criminal activities test run Visualization of criminal activity related tweets Showing the semantic analysis & machine learning phase Contents Handle Date Typ., Loc. Showing the mapping of criminal activities onto the Tokyo map Health Crimes Rain Tsunami 15 Copyright 2013 FUJITSU LIMITED

Japan very vulnerable to climate change Natural disaster due to rainfall Ca. 4 Billion U.S. dollars of property damage annually caused by flooding & inundation Over 1,100 landslides annually Over 100mm per hour precipitation from torrential rain Precipitation increasing every year Strong need for early warnings & preventions http://www.mlit.go.jp/river/basic_info/english/pdf/conf_01-0.pdf https://en.wikipedia.org/wiki/geography_of_japan Health Crime Rain Tsunami 16 Copyright 2013 FUJITSU LIMITED

Finer granularity of rain observation in Japan Leverage XRAIN * radar Compared to the conventional: 5x more frequent data (1 min) 16x finer mesh resolution (250m) 3D scan raindrop information Over 100 times data increase Over 500K records per minute per zone (w/ up to 4 radars) More precise & more real-time Short-period aggregation (e.g.1h) Map data OpenStreetMap Long-period aggregation (e.g.3h) Disaster-warning area weak Rainfall coverage area(1km-mesh) Evacuation required strong 0 30 60 100 200 (mm/h) Finer rainfall measurement (250m mesh) Map data OpenStreetMap 5km-mesh rainfall data by XRAIN (Source: Water & Disaster Mgmt. Bureau, Ministry of Land, Infrastructure, Transport and Tourism) With Fujitsu big data processing: Aggregation of up to 100 mil. records within 10+ secs., updated every 1 min. Real-time aggregation of total rainfall since the 1 st drop for each mesh Detect potential disaster areas w/ the fast data aggregation *XRAIN = X-band MP radar developed by NIED* for MLIT* *NIED = National Research Institute for Earth Science and Disaster Prevention, Japan *MLIT = Ministry of Land, Infrastructure, Transport and Tourism *C-band radar = currently, the most popular weather radar type in the world Health Crime Rain Tsunami http://www.raingain.eu/sites/default/files/maesaka_seminar_ecole_ponts_2_july_2012.pdf https://ams.confex.com/ams/35radar/webprogram/manuscript/paper191685/35radar_maesaka.pdf 17 Copyright 2013 FUJITSU LIMITED

Estimate status quo w/ Social Network Services Use a certain tendency from large SNS data to identify the status quo More precise analysis of location information contained in SNS through Area of life analysis to overcome the following challenges: Only 0.5%(*) of SNS posts including GPS information Only 30%(*) of posts containing landmark information(e.g. town name vs. neighborhood) Only coarse resolution (municipal area, state, prefecture) as SNS user location profile Unreliable ties between contents of posts and user location profiles (e.g. hearsay vs. real experience) * According to the analysis on Tweeter tweets conducted by Fujitsu Labs SNS User s posts User s history of posts Disaster related posts + Area of life (e.g. City B) Area of Life Draw rough prediction of the area where each user spends his/her daily life from the landmarks in the past posts - anonymously Flood? Flood! Submerged! Map data OpenStreetMap Flood! Flood! Terrible! Submerged? Submerged! Flood occurs in City B Witness, observation Hearsay from others Reports from media ReTweet Higher potential of disaster status quo estimated by the large # of posts regarding towns/citys Health Crime Rain Tsunami 18 Copyright 2013 FUJITSU LIMITED

Big Data use: Disaster alert w/ SNS & XRAIN Analysis of rainfall Time Line XRAIN SNS Rainfall Data posts for heavy rain Detecting Disasters Effect on people, logistics, etc. slight Rain Fall severe Alert XRAIN detected heavy rain Monitoring an increase of posts on a specific topic (rain) for the location identified by XRAIN Detecting Disasters * XRAIN data supplied courtesy of Ministry of Land, Infrastructure, Transportation and Tourism of Japan Test run in 2012 for towns in Osaka & Kyoto prefs. raised alerts 3 h earlier than conventional warnings Health Crime Rain Tsunami 19 Copyright 2013 FUJITSU LIMITED

2011.03.11 Tohoku Earthquake and Tsunami Damages caused by Tsunami Epicentral area ca. 500km long (N-S) and 200km wide (E-W) Max. 14.8m Tsunami height, up to 40m Tsunami run-up height 535km 2 of land inundated by Tsunami in Tohoku & Kanto region ca. 129,000 buildings destroyed ca. 15,850 fatalities & 3,282 missing Over 20,000 cars swept away a lot of them were in traffic jams Over 179 mil. tweets in that week, many asking for help www.jiji.com Earthquake simulation initiative by the Japanese government 1) Earthquake & Tsunami simulation: predict power & speed 2) Building response simulation: predict effects on infra. 3) Evacuation activity simulation: protect human lives www.jiji.com Health Crime Rain Tsunami 20 Copyright 2013 FUJITSU LIMITED

Japan surrounded by fault lines Fault-line slides trigger Tsunami 1) Earthquake & Tsunami simulation: predict power & speed 2011 年 の 日 本 の 地 震 分 布 図 Japan earthquakes 2011 Visualization map (2012-01-01). The impact underestimated by the existing Tsunami warning system during Tohoku earthquake on Mar. 11 th, 2011 No sensor to measure the amount of fault line slide no accurate way to predict the Tsunami speed & power, yet Urgent need for the alternative real-time Tsunami prediction system leveraging: Sensors on the surface and the bottom of the ocean (GPS buoy, seabed wave gauges, coastal tide gauges, etc.) More accurate real-time analysis on the source area of Tsunami and the Tsunami source model using the new & accurate sensor data input (above) http://www.youtube.com/watch?feature=player_embedded&v=ekp5ca2sm28 1 st Step: Solid simulation algorithm based on Tohoku earthquake data Yves Descatoire http://www.earthobservatory.sg/media/news-and-features/294-japan2011.html Health Crime Rain Tsunami 21 Copyright 2013 FUJITSU LIMITED

Zoom-up on triangular grids Big Data use: Simulation for accurate early warning 1) Earthquake & Tsunami simulation: predict power & speed Research on real-time & high-res. simulation for more accurate warning Map data is from Google map. Using K computer ( 京 ) Simulation of the 2011 Great Tohoku tsunami Simulation resolution ranges over 5 m 405 m The inundated region shown w/ black dotted line 5 m resolution used in the red boxes in this simulation focusing on Sendai city x = 5 m x = 5 m x = 15 m Non-linear simulation to solve the Tsunami wave & flow ca. 16 mil. triangular grids & finer grids over Sendai ca. 16 k calc. steps in 120 min. long simulation 23 min. for calc. w/ ca. 9.3 % of K computer total core # Health Crime Rain Tsunami More accurate & faster Tsunami warning Oishi et al. (2013, JpGU) 22 Copyright 2013 FUJITSU LIMITED

Japan many cities facing the ocean Impact by the Tsunami water flow Improve the accuracy on estimation of building damages by Tsunami Improve the accuracy on estimation of river/canal overflow by Tsunami flowing upstream direction Balance between robust buildings vs. water flow direction & speed Help the city/town infrastructure planning to secure the evaluation paths (road, bridges, etc.) Helps to better design the evacuation facilities and the way to get there 2) Building response simulation: predict effects on infra. 2 st Step: Solid 3D simulation of the water flow to predict damages Health Crime Rain Tsunami 23 Copyright 2013 FUJITSU LIMITED

Big Data use: Simulation for disaster prevention 2) Building response simulation: predict effects on infra. Accurate 3D replication of invading wave from offshore to shallow sea This video is a demonstration of simulation technique. It is not for an estimation of a damage by tsunami. (left: Yokohama, right: Tokyo) 3D simulation for wide-area using K computer ( 京 ) Smoothed-particle hydrodynamic simulation w/ 400 million particles The potential use of these research results are: to design levees & evacuation shelters to develop guidelines for hazard maps and evacuation routes More effective disaster prevention planning through 3D simulation Health Crime Rain Tsunami 24 Copyright 2013 FUJITSU LIMITED

Big Value Data! Social Networking Services & Sensor Data Business Data 25 Copyright 2013 FUJITSU LIMITED

"The Rock" The Rock Thomas Stearns Eliot Where is the Life we have lost in living? Where is the wisdom we have lost in knowledge? Where is the knowledge we have lost in information? (1888 1965) publisher, playwright, literary born American naturalized British subject in 1927 Nobel prize in literature in 1948 Source: Wikipedia 26 Copyright 2013 FUJITSU LIMITED

DIKW-Hierarchy Where is the WISDOM we have lost in knowledge? Where is the KNOWLEDGE we have lost in information? Where is the INFORMATION we have lost in DATA? 27 Copyright 2013 FUJITSU LIMITED

28 Copyright 2013 FUJITSU LIMITED