Cloud computing based big data ecosystem and requirements



Similar documents
SERIES Y: GLOBAL INFORMATION INFRASTRUCTURE, INTERNET PROTOCOL ASPECTS AND NEXT-GENERATION NETWORKS Cloud Computing

Data in the IoT: considerations on opportunities, challenges and standardization perspectives

JOURNAL OF OBJECT TECHNOLOGY

Research of Postal Data mining system based on big data

Potential standardization items for the cloud computing in SC32

Certified Information Professional 2016 Update Outline

COMP9321 Web Application Engineering

Cloud and Big Data Standardisation

Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015

Fight fire with fire when protecting sensitive data

Overview NIST Big Data Working Group Activities

Big Data and Analytics: Challenges and Opportunities

SOA, Cloud Computing & Semantic Web Technology: Understanding How They Can Work Together. Thomas Erl, Arcitura Education Inc. & SOA Systems Inc.

IoT/M2M standardization activities in ITU T. Yoshinori Goto, NTT

ITU- T Focus Group Cloud Compu2ng

The Concept of Big Data Reference Model

Master Data Management Architecture

Certified Information Professional (CIP) Certification Maintenance Form

Salesforce Certified Data Architecture and Management Designer. Study Guide. Summer 16 TRAINING & CERTIFICATION

Paxata Security Overview

Cloud Based Document Management

BIG DATA TECHNOLOGY. Hadoop Ecosystem

HOW TO DO A SMART DATA PROJECT

Information Access Platforms: The Evolution of Search Technologies

Information Security, PII and Big Data

Hadoop Data Hubs and BI. Supporting the migration from siloed reporting and BI to centralized services with Hadoop

Session 4 Cloud computing for future ICT Knowledge platforms

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

How To Make Sense Of Data With Altilia

Big Data Analytics Roadmap Energy Industry

REFLECTIONS ON THE USE OF BIG DATA FOR STATISTICAL PRODUCTION

What Cloud computing means in real life

Report on the Dagstuhl Seminar Data Quality on the Web

Data Mining Governance for Service Oriented Architecture

Transforming the Telecoms Business using Big Data and Analytics

conceptsearching Prepared by: Concept Searching 8300 Greensboro Drive, Suite 800 McLean, VA USA

"Data Manufacturing: A Test Data Management Solution"

Clinical Data Management (Process and practical guide) Nguyen Thi My Huong, MD. PhD WHO/RHR/SIS

This Symposium brought to you by

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Testing Big data is one of the biggest

Enabling the Big Data Commons through indexing of data and their interactions

Mitel Professional Services Catalog for Contact Center JULY 2015 SWEDEN, DENMARK, FINLAND AND BALTICS RELEASE 1.0

The 5G Infrastructure Public-Private Partnership

Data Modeling for Big Data

IBM Analytics Make sense of your data

A Symptom Extraction and Classification Method for Self-Management

Trustworthiness of Big Data

BIG Big Data Public Private Forum

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

Federated Application Centric Infrastructure (ACI) Fabrics for Dual Data Center Deployments

10426: Large Scale Project Accounting Data Migration in E-Business Suite

Image Data, RDA and Practical Policies

Disclosure Requirements of CloudCode Software

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

3 MUST-HAVES IN PUBLIC SECTOR INFORMATION GOVERNANCE

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Splunk Enterprise Log Management Role Supporting the ISO Framework EXECUTIVE BRIEF

How To Use Data Analysis To Get More Information From A Computer Or Cell Phone To A Computer

Publishing Linked Data Requires More than Just Using a Tool

The Telematics Application Innovation Based On the Big Data. China Telecom Transportation ICT Application Base(Shanghai)

Using Enterprise Content Management Principles to Manage Research Assets. Kelly Mannix, Manager Deloitte Consulting Perth, WA.

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

Data collection architecture for Big Data

Technical. Overview. ~ a ~ irods version 4.x

Data Quality in Information Integration and Business Intelligence

REQUEST FOR INFORMATION RFI No.: FOR FRAUD ANALYTICS SOFTWARE

A Knowledge Management Framework Using Business Intelligence Solutions

Agenda. Overview. Federation Requirements. Panlab IST Teagle for Partners

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

locuz.com Big Data Services

The future: Big Data, IoT, VR, AR. Leif Granholm Tekla / Trimble buildings Senior Vice President / BIM Ambassador

Architecture Principles

Big Data? Definition # 1: Big Data Definition Forrester Research

ITU-T Kaleidoscope Conference Innovations in NGN. Managing NGN using the SOA Philosophy. Y. Fun Hu University of Bradford

White Paper. Software Development Best Practices: Enterprise Code Portal

Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.

Big Data & Security. Aljosa Pasic 12/02/2015

Cloud Thinking. Simplifying Big Data Processing. Rui L. Aguiar, Diogo Gomes Universidade de Aveiro - Portugal

IMPLEMENTATION OF AN ELECTRONIC DOCUMENT MANAGEMENT SYSTEM

FROM DATA STORE TO DATA SERVICES - DEVELOPING SCALABLE DATA ARCHITECTURE AT SURS. Summary

TrustedX: eidas Platform

Agile Business Intelligence Data Lake Architecture

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Reduce Cost, Time, and Risk ediscovery and Records Management in SharePoint

Open & Big Data for Life Imaging Technical aspects : existing solutions, main difficulties. Pierre Mouillard MD

Service Oriented Architecture and the DBA Kathy Komer Aetna Inc. New England DB2 Users Group. Tuesday June 12 1:00-2:15

BigData Flipkart. Raju Shetty Dir. of Engg, Data Platform

Big Data lisää älyä tiedosta

The X-Factor in Data-Centric Security. Webinar, Tuesday July 14 th 2015

Content Marketing Architecture & Semantics Driving Organic Traffic Growth from Search While Improving CTR Michael Kirchhoff Director of SEO/Product

Big Data Analytics Nokia

Dynamic Data Analytics for the Internet of Things: Challenges and Opportunities

ITU WORK ON INTERNET OF THINGS

Tales of Empirically Understanding and Providing Process Support for Migrating to Clouds

De la Business Intelligence aux Big Data. Marie- Aude AUFAURE Head of the Business Intelligence team Ecole Centrale Paris. 22/01/14 Séminaire Big Data

Big Data & Its Importance

ENVIRONMENTAL PRESSURES DRIVING AN EVOLUTION IN FILE STORAGE

CONCEPTCLASSIFIER FOR SHAREPOINT

NIST Big Data Public Working Group

Transcription:

Cloud computing based big data ecosystem and requirements Yongshun Cai ( 蔡 永 顺 ) Associate Rapporteur of ITU T SG13 Q17 China Telecom Dong Wang ( 王 东 ) Rapporteur of ITU T SG13 Q18 ZTE Corporation

Agenda ITU T progress with big data Big data characteristics and challenges Cloud computing based big data ecosystem Use cases of big data applications & services Requirements and capabilities of cloud computing based big data Conclusion

2013, July: ITU T progress with big data o Initiation of 1 st big data working item Y.Bigdata reqts (Requirements and capabilities for cloud computing based big data) by ITU T SG13 Q17 Overview of cloud computing based big data; Big Data system context and its activities; Cloud computing based big data requirements, capabilities, use cases and scenarios o Targeted to be consented in July 2015 2013, November: o 2014, July: o ITU T watch report Big Data: Big today, normal tomorrow by TSAG Initiation of 2 nd big data working item Y.IoT BigData reqts (Specific requirements and capabilities of the Internet of Things for Big Data) by ITU T SG13 Q2 2015, April: o Initiation of further big data working items: o Y.BigDataEX reqts (Big Data Exchange Requirements and Framework for Telecom Big data)) o Y. Suppl.BigData RoadMap (Supplement on Big Data Standardization Roadmap) o Y.BDaaS arch ((Functional architecture of Big Data as a Service)

Big data definition and characteristics Definition Big data: a category of technologies and services where the capabilities provided to collect, store, search, share, analyse and visualize data which have the characteristics of high volume, high velocity and high variety. Characteristics Volume (large): refers to the amount of data collected, stored, analyzed and visualized, which need big data technologies to be resolved. Variety (Diverse): refers to different data types and data formats that are performed by big data applications and platforms. Velocity (High): refers to both how fast the data is being collected and how fast the data is processed to deliver expected results.

Big data challenges Heterogeneity and incompleteness Data processed using big data technologies can miss some attributes or introduce noise in data transmission. Even after data cleaning and error correction, some incompleteness and errors in data are likely to remain. Scale Managing large and rapidly increasing volumes of data is a challenging issue for data processing. In the past, the data scale challenge was mitigated by evolution of processing and storage resources. Timeliness The acquisition rate and timeliness, to effectively find elements within limited time that meet a specified criterion in a large data set, are new challenges faced by data processing. Privacy Data about human individuals, such as demographic information, internet activities, commutation patterns, social interactions, energy or water consumption, are being collected and analysed for different purposes.

Cloud computing based big data ecosystem The roles in cloud computing can support the three main roles of big data as followings; CSN: data provider; CSP: big data application provider; CSP: big data infrastructure provider; CSC: big data service customer. 6 interne Orange

Use cases 1 Personalization customized service o Each web access activity might be remains as logs of records of visiting Web sites o The CSP:BDIP store relevant information of the Web service user s activities o The CSC:BDAP may know the patterns or preference of the user through analysis of the user s activities

Use cases 2 Intelligent transport big data analysis o The traffic of vehicles and the status of roads are collected in real time. The big data applications for intelligent transport systems are built. o Real time and accurate route navigation o Accurate scheduling for bus departures o Automatic recognition of violations of laws on road traffic

Requirements & capabilities 1 Data collection Data source recognition, which offers the capabilities to locate the data sources and detect the types of data being exposed; Data adaptation, which offers the capabilities to transform and organize the data being accepted with targeted structured and attributes (numbering, location, ownerships, etc); Data transmission, which offers the capabilities to transfer data sets from one location to another keeping the integrity and consistency. Data extraction, which offers the capabilities to extract the semi structured data and unstructured data; Data import, which offers the capabilities to import large amount of static data and real time data; Data integration, which offers the capabilities to integrate data from different data sources (different data type or schema) using by metadata or ontology;

Requirements & capabilities 2 Data storage Data registration, which offers the capabilities to create, update and delete the metadata with corresponding changes to data storage; Note In case of non structured data registration, data registration component can request the transforming raw data to semi structured data (e.g. JSON, BSON) for data registration; Data persistence, which offers the capabilities to store data with low redundancy and low cost and converged with processing capability; Data semantic intellectualization, which offers the capabilities to define semantic relationships among different data set for knowledge sharing; Data access, which offers the capabilities to access data through multiple interfaces, such as API; Data indexing, which offers the capabilities to generate and update index for data sets; Data duplication and backup, which offers the capabilities to duplicate and make backup for data sets;

Requirements & capabilities 3 Data analysis Data preparation, which offers the capabilities to transform data into a form that can be analyzed. This capability includes exploring, de noising, changing, and shaping the data; Data analysis, which offers the capabilities to investigate, inspect, and model data in order to discover useful information; Data visualization, which offers the capabilities to display and present data analysis results; Workflow automation, which offers the automation processes, in whole or part, during which data or functions are passed from one step to another for action, according to a set of procedural rules; Analysis algorithm adaption, which offers the capabilities to apply algorithms of classification, regression, clustering, association rules, ranking and so on according to the requirements;

Requirements & capabilities 4 Data management Data provenance, which offers the capabilities to manage information pertaining to any source of data including the party or parties involved in generation, introduction and/or mash up processes for data; Data preservation, which offers the capabilities to manage the series of activities necessary to ensure continued access to data for as long as necessary; Data privacy, which offers the capabilities to manage the right of individuals to control or influence what information related to them; Data security, which offers the capabilities to handle of the network and service aspects of security, including administrative, operational and maintenance issues; Data ownership, which offers the capabilities to manage digital right of data possession, disposition according to the change of data status (e.g. data integration); Processing task track and control, which offers the information such as success of the job and task, running time and resource utilization and etc.

Conclusion Y.Bigdata reqts Requirements and capabilities for cloud computing based big data First ITU T Big data related recommendation reusing definitions provided in the Cloud vocabulary Rec. ITU T Y.3500 ISO/IEC 17888 and roles in the cloud computing reference Rec. ITU T Y.3502 ISO/IEC 17789 This Recommendation is expected to be delivered in Q3 2015 User view approach: Big data system context with activities Cloud computing based big data ecosystem and activities Usecase to functions derivation: Use cases of cloud computing in support of big data Use cases of cloud computing based big data as analysis services (BDaaS) Main requirements / capabilities cover the whole process of big data processing: Data collection Data storage/persistence Data analyse Data visualization/application Data management/security/privacy As a basis or reference for other big data recommendations Y.BDaaS arch Y.IoT BigData reqts Y.BigDataEX reqts..

Meeting plan in 2015 July 13 23, Geneva, Rapporteur meeting Nov 30 Dec 11, Geneva, Plenary meeting Welcome to join us http://www.itu.int/en/itu T/studygroups/2013 2016/13/Pages/default.aspx

Q&A Thank you for your attention