Overview NIST Big Data Working Group Activities

Size: px
Start display at page:

Download "Overview NIST Big Data Working Group Activities"

Transcription

1 Overview NIST Big Working Group Activities and Big Architecture Framework (BDAF) by UvA Yuri Demchenko SNE Group, University of Amsterdam Big Analytics Interest Group 17 September 2013, 2nd RDA Plenary

2 Outline Overview NIST Big Working Group (NBD-WG) activities and deliverables Proposed Big Architecture Framework (BDAF) Models and Big Lifecycle Big Infrastructure (BDI) Discussion: Liaison and information exchange with NIST BD-WG Disclaimer: Presented here information about NIST Big Working Group (NBD-WG) and images from the NBD-WG working documents are not official position of the NBD-WG and are solely the authors opinion. CWG- BDA NIST BD-WG and UvA BDAF Slide_2

3 NIST Big Working Group (NBD-WG) Deliverables target September September initial draft documents 30 September Workshop and F2F meeting Activities: Conference calls every day 17-19:00 (CET) by subgroup - Big Definition and Taxonomies Requirements (chair: Geoffrey Fox, Indiana Univ) Big Security Reference Architecture Technology Roadmap BigdataWG mailing list and useful documents Input documents Big Reference Architecture Requirements for 21 usecases NIST BD-WG and UvA BDAF 3

4 NIST Proposed Reference Architecture (before July 2013) Obviously not data centric Doesn t make data (lifecycle) management clear [ref] NIST Big WG mailing list discussion NIST BD-WG and UvA BDAF 4

5 Big Ecosystem Reference Architecture (By Microsoft) [ref] Initial contribution July 2013 [ref] Big Ecosystem Reference Architecture (Microsoft) NIST BD-WG and UvA BDAF 5

6 Capability Service Abstraction Cloud C. Framework Transitional Framework SaaS PaaS IaaS Software Platform Cluster Hardware (Storage, Networking, etc.) Management Security & Privacy Management System Management Lifecycle Management NIST Reference Architecture version 0.0 (August 2013) Sources V A R I E T Y V E L O C I T Y V O L U M E Capability Management Service Abstraction Transformation Collection Curation Analytic Visualization Access Usage Service Abstraction Usage R E P O R T RENDER IN G R E T R I E V E NIST BD-WG and UvA BDAF 6

7 NIST Reference Architecture version 0.1 (September 2013) I N F O R M AT I O N F L O W / V A L U E C H A I N System Manager or Vertical Orchestrator System Service Abstraction DATA DATA SW SW Capabilities Service Abstraction Provider Capabilities Provider Service Abstraction Big Framework Transformation Provider Collection Curation Scalable Applications (analytic tools, etc.) Legacy Applications Scalable Platforms (databases, etc.) Analytics Visualization Access Usage Service Abstraction DATA SW Legacy Platforms Scalable Infrastructures (VM cluster, etc.) Legacy Infrastructures Consumer Hardware (Storage, Networking, etc.) K E Y : DATA SW Service Use Big Information Flow SW Tools and Algorithms Transfer I T S TA C K / V A L U E C H A I N NIST BD-WG and UvA BDAF 7

8 Big Architecture Framework (BDAF) by the University of Amsterdam Big definition: from 5+1Vs to 5 parts Big Architecture Framework (BDAF) components NIST BD-WG and UvA BDAF 8

9 Improved: 5+1 V s of Big Variety Structured Unstructured Multi-factor Probabilistic Linked Dynamic Volume Terabytes Records/Arch Tables, Files Distributed 6 Vs of Big Velocity Batch Real/near-time Processes Streams Value Generic Big Properties Volume Variety Velocity Acquired Properties (after entering system) Value Changing data Changing model Linkage Variability Trustworthiness Authenticity Origin, Reputation Availability Accountability Correlations Statistical Events Hypothetical Veracity Variability Veracity NIST BD-WG and UvA BDAF 9

10 Big Definition: From 5+1V to 5 Parts (1) (1) Big Properties: 5V Volume, Variety, Velocity, Value, Veracity Additionally: Dynamicity (Variability) (2) New Models Lifecycle and Variability linking, provenance and referral integrity (3) New Analytics Real-time/streaming analytics, interactive and machine learning analytics (4) New Infrastructure and Tools High performance Computing, Storage, Network Heterogeneous multi-provider services integration New Centric (multi-stakeholder) service models New Centric security models for trusted infrastructure and data processing and storage (5) Source and Target High velocity/speed data capture from variety of sensors and data sources delivery to different visualisation and actionable systems and consumers Full digitised input and output, (ubiquitous) sensor networks, full digital control NIST BD-WG and UvA BDAF 10

11 Big Definition: From 5V to 5 Parts (2) Refining Gartner definition Big ( Intensive) Technologies are targeting to process (1) highvolume, high-velocity, high-variety data (sets/assets) to extract intended data value and ensure high-veracity of original data and obtained information that demand cost-effective, innovative forms of data and information processing (analytics) for enhanced insight, decision making, and processes control; all of those demand (should be supported by) new data models (supporting all data states and stages during the whole data lifecycle) and new infrastructure services and tools that allows also obtaining (and processing data) from a variety of sources (including sensor networks) and delivering data in a variety of forms to different data and information consumers and devices. (1) Big Properties: 5V (2) New Models (3) New Analytics (4) New Infrastructure and Tools (5) Source and Target NIST BD-WG and UvA BDAF 11

12 Big Nature: Origin and consumers (target) Big Origin Science Telecom Industry Business Living Environment, Cities Social media and networks Healthcare Big Target Use Scientific discovery New technologies Manufacturing, processes, transport Personal services, campaigns Living environment support Healthcare support NIST BD-WG and UvA BDAF 12

13 Big Nature: Origin and consumers (targets) Scietific Discovery New Technology Manufactur Transport Personal services, campaigns Living Environmnt, Infrastruct, Utility Science Telecom Industry Business Living environment, Cities Social media, networks Healthcare support Healthcare Rich information on usecases is available from the NIST document store NIST BD-WG and UvA BDAF 13

14 Moving to -Centric Models and Technologies Current IT and communication technologies are host based or host centric Any communication or processing are bound to host/computer that runs software Especially in security: all security models are host/client based Big requires new data-centric models location, search, access variability and lifecycle integrity and identification centric security and access control NIST BD-WG and UvA BDAF 14

15 Defining Big Architecture Framework Existing attempts don t converge to consistent view: ODCA, TMF, NIST See Big Architecture Framework (BDAF) by UvA Architecture Framework and Components for the Big Ecosystem. Draft Version Architecture vs Ecosystem Big undergo a number of transformations during their lifecycle Big fuel the whole transformation chain sources and data consumers, target data usage Multi-dimensional relations between models and data driven processes Infrastructure components and data centric services Architecture vs Architecture Framework (Stack) Separates concerns and factors Control and Management functions, orthogonal factors Architecture Framework components are inter-related NIST BD-WG and UvA BDAF 15

16 Big Architecture Framework (BDAF) for Big Ecosystem (BDE) (1) Models, Structures, Types formats, non/relational, file systems, etc. (2) Big Management Big Lifecycle (Management) Model Big transformation/staging Provenance, Curation, Archiving (3) Big Analytics and Tools Big Applications Target use, presentation, visualisation (4) Big Infrastructure (BDI) Storage, Compute, (High Performance Computing,) Network Sensor network, target/actionable devices Big Operational support (5) Big Security security in-rest, in-move, trusted processing environments NIST BD-WG and UvA BDAF 16

17 Big Architecture Framework (BDAF) Aggregated Relations between components (2) Col: Used By Row: Requires This Models & Structures Managmnt & Lifecycle Big Infrastruct & Operations Big Analytics & Applications Big Security Models Structrs Managmnt & Lifecycle Big Infrastr & Operations Big Analytics & Applicatn Big Security NIST BD-WG and UvA BDAF 17

18 Big Ecosystem:, Lifecycle, Infrastructure Consumer Source Collection& Registration Filter/Enrich, Classification Analytics, Modeling, Prediction Delivery, Visualisation Big Target/Customer: Actionable/Usable Target users, processes, objects, behavior, etc. Big Source/Origin (sensor, experiment, logdata, behavioral data) Big Analytic/Tools Federated Access and Delivery Infrastructure (FADI) Storage General Purpose Compute General Purpose High Performance Computer Clusters Storage Specialised bases Archives (analytics DB, In memory, operstional) Management categories: categories: metadata, categories: metadata, (un)structured, metadata, (un)structured, (non)identifiable (un)structured, (non)identifiable (non)identifiable Intercloud multi-provider heterogeneous Infrastructure Security Infrastructure Network Infrastructure Internal Infrastructure Management/Monitoring NIST BD-WG and UvA BDAF 18

19 Big Infrastructure and Analytic Tools Big Target/Customer: Actionable/Usable Target users, processes, objects, behavior, etc. Big Source/Origin (sensor, experiment, logdata, behavioral data) Big Analytic/Tools Analytics: Refinery, Linking, Fusion Analytics Applications : Link Analysis Cluster Analysis Entity Resolution Complex Analysis Federated Access and Delivery Infrastructure (FADI) Analytics : Realtime, Interactive, Batch, Streaming Storage General Purpose Compute General Purpose High Performance Computer Clusters Storage Specialised bases Archives Management categories: categories: metadata, categories: metadata, (un)structured, metadata, (un)structured, (non)identifiable (un)structured, (non)identifiable (non)identifiable Intercloud multi-provider heterogeneous Infrastructure Security Infrastructure Network Infrastructure Internal Infrastructure Management/Monitoring NIST BD-WG and UvA BDAF 19

20 Consumer Analitics Application Transformation/Lifecycle Model Model (1) Model (1) (inter)linking? Persistent ID Identification Privacy, Opacity Storage Model (3) Common Model? Variety and Variability Semantic Interoperability Model (4) Source Collection& Registration Filter/Enrich, Classification Analytics, Modeling, Prediction Delivery, Visualisation repurposing, Analitics re-factoring, Secondary processing Does Model changes along lifecycle or data evolution? Identifying and linking data Persistent identifier Traceability vs Opacity Referral integrity NIST BD-WG and UvA BDAF 20

21 Scientific Lifecycle Management (SDLM) Model DB Lifecycle Model in e-science User Researcher discovery Curation (including retirement and clean up) recycling Re-purpose archiving Raw Experimental Structured Scientific linkage to papers archiving Project/ Experiment Planning collection and filtering analysis sharing/ publishing End of project Linkage Issues Persistent Identifiers (PID) ORCID (Open Researcher and Contributor ID) Lined Re-purpose Clean up and Retirement Ownership and authority Detainment Links Open Public Use Metadata & Mngnt NIST BD-WG and UvA BDAF 21

22 Evolutional/Hierarchical Model Actionable Papers/Reports Archival Usable Processed (for target use) Processed (for target use) Processed (for target use) Classified/Structured Classified/Structured Classified/Structured Raw Common Model? interlinking? Fits to Graph data type? Metadata Referrals Control information Policy patterns NIST BD-WG and UvA BDAF 22

23 Additional Information Existing proposed Big architectures NIST BD-WG and UvA BDAF 23

24 Industry Initiatives to define Big (Architecture) Open Center Alliance (ODCA) Information as a Service (INFOaaS) TMF Big Analytics Reference Architecture Research Alliance (RDA) All data related aspects, but not Infrastructure and tools LexisNexis HPCC Systems NIST BD-WG and UvA BDAF 24

25 ODCA INFOaaS Information as a Service Using integrated/unified storage New DB/storage technologies allow storing data during all lifecycle [ref] Open Center Alliance Master Usage model: Information as a Service, Rev org/docs/information_as_a_servic e_master_usage_model_rev1.0.p df NIST BD-WG and UvA BDAF 25

26 ODCA Example INFOaaS Architecture Core and Information Components Integration and Distribution Components Presentation and Information Delivery Components Control and Support Components NIST BD-WG and UvA BDAF 26

27 TMF Big Analytics Architecture [ref] TR202 Big Analytics Reference Model. Version 1.9, April NIST BD-WG and UvA BDAF 27

28 LexisNexis Vision for Analytics Supercomputer (DAS) [ref] [ref] HPCC Systems: Introduction to HPCC (High Performance Computer Cluster), Author: A.M. Middleton, LexisNexis Risk Solutions, Date: May 24, 2011 NIST BD-WG and UvA BDAF 28

29 LexisNexis HPCC System Architecture ECL Enterprise Control Language THOR Processing Cluster ( Refinery) Roxie Rapid Delivery Engine [ref] HPCC Systems: Introduction to HPCC (High Performance Computer Cluster), Author: A.M. Middleton, LexisNexis Risk Solutions, Date: May 24, 2011 NIST BD-WG and UvA BDAF 29

30 IBM GBS Business Analytics and Optimisation (2011). b6acfbd608c0/document/aa78f77c-0d57-4f41-a923-50e5c6374b6d/media&ei=yrknubjmnm_liwkqhocqbq&usg=afqjcnf_xu6aifcahlf4266xxnhkfkatlw&sig2=j8jifv_md5dnzfql0spvrg&bvm=bv ,d.cGE September 2013, RDA- NIST BD-WG and UvA BDAF 30

Cloud and Big Data Standardisation

Cloud and Big Data Standardisation Cloud and Big Data Standardisation EuroCloud Symposium ICS Track: Standards for Big Data in the Cloud 15 October 2013, Luxembourg Yuri Demchenko System and Network Engineering Group, University of Amsterdam

More information

Big Data Standardisation in Industry and Research

Big Data Standardisation in Industry and Research Big Data Standardisation in Industry and Research EuroCloud Symposium ICS Track: Standards for Big Data in the Cloud 15 October 2013, Luxembourg Yuri Demchenko System and Network Engineering Group, University

More information

Big Data Architecture Research at UvA

Big Data Architecture Research at UvA Big Architecture Research at UvA Yuri Demchenko System and Network Engineering Group, University of Amsterdam ISO/IEC SGBD Big Technologies Workshop Part of ISO/IEC Big Study Group meeting 13-16 May Outline

More information

Defining the Big Data Architecture Framework (BDAF)

Defining the Big Data Architecture Framework (BDAF) Defining the Big Architecture Framework (BDAF) Outcome of the Brainstorming Session at the University of Amsterdam Yuri Demchenko (facilitator, reporter), SNE Group, University of Amsterdam 17 July 2013,

More information

Defining Architecture Components of the Big Data Ecosystem

Defining Architecture Components of the Big Data Ecosystem Defining Architecture Components of the Big Data Ecosystem Yuri Demchenko SNE Group, University of Amsterdam 2 nd BDDAC2014 Symposium, CTS2014 Conference 19-23 May 2014, Minneapolis, USA Outline Big Data

More information

Big Security for Big Data: Addressing Security Challenges for the Big Data Infrastructure

Big Security for Big Data: Addressing Security Challenges for the Big Data Infrastructure Big Security for Big : Addressing Security Challenges for the Big Infrastructure Yuri Demchenko SNE Group, University of Amsterdam On behalf of Yuri Demchenko, Canh Ngo, Cees de Laat, Peter Membrey, Daniil

More information

Big Data course and Learning Model for Online education (LMO) at the Laureate Online Education (University of Liverpool)

Big Data course and Learning Model for Online education (LMO) at the Laureate Online Education (University of Liverpool) Big Data course and Learning Model for Online education (LMO) at the Laureate Online Education (University of Liverpool) Yuri Demchenko (University of Amsterdam) Emanuel Gruengard (Laureate Online Education)

More information

Instructional Model for Building effective Big Data Curricula for Online and Campus Education

Instructional Model for Building effective Big Data Curricula for Online and Campus Education Instructional Model for Building effective Big Data Curricula for Online and Campus Education Big Data course and Learning Model for Online education (LMO) at the Laureate Online Education (University

More information

Defining Architecture Components of the Big Data Ecosystem

Defining Architecture Components of the Big Data Ecosystem Defining Architecture Components of the Big Data Ecosystem Yuri Demchenko, Cees de Laat System and Network Engineering Group University of Amsterdam Amsterdam, The Netherlands e-mail: {y.demchenko, C.T.A.M.deLaat}@uva.nl

More information

Big Data and Data Intensive Science:

Big Data and Data Intensive Science: Big Data and Data Intensive Science: Infrastructure and Other Challenges Yuri Demchenko SNE Group, University of Amsterdam EENET Conference 4 October 2013, Tartu, Outline Big Data and Data Intensive Science

More information

Architecture Framework and Components for the Big Data Ecosystem

Architecture Framework and Components for the Big Data Ecosystem System and Network Engineering Group, UvA UNIVERSITEIT VAN AMSTERDAM System and Network Engineering Architecture Framework and Components for the Big Data Ecosystem Draft Version 0.2 Yuri Demchenko, Canh

More information

Big Data and Data Intensive (Science) Technologies

Big Data and Data Intensive (Science) Technologies Big Data and Data Intensive (Science) Technologies Yuri Demchenko System and Network Engineering, UvA CN CN CN CN CN CN CN CN CN CN CN 2014, UvA Big Data and Data Science 1 Outline Big Data and Data Intensive

More information

Open Cloud exchange (OCX)

Open Cloud exchange (OCX) Open Cloud exchange (OCX) Draft Proposal and Progress GN3plus JRA1 Task 2 - Network Architectures for Cloud Services Yuri Demchenko SNE Group, University of Amsterdam 10 October 2013, GN3plus Symposium,

More information

Instructional Model for Building effective Big Data Curricula for Online and Campus Education

Instructional Model for Building effective Big Data Curricula for Online and Campus Education Instructional Model for Building effective Big Data Curricula for Online and Campus Education 1 Yuri Demchenko, 2 Emanuel Gruengard, 1,3 Sander Klous, 1 University of Amsterdam System and Network Engineering

More information

NIST Big Data Phase I Public Working Group

NIST Big Data Phase I Public Working Group NIST Big Data Phase I Public Working Group Reference Architecture Subgroup May 13 th, 2014 Presented by: Orit Levin Co-chair of the RA Subgroup Agenda Introduction: Why and How NIST Big Data Reference

More information

Survey of Big Data Architecture and Framework from the Industry

Survey of Big Data Architecture and Framework from the Industry Survey of Big Data Architecture and Framework from the Industry NIST Big Data Public Working Group Sanjay Mishra May13, 2014 3/19/2014 NIST Big Data Public Working Group 1 NIST BD PWG Survey of Big Data

More information

Big Data Challenges for e-science Infrastructure

Big Data Challenges for e-science Infrastructure Big Challenges for e-science Infrastructure Yuri Demchenko, SNE Group, University of Amsterdam AAA-Study Project COINFO2012 Conference 24-25 November 2012, Nanjing, China 23-25 November 2012, Nanjing Big

More information

NIST Big Data Public Working Group

NIST Big Data Public Working Group NIST Big Data Public Working Group Requirements May 13, 2014 Arnab Roy, Fujitsu On behalf of the NIST BDWG S&P Subgroup S&P Requirements Emerging due to Big Data Characteristics Variety: Traditional encryption

More information

DRAFT NIST Big Data Interoperability Framework: Volume 5, Architectures White Paper Survey

DRAFT NIST Big Data Interoperability Framework: Volume 5, Architectures White Paper Survey NIST Special Publication 1500-5 DRAFT NIST Big Data Interoperability Framework: Volume 5, Architectures White Paper Survey NIST Big Data Public Working Group Reference Architecture Subgroup Draft Version

More information

EDISON: Coordination and cooperation to establish new profession of Data Scientist for European Research and Industry

EDISON: Coordination and cooperation to establish new profession of Data Scientist for European Research and Industry EDISON: Coordination and cooperation to establish new profession of Data Scientist for European Research and Industry Yuri Demchenko University of Amsterdam EDISON Education for Data Intensive Science

More information

NIST Big Data Interoperability Framework: Volume 5, Architectures White Paper Survey

NIST Big Data Interoperability Framework: Volume 5, Architectures White Paper Survey NIST Special Publication 1500-5 NIST Big Data Interoperability Framework: Volume 5, Architectures White Paper Survey Final Version 1 NIST Big Data Public Working Group Reference Architecture Subgroup This

More information

Research Data Alliance: Current Activities and Expected Impact. SGBD Workshop, May 2014 Herman Stehouwer

Research Data Alliance: Current Activities and Expected Impact. SGBD Workshop, May 2014 Herman Stehouwer Research Data Alliance: Current Activities and Expected Impact SGBD Workshop, May 2014 Herman Stehouwer The Vision 2 Researchers and innovators openly share data across technologies, disciplines, and countries

More information

Data Centric Systems (DCS)

Data Centric Systems (DCS) Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems

More information

Standard Big Data Architecture and Infrastructure

Standard Big Data Architecture and Infrastructure Standard Big Data Architecture and Infrastructure Wo Chang Digital Data Advisor Information Technology Laboratory (ITL) National Institute of Standards and Technology (NIST) wchang@nist.gov May 20, 2016

More information

Addressing Big Data Issues in Scientific Data Infrastructure

Addressing Big Data Issues in Scientific Data Infrastructure Addressing Big Issues in Scientific Infrastructure Yuri Demchenko, SNE Group, University of Amsterdam 21 May 2013, BDDAC2013 Workshop CTS2013 San Diego Outline Background to Big research at SNE/UvA Big

More information

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume

More information

IBM Solution Framework for Lifecycle Management of Research Data. 2008 IBM Corporation

IBM Solution Framework for Lifecycle Management of Research Data. 2008 IBM Corporation IBM Solution Framework for Lifecycle Management of Research Data Aspects of Lifecycle Management Research Utilization of research paper Usage history Metadata enrichment Usage Pattern / Citation Collaboration

More information

Big Data in Clouds. Cloud based platforms and tools for Big Data applications. Data Centric Computing UvA workshop 3 November 2015.

Big Data in Clouds. Cloud based platforms and tools for Big Data applications. Data Centric Computing UvA workshop 3 November 2015. Big Data in Clouds Cloud based platforms and tools for Big Data applications UvA workshop 3 November 2015 Yuri Demchenko Cloudscape for Big Data 1 Outline Big Data and Need for new paradigm? Cloud Computing

More information

Cloud computing based big data ecosystem and requirements

Cloud computing based big data ecosystem and requirements Cloud computing based big data ecosystem and requirements Yongshun Cai ( 蔡 永 顺 ) Associate Rapporteur of ITU T SG13 Q17 China Telecom Dong Wang ( 王 东 ) Rapporteur of ITU T SG13 Q18 ZTE Corporation Agenda

More information

Big Data in the Cloud for Education Institutions

Big Data in the Cloud for Education Institutions Big Data in the Cloud for Education Institutions Pradit Songsangyos 1 Rajamangala University of Technology Suvarnabhumi, Phranakhon Si Ayutthaya, Thailand 1 pradit.s@rmutsb.ac.th Prachyanun Nilsook 2 King

More information

ISO/IEC JTC1 SC32. Next Generation Analytics Study Group

ISO/IEC JTC1 SC32. Next Generation Analytics Study Group November 13, 2013 ISO/IEC JTC1 SC32 Next Generation Analytics Study Group Title: Author: Project: Status: Big Data Efforts Keith W. Hare Discussion Paper References: 1/6 1 NIST Big Data Public Working

More information

NIST Big Data PWG & RDA Big Data Infrastructure WG: Implementation Strategy: Best Practice Guideline for Big Data Application Development

NIST Big Data PWG & RDA Big Data Infrastructure WG: Implementation Strategy: Best Practice Guideline for Big Data Application Development NIST Big Data PWG & RDA Big Data Infrastructure WG: Implementation Strategy: Best Practice Guideline for Big Data Application Development Wo Chang wchang@nist.gov National Institute of Standards and Technology

More information

locuz.com Big Data Services

locuz.com Big Data Services locuz.com Big Data Services Big Data At Locuz, we help the enterprise move from being a data-limited to a data-driven one, thereby enabling smarter, faster decisions that result in better business outcome.

More information

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS Big Data and Complex Networks Analytics Timos Sellis, CSIT Kathy Horadam, MGS Big Data What is it? Most commonly accepted definition, by Gartner (the 3 Vs) Big data is high-volume, high-velocity and high-variety

More information

The InterNational Committee for Information Technology Standards INCITS Big Data

The InterNational Committee for Information Technology Standards INCITS Big Data The InterNational Committee for Information Technology Standards INCITS Big Data Keith W. Hare JCC Consulting, Inc. April 2, 2015 Who am I? Senior Consultant with JCC Consulting, Inc. since 1985 High performance

More information

EDISON Education for Data Intensive Science to Open New science frontiers

EDISON Education for Data Intensive Science to Open New science frontiers H2020 INFRASUPP-4 CSA Project EDISON Education for Data Intensive Science to Open New science frontiers Yuri Demchenko University of Amsterdam Outline Consortium members EDISON Project Concept and Objectives

More information

Data collection architecture for Big Data

Data collection architecture for Big Data Data collection architecture for Big Data a framework for a research agenda (Research in progress - ERP Sense Making of Big Data) Wout Hofman, May 2015, BDEI workshop 2 Big Data succes stories bias our

More information

ISSN:2321-1156 International Journal of Innovative Research in Technology & Science(IJIRTS)

ISSN:2321-1156 International Journal of Innovative Research in Technology & Science(IJIRTS) Nguyễn Thị Thúy Hoài, College of technology _ Danang University Abstract The threading development of IT has been bringing more challenges for administrators to collect, store and analyze massive amounts

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411

More information

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V WHITE PAPER Create the Data Center of the Future Accelerate

More information

Big Data and Analytics: Challenges and Opportunities

Big Data and Analytics: Challenges and Opportunities Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif

More information

Defining Generic Architecture for Cloud Infrastructure as a Service Model

Defining Generic Architecture for Cloud Infrastructure as a Service Model Defining Generic Architecture for Cloud Infrastructure as a Service Model Yuri Demchenko 1 University of Amsterdam Science Park 904, Amsterdam, The Netherlands E-mail: y.demchenko@uva.nl Cees de Laat University

More information

90% of your Big Data problem isn t Big Data.

90% of your Big Data problem isn t Big Data. White Paper 90% of your Big Data problem isn t Big Data. It s the ability to handle Big Data for better insight. By Arjuna Chala Risk Solutions HPCC Systems Introduction LexisNexis is a leader in providing

More information

Addressing Big Data Issues in Scientific Data Infrastructure

Addressing Big Data Issues in Scientific Data Infrastructure Addressing Big Data Issues in Scientific Data Infrastructure Yuri Demchenko, Paola Grosso, Cees de Laat System and Network Engineering Group University of Amsterdam Amsterdam, The Netherlands e-mail: {y.demchenko,

More information

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015 Bringing Strategy to Life Using an Intelligent Platform to Become Ready Informatica Government Summit April 23, 2015 Informatica Solutions Overview Power the -Ready Enterprise Government Imperatives Improve

More information

Future Trends in Big Data

Future Trends in Big Data Future Trends in Big Data NIST Big Data Public Working Group IEEE Big Data Workshop October 27, 2014 David Boyd, Chief Technology Officer L-3 Data Tactics david.boyd@l-3com.com David Boyd CTO, L-3 Data

More information

Cloud Standards - A Telco Perspective

Cloud Standards - A Telco Perspective Cloud Standards - A Telco Perspective Abdellatif Benjelloun Touimi abdellatif.benjelloun@huawei.com Corporate Standards Department www.huawei.com TEN YEARS OF CONNECTING EUROPE HUAWEI TECHNOLOGIES CO.,

More information

Big Data Systems and Interoperability

Big Data Systems and Interoperability Big Data Systems and Interoperability Emerging Standards for Systems Engineering David Boyd VP, Data Solutions Email: dboyd@incadencecorp.com Topics Shameless plugs and denials What is Big Data and Why

More information

Big Data Analytics Platform @ Nokia

Big Data Analytics Platform @ Nokia Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform

More information

EL Program: Smart Manufacturing Systems Design and Analysis

EL Program: Smart Manufacturing Systems Design and Analysis EL Program: Smart Manufacturing Systems Design and Analysis Program Manager: Dr. Sudarsan Rachuri Associate Program Manager: K C Morris Strategic Goal: Smart Manufacturing, Construction, and Cyber-Physical

More information

Arnab Roy Fujitsu Laboratories of America and CSA Big Data WG

Arnab Roy Fujitsu Laboratories of America and CSA Big Data WG Arnab Roy Fujitsu Laboratories of America and CSA Big Data WG 1 The Big Data Working Group (BDWG) will be identifying scalable techniques for data-centric security and privacy problems. BDWG s investigation

More information

Top Ten Security and Privacy Challenges for Big Data and Smartgrids. Arnab Roy Fujitsu Laboratories of America

Top Ten Security and Privacy Challenges for Big Data and Smartgrids. Arnab Roy Fujitsu Laboratories of America 1 Top Ten Security and Privacy Challenges for Big Data and Smartgrids Arnab Roy Fujitsu Laboratories of America 2 User Roles and Security Concerns [SKCP11] Users and Security Concerns [SKCP10] Utilities:

More information

Enterprise Application Enablement for the Internet of Things

Enterprise Application Enablement for the Internet of Things Enterprise Application Enablement for the Internet of Things Prof. Dr. Uwe Kubach VP Internet of Things Platform, P&I Technology, SAP SE Public Internet of Things (IoT) Trends 12 50 bn 40 50 % Devices

More information

Big Data, Integration and Governance: Ask the Experts

Big Data, Integration and Governance: Ask the Experts Big, Integration and Governance: Ask the Experts January 29, 2013 1 The fourth dimension of Big : Veracity handling data in doubt Volume Velocity Variety Veracity* at Rest Terabytes to exabytes of existing

More information

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER Hur hanterar vi utmaningar inom området - Big Data Jan Östling Enterprise Technologies Intel Corporation, NER Legal Disclaimers All products, computer systems, dates, and figures specified are preliminary

More information

A Strawman Model. NIST Cloud Computing Reference Architecture and Taxonomy Working Group. January 3, 2011

A Strawman Model. NIST Cloud Computing Reference Architecture and Taxonomy Working Group. January 3, 2011 A Strawman Model NIST Cloud Computing Reference Architecture and Taxonomy Working Group January 3, 2011 Objective Our objective is to define a neutral architecture consistent with NIST definition of cloud

More information

Big Data & Security. Aljosa Pasic 12/02/2015

Big Data & Security. Aljosa Pasic 12/02/2015 Big Data & Security Aljosa Pasic 12/02/2015 Welcome to Madrid!!! Big Data AND security: what is there on our minds? Big Data tools and technologies Big Data T&T chain and security/privacy concern mappings

More information

How To Create A Data Science System

How To Create A Data Science System Enhance Collaboration and Data Sharing for Faster Decisions and Improved Mission Outcome Richard Breakiron Senior Director, Cyber Solutions Rbreakiron@vion.com Office: 571-353-6127 / Cell: 803-443-8002

More information

Data Science Body of Knowledge (DS-BoK): Approach and Initial

Data Science Body of Knowledge (DS-BoK): Approach and Initial EDISON Discussion Document Data Science Body of Knowledge (DS-BoK): Approach and Initial version Project acronym: EDISON Project full title: Education for Data Intensive Science to Open New science frontiers

More information

Transforming the Telecoms Business using Big Data and Analytics

Transforming the Telecoms Business using Big Data and Analytics Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe

More information

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com

More information

Reference Architecture, Requirements, Gaps, Roles

Reference Architecture, Requirements, Gaps, Roles Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture

More information

OPEN DATA CENTER ALLIANCE USAGE MODEL: Cloud Maturity Model Rev. 2.0

OPEN DATA CENTER ALLIANCE USAGE MODEL: Cloud Maturity Model Rev. 2.0 OPEN DATA CENTER ALLIANCE USAGE MODEL: Cloud Maturity Model Rev. 2.0 Table of Contents Legal Notice...3 Executive Summary...4 Overview of the Cloud Maturity Model...5 Description of the Cloud Maturity

More information

Agenda 4/21/2015. 1. Big Data Level Set 2. Who are we? 3. What do we do? 4. What have we done so far? 5. What are we working on? 6.

Agenda 4/21/2015. 1. Big Data Level Set 2. Who are we? 3. What do we do? 4. What have we done so far? 5. What are we working on? 6. Wilco van Ginkel, Co-Chair BDWG Agenda 1. Big Data Level Set 2. Who are we? 3. What do we do? 4. What have we done so far? 5. What are we working on? 6. Q&A 1 Big Data Level Set 2 Word on the street Sliding

More information

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Exploiting Data at Rest and Data in Motion with a Big Data Platform Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags

More information

Technology Implications of an Instrumented Planet presented at IFIP WG 10.4 Workshop on Challenges and Directions in Dependability

Technology Implications of an Instrumented Planet presented at IFIP WG 10.4 Workshop on Challenges and Directions in Dependability Technology Implications of an Instrumented Planet presented at IFIP WG 10.4 Workshop on Challenges and Directions in Dependability Nick Bowen Colin Harrison IBM June 2008 1 Background Global Technology

More information

11:06. Transformation From People serving Structures to Networks serving People. Montag, 08. Dezember 2014

11:06. Transformation From People serving Structures to Networks serving People. Montag, 08. Dezember 2014 Transformation From People serving Structures to Networks serving People 11:06 Montag, 08. Dezember 2014 With People Wisdom of the crowd Preventive Autonomous Transreality Network Integrated Pretime Harald

More information

AppSymphony White Paper

AppSymphony White Paper AppSymphony White Paper Secure Self-Service Analytics for Curated Digital Collections Introduction Optensity, Inc. offers a self-service analytic app composition platform, AppSymphony, which enables data

More information

Big Data Terminology - Key to Predictive Analytics Success. Mark E. Johnson Department of Statistics University of Central Florida F2: Statistics

Big Data Terminology - Key to Predictive Analytics Success. Mark E. Johnson Department of Statistics University of Central Florida F2: Statistics Big Data Terminology - Key to Predictive Analytics Success Mark E. Johnson Department of Statistics University of Central Florida F2: Statistics Outline Big Data Phenomena Terminology Role Background on

More information

DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY

DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY Big Data Analytics DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY Tom Haughey InfoModel, LLC 868 Woodfield Road Franklin Lakes, NJ 07417 201 755 3350 tom.haughey@infomodelusa.com

More information

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08

More information

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON Overview * Introduction * Multiple faces of Big Data * Challenges of Big Data * Cloud Computing

More information

Securing Big Data Learning and Differences from Cloud Security

Securing Big Data Learning and Differences from Cloud Security Securing Big Data Learning and Differences from Cloud Security Samir Saklikar RSA, The Security Division of EMC Session ID: DAS-108 Session Classification: Advanced Agenda Cloud Computing & Big Data Similarities

More information

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON 2 The V of Big Data Velocity means both how fast data is being produced and how fast the data must be processed to meet demand. Gartner The emergence

More information

BIG DATA PUBLIC PRIVATE FORUM

BIG DATA PUBLIC PRIVATE FORUM BIG DATA PUBLIC PRIVATE FORUM Agenda 09:00-10:30 9:00-9:20 9:20-9:55 9:55-10:30 The Big Project Results (Session 1) - The Big Project - Welcome and Introduction Nuria De Lama (ATOS Spain) - Key Technology

More information

Enabling the Big Data Commons through indexing of data and their interactions

Enabling the Big Data Commons through indexing of data and their interactions biomedical and healthcare Data Discovery Index Ecosystem Enabling the Big Data Commons through indexing of and their interactions 2 nd BD2K all-hands meeting Bethesda 11/12/15 Aims 1. Help users find accessible

More information

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Volker Markl volker.markl@tu-berlin.de dima.tu-berlin.de dfki.de/web/research/iam/ bbdc.berlin Based on my 2014 Vision Paper On

More information

How To Build A Cloud Platform

How To Build A Cloud Platform Cloud Platforms: Concepts, Definitions, Architectures and Open Issues Samir Tata, Institut Mines-Télécom Télécom SudParis Institut Mines-Télécom Outline Concepts & Definitions Architectures Standards Open

More information

Data Refinery with Big Data Aspects

Data Refinery with Big Data Aspects International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data

More information

IBM & Cloud Computing. Smarter Planet. John Easton UK & Ireland Cloud Computing Technical Leader

IBM & Cloud Computing. Smarter Planet. John Easton UK & Ireland Cloud Computing Technical Leader Cloud Getting hold Computing of the cloud: for a & Cloud Computing Smarter Planet John Easton UK & Ireland Cloud Computing Technical Leader Copyright Corporation 2010 Cloud is important to Corporation

More information

GO BEYOND DATA Real-time Analytics for Application Performance Management

GO BEYOND DATA Real-time Analytics for Application Performance Management GO BEYOND DATA Real-time Analytics for Application Performance Management Yury Oleynik Data Analyst Modern applications Agenda Monitoring challenges INSTANA apploach Instana, Inc. Proprietary and Confidential

More information

Information Security, PII and Big Data

Information Security, PII and Big Data ITU Workshop on ICT Security Standardization for Developing Countries (Geneva, Switzerland, 15-16 September 2014) Information Security, PII and Big Data Edward (Ted) Humphreys ISO/IEC JTC 1/SC 27 (WG1

More information

Big Data Analytics. Chances and Challenges. Volker Markl

Big Data Analytics. Chances and Challenges. Volker Markl Volker Markl Professor and Chair Database Systems and Information Management (DIMA), Technische Universität Berlin www.dima.tu-berlin.de Big Data Analytics Chances and Challenges Volker Markl DIMA BDOD

More information

The Internet of Things and Big Data: Intro

The Internet of Things and Big Data: Intro The Internet of Things and Big Data: Intro John Berns, Solutions Architect, APAC - MapR Technologies April 22 nd, 2014 1 What This Is; What This Is Not It s not specific to IoT It s not about any specific

More information

Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems

Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems Building the Internet of Things Jim Green - CTO, Data & Analytics Business Group, Cisco Systems Brian McCarson Sr. Principal Engineer & Sr. System Architect, Internet of Things Group, Intel Corp Mac Devine

More information

ASCETiC Whitepaper. Motivation. ASCETiC Toolbox Business Goals. Approach

ASCETiC Whitepaper. Motivation. ASCETiC Toolbox Business Goals. Approach ASCETiC Whitepaper Motivation The increased usage of ICT, together with growing energy costs and the need to reduce greenhouse gases emissions call for energy-efficient technologies that decrease the overall

More information

Cloud Computing Standards: Overview and first achievements in ITU-T SG13.

Cloud Computing Standards: Overview and first achievements in ITU-T SG13. Cloud Computing Standards: Overview and first achievements in ITU-T SG13. Dr ITU-T, Chairman of Cloud Computing Working Party, SG 13 Future Networks Orange Labs Networks, Cloud & Future Networks Standard

More information

International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 ISSN 2278-7763. BIG DATA: A New Technology

International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 ISSN 2278-7763. BIG DATA: A New Technology International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 BIG DATA: A New Technology Farah DeebaHasan Student, M.Tech.(IT) Anshul Kumar Sharma Student, M.Tech.(IT)

More information

Emerging Approaches in a Cloud-Connected Enterprise: Containers and Microservices

Emerging Approaches in a Cloud-Connected Enterprise: Containers and Microservices Emerging Approaches in a -Connected Enterprise: Containers and Microservices Anil Karmel Co-Founder and CEO, C2 Labs Co-Chair, NIST Security Working Group akarmel@c2labs.com @anilkarmel Emerging Technologies

More information

Session 4 Cloud computing for future ICT Knowledge platforms

Session 4 Cloud computing for future ICT Knowledge platforms ITU Workshop on "Future Trust and Knowledge Infrastructure", Phase 1 Geneva, Switzerland, 24 April 2015 Session 4 Cloud computing for future ICT Knowledge platforms Olivier Le Grand, Senior Standardization

More information

Standards for Big Data in the Cloud

Standards for Big Data in the Cloud Standards for Big Data in the Cloud James Kobielus Chair, CSCC Big Data Working Group Big Data Evangelist, Senior Program Director, Product Marketing, Big Data Analytics, IBM jgkobiel@us.ibm.com 15 October

More information

Cloud Ready Data: Speeding Your Journey to the Cloud

Cloud Ready Data: Speeding Your Journey to the Cloud Cloud Ready Data: Speeding Your Journey to the Cloud Hybrid Cloud first Born to the cloud 3 Am I part of a Cloud First organization? Am I part of a Cloud First agency? The cloud applications questions

More information

The New World of Data. Don Strickland President, Strickland & Associates

The New World of Data. Don Strickland President, Strickland & Associates The New World of Data Don Strickland President, Strickland & Associates THE NEW WORLD OF DATA 1900 1950 2000 Physical Infrastructure Labor Capital Physical Infrastructure Labor Capital Physical Infrastructure

More information

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Ernst van Waning Senior Sales Engineer May 28, 2010 Agenda SPSS, an IBM Company SPSS Statistics User-driven product

More information

Trust and Dependability in Cloud Computing

Trust and Dependability in Cloud Computing Trust and Dependability in Cloud Computing Claus Pahl IC4 Principal Investigator November 7 th, 2013 Research Philosophy design for growth design for best service provision design for widest acceptance

More information

Standardization Requirements Analysis on Big Data in Public Sector based on Potential Business Models

Standardization Requirements Analysis on Big Data in Public Sector based on Potential Business Models , pp. 165-172 http://dx.doi.org/10.14257/ijseia.2014.8.11.15 Standardization Requirements Analysis on Big Data in Public Sector based on Potential Business Models Suwook Ha 1, Seungyun Lee 2 and Kangchan

More information

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov Search and Data Mining: Techniques Applications Anya Yarygina Boris Novikov Introduction Data mining applications Data mining system products and research prototypes Additional themes on data mining Social

More information