Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS



Similar documents
Exploiting the power of Big Data

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

How To Make Data Streaming A Real Time Intelligence

ANALYTICS IN BIG DATA ERA

locuz.com Big Data Services

Information Management course

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Are You Ready for Big Data?

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India

Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo

The University of Jordan

Professional Organization Checklist for the Computer Science Curriculum Updates. Association of Computing Machinery Computing Curricula 2008

Big Data R&D Initiative

ISSN: International Journal of Innovative Research in Technology & Science(IJIRTS)

Big Data Analytics. Lucas Rego Drumond

UNIVERSITY OF INFINITE AMBITIONS. MASTER OF SCIENCE COMPUTER SCIENCE DATA SCIENCE AND SMART SERVICES

Take the Red Pill: Becoming One with Your Computing Environment using Security Intelligence

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India

Timo Elliott VP, Global Innovation Evangelist SAP SE or an SAP affiliate company. All rights reserved. 1

Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Technology Implications of an Instrumented Planet presented at IFIP WG 10.4 Workshop on Challenges and Directions in Dependability

Are You Ready for Big Data?

Big Data Use Cases Update

Cloud and Big Data Standardisation

VIEWPOINT. High Performance Analytics. Industry Context and Trends

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015

A New Era Of Analytic

Big Data a threat or a chance?

Data Mining + Business Intelligence. Integration, Design and Implementation

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

Automated Machine Learning For Autonomic Computing

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

Overview NIST Big Data Working Group Activities

Big Data and Analytics: Challenges and Opportunities

Challenges for Data Driven Systems

Industry 4.0 and Big Data

Big Data Analytics and Healthcare

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics

Where is... How do I get to...

Safe Harbor Statement

Timo Elliott VP, Global Innovation Evangelist SAP SE or an SAP affiliate company. All rights reserved. 1

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Sanjeev Kumar. contribute

Master of Science in Health Information Technology Degree Curriculum

Big Data Driven Knowledge Discovery for Autonomic Future Internet

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

Big Data Analytic and Mining with Machine Learning Algorithm

MES and Industrial Internet

IC05 Introduction on Networks &Visualization Nov

Big Data-ready, Secure & Sovereign Cloud

BIG DATA: CHALLENGES AND OPPORTUNITIES IN LOGISTICS SYSTEMS

Statistics for BIG data

Thinking small about big data: Privacy considerations for the public sector Shaun Brown Partner, nnovation LLP

The 4 Pillars of Technosoft s Big Data Practice

YOU VS THE SENSORS. Six Requirements for Visualizing the Internet of Things. Dan Potter Chief Marketing Officer, Datawatch Corporation

Smart City Australia

Data Analytics as a Service

Zero-in on business decisions through innovation solutions for smart big data management. How to turn volume, variety and velocity into value

LEVERAGING BIG DATA ANALYTICS TO REDUCE SECURITY INCIDENTS A use case in Finance Sector

Horizontal IoT Application Development using Semantic Web Technologies

Big Data, Physics, and the Industrial Internet! How Modeling & Analytics are Making the World Work Better."

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Statistical Challenges with Big Data in Management Science

Training for Big Data

Enhancing Cybersecurity with Big Data: Challenges & Opportunities

BIG DATA STRATEGY. Rama Kattunga Chair at American institute of Big Data Professionals. Building Big Data Strategy For Your Organization

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Big Data in Subsea Solutions

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Acting on the Deluge of Newly Created Automation Data:

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON

SMARTPHONES & BIG DATA. Daniel Nelson Head of Enterprise Development, daniel.nelson@braintreepayments.

Intelligent Business Operations

Deploying Big Data to the Cloud: Roadmap for Success

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Operational Intelligence: Real-Time Business Analytics for Big Data Philip Russom

Collaborations between Official Statistics and Academia in the Era of Big Data

Managing big data for smart grids and smart meters

Value of. Clinical and Business Data Analytics for. Healthcare Payers NOUS INFOSYSTEMS LEVERAGING INTELLECT

Reimagining Business with SAP HANA Cloud Platform for the Internet of Things

COMP9321 Web Application Engineering

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Transcription:

Big Data and Complex Networks Analytics Timos Sellis, CSIT Kathy Horadam, MGS

Big Data What is it? Most commonly accepted definition, by Gartner (the 3 Vs) Big data is high-volume, high-velocity and high-variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making. 2

Big Data some stats high-volume, high-velocity and high-variety > 2 million emails sent Every minute (http://www.domo.com/blog/blog /2012/06/08/how-much-data-iscreated-every-minute/) 34,722 likes 100,000 tweets 571 websites added 250,000 items sold on amazon $272,020 spend on web shopping 3

Complex Networks What is it? Network with significant topological features common in real-world networks eg most technological, biological and social networks Rapidly expanding field bringing together mathematics, engineering, computer science, sociology, epidemiology, physics, biology. 4

Big Data and Complex Network Synergies Both share interesting properties Large scale (volume) Complexity (variety) Dynamics (velocity) Interesting analytics algorithms Many applications with both characteristics (social networks, utility networks, security, etc) 5

Big Data - Research Issues (1) Main stream Infrastructure and Architectures (New large scale data architectures, Cloud architectures) Models (Data representation, storage, and retrieval) and Data Access (Query processing and optimization, Privacy, Security) 6

Big Data - Research Issues (2) Complex Data Analytics Computational, mathematical, statistical, and algorithmic techniques for modelling high dimensional data, large graphs, and complex (interrelated) data Learning, inference, prediction, and knowledge discovery for large volumes of dynamic data sets Data retrieval and data mining to facilitate pattern discovery, trend analysis and anomaly detection Dimensionality reduction, sparse data 7

Big Data - Research Issues (3) Highly Streaming Data Positional streams Social network data Mobile app data Game data 8

Big Data - Research Issues (4) Data Integration Findability and search Information fusion of multiple data sources Semantic integration Recommendation systems 9

Networks- Research Issues (1) Analytics Mathematical models of simpler networks do not show the significant topological features. Network structure and community detection Knowledge discovery, especially of characteristic small communities (motifs) in large networks Bipartite networks 10

Networks- Research Issues (2) Dynamics Algorithm development: machine learning, high dimensional data, large networks New topological, statistical techniques Eg. persistent homology: track connectivity changes RMIT could be a national leader if we could develop this further 11

Networks- Research Issues (3) Detection and Prediction Identification of influential or hidden nodes or communities across networks Structural anomaly detection (via supervised or unsupervised learning) Model transmission or flow through network 00 50 Correlation=94%!! Data Fit 0 06 June 2001 1st June 2002 1st June 2003 1st June 2004 1st June 2005 1st June 2006 1st June 2007 1st June 2008 1st June 2009 1st June 2010 1st June 2011 1st June 2001 Fitting period year Extrapolation 12

Networks- Research Issues (4) Location and Spatial Networks Prioritised habitats 13

Possible Research Themes (1) Situation Awareness applications (Disaster Management, Fault detection) Resource Management applications (Ecology, environment, power network management) Public Health applications (Epidemics, medical records) Financial and Forensic applications (Fraud detection, money laundering) Smart cities applications (Transport, Energy) 14

Possible Research Themes (1) Security applications (Biometrics, computer and information security) Positioning Technologies applications (Agriculture, Forest health, real-time tracks, large mobile networks) Education (Learning analytics) 15

RMIT today High-interest, cutting-edge and well-funded research in: Large scale Data Integration Data quality, etc Sensor networks Data driven complex networks, Sensor network data, Distributed Sensor Networks Complex Networks/Graphs network/graph models and structure detection, graph mining, network/graph analysis, prediction, identification and security Positioning apps/technologies Power and Transport networks, network analysis for detecting possible problems, streamed metering data, real time analytics 16

RMIT today - Examples Former Employees Current Employees Insiders Contractors Trusted Business Partners Cloud Providers Anomaly detection Money laundering Epidemic spread Smart metering Biometric Identification 17

RMIT tomorrow Foster collaboration between many disciplines towards large scale information management. For example, planners, designers and technologists can collaborate on designing buildings fitted with sensors using intelligent optimisation techniques. Plan for a major collaborative effort, like a CRC. Build long term partnerships with key international and national public and private organizations. 18

Preliminary SWOT analysis Strengths 1. Infrastructure/data management 2. Complex network dynamics 3. Location based services 4. Information retrieval 5. Optimization 6. Theoretical analysis Opportunities 1. NICTA funding potential for RMIT centre 2. Cover different application areas, compared to on-going activities 3. Identify a short term impact opportunity 4. Identify an opportunity that can attract an industry sector (e.g. logistics, energy and positioning/mobile applications) Weaknesses 1. No major results/history in the area 2. Big data and complex networks on its own is not recognised as an RMIT strength Threats 1. A couple of CoE proposals submitted 2. Some other on-going efforts (CRCs, government CoE) 3. Fragmentation based on disciplines, due to cultural difference 19