UniGR Workshop: Big Data «The challenge of visualizing big data»



Similar documents
Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

Big Data in Pictures: Data Visualization

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

Chapter -5 SCALABILITY AND AVAILABILITY

Interactive Visual Data Analysis in the Times of Big Data

Exploration and Visualization of Post-Market Data

Sanjeev Kumar. contribute

GENASIS System Architecture

Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University. Xu Liang ** University of California, Berkeley

1 st Symposium on Colossal Data and Networking (CDAN-2016) March 18-19, 2016 Medicaps Group of Institutions, Indore, India

Sustainable Development with Geospatial Information Leveraging the Data and Technology Revolution

A Professional Big Data Master s Program to train Computational Specialists

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April ISSN

Cloud and Big Data Standardisation

second level university master Academic Year 2013/14 QoLexity Measuring, Monitoring and Analysis of Quality of Life and its Complexity

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

IC05 Introduction on Networks &Visualization Nov

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

MEng, BSc Computer Science with Artificial Intelligence

Data Warehouse: Introduction

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1

It Takes a Village to Raise a Machine Learning Model. Lucian

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

MEng, BSc Applied Computer Science

3rd International Symposium on Big Data and Cloud Computing Challenges (ISBCC-2016) March 10-11, 2016 VIT University, Chennai, India

Summary of 1 st ESA symposium on big Earth observing data

Big Data Text Mining and Visualization. Anton Heijs

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012

Information Visualization WS 2013/14 11 Visual Analytics

Disributed Query Processing KGRAM - Search Engine TOP 10

A Review of Data Mining Techniques

PEER REVIEW HISTORY ARTICLE DETAILS VERSION 1 - REVIEW. Dingcheng Li Mayo Clinic, USA 20-Dec-2015

Comparative Analysis of the Main Business Intelligence Solutions

PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

EL Program: Smart Manufacturing Systems Design and Analysis

CHAPTER 1 INTRODUCTION

Position Paper for W3C Web and Automotive Workshop. Marius Spika, Mark Beckmann

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

#jenkinsconf. Jenkins as a Scientific Data and Image Processing Platform. Jenkins User Conference Boston #jenkinsconf

Cloud-based Infrastructures. Serving INSPIRE needs

Using Summingbird for aggregating eye tracking data to find patterns in images in a multi-user environment

HPC technology and future architecture

Mag. Vikash Kumar, Dr. Anna Fensel SEMANTIC DATA ANALYTICS AS A BASIS FOR ENERGY EFFICIENCY SERVICES

E6893 Big Data Analytics Lecture 2: Big Data Analytics Platforms

Outlines. Business Intelligence. What Is Business Intelligence? Data mining life cycle

Ganzheitliches Datenmanagement

bigdata Managing Scale in Ontological Systems

Overview NIST Big Data Working Group Activities

RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE

Graph Database Performance: An Oracle Perspective

Application Development. A Paradigm Shift

Cloud Computing and the Future of Internet Services. Wei-Ying Ma Principal Researcher, Research Area Manager Microsoft Research Asia

An Interface Design for Future Cloud-based Visualization Services

Search and Data Mining: Techniques. Introduction Anna Yarygina Boris Novikov

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Dynamic Network Analyzer Building a Framework for the Graph-theoretic Analysis of Dynamic Networks

Final Report - HydrometDB Belize s Climatic Database Management System. Executive Summary

Mastering Big Data. Steve Hoskin, VP and Chief Architect INFORMATICA MDM. October 2015

Fusion Applications Overview of Business Intelligence and Reporting components

Complexities of Simulating a Hybrid Agent-Landscape Model Using Multi-Formalism

Visual Analytics and Data Mining

What is Visualization? Information Visualization An Overview. Information Visualization. Definitions

Open & Big Data for Life Imaging Technical aspects : existing solutions, main difficulties. Pierre Mouillard MD

An Introduction to SAS Enterprise Miner and SAS Forecast Server. André de Waal, Ph.D. Analytical Consultant

ADVANCED VISUALIZATION

IO Informatics The Sentient Suite

The Big Data Paradigm Shift. Insight Through Automation

Gain insight, agility and advantage by analyzing change across time and space.

Performing a data mining tool evaluation

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI

Big Data and Advanced Analytics Technologies for the Smart Grid

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Unified Batch & Stream Processing Platform

Database preservation toolkit:

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley

Emerging Geospatial Trends The Convergence of Technologies. Jim Steiner Vice President, Product Management

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Utilizing spatial information systems for non-spatial-data analysis

NSF Workshop: High Priority Research Areas on Integrated Sensor, Control and Platform Modeling for Smart Manufacturing

Chapter ML:XI. XI. Cluster Analysis

European Data Infrastructure - EUDAT Data Services & Tools

Principles for Working with Big Data"

Demonstration of an Automated Integrated Test Environment for Web-based Applications

Using Visual Analytics to Enhance Data Exploration and Knowledge Discovery in Financial Systemic Risk Analysis: The Multivariate Density Estimator

LDIF - Linked Data Integration Framework

Information Management course

Visualization methods for patent data

Curriculum of the research and teaching activities. Matteo Golfarelli

Scalable End-User Access to Big Data HELLENIC REPUBLIC National and Kapodistrian University of Athens

European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project

Big Data Mining Services and Knowledge Discovery Applications on Clouds

Visualization for Network Traffic Monitoring & Security

The Visualization Pipeline

The Service Revolution software engineering without programming languages

INFORMING A INFORMATION DISCOVERY TOOL FOR USING GESTURE

The Trials and Tribulations and ultimate success of parallelisation using Hadoop within the SCAPE project

Supporting Collaborative Grid Application Development Within The E-Science Community p. 1

Transcription:

Dept. ISC Informatics, Systems & Collaboration UniGR Workshop: Big Data «The challenge of visualizing big data» Dr Ir Benoît Otjacques Deputy Scientific Director ISC

The Future is Data-based Can we help? 2

Who we are +/- 30 members (all MSc, MEng or PhD in Computer Science) Network of Partners from Luxembourg and abroad Funding from Ministry of Research EU / National research programs (FNR) Contract research (private/public) Outputs Fundamental research Applied Research ISC Scientific Papers R&D Studies Proof-of-concept Prototypes Professional Applications

Mission use of computer science to ease the understanding of complex big data coming from multiple and heterogeneous sources by primarily using visual representations accessed via any type of devices in various contexts of use. Only software applications layer (hardware, network not included)

Scope Data Provisionin g More than Data: Consider Meta-data More than Preprocessing: Visual Analytics Data Processing & Analysis Interactive Visualization of Data More than Graphics: Usable software tools One of the largest team in Europe focused on this topic (> 20 permanent positions) Software Tools Delivery

What we do CAD/CAM Scientific Vis Virtual Reality Computer Graphics Medical Imaging

What we do Visual Analytics Data Analytics Infovis Abstract Data Visual Data Mining

What we do www.calluna.lu

What we do Domain agnostic

What we do Business / Science Field expert How to analyse my network of friends? Field Question Web-based app with interactive visualization of social network contacts Solution usable on the field Our Group Mixed teams How to analyse network data? Generic Problem Instantiate a Generic Solution Multi-level graph drawing with semantic labelling Reuse / Adapt / Invent Potential generic solution(s) Graph drawing, dynamic graphs, adjacency matrices, graph clustering

Infovis & Visual Analytics User Interaction Raw Data Formatted & Structured Data Processed Data Visual Representation Data Acquisition Data Analysis & Mining Algorithms Drawing & Rendering Algorithms User with a problem to solve What does Big Data change?

What s the problem? 2 major challenges in Visual Analytics Scalability Dynamics Small, Mid-sized Big Static Data Well studied Open issues type A Dynamic Data Open issues type B Highly challenging (A and B) >> A+B

What s the VA problem? It s Big! Big Static Data Heterogeneous high volume data sources Scalability of data provisioning HW/SW infrastructure Scalability of mining algorithms Scalability of visual representations Software engineering issues How to run queries on distributed systems to explore big data sets? How to visualize a million multi-variate items on a screen? How to lower the time needed to run a clustering algorithm on xgbytes? How to design an interactive user interface loading big data in < 1 sec?

What s the VA problem? It s Big! What if data processing is running in the background? What if the user wants seamless nagivation in the data set? Can this map be generated in <0.1 sec on a classic laptop? How a competing algo scales 36000 French Communes on a single screen Weighted by population size, spatially constrained

What s the VA problem? Data changes! Dynamic Mid-sized Data Heterogeneous data streams Dynamic data provisioning HW/SW infrastructure Evolution of mining algorithms Evolution of visual representations Software engineering issues How to aggregate data streams? How to visualize a continuously changing data structure? How to adapt clustering algorithms to consider dynamic data? How to design an interactive user interface continuously fed by data?

What s the VA problem? Data changes! Clustering of streams V(t 1 ) V(t 2 ) V(t 3 ) V(t i ) V(t i+1 ) V(t n ) time W(t 1 ) W(t 2 ) W(t 3 ) W(t i ) W(t i+1 ) W(t n ) C1(t i ) C2(t i ) Update frequency? C1a(t i+1 ) C1b(t i+1 ) C3(t i ) Mental map? What if a MDS projection must be computed in real time to visualize the clusters? What if the user wants to adapt clustering parameters at run time? What if the connexion to a data stream is lost?

My God! Data are big and are changing! Big Dynamic Data Solutions for type A and type B problems often do not work for (A and B) problems Pre-computation (batch mode) available for big static data sets streams? Real time fusion of data streams still possible if 10 n heterogeneous streams? Stability of mental maps of the user? Aggregation strategy for multiscale data wrt time and wrt space? What if the user device is a smartphone with poor computing resources?

My God! Data are big and are changing! How/when to update it? How/when to compute it? How not to loose the user? How to interact with it?

Enabling decisions through Visual Analytics Big Data Visual Analytics techniques Data Provisioning Batch Interactive Streaming Big Systems Rethinking/adapt existing algorithms / techniques w.r.t Big Data 19

Collaborations Your Scientific / Business Problem Data Provisioning is an issue Data Visualization is an issue Data Analytics is an issue You need a software tool to do this Probably we should discuss together

Before joining ISC, its members were there

Big Data Visualization

Conclusion We are here today to join our respective forces to face a BIG challenge

Contact: Dr Ir Benoît Otjacques otjacque@lippmann.lu