Visual Mining for Big Data



Similar documents
Big Data Visualization for Genomics. Luca Vezzadini Kairos3D

Logentries Insights: The State of Log Management & Analytics for AWS

The Purview Solution Integration With Splunk

Towards Smart and Intelligent SDN Controller

SAP Solution Brief SAP HANA. Transform Your Future with Better Business Insight Using Predictive Analytics

Session 805 -End-to-End SAP Lumira: Desktop to On-Premise, Cloud, and Mobile

Soma: Linked Data Infrastructure

Five Reasons Spotfire Is Better than Excel for Business Data Analytics

locuz.com Big Data Services

Niara Security Analytics. Overview. Automatically detect attacks on the inside using machine learning

MicroStrategy Course Catalog

SAP Lumira Cloud: True Self-Service BI Without The Server

Big Data Analytics- Innovations at the Edge

Niara Security Intelligence. Overview. Threat Discovery and Incident Investigation Reimagined

PDF PREVIEW EMERGING TECHNOLOGIES. Applying Technologies for Social Media Data Analysis

How To Handle Big Data With A Data Scientist

Apigee Insights Increase marketing effectiveness and customer satisfaction with API-driven adaptive apps

Understanding Your Customer Journey by Extending Adobe Analytics with Big Data

White Paper. Intelligence Driven. Security Monitoring. v nexusguard.com

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

An Introduction to Genomics and SAS Scientific Discovery Solutions

Retool your HTML/JavaScript to go Mobile

Qlik s Associative Model

Extend your analytic capabilities with SAP Predictive Analysis

Cancer Genomics: What Does It Mean for You?

Concept and Project Objectives

Steven C.H. Hoi School of Information Systems Singapore Management University

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

Copyright 2013 Splunk Inc. Introducing Splunk 6

Toronto 26 th SAP BI. Leap Forward with SAP

Self-Service Business Intelligence

Expanding Uniformance. Driving Digital Intelligence through Unified Data, Analytics, and Visualization

6.0, 6.5 and Beyond. The Future of Spotfire. Tobias Lehtipalo Sr. Director of Product Management

1. INTERFACE ENHANCEMENTS 2. REPORTING ENHANCEMENTS

Data Analytics as a Service

Header 1. John T. Irwin Software Consulting Manager EMEA Managing End User Experience

Using OBIEE for Location-Aware Predictive Analytics

MediSapiens Ltd. Bio-IT solutions for improving cancer patient care. Because data is not knowledge. 19th of March 2015

1. INTERFACE ENHANCEMENTS 2. REPORTING ENHANCEMENTS

Delivering the power of the world s most successful genomics platform

YOUR APP. OUR CLOUD.

SAP Business One and SAP HANA

SkySpark Tools for Visualizing and Understanding Your Data

PTK Forensics. Dario Forte, Founder and Ceo DFLabs. The Sleuth Kit and Open Source Digital Forensics Conference

Big Data and the Data Lake. February 2015

KnowledgeSEEKER Marketing Edition

Take the Red Pill: Becoming One with Your Computing Environment using Security Intelligence

Data Visualization An Outlook on Disruptive Techniques (Technical Insights)

CRITEO INTERNSHIP PROGRAM 2015/2016

Outlines. Business Intelligence. What Is Business Intelligence? Data mining life cycle

USING OPNET TO SIMULATE THE COMPUTER SYSTEM THAT GIVES SUPPORT TO AN ON-LINE UNIVERSITY INTRANET

VULNERABILITY MANAGEMENT

ADOBE EXPERIENCE MANAGER MOBILE. for Healthcare

Visualizing Data: Scalable Interactivity

Cloud-Scale BGP and NetFlow Analysis. Jim Frey, VP Product, Kentik Technologies December 15, 2015

VISUALIZING DATA AT NORTHWESTERN UNIVERSITY. Matt McCrory Lead Visualization Engineer

Integrate Big Data into Business Processes and Enterprise Systems. solution white paper

ORACLE HEALTH SCIENCES INFORM ADVANCED MOLECULAR ANALYTICS

New solutions for Big Data Analysis and Visualization

SAP Predictive Analytics: An Overview and Roadmap. Charles Gadalla, SESSION CODE: 603

Adobe Insight, powered by Omniture

Microsoft Power BI. Nov 21, 2015

TURN YOUR DATA INTO KNOWLEDGE

The Next Wave of Big Data Analytics: Internet of Things and Sensor Data. November 6, 2014 Hannah Smalltree, Director

How to deliver a superior multi channel experience including the new Web Channel Experience Management 3.0

Big Data Visualization with JReport

Attacking the Biobank Bottleneck

A Cloud-based Architecture for Visual Effect Rendering System

NaviCell Data Visualization Python API

Enterprise IT Monitoring for Networks, Applications, Servers and Virtual Infrastructure - 2 -

Big Data Integration: A Buyer's Guide

Digital Collections as Big Data. Leslie Johnston, Library of Congress Digital Preservation 2012

How To Make Sense Of Data With Altilia

OF 1.3 Testing and Challenges

Fast. Integrated Genome Browser & DAS. Easy. Flexible. Free. bioviz.org/igb

Cloud Self Service Mobile Business Intelligence MAKE INFORMED DECISIONS WITH BIG DATA ANALYTICS, CLOUD BI, & SELF SERVICE MOBILITY OPTIONS

Lab Management, Device Provisioning and Test Automation Software

Transcription:

Visual Mining for Big Data Big Dive June 21st, 2013 Alessandro Piglia Kairos3D

Where do we come from? Kairos3D comes from real-time 3D graphics Serious Games (virtual visits, training for industry operators, ) Highly Immersive visualization (up to CAVE environments)

the BIG idea: Visual Mining Use interactive 3D technology to enable Big Data analytics Visually represent your Big Data sets View thousands of variables at the same time Interactively analyze single data items or groups Visually discover data patterns and correlations and gain insights

the BIG idea: Visual Mining

«positioning»

What the project is SW platform developed by Kairos3D Fully C++, heavily using GPU processing Packaged as a generic API (still evolving) Based on proprietary code, integrating open-source modules Main library: OpenSceneGraph (3D engine) Derived from experience on applications with huge data sets See examples and demos

And what the project is NOT A complete Big Data platform We only focus on the presentation layer (visualization and analysis) We don t access directly the (big)data store We rely on other tools for querying and preliminary normalization A commercial tool Mainly used internally or by trained partners to create applications A cloudy tool It is a client application (multi platform), it does not run in a Web browser (yet )

Long-term goals (potential) Create a generic data model (and related file format) Allow to easily input data from any source Generalize visual metaphors and keep adding new ones Provide different representations for different data structures Generalize analytics functionality Open to scripting and/or plugin creation Implement wizards to quickly assemble everything Which would mean: quickly create any app

Roadmap (potential) any data txt xml xls network metaphor library db f x = a 0 + a n + b n n=1 metada mapping

Example 1 Time Series The problem: visualize historical series for road traffic data Big volumes: over 16,000 values every minute Other info to integrate: event database The goals: Spot anomalies in traffic flow Try to correlate events and anomalies

Example 1 Time Series

Example 1 Time Series

Example 1 Time Series

Example 2 3D CMDB The problem: visualize a complex IT infrastructure Big volumes: thousands of items in a hierarchy Clustered in hundreds of groups (subnets, IT processes, ) Other info to integrate: monitoring data (system status) The goals: Check overall IT organization and spot potential issues Try to correlate malfunctions to their system causes

Example 2 3D CMDB

Example 3

the problem: genomic data complexity is increasing modern DNA sequencing produces billions of values per sample new era of cloud-based systems for managing, analyzing and sharing genomic data MORE DATA per single sample MORE SAMPLES in clouds MORE TOOLS for researches MORE data PERSPECTIVES analysis process fragmented as relevant resources are scattered among a pletora of different software tools and databases scientists need to analyze the structure and dynamics of a number of related variables

2000 who has the problem? TODAY more than 30,000 biomedical workgroups are publishing analysis on genomic data. The beginning of the digital age of molecular research. Genomic research trend Human Genome published TOMORROW there will be 10X workgroupgs switching to genomics for their research.

GenomeCruzer: what is the value 3D big data visualization has the potential to dramatically increase the volume of cancer research and shorten the path to cures makes the analysis process accessible to a wider range of researchers, even those with no bio-software skills, such as biologists and physicians the tool ultimately slashes the timelines of analysis and allows unsupervised, fast data analysis unique environment where the whole data set can be visualized and explored, together with its data patterns and relations expand the current reach of the software to attack new markets / new segments (i.e. agrigenomics / personal genomics)

GenomeCruzer today Preliminary release rolled out in production environment @ the Institute for Cancer Research and Treatment at Candiolo (Torino - Italy) Free discovery version completed and available for download (includes 3 case studies) @ http://genomecruzer.com Patent pending Thanks for showing us the fantastic software 黎 文 雁 Wenyan Li BGI-Europe I really enjoyed the demo and very much like a 3D approach to looking at this complex data Lukas J Smink,PhD,Manager, Regional Marketing EMEA Illumina The amazing thing is the speed at which we are exploring huge datasets and discovering features we never noticed before Dr Andrea Bertotti researcher @ the Institute for Cancer Research and Treatment Dr Enzo Medico has captivated an entire hall with his presentation about GenomeCruzer Dr Ovidiu Balacescu The Oncology Institute Cluj-Napoca

GenomeCruzer today Award for the AACR Annual Meeting 2013 Ongoing evaluation of discovery release great feedback received after poster presentation and demo at the second annual TCGA Scientific Symposium November 27-28, 2012 in Washington, D.C. GenomeCruzer evaluation (full TCGA datasets analysis + MBI samples data analysis)

Kairos3D s.r.l. Corso Casale 297 Bis - 10132 Torino, Italy VAT number: 10190870013 info@kairos3d.it thank you!