Data Analytics at NICTA. Stephen Hardy National ICT Australia (NICTA)

Size: px
Start display at page:

Download "Data Analytics at NICTA. Stephen Hardy National ICT Australia (NICTA) shardy@nicta.com.au"

Transcription

1 Data Analytics at NICTA Stephen Hardy National ICT Australia (NICTA) NICTA Copyright 2013

2 Outline Big data = science! Data analytics at NICTA Discrete Finite Infinite Machine Learning for the natural sciences NICTA Copyright

3 Data, Data, Everywhere 3

4 Evolution vs. Revolution Statistics Machine Learning Computer Science problems Personal techniques techniques Societal Challenges Enterprise problems problems Government Scientific Challenges techniques Analysis of data to prove or disprove hypotheses = science!! 4

5 Not just the data Data Scale Infrastructure Algorithmic complexity Machine learning toolkits Graphical models Volume Analytics Engines SQL / NoSQL Graph learning Deep learning Velocity Variety Distributed computation File systems Random forests Nonparametric statistics Big Data Data Analytics Big Analytics 5

6 What is NICTA? Australia s National Centre of Excellence in Information and Communication Technology 700 Staff, 5 labs, $100m/y revenue NICTA objectives Research Excellence in ICT Wealth Creation for Australia Transforming Industry $3bn/y direct impact on GDP from projects New Industries Eleven spin-outs, working with ICT SMEs Skills and Capacity 17 University partners, 280 PhD Students NICTA Copyright 2010

7 Data Analytics: A summary Discrete ℵ P(n i ) Events People Finite R n P(x i ) Signals Location Infinite I P( f i ) Spatial Fields Temporal Fields NICTA Copyright

8 NICTA Data Analytics (1) Discrete ℵ P(n i ) Events, People, Text, Gene Sequences Scoobi data mining / Active learning Energy constrained machine learning Edge-distributed learning Offer targeting Risk Estimation Behaviour prediction Biomedical texts Opinion Watch Event Watch Machine learning for Natural Language Processing Patent analysis Biomedical informatics Sentiment analysis Xenome GWIS Efficient compressed storage and search for sequence data Bioinfomatics NICTA Copyright

9 Event watch Demo Sentiment Analysis 40,000 world lexicon Part of Speech Sentiment Key phase extractor Named Entity Recognition LDA: Latent Dirichlet Allocation Differential topic modeling Supervised LDA 9

10 Key technology - Topic modeling Document 5 Document 4 Document 3 A B C D Vocabulary Document 2 Document 1 1 Probability distribution Topic A Probability distribution 2 Probability distribution Topic B Probability distribution 3 Probability distribution 4 Probability distribution Topic C Probability distribution 5 Probability distribution Topic D Probability distribution Documents consist of words Documents are modeled as a mixture of topics Words are associated with topics Latent Dirichlet Allocation learns the distributions and allocates every word in each document to a topic 10

11 NICTA Data Analytics (2) Finite R n P(x i ) Signals, Location, Genetics SparSNP Efficient distributed sparse regression method Disease expression Cri$cal(Water(Mains( Non-parametric Bayesian methods Preventative Maintenance Structural(Health(Monitoring( distributed, autonomous, real-time data with classification / clustering Fault Prediction Service optimisation SmartGrid( NICTA Copyright

12 12

13 Machine Learning Process Existing data NICTA s analysis Cond. Assessment Age Type Material Size Length Failures Soil Pressure Location Weather and many more Hierarchical Beta Process Risk / age Risk / type Risk / size Age profile Complex data mix Accurate Improved prediction Data Driven prediction from multiple existing data sources Dynamic model update and aggregation 13

14 Improvement on failure prediction Use break records for modelling building Use break record for testing Multiple factors Laid year, material, size, coating, and soil Failures detected Wollongong NICTA Weibull Length of condition assessment NICTA Weibull NICTA NICTA COPYRIGHT Copyright zoom in (2.5%) 14

15 Risk Map Risk ranking of pipes based on likelihood of failure Red = highest Top 10% pipes 10% ~ 40% pipes 40% ~ 60% pipes Last 40% pipes Actual breaks in the following year Blue = lowest

16 NICTA Data Analytics (3) Infinite I P( f i ) Spatial Fields, Temporal Fields Renewable Energy Solar Energy Forecast Software Geothermal( Groundwater( Did you know failure to predict solar energy production will mean we won t fully capture available solar resources? The Problem Electricity grids around the world were not designed to manage large fluctuations of supply in power generation. Traditional forms of power supply such as coal-fired stations provide a stable, non-fluctuating form of power supply. However, the energy we receive from the sun is much more unpredictable and grids are not designed to cope with the dynamic nature of renewable energy production. Data Fusion with Current prediction methods are not accurate enough the suburb level and not fine-grained enough (i.e. uncertainty estimation currently a matter of days, not minutes). Current methods also require expensive (up to $75,000) and obtrusive equipment in a large area to collect the required data. Resource exploration Soils( ((((((((Air(quality( Solar! Impact google.com.au/images Non-parametric Bayesian methods en.wikipedia.org Resource management NICTA aims to lower the costs of solar monitoring systems to allow for fast, affordable forecast systems to be installed all over Australia. Specifically, we aim to: Develop low-cost devices ($500) that measure current levels of rooftop solar power production by monitoring 150 households across the ACT. Technical Contact Business Contact Utilise low-cost sky cameras ($250) to detect cloud cover. From these images, NICTA s researchers will project the motion of the clouds and estimate the 'darkness' of their shadows, thereby predicting their inhibitive effect on power output. Develop software that will predict solar energy production by suburb within minutes and hours rather than days. Transparent Machine Learning Resource discovery Plant system diversity Non-linear laser physics Big(Data(Knowledge(Discovery( NICTA Copyright 2013 Collaborators The Solar Energy Forecast Software project is part of NICTA s Security and Environment Business Team, providing security for people, resources and critical systems. Research Excellence in ICT Wealth Creation for Australia 16

17 Engineered Geothermal Systems

18 Geophysical Data Gravity Magnetics Core Samples Temperature Reflection Seismic Magnetotellurics Gravity Gradiometry Down-hole Geophysics Stress Porosity Passive Seismic Micro Seismic...

19 Distributions of geologies Magneto-Telleurics Seismic Magnetism Gravity Probability Distribution

20 Results fusing gravity & boreholes Predicted mean density and uncertainty 20

21 Reuse Statistics Machine Learning Computer Science problems Personal techniques techniques Societal Challenges Enterprise problems problems Government Scientific Challenges techniques How can we apply new techniques of machine learning / analytics to science? 21

22 Machine Learning in the Natural Sciences Big Data Knowledge Discovery Science and Industry Endowment Fund (www.sief.org) project Collaboration between NICTA (machine learning) SIRCA (big data) Sydney Uni (plate tectonics) Macquarie Uni (forest ecosystems, non-linear laser physics) How do we make machine learning easier to use in the natural sciences?

23 The End

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics http://www.ccmb.med.umich.edu/node/1376

BIOINF 585 Fall 2015 Machine Learning for Systems Biology & Clinical Informatics http://www.ccmb.med.umich.edu/node/1376 Course Director: Dr. Kayvan Najarian (DCM&B, kayvan@umich.edu) Lectures: Labs: Mondays and Wednesdays 9:00 AM -10:30 AM Rm. 2065 Palmer Commons Bldg. Wednesdays 10:30 AM 11:30 AM (alternate weeks) Rm.

More information

HT2015: SC4 Statistical Data Mining and Machine Learning

HT2015: SC4 Statistical Data Mining and Machine Learning HT2015: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Bayesian Nonparametrics Parametric vs Nonparametric

More information

Challenges for Data Driven Systems

Challenges for Data Driven Systems Challenges for Data Driven Systems Eiko Yoneki University of Cambridge Computer Laboratory Quick History of Data Management 4000 B C Manual recording From tablets to papyrus to paper A. Payberah 2014 2

More information

Introduction to Machine Learning. Speaker: Harry Chao Advisor: J.J. Ding Date: 1/27/2011

Introduction to Machine Learning. Speaker: Harry Chao Advisor: J.J. Ding Date: 1/27/2011 Introduction to Machine Learning Speaker: Harry Chao Advisor: J.J. Ding Date: 1/27/2011 1 Outline 1. What is machine learning? 2. The basic of machine learning 3. Principles and effects of machine learning

More information

An Introduction to Data Mining

An Introduction to Data Mining An Introduction to Intel Beijing wei.heng@intel.com January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail

More information

Australian Curriculum: Science

Australian Curriculum: Science Australian Curriculum: Science Scope and Sequence by Strands (-10) This document presents scope and sequence documents arranged by the Strands of Science as a Human Endeavour, Science Inquiry Skills and

More information

Scope and Sequence Interactive Science grades 6-8

Scope and Sequence Interactive Science grades 6-8 Science and Technology Chapter 1. What Is Science? 1. Science and the Natural World 2.Thinking Like a Scientist 3. Scientific Inquiry Scope and Sequence Interactive Science grades 6-8 Chapter 2. Science,

More information

Course Requirements for the Ph.D., M.S. and Certificate Programs

Course Requirements for the Ph.D., M.S. and Certificate Programs Health Informatics Course Requirements for the Ph.D., M.S. and Certificate Programs Health Informatics Core (6 s.h.) All students must take the following two courses. 173:120 Principles of Public Health

More information

Big Data Text Mining and Visualization. Anton Heijs

Big Data Text Mining and Visualization. Anton Heijs Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark

More information

CPO Science and the NGSS

CPO Science and the NGSS CPO Science and the NGSS It is no coincidence that the performance expectations in the Next Generation Science Standards (NGSS) are all action-based. The NGSS champion the idea that science content cannot

More information

MS1b Statistical Data Mining

MS1b Statistical Data Mining MS1b Statistical Data Mining Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Administrivia and Introduction Course Structure Syllabus Introduction to

More information

Short-term Machine-learning-based Forecasting of Distributed Solar Energy Production

Short-term Machine-learning-based Forecasting of Distributed Solar Energy Production Short-term Machine-learning-based Forecasting of Distributed Solar Energy Production Dr Stephen Gould Fellow, Research School of Computer Science Australian National University (ANU) and Senior Researcher,

More information

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume

More information

Big Data and Marketing

Big Data and Marketing Big Data and Marketing Professor Venky Shankar Coleman Chair in Marketing Director, Center for Retailing Studies Mays Business School Texas A&M University http://www.venkyshankar.com venky@venkyshankar.com

More information

Machine Learning for Data Science (CS4786) Lecture 1

Machine Learning for Data Science (CS4786) Lecture 1 Machine Learning for Data Science (CS4786) Lecture 1 Tu-Th 10:10 to 11:25 AM Hollister B14 Instructors : Lillian Lee and Karthik Sridharan ROUGH DETAILS ABOUT THE COURSE Diagnostic assignment 0 is out:

More information

Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague. http://ida.felk.cvut.

Machine Learning and Data Analysis overview. Department of Cybernetics, Czech Technical University in Prague. http://ida.felk.cvut. Machine Learning and Data Analysis overview Jiří Kléma Department of Cybernetics, Czech Technical University in Prague http://ida.felk.cvut.cz psyllabus Lecture Lecturer Content 1. J. Kléma Introduction,

More information

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

CS 2750 Machine Learning. Lecture 1. Machine Learning.  CS 2750 Machine Learning. Lecture 1 Machine Learning Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square, x-5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht milos@cs.pitt.edu 539 Sennott

More information

Machine Learning: Overview

Machine Learning: Overview Machine Learning: Overview Why Learning? Learning is a core of property of being intelligent. Hence Machine learning is a core subarea of Artificial Intelligence. There is a need for programs to behave

More information

testo dello schema Secondo livello Terzo livello Quarto livello Quinto livello

testo dello schema Secondo livello Terzo livello Quarto livello Quinto livello Extracting Knowledge from Biomedical Data through Logic Learning Machines and Rulex Marco Muselli Institute of Electronics, Computer and Telecommunication Engineering National Research Council of Italy,

More information

PeerEnergyCloud Trading Renewable Energies

PeerEnergyCloud Trading Renewable Energies PeerEnergyCloud Trading Renewable Energies Jochen Frey, Boris Brandherm, and Jörg Baus German Research Center for Artificial Intelligence GmbH, Stuhlsatzenhausweg 3, 66123 Saarbrücken, Germany {frey,brandherm,baus}@dfki.de

More information

BIG DATA What it is and how to use?

BIG DATA What it is and how to use? BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14

More information

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina Graduate Co-op Students Information Manual Department of Computer Science Faculty of Science University of Regina 2014 1 Table of Contents 1. Department Description..3 2. Program Requirements and Procedures

More information

Partnership to Improve Solar Power Forecasting

Partnership to Improve Solar Power Forecasting Partnership to Improve Solar Power Forecasting Venue: EUPVSEC, Paris France Presenter: Dr. Manajit Sengupta Date: October 1 st 2013 NREL is a national laboratory of the U.S. Department of Energy, Office

More information

Scalable Machine Learning to Exploit Big Data for Knowledge Discovery

Scalable Machine Learning to Exploit Big Data for Knowledge Discovery Scalable Machine Learning to Exploit Big Data for Knowledge Discovery Una-May O Reilly MIT MIT ILP-EPOCH Taiwan Symposium Big Data: Technologies and Applications Lots of Data Everywhere Knowledge Mining

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

Master's projects at ITMO University. Daniil Chivilikhin PhD Student @ ITMO University

Master's projects at ITMO University. Daniil Chivilikhin PhD Student @ ITMO University Master's projects at ITMO University Daniil Chivilikhin PhD Student @ ITMO University General information Guidance from our lab's researchers Publishable results 2 Research areas Research at ITMO Evolutionary

More information

CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning.

CS 2750 Machine Learning. Lecture 1. Machine Learning. http://www.cs.pitt.edu/~milos/courses/cs2750/ CS 2750 Machine Learning. Lecture Machine Learning Milos Hauskrecht milos@cs.pitt.edu 539 Sennott Square, x5 http://www.cs.pitt.edu/~milos/courses/cs75/ Administration Instructor: Milos Hauskrecht milos@cs.pitt.edu 539 Sennott

More information

Beginning the journey to smart water companies starting with water networks

Beginning the journey to smart water companies starting with water networks Beginning the journey to smart water companies starting with water networks Paul Rutter, Water Innovation Manager, Thames Water Joby Boxall, Professor of Water Infrastructure Engineering, The University

More information

Solar Irradiance Forecasting Using Multi-layer Cloud Tracking and Numerical Weather Prediction

Solar Irradiance Forecasting Using Multi-layer Cloud Tracking and Numerical Weather Prediction Solar Irradiance Forecasting Using Multi-layer Cloud Tracking and Numerical Weather Prediction Jin Xu, Shinjae Yoo, Dantong Yu, Dong Huang, John Heiser, Paul Kalb Solar Energy Abundant, clean, and secure

More information

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Ernst van Waning Senior Sales Engineer May 28, 2010 Agenda SPSS, an IBM Company SPSS Statistics User-driven product

More information

Big Data Analytics for SCADA

Big Data Analytics for SCADA ENERGY Big Data Analytics for SCADA Machine Learning Models for Fault Detection and Turbine Performance Elizabeth Traiger, Ph.D., M.Sc. 14 April 2016 1 SAFER, SMARTER, GREENER Points to Convey Big Data

More information

Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011

Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Crowded Environments XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Sparse Scenes Zelnik-Manor & Irani CVPR

More information

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD Predictive Analytics Techniques: What to Use For Your Big Data March 26, 2014 Fern Halper, PhD Presenter Proven Performance Since 1995 TDWI helps business and IT professionals gain insight about data warehousing,

More information

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data

More information

PRIORITISING WATER PIPES FOR CONDITION ASSESSMENT WITH DATA ANALYTICS

PRIORITISING WATER PIPES FOR CONDITION ASSESSMENT WITH DATA ANALYTICS PRIORITISING WATER PIPES FOR CONDITION ASSESSMENT WITH DATA ANALYTICS Bin Li 1, Bang Zhang 1, Zhidong Li 1, Yang Wang 1, Fang Chen 1, Dammika Vitanage 2 1. National ICT Australia, Sydney, NSW 2. Sydney

More information

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS Big Data and Complex Networks Analytics Timos Sellis, CSIT Kathy Horadam, MGS Big Data What is it? Most commonly accepted definition, by Gartner (the 3 Vs) Big data is high-volume, high-velocity and high-variety

More information

Sensor Devices and Sensor Network Applications for the Smart Grid/Smart Cities. Dr. William Kao

Sensor Devices and Sensor Network Applications for the Smart Grid/Smart Cities. Dr. William Kao Sensor Devices and Sensor Network Applications for the Smart Grid/Smart Cities Dr. William Kao Agenda Introduction - Sensors, Actuators, Transducers Sensor Types, Classification Wireless Sensor Networks

More information

Grade Stand Sub-Strand Standard Benchmark GRADE 6

Grade Stand Sub-Strand Standard Benchmark GRADE 6 Grade Stand Sub-Strand Standard Benchmark OF OF OF A. Scientific World View B. Scientific Inquiry C. Scientific Enterprise understand that science is a way of knowing about the world that is characterized

More information

Course Requirements for the Ph.D., M.S. and Certificate Programs

Course Requirements for the Ph.D., M.S. and Certificate Programs Course Requirements for the Ph.D., M.S. and Certificate Programs PhD Program The PhD program in the Health Informatics subtrack inherits all course requirements of the Informatics PhD program, that is,

More information

BigData@Chalmers Machine Learning Business Intelligence, Culturomics and Life Sciences

BigData@Chalmers Machine Learning Business Intelligence, Culturomics and Life Sciences BigData@Chalmers Machine Learning Business Intelligence, Culturomics and Life Sciences Devdatt Dubhashi LAB (Machine Learning. Algorithms, Computational Biology) D&IT Chalmers Entity Disambiguation

More information

Ambiata.com. Personalisation with Predictive Analytics Dr Rami Mukhtar National ICT Australia May 2013

Ambiata.com. Personalisation with Predictive Analytics Dr Rami Mukhtar National ICT Australia May 2013 Ambiata.com Personalisation with Predictive Analytics Dr Rami Mukhtar National ICT Australia May 2013 Personalisation in Enterprise Commoditisation Customers Today Real-time influence External unpredictable

More information

INTRODUCTION TO MACHINE LEARNING

INTRODUCTION TO MACHINE LEARNING Why are you here? What is Machine Learning? Why are you taking this course? INTRODUCTION TO MACHINE LEARNING David Kauchak CS 451 Fall 2013 What topics would you like to see covered? Machine Learning is

More information

Predictive modelling around the world 28.11.13

Predictive modelling around the world 28.11.13 Predictive modelling around the world 28.11.13 Agenda Why this presentation is really interesting Introduction to predictive modelling Case studies Conclusions Why this presentation is really interesting

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)

More information

ICT Perspectives on Big Data: Well Sorted Materials

ICT Perspectives on Big Data: Well Sorted Materials ICT Perspectives on Big Data: Well Sorted Materials 3 March 2015 Contents Introduction 1 Dendrogram 2 Tree Map 3 Heat Map 4 Raw Group Data 5 For an online, interactive version of the visualisations in

More information

PREDICTIVE AND OPERATIONAL ANALYTICS, WHAT IS IT REALLY ALL ABOUT?

PREDICTIVE AND OPERATIONAL ANALYTICS, WHAT IS IT REALLY ALL ABOUT? PREDICTIVE AND OPERATIONAL ANALYTICS, WHAT IS IT REALLY ALL ABOUT? Derek Vogelsang 1, Alana Duncker 1, Steve McMichael 2 1. MWH Global, Adelaide, SA 2. South Australia Water Corporation, Adelaide, SA ABSTRACT

More information

Cloud tracking with optical flow for short-term solar forecasting

Cloud tracking with optical flow for short-term solar forecasting Cloud tracking with optical flow for short-term solar forecasting Philip Wood-Bradley, José Zapata, John Pye Solar Thermal Group, Australian National University, Canberra, Australia Corresponding author:

More information

Why big data? Lessons from a Decade+ Experiment in Big Data

Why big data? Lessons from a Decade+ Experiment in Big Data Why big data? Lessons from a Decade+ Experiment in Big Data David Belanger PhD Senior Research Fellow Stevens Institute of Technology dbelange@stevens.edu 1 What Does Big Look Like? 7 Image Source Page:

More information

Big Data Analytic Paradigms -From PCA to Deep Learning

Big Data Analytic Paradigms -From PCA to Deep Learning The Intersection of Robust Intelligence and Trust in Autonomous Systems: Papers from the AAAI Spring Symposium Big Data Analytic Paradigms -From PCA to Deep Learning Barnabas K. Tannahill Aerospace Electronics

More information

Curriculum Map Earth Science - High School

Curriculum Map Earth Science - High School September Science is a format process to use Use instruments to measure Measurement labs - mass, volume, to observe, classify, and analyze the observable properties. density environment. Use lab equipment

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:

More information

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) 299 REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) (See also General Regulations) Any publication based on work approved for a higher degree should contain a reference

More information

Data Isn't Everything

Data Isn't Everything June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Machine Learning and Data Mining. Fundamentals, robotics, recognition Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,

More information

PROGRAM DIRECTOR: Arthur O Connor Email Contact: URL : THE PROGRAM Careers in Data Analytics Admissions Criteria CURRICULUM Program Requirements

PROGRAM DIRECTOR: Arthur O Connor Email Contact: URL : THE PROGRAM Careers in Data Analytics Admissions Criteria CURRICULUM Program Requirements Data Analytics (MS) PROGRAM DIRECTOR: Arthur O Connor CUNY School of Professional Studies 101 West 31 st Street, 7 th Floor New York, NY 10001 Email Contact: Arthur O Connor, arthur.oconnor@cuny.edu URL:

More information

Interpretation of Data (IOD) Score Range

Interpretation of Data (IOD) Score Range These Standards describe what students who score in specific score ranges on the Science Test of ACT Explore, ACT Plan, and the ACT college readiness assessment are likely to know and be able to do. 13

More information

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Is a Data Scientist the New Quant? Stuart Kozola MathWorks Is a Data Scientist the New Quant? Stuart Kozola MathWorks 2015 The MathWorks, Inc. 1 Facts or information used usually to calculate, analyze, or plan something Information that is produced or stored by

More information

Statistics for BIG data

Statistics for BIG data Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before

More information

Sense Making in an IOT World: Sensor Data Analysis with Deep Learning

Sense Making in an IOT World: Sensor Data Analysis with Deep Learning Sense Making in an IOT World: Sensor Data Analysis with Deep Learning Natalia Vassilieva, PhD Senior Research Manager GTC 2016 Deep learning proof points as of today Vision Speech Text Other Search & information

More information

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016 Network Machine Learning Research Group S. Jiang Internet-Draft Huawei Technologies Co., Ltd Intended status: Informational October 19, 2015 Expires: April 21, 2016 Abstract Network Machine Learning draft-jiang-nmlrg-network-machine-learning-00

More information

Enhance Collaboration and Data Sharing for Faster Decisions and Improved Mission Outcome

Enhance Collaboration and Data Sharing for Faster Decisions and Improved Mission Outcome Enhance Collaboration and Data Sharing for Faster Decisions and Improved Mission Outcome Richard Breakiron Senior Director, Cyber Solutions Rbreakiron@vion.com Office: 571-353-6127 / Cell: 803-443-8002

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

MIDLAND ISD ADVANCED PLACEMENT CURRICULUM STANDARDS AP ENVIRONMENTAL SCIENCE

MIDLAND ISD ADVANCED PLACEMENT CURRICULUM STANDARDS AP ENVIRONMENTAL SCIENCE Science Practices Standard SP.1: Scientific Questions and Predictions Asking scientific questions that can be tested empirically and structuring these questions in the form of testable predictions SP.1.1

More information

Computer Animation and Visualisation. Lecture 1. Introduction

Computer Animation and Visualisation. Lecture 1. Introduction Computer Animation and Visualisation Lecture 1 Introduction 1 Today s topics Overview of the lecture Introduction to Computer Animation Introduction to Visualisation 2 Introduction (PhD in Tokyo, 2000,

More information

Nagarjuna College Of

Nagarjuna College Of Nagarjuna College Of Information Technology (Bachelor in Information Management) TRIBHUVAN UNIVERSITY Project Report on World s successful data mining and data warehousing projects Submitted By: Submitted

More information

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) 305 REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) (See also General Regulations) Any publication based on work approved for a higher degree should contain a reference

More information

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments Contents List of Figures Foreword Preface xxv xxiii xv Acknowledgments xxix Chapter 1 Fraud: Detection, Prevention, and Analytics! 1 Introduction 2 Fraud! 2 Fraud Detection and Prevention 10 Big Data for

More information

Virtual Reality Scientific Visualisation - A Solution for Big Data Analysis of the Block Cave Mining System

Virtual Reality Scientific Visualisation - A Solution for Big Data Analysis of the Block Cave Mining System Virtual Reality Scientific Visualisation - A Solution for Big Data Analysis of the Block Cave Mining System James Tibbett, Fidelis Suorineni, Bruce Hebblewhite and Alex Colebourn What is Block Caving?

More information

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»

More information

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends

Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Promises and Pitfalls of Big-Data-Predictive Analytics: Best Practices and Trends Spring 2015 Thomas Hill, Ph.D. VP Analytic Solutions Dell Statistica Overview and Agenda Dell Software overview Dell in

More information

SYSTEMS, CONTROL AND MECHATRONICS

SYSTEMS, CONTROL AND MECHATRONICS 2015 Master s programme SYSTEMS, CONTROL AND MECHATRONICS INTRODUCTION Technical, be they small consumer or medical devices or large production processes, increasingly employ electronics and computers

More information

An Introduction to Advanced Analytics and Data Mining

An Introduction to Advanced Analytics and Data Mining An Introduction to Advanced Analytics and Data Mining Dr Barry Leventhal Henry Stewart Briefing on Marketing Analytics 19 th November 2010 Agenda What are Advanced Analytics and Data Mining? The toolkit

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association

More information

Data, Measurements, Features

Data, Measurements, Features Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are

More information

Medical Big Data Interpretation

Medical Big Data Interpretation Medical Big Data Interpretation Vice president of the Xiangya Hospital, Central South University The director of the ministry of mobile medical education key laboratory Professor Jianzhong Hu BIG DATA

More information

Big Data Big Knowledge?

Big Data Big Knowledge? EBPI Epidemiology, Biostatistics and Prevention Institute Big Data Big Knowledge? Torsten Hothorn 2015-03-06 The end of theory The End of Theory: The Data Deluge Makes the Scientific Method Obsolete (Chris

More information

Principles of Data Mining by Hand&Mannila&Smyth

Principles of Data Mining by Hand&Mannila&Smyth Principles of Data Mining by Hand&Mannila&Smyth Slides for Textbook Ari Visa,, Institute of Signal Processing Tampere University of Technology October 4, 2010 Data Mining: Concepts and Techniques 1 Differences

More information

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov Search and Data Mining: Techniques Applications Anya Yarygina Boris Novikov Introduction Data mining applications Data mining system products and research prototypes Additional themes on data mining Social

More information

From Big Data to Smart Data Thomas Hahn

From Big Data to Smart Data Thomas Hahn Siemens Future Forum @ HANNOVER MESSE 2014 From Big to Smart Hannover Messe 2014 The Evolution of Big Digital data ~ 1960 warehousing ~1986 ~1993 Big data analytics Mining ~2015 Stream processing Digital

More information

Traffic Prediction and Analysis using a Big Data and Visualisation Approach

Traffic Prediction and Analysis using a Big Data and Visualisation Approach Traffic Prediction and Analysis using a Big Data and Visualisation Approach Declan McHugh 1 1 Department of Computer Science, Institute of Technology Blanchardstown March 10, 2015 Summary This abstract

More information

Introduction to machine learning and pattern recognition Lecture 1 Coryn Bailer-Jones

Introduction to machine learning and pattern recognition Lecture 1 Coryn Bailer-Jones Introduction to machine learning and pattern recognition Lecture 1 Coryn Bailer-Jones http://www.mpia.de/homes/calj/mlpr_mpia2008.html 1 1 What is machine learning? Data description and interpretation

More information

The Next Generation Science Standards (NGSS) Correlation to. EarthComm, Second Edition. Project-Based Space and Earth System Science

The Next Generation Science Standards (NGSS) Correlation to. EarthComm, Second Edition. Project-Based Space and Earth System Science The Next Generation Science Standards (NGSS) Achieve, Inc. on behalf of the twenty-six states and partners that collaborated on the NGSS Copyright 2013 Achieve, Inc. All rights reserved. Correlation to,

More information

Smart Science Lessons and Middle School Next Generation Science Standards

Smart Science Lessons and Middle School Next Generation Science Standards Smart Science Lessons and Middle School Next Generation Science Standards You have chosen the right place to find great science learning and, beyond learning, how to think. The NGSS emphasize thinking

More information

Search and Data Mining: Techniques. Introduction Anna Yarygina Boris Novikov

Search and Data Mining: Techniques. Introduction Anna Yarygina Boris Novikov Search and Data Mining: Techniques Introduction Anna Yarygina Boris Novikov Data Analytics: Conference Sections Fundamentals for data analytics Mechanisms and features Big Data Huge data Target analytics

More information

Animation. Intelligence. Business. Computer. Areas of Focus. Master of Science Degree Program

Animation. Intelligence. Business. Computer. Areas of Focus. Master of Science Degree Program Business Intelligence Computer Animation Master of Science Degree Program The Bachelor explosive of growth Science of Degree from the Program Internet, social networks, business networks, as well as the

More information

MSCA 31000 Introduction to Statistical Concepts

MSCA 31000 Introduction to Statistical Concepts MSCA 31000 Introduction to Statistical Concepts This course provides general exposure to basic statistical concepts that are necessary for students to understand the content presented in more advanced

More information

The University of Jordan

The University of Jordan The University of Jordan Master in Web Intelligence Non Thesis Department of Business Information Technology King Abdullah II School for Information Technology The University of Jordan 1 STUDY PLAN MASTER'S

More information

Data Science in Action

Data Science in Action + Data Science in Action Peerapon Vateekul, Ph.D. Department of Computer Engineering, Faculty of Engineering, Chulalongkorn University + Outlines 2 Data Science & Data Scientist Data Mining Analytics with

More information

Machine learning for algo trading

Machine learning for algo trading Machine learning for algo trading An introduction for nonmathematicians Dr. Aly Kassam Overview High level introduction to machine learning A machine learning bestiary What has all this got to do with

More information

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis

ElegantJ BI. White Paper. The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis ElegantJ BI White Paper The Competitive Advantage of Business Intelligence (BI) Forecasting and Predictive Analysis Integrated Business Intelligence and Reporting for Performance Management, Operational

More information

雲 端 運 算 願 景 與 實 現 馬 維 英 博 士 微 軟 亞 洲 研 究 院 常 務 副 院 長

雲 端 運 算 願 景 與 實 現 馬 維 英 博 士 微 軟 亞 洲 研 究 院 常 務 副 院 長 雲 端 運 算 願 景 與 實 現 馬 維 英 博 士 微 軟 亞 洲 研 究 院 常 務 副 院 長 Important Aspects of the Cloud Software as a Service (SaaS) Platform as a Service (PaaS) Infrastructure as a Service (IaaS) Information and Knowledge

More information

MSCA 31000 Introduction to Statistical Concepts

MSCA 31000 Introduction to Statistical Concepts MSCA 31000 Introduction to Statistical Concepts This course provides general exposure to basic statistical concepts that are necessary for students to understand the content presented in more advanced

More information

AORC Technical meeting 2014

AORC Technical meeting 2014 http : //www.cigre.org C4-1106 AORC Technical meeting 2014 Operational Issues and Solutions for Photovoltaic Power Generation Facilities T. KISHI, K. INOUE, T. SEKI Nishimu Electronics Industries Co.,

More information

The 4 Pillars of Technosoft s Big Data Practice

The 4 Pillars of Technosoft s Big Data Practice beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed

More information

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing CS Master Level Courses and Areas The graduate courses offered may change over time, in response to new developments in computer science and the interests of faculty and students; the list of graduate

More information

ANALYTICS IN BIG DATA ERA

ANALYTICS IN BIG DATA ERA ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY, DISCOVER RELATIONSHIPS AND CLASSIFY HUGE AMOUNT OF DATA MAURIZIO SALUSTI SAS Copyr i g ht 2012, SAS Ins titut

More information

Master of Science in Computer Science

Master of Science in Computer Science Master of Science in Computer Science Background/Rationale The MSCS program aims to provide both breadth and depth of knowledge in the concepts and techniques related to the theory, design, implementation,

More information

HUAWEI Advanced Data Science with Spark Streaming. Albert Bifet (@abifet)

HUAWEI Advanced Data Science with Spark Streaming. Albert Bifet (@abifet) HUAWEI Advanced Data Science with Spark Streaming Albert Bifet (@abifet) Huawei Noah s Ark Lab Focus Intelligent Mobile Devices Data Mining & Artificial Intelligence Intelligent Telecommunication Networks

More information