BIG DATA @ EMITLAB & CIDSE. K. Selçuk Candan



Similar documents
BIG EMITLAB & CIDSE. K. Selçuk Candan candan@asu.edu

BigData at UI CS. Hasan Jamil Department of Computer Science University of Idaho

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Transforming the Telecoms Business using Big Data and Analytics

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

Log Mining Based on Hadoop s Map and Reduce Technique

NEW GRADUATE CONCENTRATION PROPOSALS ARIZONA STATE UNIVERSITY

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

school of computing, informatics, engineering SCAN SCAN THIS PAGE SCAN THIS PAGE WITH LAYAR WITH LAYAR SCAN WITH LAYAR WITH LAYAR

Big data and its transformational effects

Research at the Department of Computer Science and Software Engineering. Professor Yong Yue BEng, PhD, CEng, FIET, FIMechE 17 October 2014

Graduate Co-op Students Information Manual. Department of Computer Science. Faculty of Science. University of Regina

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

Customized Report- Big Data

How To Handle Big Data With A Data Scientist

International Journal of Engineering Research ISSN: & Management Technology November-2015 Volume 2, Issue-6

Big Data. The Big Picture. Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas

EO Data by using SAP HANA Spatial Hinnerk Gildhoff, Head of HANA Spatial, SAP Satellite Masters Conference 21 th October 2015 Public

Big Data: Study in Structured and Unstructured Data

Big Data and Analytics: Challenges and Opportunities

IEEE JAVA Project 2012

COMP9321 Web Application Engineering

Keeping Pace with Big Data

The Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics

Industry Impact of Big Data in the Cloud: An IBM Perspective

Sunnie Chung. Cleveland State University

Information Infrastructure for Archiving & Integrating Primary Archaeological Data

Are You Ready for Big Data?

Introduction to Data Mining

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems

SECURITY MEETS BIG DATA. Achieve Effectiveness And Efficiency. Copyright 2012 EMC Corporation. All rights reserved.

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

Search and Real-Time Analytics on Big Data

Big Data Storage Architecture Design in Cloud Computing

BIG Big Data Public Private Forum

Big Data Analytics. Lucas Rego Drumond

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Deploying Big Data to the Cloud: Roadmap for Success

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

BIG DATA What it is and how to use?

Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies

Sustainable Development with Geospatial Information Leveraging the Data and Technology Revolution

Big Data and Complex Networks Analytics. Timos Sellis, CSIT Kathy Horadam, MGS

Big Data Analytics. The Hype and the Hope* Dr. Ted Ralphs Industrial and Systems Engineering Director, Laboratory

Big Data: Tools and Technologies in Big Data

Cloud Compu?ng & Big Data in Higher Educa?on and Research: African Academic Experience

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

Where is... How do I get to...

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

Center for Dynamic Data Analytics (CDDA) An NSF Supported Industry / University Cooperative Research Center (I/UCRC) Vision and Mission

Business Intelligence meets Big Data: An Overview on Security and Privacy

Feasibility Study of Searchable Image Encryption System of Streaming Service based on Cloud Computing Environment

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

Information Management course

Collaborations between Official Statistics and Academia in the Era of Big Data

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop

Big Data Visualiza9on

ANALYTICS BUILT FOR INTERNET OF THINGS

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Big Data and Clouds: Challenges and Opportuni5es

Big-Data Computing with Smart Clouds and IoT Sensing

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Client Overview. Engagement Situation. Key Requirements

Professional Organization Checklist for the Computer Science Curriculum Updates. Association of Computing Machinery Computing Curricula 2008

Spatio-Temporal Networks:

A review on MapReduce and addressable Big data problems

Improving Data Processing Speed in Big Data Analytics Using. HDFS Method

Big Data: Opportunities and Challenges. Raja Chiky

Big Data Explained. An introduction to Big Data Science.

Real Time Big Data Processing

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

Communica)on and sensor network technologies for smart ci)es

Big Data on Microsoft Platform

Big Data Mining: Challenges and Opportunities to Forecast Future Scenario

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Big Data Analytic and Mining with Machine Learning Algorithm

Data Warehousing. Yeow Wei Choong Anne Laurent

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS November 7, Machine Learning Group

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

The 4 Pillars of Technosoft s Big Data Practice

Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN

Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out

How To Get More Data From Your Computer

Problems to store, transfer and process the Big Data 6/2/2016 GIANG TRAN - TTTGIANG2510@GMAIL.COM 1

Software Engineering for Big Data. CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo

An Approach to Implement Map Reduce with NoSQL Databases

How To Understand The Power Of The Internet Of Things

How To Use A Webmail On A Pc Or Macodeo.Com

Big Data and Analytics (Fall 2015)

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

Big Data Driven Knowledge Discovery for Autonomic Future Internet

Transcription:

BIG DATA @ EMITLAB & CIDSE K. Selçuk Candan

Name: K. Selçuk Candan Professor of computer science and engineering at (CIDSE) ASU Senior Sustainability Scientist- Global Institute of Sustainability Director, Enterprise, Media, and Information Technologies Labs (EmitLab)

EmitLab Sriram Rathinavelu MS Mijung Kim Jung Hyun Kim Yash Garg MS Parth Nagarkar Mithila Nagendra Shengyu Huang Sicong Liu Xilun Chen Rosaria Rossini (U. Torino) Xinsheng Liu Maria Luisa Sapino Professor (U. Torino) Claudio Schifanella Post-doc. (U. Torino) Antonio Penta Post-doc. (U. Torino)

What do I do?? Executive Committee member, ACM Special Interest Group on Management of Data (SIGMOD) Associate editor, IEEE Transactions on Multimedia Associate editor, the Very Large Data Bases journal (2005-2012) Associate editor, Journal of Multimedia General Chair, IEEE International Conference on Cloud Engineering (IC2E) 2015. Workshops Chair, International Conference on Extending Database Technology (EDBT) 2014 Organizing Committee Member, ACM SIG Multimedia Conference 2013 Panels Chair, Very Large Databases (VLDB) Conference 2012 Publicity Chair, ACM SIG Multimedia Conference 2012 General Chair, ACM SIGMOD Conference 2012 General Chair, ACM SIG Multimedia Conference 2011 Program Group leader, ACM SIG Management of Data (SIGMOD) Conference 2010 PC Chair, the ACM International Conference on Image and Video Retrieval (CIVR) 2010 PC Chair, Workshop on Information & Software as Services. (WISS) 2010 Chair,Workshop on Information & Software as Services. (WISS) 2009 Chair, Workshop on Real-Time Business Intelligence (RTBI) 2009 PC Chair, ACM Workshop on Ambient Media Computing (iwam) 2009. PC Chair, ACM SIG Multimedia Conference 2008

What do I do?? How can we provide the relevant data/information to the right person/application fast???

Data data Exabytes (2 60 bytes) 400GB per person 200GB per person

Data in the real world? energy rehabilitation training security smart-offices smart-rooms production life-sciences defense VOLUME sports Cisco estimates robotics we ll see a 1.3 zettabytes of traffic annually over the internet in 2016 elderly-care retail child-care supply-chain entertainment VELOCITY personal-data management transportation Sensors education from a Boeing jet engine create 20 terabytes of data every hour. space exploration pet-care health-care arts VARIETY business/enterprise sciences advertisement 500 terabytes of new data of all forms are ingested in Facebook every day

Data challenges Cisco estimates we ll see a 1.3 zettabytes of traffic annually over the internet in 2016 Sensors from a Boeing jet engine create 20 terabytes of data every hour. 500 terabytes of new data of all forms are ingested in Facebook every day IST 3Vs HMLE [I]mprecision [S]parsity (lack of) [T]rust [V]olume [V]elocity [V]ariety [H]igh-dimensional [M]ulti-modal inter-[l]inked [E]volving

Data Challenges Cisco estimates we ll see a 1.3 zettabytes of traffic annually over the internet in 2016 Sensors from a Boeing jet engine create 20 terabytes of data every hour. 500 terabytes of new data of all forms are ingested in Facebook every day IST 3Vs HMLE [I]mprecision [S]parsity (lack of) [T]rust [V]olume [V]elocity [V]ariety [H]igh-dimensional [M]ulti-modal inter-[l]inked [E]volving

Big Data Systems space Data Manage ment Data Analytic s Dimensi onality reductio n/feature selection Classific ation, clusterin g Summar ization Visual analytics Feature extractio n/media analysis Tempor al/spatial analysis Text Analysis /NLP Web/ social network s Recom mender systems Scalable /real time Perform ance and Scalabili ty Consiste ncy, quality, cleaning Data models Data Organiz ation Data and Schema Integrati on Cloud, DaaS Data Streami ng Parallel/ Distribut ed DM MapRede ce/ Hadoop Pregel/ Hama Other parallel DBMS Multitenant, Virtualiz ation Security, privacy, assuran ce Mobile, Sensor Visualiz ation Extractio n, filtering Rowstores Column Stores Key-value stores NoSql Relational OO XML Spatial Temporal Sequence Graph Fuzzy/ uncertain Text, image, video

Research Overview Ongoing Grants/Projects: [NSF] RanKloud: Data Partitioning and Resource Allocation Strategies for Scalable Multimedia and Social Media Analysis [NSF] National Science Digital Library (NSDL) Middleware for Network- and Context-aware Recommendations [NSF] One Size Does Not Fit All: Empowering the User with User-Driven Integration [NSF] The Complexities of Ecological and Social Diversity: A Long-Term Perspective [with JCI, NSF] Data Analysis and Optimization for Building Energy Management [with SHESC, NSF] Data Management for Real-Time Data Driven Epidemic Simulations NSF-IGERT: Person-centered Technologies and Practices for Individuals with Disabilities Newer/Other Efforts [with West Point] SHARK: Searching Huge Attribute and Relational Knowledgebases Data management techniques for supporting scalable, real-time integration, analysis, and retrieval of large data sets

CS Faculty working on Data Name Title Area(s) of Specialization as they relate to proposed concentration K. Selcuk Candan Professor Databases and data management Hasan Davulcu Assoc. Professor Databases and data extraction Huan Liu Professor Data mining and analysis Ross Maciejewski Assistant Professor Data visualization Jieping Ye Assoc. Professor Data analysis Rao Kambhampati Professor Data integration, data cleaning Chitta Baral Professor Knowledge representation, NLP Dijuang Huang Assoc. Professor Data clouds

Relevant faculty at CIDSE/ASU 1. Gail- Joon Ahn risk management, access control, and security architecture for distributed systems 2. Ron Askin scheduling, opera?ons research; applied sta?s?cs 3. ChiCa Baral knowledge representa?on, bioinforma?cs, and text analysis 4. Rida Bazzi distributed compu?ng, fault tolerance, dynamic schema update in data clouds 5. K. Selcuk Candan scalable data management, integra?on and retrieval, data management and processing systems, mul?media retrieval, accessibility 6. Partha Dasgupta distributed systems, security, and resilience 7. Sandeep Gupta parallel and distributed compu?ng, data centers, energy- efficient, reliable data dissemina?on, and caching 8. Dijang Huang security, virtualiza?on, mobile cloud compu?ng 9. Subbarao Kambhampa? data integra?on, data cleaning, and planning 10. Baoxin Li sta?s?cal inference for visual tracking, feature selec?on for data/sensor fusion, image/video retrieval 11. Huan Liu data mining, machine learning, feature selec?on, classifica?on, subspace clustering, and social compu?ng 12. Ross Maciejewski geo- spa?al and spa?o- temporal visualiza?on, visual analy?cs for healthcare/pandemics, law enforcement 13. Pitu Mirchandhani water distribu?on systems, urban planning, transporta?on, forecas?ng, dynamic systems, remote sensing 14. Sethuraman Panchanathan ubiquituous mul?media analyis, accesibility 15. Andrea Richa adhoc networks, algorithms, self organizing systems, wireless communica?on 16. George Runger sta?s?cal learning, process control, data mining for massive, mul?variate data sets 17. Arunabha Sen network analysis, social, biological, transporta?on, communica?on networks 18. Esma Gel applied probability techniques for modeling, design and control of produc?on systems and supply chain 19. Hari Sundaram mul?- media and social- media analy?cs 20. Yalin Wang data visualiza?on, medical imaging, sta?s?cal pacern recogni?on 21. Peter Wonka data visualiza?on, geo- spa?al visualiza?on, modelling, image analysis 22. Teresa Wu decision making under uncertainty, biomedical informa?cs 23. Guoliang Xue privacy, smart grid, cloud compu?ng, network science 24. Steve Yau service- based systems, informa?on assurance, security, qos monitoring 25. Jieping Ye machine learning, data mining, dimensionality reduc?on, biomedical informa?cs 26. Nong Ye cyber- and network security

Relevant faculty at CIDSE/ASU