Doing Multidisciplinary Research in Data Science



Similar documents
Introduction to Predictive Analytics. Dr. Ronen Meiri

Introduction to the Mathematics of Big Data. Philippe B. Laval

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Age of Big data. Presented by: Mohammad Iqbal BCM -2014

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Big Analytics: A Next Generation Roadmap

Applications for Business Intelligence, Predictive Analytics and Big Data

BIG DATA CHALLENGES AND PERSPECTIVES

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014

Now, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith /

BIG DATA DAY BAKU 2015

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

Open source Google-style large scale data analysis with Hadoop

What happens when Big Data and Master Data come together?

Open source large scale distributed data management with Google s MapReduce and Bigtable

BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013

Big data and its transformational effects

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Big Data a threat or a chance?

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Big Data Explained. An introduction to Big Data Science.

A Survey on Big Data Concepts and Tools

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

DATA EXPERTS MINE ANALYZE VISUALIZE. We accelerate research and transform data to help you create actionable insights

Digital Earth: Big Data, Heritage and Social Science

Transforming the Telecoms Business using Big Data and Analytics

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

BIG DATA TRENDS AND TECHNOLOGIES

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

BIRT in the World of Big Data

SECURITY MEETS BIG DATA. Achieve Effectiveness And Efficiency. Copyright 2012 EMC Corporation. All rights reserved.

Game On: How Information is Changing the Rules of Insurance

Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA

Exploiting Data at Rest and Data in Motion with a Big Data Platform

AN INTRO TO DATA MANAGEMENT

How To Use Big Data In Healthcare

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

From Internet Data Centers to Data Centers in the Cloud

Large-Scale Data Processing

Pervasive Location Analytics and A Billion Dollar Opportunity. Jitender Aswani, Portfolio Strategist, Business Analytics, SAP

Copyright 2014, Neudesic. All rights reserved.

Changing the face of Business Intelligence & Information Management

So What s the Big Deal?

The Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics

Analyzing Big Data with AWS

Big Data Analytics: Collecting, Analyzing and Decision Making

Why Big Data Analytics?

Big Data. Sonovate QuickView Series #3

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016

Sunnie Chung. Cleveland State University

HDP Enabling the Modern Data Architecture

WHAT IS BIG DATA? David Bechtold

THE AGE OF BIG DATA. Chula DataScience

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

A New Era Of Analytic

Industry Impact of Big Data in the Cloud: An IBM Perspective

A U T H O R S : G a n e s h S r i n i v a s a n a n d S a n d e e p W a g h Social Media Analytics

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

THE REAL-TIME OPERATIONAL VALUE OF BIG DATA MATT DAVIES

BEYOND POINT AND CLICK THE EXPANDING DEMAND FOR CODING SKILLS BURNING GLASS TECHNOLOGIES JUNE 2016

The Next Wave of Data Management. Is Big Data The New Normal?

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

Intro to Big Data and Business Intelligence

Application and practice of parallel cloud computing in ISP. Guangzhou Institute of China Telecom Zhilan Huang

Oracle Big Data for Dummies

Big Data Realities Hadoop in the Enterprise Architecture

Sentiment Analysis on Big Data

Big Data and Industrial Internet

Data Centric Computing Revisited

What Is Big Data? Craig C. Douglas University of Wyoming

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

Peter Rakers De Verstoring van Big Data

COMP9321 Web Application Engineering

Big Data: Study in Structured and Unstructured Data

The HP IT Transformation Story

SAP Makes Big Data Real Real Time. Real Results.

Chapter 1. Contrasting traditional and visual analytics approaches

Transcription:

Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University aadamov@qu.edu.az http://ce.qu.edu.az/~aadamov 16 May 2015

Digital Universe Volume of Digital Data 2003 5 exabytes from beginning of civilization 2005 130 exabytes 2008 480.000 petabytes (PB) 2009 800.000 PB 2010 1200 000 PB or 1.2 zettabyte (ZB) 2011 1.8 ZB 2012 2.7 ZB 2014 ~ 6.2 ZB Expected to reach 44 ZB by 2020 Every day now we create as much information as we did from the dawn of civilization up until 2003 IDC's Digital Universe Study

Big Measures for Big Data kilobyte (kb) 10 3 2 10 megabyte (MB) 10 6 2 20 gigabyte (GB) 10 9 2 30 terabyte (TB) 10 12 2 40 petabyte (PB) 10 15 2 50 exabyte (EB) 10 18 2 60 zettabyte (ZB) 10 21 2 70 yottabyte (YB) 10 24 2 80

Why Data Grows so Fast? Data is produced by: Social media, Sensor Data, Software and App Logs, Smartphones - media, Public Web, Radio-frequency identification readers, Archives

Internet Penetration Note: Internet stats for December 2001 Avarage Internet usage ın the world 8% - 500 Million - 2001

Foundations of the Web Note: Internet stats for January 2014 Avarage Internet usage ın the world 42% - 3.0 Billion - 2014

Social Networking Top 15 Most Popular Social Networking Sites January 2015 1,310,000,000 - Estimated Unique Monthly Visitors 2 - Compete Rank 25,500,000 - Estimated Unique Monthly Visitors 346 - Compete Rank 12,000,000 - Estimated Unique Monthly Visitors 617 - Compete Rank 284,000,000 - Estimated Unique Monthly Visitors 24 - Compete Rank 20,500,000 - Estimated Unique Monthly Visitors 605 - Compete Rank 7,500,000 - Estimated Unique Monthly Visitors 838 - Compete Rank 343,000,000 - Estimated Unique Monthly Visitors 19,500,000 - Estimated Unique Monthly Visitors 447 - Compete Rank 5,400,000 - Estimated Unique Monthly Visitors 122 - Compete Rank 347,000,000 - Estimated Unique Monthly Visitors 44 - Compete Rank 17,500,000 - Estimated Unique Monthly Visitors *NA* - Compete Rank 3,000,000 - Estimated Unique Monthly Visitors 451 - Compete Rank 70,500,000 - Estimated Unique Monthly Visitors 51 - Compete Rank 12,500,000 - Estimated Unique Monthly Visitors 127 - Compete Rank 2,500,000 - Estimated Unique Monthly Visitors 1,596 - Compete Rank

What happens each Second online 25 Terabytes transferred through across Internet 2 Website created (172 000 per day) 9 Website created (172 000 per day) 1 800 000 SPAM emails sent 4 100 Photos posted on Facebook (355 mln per day) 5 000 Instagram photos uploaded 1 500 Skype calls made 4 000 Tweets tweeted 10 000 Dropbox files uploaded 45 000 Google searches made (3.5 bln per day) 92 000 YouTube videos viewed 55 000 Facebook likes

Problem with Moore s Law The number of transistors that can be placed on an integrated circuit doubles every 18 months to two years It s predicted to reach its limit with existing technology in 2020 Cutting the size of a transistor to a single atom may defeat that concept The Digital Universe is growing much more faster than Processing Power

Big Data and Data Science War is ninety percent information. Napoleon Bonaparte

Big Data vs. Data Science What is "Big Data" anyway? What does "Data Science" mean? What is the relationship between Big Data and Data Science? Is Data Science the science of Big Data? Is Data Science only the stuff going on in companies like Google and Facebook and tech companies? Why do many people refer to Big Data as crossing disciplines (astronomy, finance, tech, etc.) and to data science as only taking place in tech? Just how big is big? Or is it just a relative term?

What Big Data is and isn t?

What Big Data is and isn t? Computing + Internet = Big Data Big Data is not new technology Big Data is not just about size Big Data is not Business Intelligence (BI) Big Data is not Solution by itself! Big Data is mostly marketing brand

What is Data Science? Data Science is not just a rebranding of statistics or machine learning Data Science is a child born in the first decade of the 21st century of the mature parental disciplines of scientific methods, data and software engineering, statistics, and visualization.

Interdisciplinary Subfields of Computer Science Artificial Intelligence, Machine Learning, Statistics, Applied Mathematics, Text Mining, Database Systems, Business Intelligence, Computational Linguistics, Natural Language Processing (NLP), Information Theory And Information Technology, Signal Processing, Probability Models, Statistical Learning, Data Mining, Data Engineering, Pattern Recognition and Learning, Information Visualization, Predictive Analytics, Uncertainty Modeling, Data Warehousing, Data Compression, Computer Programming, High Performance Computing, Distributed Systems, Information Extraction, Cloud Computing, Computer Vision

Jobs Derived from Big Data Chief Data Officer, Big Data Solution Architect, Big Data Platform Engineer, Big Data Analyst, Big Data Analytics Business Consultant, Big Data Software Designer, Big Data Consultant, Hadoop Architects, Consultant Hadoop Developer, Senior Analytics Manager, Data & Reporting Analyst, Analytics Analyst (Big Data) By 2018, the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills Forbes - Where Big Data Jobs Will be in 2015

Data Science in Medicine Data alone won t change the world. It s the people that use data to make better decisions.

Data Science in Sports Big Data & Data Analytics Help Germany Score the World Cup

Data Science in Politics Obama s victory confirmed the value of using technology and data analytics. During the 1,5 year prior over 1.000 paid staff worked on the campaign, well over 10.000s volunteers and in total more than 100 data analysis who ran more than 66,000 computer simulations every day.

Data Science Application Direct Marketing, Online Advertising, Credit Scoring and Risk Management Help Desk Management Fraud Detection Search Ranking Product Recommendation Predicting Unusual Behavior Customer Retention in Telecom Data-driven decision making (DDD)

Big Data Management Life-Cycle Data Acquisition Data Repository Data Processing Data Analytics Data Visualization - Web Crawling - Data Mining - Information Retrieval -. - Apache Hadoop - HDFS - Microsoft Azure - Amazon EC2 - Parsing - Indexing - Searching - Ranking - NLP -. - R Programming - Python - RapidMiner - Weka -. Big Data Management involves Data Science and Data Engineering areas for implementing Data Mining Techniques

Quotes on Big Data If you torture the data long enough, it will confess. Ronald Coase, Economist He who search for pearls must dive below John Dryden

Thank you info@cedawi.org fb.com/cedawi www.cedawi.org