1 Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University 16 May 2015
2 Digital Universe Volume of Digital Data exabytes from beginning of civilization exabytes petabytes (PB) PB PB or 1.2 zettabyte (ZB) ZB ZB 2014 ~ 6.2 ZB Expected to reach 44 ZB by 2020 Every day now we create as much information as we did from the dawn of civilization up until 2003 IDC's Digital Universe Study
3 Big Measures for Big Data kilobyte (kb) megabyte (MB) gigabyte (GB) terabyte (TB) petabyte (PB) exabyte (EB) zettabyte (ZB) yottabyte (YB)
4 Why Data Grows so Fast? Data is produced by: Social media, Sensor Data, Software and App Logs, Smartphones - media, Public Web, Radio-frequency identification readers, Archives
5 Internet Penetration Note: Internet stats for December 2001 Avarage Internet usage ın the world 8% Million
6 Foundations of the Web Note: Internet stats for January 2014 Avarage Internet usage ın the world 42% Billion
8 What happens each Second online 25 Terabytes transferred through across Internet 2 Website created ( per day) 9 Website created ( per day) SPAM s sent Photos posted on Facebook (355 mln per day) Instagram photos uploaded Skype calls made Tweets tweeted Dropbox files uploaded Google searches made (3.5 bln per day) YouTube videos viewed Facebook likes
9 Problem with Moore s Law The number of transistors that can be placed on an integrated circuit doubles every 18 months to two years It s predicted to reach its limit with existing technology in 2020 Cutting the size of a transistor to a single atom may defeat that concept The Digital Universe is growing much more faster than Processing Power
10 Big Data and Data Science War is ninety percent information. Napoleon Bonaparte
11 Big Data vs. Data Science What is "Big Data" anyway? What does "Data Science" mean? What is the relationship between Big Data and Data Science? Is Data Science the science of Big Data? Is Data Science only the stuff going on in companies like Google and Facebook and tech companies? Why do many people refer to Big Data as crossing disciplines (astronomy, finance, tech, etc.) and to data science as only taking place in tech? Just how big is big? Or is it just a relative term?
12 What Big Data is and isn t?
13 What Big Data is and isn t? Computing + Internet = Big Data Big Data is not new technology Big Data is not just about size Big Data is not Business Intelligence (BI) Big Data is not Solution by itself! Big Data is mostly marketing brand
14 What is Data Science? Data Science is not just a rebranding of statistics or machine learning Data Science is a child born in the first decade of the 21st century of the mature parental disciplines of scientific methods, data and software engineering, statistics, and visualization.
15 Interdisciplinary Subfields of Computer Science Artificial Intelligence, Machine Learning, Statistics, Applied Mathematics, Text Mining, Database Systems, Business Intelligence, Computational Linguistics, Natural Language Processing (NLP), Information Theory And Information Technology, Signal Processing, Probability Models, Statistical Learning, Data Mining, Data Engineering, Pattern Recognition and Learning, Information Visualization, Predictive Analytics, Uncertainty Modeling, Data Warehousing, Data Compression, Computer Programming, High Performance Computing, Distributed Systems, Information Extraction, Cloud Computing, Computer Vision
16 Jobs Derived from Big Data Chief Data Officer, Big Data Solution Architect, Big Data Platform Engineer, Big Data Analyst, Big Data Analytics Business Consultant, Big Data Software Designer, Big Data Consultant, Hadoop Architects, Consultant Hadoop Developer, Senior Analytics Manager, Data & Reporting Analyst, Analytics Analyst (Big Data) By 2018, the United States alone could face a shortage of 140,000 to 190,000 people with deep analytical skills Forbes - Where Big Data Jobs Will be in 2015
17 Data Science in Medicine Data alone won t change the world. It s the people that use data to make better decisions.
18 Data Science in Sports Big Data & Data Analytics Help Germany Score the World Cup
19 Data Science in Politics Obama s victory confirmed the value of using technology and data analytics. During the 1,5 year prior over paid staff worked on the campaign, well over s volunteers and in total more than 100 data analysis who ran more than 66,000 computer simulations every day.
20 Data Science Application Direct Marketing, Online Advertising, Credit Scoring and Risk Management Help Desk Management Fraud Detection Search Ranking Product Recommendation Predicting Unusual Behavior Customer Retention in Telecom Data-driven decision making (DDD)
21 Big Data Management Life-Cycle Data Acquisition Data Repository Data Processing Data Analytics Data Visualization - Web Crawling - Data Mining - Information Retrieval -. - Apache Hadoop - HDFS - Microsoft Azure - Amazon EC2 - Parsing - Indexing - Searching - Ranking - NLP -. - R Programming - Python - RapidMiner - Weka -. Big Data Management involves Data Science and Data Engineering areas for implementing Data Mining Techniques
22 Quotes on Big Data If you torture the data long enough, it will confess. Ronald Coase, Economist He who search for pearls must dive below John Dryden
BUY BIG DATA IN RETAIL Table of contents What is Big Data?... How Data Science creates value in Retail... Best practices for Retail. Case studies... 3 7 11 1. Social listening... 2. Cross-selling... 3.
For Big Data Analytics There s No Such Thing as Too Big The Compelling Economics and Technology of Big Data Computing March 2012 By: 4syth.com Emerging big data thought leaders Forsyth Communications 2012.
Cost aware real time big data processing in Cloud Environments By Cristian Montero Under the supervision of Professor Rajkumar Buyya and Dr. Amir Vahid A minor project thesis submitted in partial fulfilment
Big Data + Predictive Analytics = Actionable Business Insights: Consider Big Data as the Most Important Thing for Business since the Internet Adapted from the forthcoming book, Business Innovation in the
EXECUTIVE SUMMARY Big Data is not an uncommon term in the technology industry anymore. It s of big interest to many leading IT providers and archiving companies. But what is Big Data? While many have formed
SPECIAL ADVERTISING SECTION businessweek.com/adsections DATA ANALYTICS: GO BIG OR GO HOME S1 THE ERA OF BIG DATA a time when petabytes of information on consumer behavior and countless other topics fly
Data science and the transformation of the financial industry Financial Institutions www.managementsolutions.com Design and Layout Marketing and Communication Department Management Solutions Photographs
INDEPENDENT TECHNOLOGY RESEARCH SECTOR UPDATE NOV 2013 SOFTWARE Big Data Analytics EXTRACTING INSIGHTS FROM EXABYTES Analytics is entering a new era Amidst the hype surrounding Big Data, a perfect storm
21 st Century Investment Themes August 2012: Episode 5 Big data: an industrial revolution in data AT A GLANCE Big data is the term given to large and typically unstructured data sets that are difficult
An IDC White Paper - sponsored by EMC The Expanding Digital Universe A Forecast of Worldwide Information Growth Through 2010 March 2007 John F. Gantz, Project Director David Reinsel Christopher Chute Wolfgang
Emergence and Taxonomy of Big Data as a Service Benoy Bhagattjee Working Paper CISL# 2014-06 May 2014 Composite Information Systems Laboratory (CISL) Sloan School of Management, Room E62-422 Massachusetts
How to embrace Big Data A methodology to look at the new technology Contents 2 Big Data in a nutshell 3 Big data in Italy 3 Data volume is not an issue 4 Italian firms embrace Big Data 4 Big Data strategies
White paper Proactive Planning for.. Big Data.. In government, Big Data presents both a challenge and an opportunity that will grow over time. Executive Summary Consider this list of government-adopted
American Journal of Engineering Research (AJER) e-issn : 2320-0847 p-issn : 2320-0936 Volume-03, Issue-05, pp-266-270 www.ajer.org Research Paper Open Access Convergence of Big Data and Cloud Sreevani.Y.V.
INTELLIGENT BUSINESS STRATEGIES W H I T E P A P E R Architecting A Big Data Platform for Analytics By Mike Ferguson Intelligent Business Strategies October 2012 Prepared for: Table of Contents Introduction...
why your hr department needs big data why your hr department needs big data 2 Introduction Big Data is a term that increasingly is used to describe the emerging industry of analyzing multiple databases
Principles of E-Commerce I: Business and Technology. (PoE1) Focus: Big Data Platforms Prof. Roberto V. Zicari with support of Todor Ivanov, Marten Rosselli and Dr. Karsten Tolle 2015 SS Principles of E-Commerce
32 Big Data: present and future Big Data: present and future Mircea Răducu TRIFU, Mihaela Laura IVAN University of Economic Studies, Bucharest, Romania firstname.lastname@example.org, email@example.com
May 2011 Big data: The next frontier for innovation, competition, and productivity The McKinsey Global Institute The McKinsey Global Institute (MGI), established in 1990, is McKinsey & Company s business
Program in association with IBM POST GRADUATE PROGRAM IN BUSINESS ANALYTICS AND BIG DATA I A true holistic Data Science Program I The emerging need of Techno Management & Cross Functional skills will be
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 4, Number 1 (2014), pp. 33-40 International Research Publications House http://www. irphouse.com /ijict.htm Big Data
International Journal of Computer Science and Applications, Technomathematics Research Foundation Vol. 11, No. 3, pp. 116 127, 2014 ANALYTICS ON BIG AVIATION DATA: TURNING DATA INTO INSIGHTS RAJENDRA AKERKAR
IBM Global Business Services Business Analytics and Optimization In collaboration with Saïd Business School at the University of Oxford Executive Report IBM Institute for Business Value Analytics: Real-world
Big Data: Powering the Next Industrial Revolution Author: Abhishek Mehta April 2011 p2 Executive Summary Data is a key raw material for a variety of socioeconomic business systems. Unfortunately, the ability
Benchmarking Large-Scale Data Management Insights from Presentation Confidentiality Statement The materials in this presentation are protected under the confidential agreement and/or are copyrighted materials