Big Data Analytics: Challenges and What Computational Intelligence

Size: px
Start display at page:

Download "Big Data Analytics: Challenges and What Computational Intelligence"

Transcription

1 Big Data Analytics: Challenges and What Computational Intelligence Techniques May Offer Ah-Hwee Tan ( School of Computer Engineering Nanyang Technological University Big Data Analytics Symposium Big Data Analytics Symposium London, UK 13 September 2013

2 Outline Big Data Analytics Computational Intelligence Techniques Web Data Analytics Flexible Organizer for Competitive Intelligence (FOCI) Web Information Fusion and Associative Discovery Analytics for Active Living for Elderly

3 The Era of Big Data Big data refers to collection of data sets so large and complex that t exceed dthe competence of commonly used IT systems in terms of processing space and/or time.

4 Sources of Big Data Traditionally, mostly produced in scientific fields such as astronomy, meteorology, genomics physics, biology, and environmental research. With rapid development of IT technology and the consequent decrease of cost on collecting and storing data, big data has been generated from almost every industry and sector as well as governmental department, including retail, finance, banking, security, audit, electric power, healthcare. Recently, big data over the Web (big Web data for short), which includes all the context data, such as, user generated contents, browser/search log data, deep web data, etc.

5 Examples of Big Data (Source: Wikipedia) Walmart handles more than 1 million customer transactions every hour, which h is imported into databases estimated t to contain more than 2.5 petabytes (2560 terabytes) of data the equivalent of 167 times the information contained in all the books in the US Library of Congress. Facebook handles 50 billion photos from its user base. FICO Falcon Credit Card Fraud Detection System protects 2.1 billion active accounts world-wide. Windermere Real Estate uses anonymous GPS signals from nearly 100 million drivers to help new home buyers determine their typical drive times to and from work throughout various times of the day.

6 Examples of Big Data (Source: Wikipedia) NASA Center for Climate Simulation (NCCS) stores 32 petabytes of climate observations and simulations on the Discover supercomputing cluster. Utah Data Center is a data center currently being constructed cted by the United States National Security Agency. When finished, the facility will handle yottabytes of information collected by NSA over the Internet. Value Metric 1000 kb kilobyte MB megabyte GB gigabyte TB terabyte PB petabyte EB exabyte ZB zettabyte YB yottabyte

7 Money of Big Data (Source: Wikipedia) "Big data" have increased the demand of information management specialists Software AG, Oracle Corporation, IBM, Microsoft, SAP, EMC, and HP have spent more than $15 billion on software firms specializing in data management and analytics. In 2010, this industry on its own was worth more than $100 billion and was growing at almost 10 percent a year: about twice as fast as the software business as a whole.

8 Market of Big Data (Source: Wikipedia) Developed economies make increasing use of data- intensive technologies. There are 4.6 billion mobilephone subscriptions worldwide and there are between 1 billion and 2 billion people accessing the internet The world's effective capacity to exchange information through telecommunication networks was 281 petabytes in 1986, 471 petabytes in 1993, 2.2 exabytes in 2000, 65 exabytes in 2007 [14] and it is predicted that the amount of traffic flowing over the internet will reach 667 exabytes annually by [5]

9 Big Data Market Segments (Report by Transparency Market Research) Segmentation of the big data market by components, by applications and by geography. The different components included are software and services, hardware and storage. Software and services segment dominates the components market whereas storage segment will be the fastest growing segment for the next 5 years owing to the perpetual growth in the data generated.

10 Big Data Market Segment by Applications Covered eight applications namely financial services, manufacturing, healthcare, telecommunication, government, retail and media & entertainment and others in the application segment. Financial Services, healthcare and the government sector are the top three contributors of the big data market and together held more than 55% of the big data market in Media and Entertainment t t and the healthcare sectors will grow at high CAGR of nearly 42% from 2012 to The growth in data in the form of video, images, and games is driving the media and entertainment segment. Read more:

11 Challenges of Big Data Volume Size in the order of petabytes, exabytes, Velocity Time sensitive data, data that grow exponentially or even in rates that overwhelm the wellknown Moore's Law Variety Value Metric 1000 kb kilobyte MB megabyte GB gigabyte TB terabyte PB petabyte EB exabyte ZB zettabyte YB yottabyte From structured data into semi-structured and completely unstructured data of different types, such as text, image, audio, video, click streams, log files,

12 Deeper Issues of Big Data (The additional 3Vs) Validity Is the data correct and accurate for the intended usage? Veracity Are the results meaningful for the given problem space? Volatility How long do you need to look/store this data?

13 Computational Intelligence Neural Networks (IJCNN) Brain-like mathematical models for pattern recognition, memory, and association discovery Examples: Perceptron, BP, SVM, SOM, ART, Fuzzy Systems (IEEE-FUZZ) Fuzzy operators for handling non-discrete reasoning Examples: FNN, Fuzzy C-Means,

14 Computational Intelligence Evolutionary Computing (CEC) Classes of heuristic algorithms repeatedly search for good solutions by mimicking the process of natural evolution Commonly used for optimization and search problems Examples: Genetic Algo, Memetic Algo,

15 Flagship Events of Computational Intelligence World Congress on Computational Intelligence (Australia 2012, Beijing 2014) IEEE Symposium on Computational Intelligence (Singapore 2013, Florida, USA 2014) IEEE Symposium on Computational Intelligence in Big Data (IEEE CIBD'2014)

16 Examples of Use of CI in Big Data Data size and feature space adaptation Uncertainty modeling in learning from big data Distributed learning techniques in uncertain environment Uncertainty in cloud computing Distributed ib parallel l computation Feature selection/extraction in big data Sample selection based on uncertainty Incremental Learning Manifold Learning on big data Uncertainty techniques in big data classification/clustering Imbalance learning on big data Active learning on big data Random weight networks on big data Transfer learning on big data

17 Self-Organizing i Neural Networks for Personalized Web Intelligence Towards Personalized Web Intelligence g Ah-Hwee Tan, Hwee-Leng Ong, Hong Pan, Jamie Ng, Qiu-Xiang Li Knowledge and Information Systems 18 (2004)

18 Workflow for Web Data Analytics Search Getting the information Organize (clustering/categorizing) Putting things in perspectives Analyze (data mining) Discover hidden knowledge Share (knowledge management) Saving for reference and sharing Track Constant monitoring

19 Approaches to Organizing/Analyzing Clustering Organizing information into groups based on similarity functions and thresholds e.g. BullsEye, NorthernLight, Vivisimo Categorization Organizing information into a predefined set of classes e.g. Yahoo!, Autonomy Knowledge Server Which is better?

20 Clustering Pros Unsupervised/self-organizing, require no training or predefinition of classes Able to identify new themes Cons Users have no control Ever changing cluster structure Difficult to navigate and track

21 Categorization Pros Good control on classes Every info assigned to one or more classes of interests Cons Require learning (supervised) and/or definition of classification rules/knowledge Every info has to be assigned to one or more classes Good control but lack flexibility to handle new information

22 User-configurable Clustering (Tan & Pan, PAKDD 2002) Information organization ation and content management Online incremental clustering + userdefined structure (preferences) Reduces to a clustering system if no user indication given Allows personalization in a direct, intuitive, and interactive manner Control + flexibility

23 ARAM for Personalized Information Management Information Clusters F 2 F 1 a F 1 b a a - x x b - b + + Information Vector A B Preference Vector

24 Flexible Organizer for Competitive Intelligence (FOCI) A platform for gathering, organizing, tracking, analyzing, and sharing competitive information Natural way of turning raw search results into personalized CI portfolios Multilingual enabled with Multilingual Efficient Analyzer Domain localization (Technology) Patented and licensed to many companies

25 FOCI User Interface

26 FOCI Architecture Intranet/ Internet User s CI Portfolio Domain-Specific Knowledge Content Gathering Content Management Content Publishing Content Analysis Front End Visu ualization

27 Personalized Content Management Portfolio created through Search Unsupervised clustering (ARAM Pattern Channel A) Loop Personalization by users (ARAM Pattern Channel B) Reorganization of clusters (ARAM Pattern Channel A&B) Saving of personalized portfolio Tracking of new information

28 Personalization Functions Marking/labeling (selected) clusters Personal interpretation Inserting Clusters Indicate preference on groupings Merging clusters Indicate preferences on similarities Splitting clusters... Indicate preferences on differences

29 Information Clustering A portfolio created by a meta-search of 4 search engines with a query on Text Mining

30 A Personalized Portfolio after <=19 personalization operations (mainly labeling and creating clusters)

31 Organizing g New Information Without the Personalized Portfolio Based on Personalized Portfolio 42 new documents from DirectHit, Netscape, and BusinessWire

32 Summary A fusion neural network algorithm, called fusion ART, has been proposed for integrating clustering and categorization Has been applied to competitivee intelligence on the web. Comparing with existing works, fusion ART has advantages in Personalization fusion ART performs analysis and organization of data based on user preferences Low time complexity fusion ART performs real-time search and match of patterns resulting in a linear time complexity Incremental clustering manner fusion ART may adapt to dynamic web multimedia li data set by incrementally clustering new patterns based on the learnt cluster structure without referring to the old data. 3 2

33 Heterogeneous Data Co-clustering for Social Media Data Theme Discovery and Mining Lei Meng, Ah-Hwee Tan and Dong Xu IEEE Transactions on Knowledge and Data Engineering,

34 Introduction The popularity p of social websites leads to greatly increase of web multimedia documents Massive number Billions of images and articles online Diversity Diverse content and booming emerging topics Multi-modal descriptors images, text, category, tags, comments Category Birds Images Wild, bird, beach, tree, vacation, animal, mar, sunny, playa, nayarit, arena,ave, water, vacaciones, hollyday, pelicano. Keywords from Surrounding text 34

35 Introduction Clustering of web multimedia data is challenging Scalability to big data Difficulty in integrating multi-modal feature data Ambiguity in deciding the number of categories Rich but noisy meta-information semantic gap of images, noisy tags Birds Beach Wild, bird, beach, tree, vacation, animal, mar, sunny, playa, nayarit, arena, ave, water, vacaciones, hollyday, pelicano. Ocean, blue, sea, summer, vacation, sun, man, beach, water, yellow, fun, sand, play, funny, adult, humor, lifestyle, sunny, resort. 35

36 Problem Statement We define the theme discovery of web multimedia data as a heterogeneous data co-clustering problem, which h identifies the semantic categories of data patterns through the fusion and recognition of multiple types of features. Apple Multiple Descriptions Category Tag Apple Fruits Products Movies User Description Surrounding text 36

37 Proposed Approach A self-organizing neural network approach to Heterogeneous Data Co-clustering Based on Fusion Adaptive Resonance Theory (Fusion ART) Fuse arbitrary number of feature modalities Adaptively tune the weights for different feature modalities Two different learning function for primary data, such as images and articles, and meta-information to handle short and noisy text t Incremental fast learning Do not need to give the number of clusters 37

38 Experiments NUS-WIDE data set images of 18 categories Visual features: Grid color moment, Edge direction histogram, and wavelet texture Textual features of surrounding text: t 1142 words (7 words per image on average) 20 Newsgroups data set text documents of 10 categories Textual features of document content: over 60k words (800 words per document on average) Textual features of category: 3 labels per document on average 38

39 Experiments on NUS-WIDE Data Set Evaluation on weight adaptation across channels for visual and textual features Performance Comparison with fixed weight values GHF-ART with the adaptively tuned weight values γ_sa achieves the best performance in 5 classes and the overall performance, and achieves close performance with the best results obtained by fixed weight values 39

40 Experiments on NUS-WIDE Data Set Tracking of the change in weight values of γ _SA Textual features of surrounding text are assigned higher weights than visual features The value of γ_sa stabilizes in [0.7, 0.8] with the increase of patterns Big fluctuation may be resulted by the generation of new clusters 40

41 Experiments on NUS-WIDE Data Set Clustering Performance comparison with existing algorithms in terms of weighted average precision, cluster entropy ( H l H cluster), class entropy ( class ), purity and rand index (RI) GHF-ART achieves the best performance in terms of all the evaluation measures With supervisory information, GHF-ART(SS) consistently obtains better performance 41

42 Experiments on NUS-WIDE Data Set Time complexity analysis GHF-ART and Fusion ART incur very small increase of time cost For images, GHF-ART complete the clustering process in 10 seconds 42

43 Experiments on 20 Newsgroups Data Set Clustering performance comparison using document content and category information Both GHF-ART and GHF-ART(SS) outperform other algorithms in all the evaluation measures GHF-ART has a 5% gain than Fusion ART in terms of Average Precision, Purity and Rand Index. Comparing with other unsupervised algorithms, GHF-ART achieves around 80% in Average Precision, Purity and Rand Index while other algorithms typically obtain less than 75% 43

44 Summary A Heterogeneous data co-clustering algorithm, called GHF- ART, is proposed to discover the themes of web multimedia data via their rich but heterogeneous descriptors. Comparing with existing works, GHF-ART has advantages in Strong noise immunity A learning function of meta-information is proposed to handle noise Adaptive channel weighting ihti A well-defined dfi dweighting i algorithm is proposed to identify the important feature modalities for a better fusion of multi-modal features for overall similarity measure; Low time complexity GHF-ART performs real-time search and match of patterns resulting in a linear time complexity for big data; Incremental clustering manner GHF-ART may adapt to dynamic web multimedia data set by incrementally clustering new patterns based on the learnt cluster structure without referring to the old data. 44

45 Research Centre of Excellence in Ati Active LIving for the elderly (LILY) Aging in Place: Opportunities and Challenges Ah-Hwee Tan ( School of Computer Engineering Nanyang Technological University JOINT UBC-NTU RESEARCH CENTRE

46 Aging g in Place the ability to live in one's own home and community safely, independently, and comfortably, regardless of age, income, or ability level - Center for Disease Control, Dec

47 Motivation Global aging population creates silver challenges Most adults would prefer to age in place 78 percent of adults between the ages of 50 and 64 report that they would prefer to stay in their current residence as they age Growing elderly population will be living independently in own homes Vital to transform future homes into intelligent human-centered environment for the elderly Golden opportunities for innovating assistive technologies for aging in place 47

48 A Basic Scenario of Tender Care for Agingin-place Unobtrusive Sensing Social Signal Processing Context Aware Auto Tagging Social Cognitive Network Unobtrusive sensing device detects: the elder keeps walking around at an irregular pace. Social signal processing indicates: the elder has been silent for an unusually long time. Cognitive Analysis result Your mother may be feeling anxious now I need to call my mother now

49 Silver Challenges 49

50 Vision To enable elderly l to maintain i an active, healthy and engaging life style in their own homes supported by an age-friendly intelligent environment, providing all- round comprehensive tender care Round-the-clock day-to-day health and wellness monitoring i Cognitive Support and recommendation to products and services Companionship and emotional support Support for maintaining/stimulating social interaction 50

51 Design Consideration and Challenges How to perform unobtrusive monitoring? - Mobile sensing, activity tracking How to provide all-around around comprehensive care? - Physical, cognitive, emotional, social, sustainability How to maintain ubiquitous access and interaction? - Cross platform, multimedia, multimodal How to provide friendly, personal touch? - Adaptive user modeling, mood detection - Proactive, natural interactioni 51

52 Approach and Methodology To support active living of elderlies through an intelligent multi-agent environment with ubiquitous access, natural interface, and alldd comprehensive rounded care Key Technologies Unobtrusive sensing and social signal processing Activity pattern and user modeling Information and service recommendation Proactive stimulation and natural interaction 52

53 A Multi-Agent Collaborative Care Environment Isabel (Personal Nurse) Alfred (The Butler) Small talk User modeling Social and travel advisory Frank (Robot Dog) Activity sensing Pattern modeling Small talk Recommendations for healthcare products and services 53

54 Why Multi-Agent? Unobtrusive sensing and monitoring agents of different characteristics and capabilities Ubiquitous i access to information i and services agents in different platforms and locations Comprehensive tender care agents with different domain knowledge and functions Three s a party more opportunities for cognitive stimulation and social interaction 54

55 Comprehensive Tender Care Physical Support Activity tracking, safety and wellness monitoring Cognitive i Support information i and recommendation on (healthcare) products, services, skills and activities t Emotional Support mood detection, affective support, small talk Social Support companionship and connection to family and friends (old and new) through sms, s and facebooks etc 55

56 Unobtrusive Sensing and Ubiquitous Access to Services unobtrusive in-home real-time data collection and contextual social signal processing - Essential to better understand and cater to the elderly s l needs. Sensing bio sensing, motion sensors, wearable/mobile sensors for health monitoring and activity tracking Cross Platform Large screen interactive display, mobile handheld devices, physical robots Multimedia text, audio, video 56

57 Adaptive User Modelling Identity and profile Interests and preferences Behaviour model: Time, space, activity it Knowledge and skills Social network: Familyand friends Meth0ds for Model Building Explicit: User specification Implicit: User actions, choices, conversation 57

58 Cognitive Support: Product/Service Recommendation Domain knowledge: Healthcare, Travel, Cooking Delivery modes: - Question & Answer - Proactive recommendation - Conversation PersonalTouch: Personalized, Context sensitive, small talks 58

59 Challenges in Big Living Analytics Volume huge amount of data through bio sensing, motion sensors, wearable/mobile sensors for health monitoring and activity tracking Velocity 24x7 real time sensing, sense making, decision making, service recommendation Variety information integration and knowledge sharing from cross platform, multimedia unstructured data - text, audio, video, gestures 59

60 Research Centre of Excellence in Active LIving for the elderly (LILY) Thank you! JOINT UBC-NTU RESEARCH CENTRE

Personalized Information Management for Web Intelligence

Personalized Information Management for Web Intelligence Personalized Information Management for Web Intelligence Ah-Hwee Tan Kent Ridge Digital Labs 21 Heng Mui Keng Terrace, Singapore 119613 Email: ahhwee@krdl.org.sg Abstract Web intelligence can be defined

More information

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics

More information

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP The Big Deal about Big Data Mike Skinner, CPA CISA CITP HORNE LLP Mike Skinner, CPA CISA CITP Senior Manager, IT Assurance & Risk Services HORNE LLP Focus areas: IT security & risk assessment IT governance,

More information

Big Data: Image & Video Analytics

Big Data: Image & Video Analytics Big Data: Image & Video Analytics How it could support Archiving & Indexing & Searching Dieter Haas, IBM Deutschland GmbH The Big Data Wave 60% of internet traffic is multimedia content (images and videos)

More information

Doing Multidisciplinary Research in Data Science

Doing Multidisciplinary Research in Data Science Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University aadamov@qu.edu.az http://ce.qu.edu.az/~aadamov 16 May

More information

category field F 2 feature field feature field ART b ART a F b 1 F 1 a - x a x b b w

category field F 2 feature field feature field ART b ART a F b 1 F 1 a - x a x b b w FOCI: A Personalized Web Intelligence System Ah-Hwee Tan, Hwee-Leng Ong, Hong Pan, Jamie Ng, Qiu-Xiang Li Kent Ridge Digital Labs 21 Heng Mui Keng Terrace, Singapore 119613, email: fahhwee, hweeleng, panhong,

More information

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,

More information

CIS492 Special Topics: Cloud Computing د. منذر الطزاونة

CIS492 Special Topics: Cloud Computing د. منذر الطزاونة CIS492 Special Topics: Cloud Computing د. منذر الطزاونة Big Data Definition No single standard definition Big Data is data whose scale, diversity, and complexity require new architecture, techniques, algorithms,

More information

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica So What s the market s definition of Big Data? Datasets whose volume, velocity, variety

More information

A New Era Of Analytic

A New Era Of Analytic Penang egovernment Seminar 2014 A New Era Of Analytic Megat Anuar Idris Head, Project Delivery, Business Analytics & Big Data Agenda Overview of Big Data Case Studies on Big Data Big Data Technology Readiness

More information

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI Disclosures James E. Tcheng, MD, FACC, FSCAI Affiliations / Financial Relationships / Other RWI ACC Chair, Informatics and Health IT Task Force

More information

Semi-Supervised Heterogeneous Fusion for Multimedia Data Co-clustering. Lei Meng, Ah-Hwee Tan, Senior Member, IEEE and Dong Xu, Member, IEEE

Semi-Supervised Heterogeneous Fusion for Multimedia Data Co-clustering. Lei Meng, Ah-Hwee Tan, Senior Member, IEEE and Dong Xu, Member, IEEE 1 Semi-Supervised Heterogeneous Fusion for Multimedia Data Co-clustering Lei Meng, Ah-Hwee Tan, Senior Member, IEEE and Dong Xu, Member, IEEE Abstract Co-clustering is a commonly used technique for tapping

More information

Data Refinery with Big Data Aspects

Data Refinery with Big Data Aspects International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data

More information

Big Data a threat or a chance?

Big Data a threat or a chance? Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but

More information

Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data

Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data Introduction to Engineering Using Robotics Experiments Lecture 17 Big Data Yinong Chen 2 Big Data Big Data Technologies Cloud Computing Service and Web-Based Computing Applications Industry Control Systems

More information

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012

Clustering Big Data. Anil K. Jain. (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012 Clustering Big Data Anil K. Jain (with Radha Chitta and Rong Jin) Department of Computer Science Michigan State University November 29, 2012 Outline Big Data How to extract information? Data clustering

More information

Congrats to Game Winners. How can computation use data to solve problems? What topics have we covered in CS 202? Part 1: Completed!

Congrats to Game Winners. How can computation use data to solve problems? What topics have we covered in CS 202? Part 1: Completed! CS 202: Introduction to Computation " UNIVERSITY of WISCONSIN-MADISON Computer Sciences Department Professor Andrea Arpaci-Dusseau How can computation use data to solve problems? Congrats to Game Winners

More information

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

Introduction to the Mathematics of Big Data. Philippe B. Laval

Introduction to the Mathematics of Big Data. Philippe B. Laval Introduction to the Mathematics of Big Data Philippe B. Laval Fall 2015 Introduction In recent years, Big Data has become more than just a buzz word. Every major field of science, engineering, business,

More information

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data! Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data: Data Analytical Tools for Decision Support 2 Outline Introduce

More information

www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

What happens when Big Data and Master Data come together?

What happens when Big Data and Master Data come together? What happens when Big Data and Master Data come together? Jeremy Pritchard Master Data Management fgdd 1 What is Master Data? Master data is data that is shared by multiple computer systems. The Information

More information

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»

More information

BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013

BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013 BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013 PERSONAL BACKGROUND Founder of the first specialist Service Management & Helpdesk System provider in Europe Past President of AFSMI

More information

Statistics for BIG data

Statistics for BIG data Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON BIG DATA ISSUES AMRINDER KAUR Assistant Professor, Department of Computer

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

Now, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith / http://about.me/mikessmith

Now, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith / http://about.me/mikessmith Now, Next and the Future: IT, Big Data and other Implications for RIM Agenda for This Afternoon Now: What trends are creating implications within the profession? Next: Why is IT now concerned about RIM?

More information

Medical Big Data Interpretation

Medical Big Data Interpretation Medical Big Data Interpretation Vice president of the Xiangya Hospital, Central South University The director of the ministry of mobile medical education key laboratory Professor Jianzhong Hu BIG DATA

More information

BIG DATA CHALLENGES AND PERSPECTIVES

BIG DATA CHALLENGES AND PERSPECTIVES BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,

More information

UNDERSTANDING THE BIG DATA PROBLEMS AND THEIR SOLUTIONS USING HADOOP AND MAP-REDUCE

UNDERSTANDING THE BIG DATA PROBLEMS AND THEIR SOLUTIONS USING HADOOP AND MAP-REDUCE UNDERSTANDING THE BIG DATA PROBLEMS AND THEIR SOLUTIONS USING HADOOP AND MAP-REDUCE Mr. Swapnil A. Kale 1, Prof. Sangram S.Dandge 2 1 ME (CSE), First Year, Department of CSE, Prof. Ram Meghe Institute

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data

More information

Applications for Business Intelligence, Predictive Analytics and Big Data

Applications for Business Intelligence, Predictive Analytics and Big Data Finance, Management, & Operations Applications for Business Intelligence, Predictive Analytics and Big Data Patrick Bogan, Chief Information Officer, Fuzion Analytics Kyle Korzenowski, Chief Information

More information

BIG DATA & SOCIAL INNOVATION KENNETH THOMAS, CLIENT MANAGER

BIG DATA & SOCIAL INNOVATION KENNETH THOMAS, CLIENT MANAGER BIG DATA & SOCIAL INNOVATION KENNETH THOMAS, CLIENT MANAGER 1 MAKING THE RIGHT DECISSION AT THE RIGHT PLACE AT THE RIGHT TIME 2 THE DATA MULTIPLIER EFFECT AT WORK BUSINESS DRIVEN HUMAN DRIVEN MACHINE DRIVEN

More information

Is Big Data a Big Deal? What Big Data Does to Science

Is Big Data a Big Deal? What Big Data Does to Science Is Big Data a Big Deal? What Big Data Does to Science Netherlands escience Center Wilco Hazeleger Wilco Hazeleger Student @ Wageningen University and Reading University Meteorology PhD @ Utrecht University,

More information

Sunnie Chung. Cleveland State University

Sunnie Chung. Cleveland State University Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:

More information

Chapter 1. Contrasting traditional and visual analytics approaches

Chapter 1. Contrasting traditional and visual analytics approaches Chapter 1 Understanding Big Data Analytics In This Chapter Defining Big Data Understanding Big Data Analytics Contrasting traditional and visual analytics approaches The era of Big Data is upon us. The

More information

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Is a Data Scientist the New Quant? Stuart Kozola MathWorks Is a Data Scientist the New Quant? Stuart Kozola MathWorks 2015 The MathWorks, Inc. 1 Facts or information used usually to calculate, analyze, or plan something Information that is produced or stored by

More information

Big Data and its use for Communications Service Providers. IoT, Washington, DC April 9, 2014 Sanjay Mishra

Big Data and its use for Communications Service Providers. IoT, Washington, DC April 9, 2014 Sanjay Mishra Big Data and its use for Communications Service Providers IoT, Washington, DC April 9, 2014 Sanjay Mishra 4/9/2014 IoT, Washington, DC, Big Data & its use for CSP (c) 2014 1 Information and Communications

More information

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof. CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing University of Florida, CISE Department Prof. Daisy Zhe Wang Data Science Overview Why, What, How, Who Outline Why Data Science?

More information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014 5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for

More information

Exploring Big Data in Social Networks

Exploring Big Data in Social Networks Exploring Big Data in Social Networks virgilio@dcc.ufmg.br (meira@dcc.ufmg.br) INWEB National Science and Technology Institute for Web Federal University of Minas Gerais - UFMG May 2013 Some thoughts about

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli (alberto.ceselli@unimi.it)

More information

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics

More information

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume

More information

The Challenge of Handling Large Data Sets within your Measurement System

The Challenge of Handling Large Data Sets within your Measurement System The Challenge of Handling Large Data Sets within your Measurement System The Often Overlooked Big Data Aaron Edgcumbe Marketing Engineer Northern Europe, Automated Test National Instruments Introduction

More information

Oracle Real Time Decisions

Oracle Real Time Decisions A Product Review James Taylor CEO CONTENTS Introducing Decision Management Systems Oracle Real Time Decisions Product Architecture Key Features Availability Conclusion Oracle Real Time Decisions (RTD)

More information

Autonomy Consolidated Archive

Autonomy Consolidated Archive Autonomy Consolidated Archive Dennis Wild Director SME, Information Governance and Archiving POWER PROTECT PROMOTE Meaning-Based Governance Files IM Audio Email Social Video SharePoint Archiving = Gain

More information

Data Centric Systems (DCS)

Data Centric Systems (DCS) Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems

More information

Transforming the Telecoms Business using Big Data and Analytics

Transforming the Telecoms Business using Big Data and Analytics Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:

More information

Chapter 7. Using Hadoop Cluster and MapReduce

Chapter 7. Using Hadoop Cluster and MapReduce Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in

More information

Big Data Solutions. Portal Development with MongoDB and Liferay. Solutions

Big Data Solutions. Portal Development with MongoDB and Liferay. Solutions Big Data Solutions Portal Development with MongoDB and Liferay Solutions Introduction Companies have made huge investments in Business Intelligence and analytics to better understand their clients and

More information

How To Use Big Data Effectively

How To Use Big Data Effectively Why is BIG Data Important? March 2012 1 Why is BIG Data Important? A Navint Partners White Paper May 2012 Why is BIG Data Important? March 2012 2 What is Big Data? Big data is a term that refers to data

More information

Introduction. A. Bellaachia Page: 1

Introduction. A. Bellaachia Page: 1 Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.

More information

Introduction to Predictive Analytics. Dr. Ronen Meiri ronen@dmway.com

Introduction to Predictive Analytics. Dr. Ronen Meiri ronen@dmway.com Introduction to Predictive Analytics Dr. Ronen Meiri Outline From big data to predictive analytics Predictive Analytics vs. BI Intelligent platforms What can we do with it. The modeling process. Example

More information

Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out

Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out Big Data Challenges and Success Factors Deloitte Analytics Your data, inside out Big Data refers to the set of problems and subsequent technologies developed to solve them that are hard or expensive to

More information

Information Security, PII and Big Data

Information Security, PII and Big Data ITU Workshop on ICT Security Standardization for Developing Countries (Geneva, Switzerland, 15-16 September 2014) Information Security, PII and Big Data Edward (Ted) Humphreys ISO/IEC JTC 1/SC 27 (WG1

More information

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme Big Data Analytics Prof. Dr. Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany 33. Sitzung des Arbeitskreises Informationstechnologie,

More information

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science Dr. Daisy Zhe Wang CISE Department University of Florida August 25th 2014 20 Review Overview of Data Science Why Data

More information

Data Centric Computing Revisited

Data Centric Computing Revisited Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data

More information

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health Lecture 1: Data Mining Overview and Process What is data mining? Example applications Definitions Multi disciplinary Techniques Major challenges The data mining process History of data mining Data mining

More information

Hadoop Market - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast, 2012 2018

Hadoop Market - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast, 2012 2018 Transparency Market Research Hadoop Market - Global Industry Analysis, Size, Share, Growth, Trends, and Forecast, 2012 2018 Buy Now Request Sample Published Date: July 2013 Single User License: US $ 4595

More information

Big Analytics: A Next Generation Roadmap

Big Analytics: A Next Generation Roadmap Big Analytics: A Next Generation Roadmap Cloud Developers Summit & Expo: October 1, 2014 Neil Fox, CTO: SoftServe, Inc. 2014 SoftServe, Inc. Remember Life Before The Web? 1994 Even Revolutions Take Time

More information

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first

More information

A Survey on Big Data Concepts and Tools

A Survey on Big Data Concepts and Tools A Survey on Big Data Concepts and Tools D. Rajasekar 1, C. Dhanamani 2, S. K. Sandhya 3 1,3 PG Scholar, 2 Assistant Professor, Department of Computer Science and Engineering, Sri Krishna College of Engineering

More information

Big Data Analytics. The Hype and the Hope* Dr. Ted Ralphs Industrial and Systems Engineering Director, COR@L Laboratory

Big Data Analytics. The Hype and the Hope* Dr. Ted Ralphs Industrial and Systems Engineering Director, COR@L Laboratory Big Data Analytics The Hype and the Hope* Dr. Ted Ralphs Industrial and Systems Engineering Director, COR@L Laboratory * Source: http://www.economistinsights.com/technology-innovation/analysis/hype-and-hope/methodology

More information

Analyzing Big Data: The Path to Competitive Advantage

Analyzing Big Data: The Path to Competitive Advantage White Paper Analyzing Big Data: The Path to Competitive Advantage by Marcia Kaplan Contents Introduction....2 How Big is Big Data?................................................................................

More information

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group

MLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group Big Data and Its Implication to Research Methodologies and Funding Cornelia Caragea TARDIS 2014 November 7, 2014 UNT Computer Science and Engineering Data Everywhere Lots of data is being collected and

More information

SURVEY REPORT DATA SCIENCE SOCIETY 2014

SURVEY REPORT DATA SCIENCE SOCIETY 2014 SURVEY REPORT DATA SCIENCE SOCIETY 2014 TABLE OF CONTENTS Contents About the Initiative 1 Report Summary 2 Participants Info 3 Participants Expertise 6 Suggested Discussion Topics 7 Selected Responses

More information

Smarter Planet evolution

Smarter Planet evolution Smarter Planet evolution 13/03/2012 2012 IBM Corporation Ignacio Pérez González Enterprise Architect ignacio.perez@es.ibm.com @ignaciopr Mike May Technologies of the Change Capabilities Tendencies Vision

More information

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 美 國 13 歲 學 生 用 Big Data 找 出 霸 淩 熱 點 Puri 架 設 網 站 Bullyvention, 藉 由 分 析 Twitter 上 找 出 提 到 跟 霸 凌 相 關 的 詞, 搭 配 地 理 位 置

More information

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.

Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. Impact of Big Data in Oil & Gas Industry Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. New Age Information 2.92 billions Internet Users in 2014 Twitter processes 7 terabytes

More information

Industry Impact of Big Data in the Cloud: An IBM Perspective

Industry Impact of Big Data in the Cloud: An IBM Perspective Industry Impact of Big Data in the Cloud: An IBM Perspective Inhi Cho Suh IBM Software Group, Information Management Vice President, Product Management and Strategy email: inhicho@us.ibm.com twitter: @inhicho

More information

Big Data Analytics: 14 November 2013

Big Data Analytics: 14 November 2013 www.pwc.com CSM-ACE 2013 Big Data Analytics: Take it to the next level in building innovation, differentiation and growth 14 About me Data analytics in the UK Forensic technology and data analytics in

More information

Bruhati Technologies. About us. ISO 9001:2008 certified. Technology fit for Business

Bruhati Technologies. About us. ISO 9001:2008 certified. Technology fit for Business Bruhati Technologies ISO 9001:2008 certified Technology fit for Business About us 1 Strong, agile and adaptive Leadership Geared up technologies for and fast moving long lasting With sound understanding

More information

Text Mining: The state of the art and the challenges

Text Mining: The state of the art and the challenges Text Mining: The state of the art and the challenges Ah-Hwee Tan Kent Ridge Digital Labs 21 Heng Mui Keng Terrace Singapore 119613 Email: ahhwee@krdl.org.sg Abstract Text mining, also known as text data

More information

A Strategic Approach to Unlock the Opportunities from Big Data

A Strategic Approach to Unlock the Opportunities from Big Data A Strategic Approach to Unlock the Opportunities from Big Data Yue Pan, Chief Scientist for Information Management and Healthcare IBM Research - China [contacts: panyue@cn.ibm.com ] Big Data or Big Illusion?

More information

Web Mining using Artificial Ant Colonies : A Survey

Web Mining using Artificial Ant Colonies : A Survey Web Mining using Artificial Ant Colonies : A Survey Richa Gupta Department of Computer Science University of Delhi ABSTRACT : Web mining has been very crucial to any organization as it provides useful

More information

Big Data Performance Growth on the Rise

Big Data Performance Growth on the Rise Impact of Big Data growth On Transparent Computing Michael A. Greene Intel Vice President, Software and Services Group, General Manager, System Technologies and Optimization 1 Transparent Computing (TC)

More information

Age of Big data. Presented by: Mohammad Iqbal BCM -2014

Age of Big data. Presented by: Mohammad Iqbal BCM -2014 Age of Presented by: Mohammad Iqbal BCM -2014 Agenda Big? Big evolution from Big? Name Symbol Value Kilobyte KB 10^3 BIG DATA Megabyte MB 10^6 Gigabyte GB 10^9 Terabyte TB 10^12 Petabyte PB 10^15 So large

More information

Government Technology Trends to Watch in 2014: Big Data

Government Technology Trends to Watch in 2014: Big Data Government Technology Trends to Watch in 2014: Big Data OVERVIEW The federal government manages a wide variety of civilian, defense and intelligence programs and services, which both produce and require

More information

Big Data: Study in Structured and Unstructured Data

Big Data: Study in Structured and Unstructured Data Big Data: Study in Structured and Unstructured Data Motashim Rasool 1, Wasim Khan 2 mail2motashim@gmail.com, khanwasim051@gmail.com Abstract With the overlay of digital world, Information is available

More information

White Paper. Version 1.2 May 2015 RAID Incorporated

White Paper. Version 1.2 May 2015 RAID Incorporated White Paper Version 1.2 May 2015 RAID Incorporated Introduction The abundance of Big Data, structured, partially-structured and unstructured massive datasets, which are too large to be processed effectively

More information

So What s the Big Deal?

So What s the Big Deal? So What s the Big Deal? Presentation Agenda Introduction What is Big Data? So What is the Big Deal? Big Data Technologies Identifying Big Data Opportunities Conducting a Big Data Proof of Concept Big Data

More information

Big Data Introduction, Importance and Current Perspective of Challenges

Big Data Introduction, Importance and Current Perspective of Challenges International Journal of Advances in Engineering Science and Technology 221 Available online at www.ijaestonline.com ISSN: 2319-1120 Big Data Introduction, Importance and Current Perspective of Challenges

More information

Volume 3, Issue 8, August 2015 International Journal of Advance Research in Computer Science and Management Studies

Volume 3, Issue 8, August 2015 International Journal of Advance Research in Computer Science and Management Studies Volume 3, Issue 8, August 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com An

More information

Unleashing the Potential of your Social Media and CRM Data. Markus Hirsch Sales Manager

Unleashing the Potential of your Social Media and CRM Data. Markus Hirsch Sales Manager Unleashing the Potential of your Social Media and CRM Data Markus Hirsch Sales Manager To get the RIGHT INFORMATION to the RIGHT PLACE, at the RIGHT TIME and put it in the RIGHT CONTEXT to make the WORLD

More information

ANALYTICS BUILT FOR INTERNET OF THINGS

ANALYTICS BUILT FOR INTERNET OF THINGS ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that

More information

Introduction to Data Mining

Introduction to Data Mining Bioinformatics Ying Liu, Ph.D. Laboratory for Bioinformatics University of Texas at Dallas Spring 2008 Introduction to Data Mining 1 Motivation: Why data mining? What is data mining? Data Mining: On what

More information

Big Data Just Noise or Does it Matter?

Big Data Just Noise or Does it Matter? Big Data Just Noise or Does it Matter? Opportunities for Continuous Auditing Presented by: Solon Angel Product Manager Servers The CaseWare Group. Founded in 1988. An industry leader in providing technology

More information

Big Data. Fast Forward. Putting data to productive use

Big Data. Fast Forward. Putting data to productive use Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association

More information

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Exploiting Data at Rest and Data in Motion with a Big Data Platform Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags

More information

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this

More information

Augmented Search for IT Data Analytics. New frontier in big log data analysis and application intelligence

Augmented Search for IT Data Analytics. New frontier in big log data analysis and application intelligence Augmented Search for IT Data Analytics New frontier in big log data analysis and application intelligence Business white paper May 2015 IT data is a general name to log data, IT metrics, application data,

More information

NetApp Big Content Solutions: Agile Infrastructure for Big Data

NetApp Big Content Solutions: Agile Infrastructure for Big Data White Paper NetApp Big Content Solutions: Agile Infrastructure for Big Data Ingo Fuchs, NetApp April 2012 WP-7161 Executive Summary Enterprises are entering a new era of scale, in which the amount of data

More information