An analysis of Big Data ecosystem from an HCI perspective.
|
|
- Douglas Kennedy
- 8 years ago
- Views:
Transcription
1 An analysis of Big Data ecosystem from an HCI perspective. Jay Sanghvi Rensselaer Polytechnic Institute For: Theory and Research in Technical Communication and HCI Rensselaer Polytechnic Institute Wednesday, December 5th 2012
2 Abstract The potential benefits of Big Data are practical and significant, and some initial benefits have already been achieved, however, there are still many technical and people related challenges that must be addressed to fully exploit its potential. The size of the data is a major challenge, and this can be sensed easily. But, there are others. There are challenges not just in size of data, but also in heterogeneity in data type, its representation and semantic interpretation, and the rate at which the data needs to be processed. While these aspects are important, additional important aspect are privacy and usability. This paper presents these challenges from an HCI perspective.
3 1. What is Big Data 2. Applications and Benefits 3. Data Analysis Pipeline 4. Challenges 4.1 Fundamental challenges Volume Velocity Variety 4.2 Technology related challenges Technology usability Application acumen Provenance Annotations Cloud Visualization 4.3 People related challenges Data ownership Ethics Privacy 5. Conclusion Table of Content
4 1. What is Big Data Every day, we create 2.5 quintillion bytes of data so much that 90% of the data in the world today has been created in the last two years alone. This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals to name a few. This data is big data. Big data is data that exceeds the processing capacity of conventional information systems. The data is too big, moves too fast, or doesn t fit the strictures of conventional information architectures. To gain value from this data, we must choose an alternative way to process it. 2. Applications and Benefits Scientific research has been revolutionized by Big Data. The field of Astronomy is being transformed from one where taking pictures of the sky was a large part of an astronomer s job to one where the pictures are all in a database already and the astronomer s task is to find interesting objects and phenomena in the database. Big Data has the potential to revolutionize not just research, but also education. A recent detailed quantitative comparison of different approaches taken by 35 charter schools in NYC has found that one of the top five policies correlated with measurable academic effectiveness was the use of data to guide instruction. It is widely believed that the use of information technology can reduce the cost of healthcare while improving its quality, by making care more preventive and personalized and basing it on more extensive (home based) continuous monitoring. McKinsey estimates a savings of 300 billion dollars every year in the US alone. Similarly, there are strong cases made for the value of Big Data for urban planning, intelligent transportation, environmental modeling, energy saving, smart materials, computational social sciences, financial systemic risk analysis, homeland security, computer security. and so on. In 2010, enterprises and users stored more than 13 exabytes of new data; this is over 50,000 times the data in the Library of Congress. The potential value of global personal location data is estimated to be $700 billion to end users, and it can result in an up to 50% decrease in product development and assembly costs, according to a recent McKinsey report. McKinsey predicts an equally great effect of Big Data in employment, where 140, ,000 workers with deep analytical experience will be needed in the US; furthermore, 1.5 million managers will need to become data literate. Not surprisingly, the recent PCAST report on Networking and IT R&D identified Big Data as a research frontier that can accelerate progress across a broad range of
5 priorities. Even popular news media now appreciates the value of Big Data as evidenced by coverage in the Economist, the New York Times, and National Public Radio. 3. Data Analysis Pipeline 3.1 Data Acquisition and Recording Data is recorded from some data generating source such as human interaction, human behaviour, business transactions, nature, scientific experiments and simulations that can easily and continuously produce petabytes of data today. 3.2 Information Extraction and Cleaning Almost all the time, the information collected is not in a format ready for analysis. For example, the pictures that capture the deep outer space. We cannot leave the data in image form and still effectively analyze it. Rather we require an information extraction process that pulls out the required information from the underlying sources and expresses it in a structured form suitable for analysis. Doing this correctly and completely is a continuing technical challenge. Note that this data also includes images and will in the future include video; such extraction is often highly application dependent. Many a times the instruments used for data capture are biased under certain conditions (example: the pictures of the deep space taken from a space telescope when it was in a radiation field of a meteor or another star may have been affected in a particular way) which make it imperative to clean the data.
6 3.3 Data Integration, Aggregation, and Representation Given the heterogeneity of big data (in terms of what they represent, format, granularity, semantic interpretation and intent) it is not enough merely to record it and store it into a repository. We need to make sure that it is discoverable and make efforts to use it by the larger community. Adequate annotations does help, but integration and aggregation remain challenging due to differences in experimental details and in record structure of two or more data sets. Data analysis is significantly more challenging than just locating, identifying, understanding, and citing data. For effective large scale analysis all these steps needs to happen in a completely automated manner. This requires differences in data structure and semantics to be expressed in forms that are computer understandable, and then robotically resolvable i.e. error free data structure independent difference resolution method. Analysis is not simple even when there is only one data set involved. There are many alternative ways to store the same information. Certain database designs has advantages over others for certain purposes, and possibly drawbacks for other purposes. Database design expertise is limited to a few qualified professionals. There exists no tools or frameworks that enable other professionals, such as domain scientists, to create effective database designs. 3.4 Query Processing, Data Modeling, and Analysis This phase involves retrieving target data from heterogeneous interrelated redundant data sources, mining big data, cross checking conflicting cases, validating trustworthy relationships, identifying inherent clusters and uncovering hidden relationships and models. Big Data enables the next generation of interactive data analysis with real time answers. Scaling complex query processing techniques to terabytes while enabling interactive response times is a major open research problem today. 3.5 Interpretation Analyzing Big Data is of limited value if users cannot understand the result. An expert decision maker, provided with the result of analysis, has to interpret these results. This interpretation involves examining all the assumptions made and retracing the analysis. Also, there are many possible sources of error: bugs in computer systems, assumptions made by the data models and results can be based on erroneous data. For all of these reasons, no responsible user will cede authority to the computer system. The recent mortgage related shock to the financial system dramatically underscored the need for such decision maker diligence rather than accept the stated solvency of a financial institution at face value, a decision maker has to examine
7 critically the many assumptions at multiple stages of analysis. Hence, it is not enough to provide just the results. Rather, one must provide supplementary information that explains how each result was derived, and based upon precisely what inputs.i.e. the provenance of the (result) data. Systems with rich variety of visualizations are important in conveying the results of the queries in a way that is best understood by a particular set of people. Results needs to be presented using powerful visualizations that assist interpretation, and support user collaboration. 4. Challenges Almost all the challenges for Big Data development and adoption are due to its three fundamental dimensions: Volume, Velocity and Variety. 4.1 Fundamental challenges Volume: Enterprises are awash with ever growing data of all types, easily amassing terabytes even petabytes of information per day. Turn 12 terabytes of Tweets created each day into improved product sentiment analysis Convert 350 billion annual meter readings to better predict power consumption Velocity: Sometimes 2 minutes is too late. For time sensitive processes such as catching fraud, big data must be used as it streams into your enterprise in order to maximize its value. Scrutinize 5 million trade events created each day to identify potential fraud Analyze 500 million daily call detail records in real time to predict customer churn faster Variety: Big data is any type of data structured and unstructured data such as text, sensor data, audio, video, click streams, log files and more. New insights are found when analyzing these data types together. Monitor 100 s of live video feeds from surveillance cameras to target points of interest Exploit the 80% data growth in images, video and documents to improve customer satisfaction 4.2 Technology related challenges Technology usability: Big data has made tremendous progress in terms of developing various technology and tools to make big data benefits accessible to even smallest of the organizations. As almost 100% of this development is open source and relatively young, there is a huge scope for consolidation and standardization of technologies and tools.
8 Apache Hadoop is one of the big data enabling open source projects and has been the driving force behind the growth of the big data reach. Programming Hadoop is a case of working with the Java APIs, many of which are known for their horrific usability. As project Apache Hadoop is relatively young and constantly evolving, their isn t much focus on ease of learning, which makes the learning curve steep. This is one major hurdle for people willing to adopt these technologies. Infact, many promising startups have sprung up just to make these technologies simpler to understand and use. Another hurdle is availability of alternative sub technologies under Hadoop that are overlapping or mutually exclusive in terms of the features they offer for implementing a the functionality, so no one sub technology is complete in itself and requires use of multiple technologies. Application acumen: As big data is finding increasingly varied applications in more and more disciplines, the chances that an existing data set would be used for an un intended application are increasing. Also, it is difficult, if not downright impossible, to assess how a particular set of data that is collected today will be used even if the application is in same intended discipline. This inability results into not so helpful or inadequate annotations, provenance and metadata. In fact, the definition of noise itself, depending on the application, may change. Joining two or more data sets or joining data within the same data set requires a thorough understanding of the intent of various data manipulations. Also, the personnel making these decision needs to be proficient in understanding and manipulating the independent variables and understand how are they relate to and affect the dependent variables. The result analysis, modelling and result interpretations are all functions of his/ her proficiency with the tools and domain knowledge. Provenance: Storing information about the data at its source is not useful unless this information can be interpreted and carried along the phases of data analysis pipeline. For example, an error at one step can make following analysis useless. Only with suitable provenance, one can easily identify all following processing that is dependent on this step. We need research into generating suitable provenance and into data systems that transmit the provenance through data analysis phases. Annotations: Automatically generating the right metadata to describe what data is recorded and how it is measured is difficult. For example, in scientific experiments, considerable detail on the specific experimental conditions and procedures are required to be able to interpret the results effectively and it is important that the metadata be recorded with observational data. We need research into generating suitable metadata and into data systems that carry the metadata
9 through data analysis phases. Cloud: Big data and cloud technology go hand in hand. Big data needs clusters of servers for processing and huge storage space, which clouds can readily provide. Cloud services themselves are at an early stage, and we will see both increasing standardization and innovation over the next couple of years. The cloud services as provided by the three major players today, Amazon, Google and Microsoft are different in many aspects and have different capabilities. A big data implementation on one of them may not be portable to another platform with the exact same capabilities. In other words, such large implementations are locked in tied to a cloud service provider and there is a huge switching cost. Visualization: A Picture Is Worth 10,000 Rows! The best data visualizations are ones that expose something new about the underlying patterns and relationships contained within the data. Understanding those relationships and so being able to observe them is key to good decision making. The Periodic Table is a classic testament to the potential of visualization to reveal hidden relationships in even small data sets. Visualization are a new set of languages you can be used to communicate. As big data application matures, more complex visualization forms would be invented to understand the complex relationship between the various dimensions and equally trained brains would be required to decode and interpret them. 4.3 People related challenges Data ownership: In this age when each one of us is constantly interacting with sensors, there is a confusion over who owns the data that is about you but has been recorded or sensed by someone elses sensor. Example: Someone meets with a road accident and is carried to hospital for treatment and in the process records host of information on your behaviour, body characteristics, the activity you were involved in just before the accident, etc. Does the owner of the sensors own the data or does the person own this data? Ethics: With very limited people having access to big data technologies, is it justified only for a select few to reap the benefits of rich information that is derived from innocent looking datasets? Example: Is it justified when an insurance company with the help of big data technologies and weather data sets (generated using public money) calculates the chances of occurrence of drought or floods and correspondingly changes the insurance premiums and the fine prints in the offer document? As more and more predictions are made using the sophisticated techniques and bigger and bigger decisions are based on these predictions, should the data miners/ scientists be held responsible for any losses arising due to any wrong predictions? Recently, in Italy seven scientist were jailed and asked to pay heavy fines for false assurances before earthquake that killed 300
10 people. Privacy: With the varied innovations that Big Data is enabling, there is a fine line between being innovative and breaching someones privacy. Is privacy breached only when there is a name attached to a set of disclosed attributes? Do we have to change our core value system to be able to fully benefit from big data? 5. Conclusion Like it or not, we live in interesting times. Big data is powerful and disruptive. Like most other technologies, it is neutral. It is the applications that raises questions. There has been a considerable progress on the technology front to enable big data. On the other hand, we have just started to understand and resolve its implications on our lives and core values. The potential benefits of Big Data are practical and significant, and some initial benefits have already been achieved, there are still many technical and people related challenges that are needed be addressed to fully exploit its potential. References and Citations: ethics of big data/3/ scientists jailed earthquake aquila Book: Privacy and Big Data by Terence Craig and Mary E. Ludloff Book: Planning for Big Data by by O Reilly Radar Team ComputerInteractionandVisualization.html 01.ibm.com/software/data/bigdata/
CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait
CSC590: Selected Topics BIG DATA & DATA MINING Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait Agenda Introduction What is Big Data Why Big Data? Characteristics of Big Data Applications of Big Data Problems
More informationDanny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank
Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»
More informationInternational Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 ISSN 2278-7763. BIG DATA: A New Technology
International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 BIG DATA: A New Technology Farah DeebaHasan Student, M.Tech.(IT) Anshul Kumar Sharma Student, M.Tech.(IT)
More informationThe New Normal: Get Ready for the Era of Extreme Information Management. John Mancini President, AIIM @jmancini77 DigitalLandfill.
The New Normal: Get Ready for the Era of Extreme Information Management John Mancini President, AIIM @jmancini77 DigitalLandfill.org Giving Credit Where Credit is Due I didn t make up the term Extreme
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON BIG DATA ISSUES AMRINDER KAUR Assistant Professor, Department of Computer
More informationHow Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6
Survey Results Table of Contents Survey Results... 4 Big Data Company Strategy... 6 Big Data Business Drivers and Benefits Received... 8 Big Data Integration... 10 Big Data Implementation Challenges...
More informationStatistical Challenges with Big Data in Management Science
Statistical Challenges with Big Data in Management Science Arnab Kumar Laha Indian Institute of Management Ahmedabad Analytics vs Reporting Competitive Advantage Reporting Prescriptive Analytics (Decision
More information5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
More informationIndexed Terms: Big Data, benefits, characteristics, definition, problems, unstructured data
Managing Data through Big Data: A Review Harsimran Singh Anand Assistant Professor, PG Dept of Computer Science & IT, DAV College, Amritsar Email id: harsimran_anand@yahoo.com A B S T R A C T Big Data
More informationHow To Make Data Streaming A Real Time Intelligence
REAL-TIME OPERATIONAL INTELLIGENCE Competitive advantage from unstructured, high-velocity log and machine Big Data 2 SQLstream: Our s-streaming products unlock the value of high-velocity unstructured log
More informationThe Future of Business Analytics is Now! 2013 IBM Corporation
The Future of Business Analytics is Now! 1 The pressures on organizations are at a point where analytics has evolved from a business initiative to a BUSINESS IMPERATIVE More organization are using analytics
More informationBig Data & Its Importance
Big Data and Data Science: Case Studies Priyanka Srivatsa 1 1 Department of Computer Science & Engineering, M.S.Ramaiah Institute of Technology, Bangalore- 560054. Abstract- Big data is a collection of
More informationIntegrating a Big Data Platform into Government:
Integrating a Big Data Platform into Government: Drive Better Decisions for Policy and Program Outcomes John Haddad, Senior Director Product Marketing, Informatica Digital Government Institute s Government
More informationExploiting Data at Rest and Data in Motion with a Big Data Platform
Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags
More informationData Aggregation and Cloud Computing
Data Intensive Scalable Computing Harnessing the Power of Cloud Computing Randal E. Bryant February, 2009 Our world is awash in data. Millions of devices generate digital data, an estimated one zettabyte
More informationBIG DATA FUNDAMENTALS
BIG DATA FUNDAMENTALS Timeframe Minimum of 30 hours Use the concepts of volume, velocity, variety, veracity and value to define big data Learning outcomes Critically evaluate the need for big data management
More informationIndustry Impact of Big Data in the Cloud: An IBM Perspective
Industry Impact of Big Data in the Cloud: An IBM Perspective Inhi Cho Suh IBM Software Group, Information Management Vice President, Product Management and Strategy email: inhicho@us.ibm.com twitter: @inhicho
More informationBig Data-Challenges and Opportunities
Big Data-Challenges and Opportunities White paper - August 2014 User Acceptance Tests Test Case Execution Quality Definition Test Design Test Plan Test Case Development Table of Contents Introduction 1
More informationHow To Use Big Data Effectively
Why is BIG Data Important? March 2012 1 Why is BIG Data Important? A Navint Partners White Paper May 2012 Why is BIG Data Important? March 2012 2 What is Big Data? Big data is a term that refers to data
More informationWhite Paper. Version 1.2 May 2015 RAID Incorporated
White Paper Version 1.2 May 2015 RAID Incorporated Introduction The abundance of Big Data, structured, partially-structured and unstructured massive datasets, which are too large to be processed effectively
More informationWe are Big Data A Sonian Whitepaper
EXECUTIVE SUMMARY Big Data is not an uncommon term in the technology industry anymore. It s of big interest to many leading IT providers and archiving companies. But what is Big Data? While many have formed
More informationGrabbing Value from Big Data: The New Game Changer for Financial Services
Financial Services Grabbing Value from Big Data: The New Game Changer for Financial Services How financial services companies can harness the innovative power of big data 2 Grabbing Value from Big Data:
More informationBig Data Introduction, Importance and Current Perspective of Challenges
International Journal of Advances in Engineering Science and Technology 221 Available online at www.ijaestonline.com ISSN: 2319-1120 Big Data Introduction, Importance and Current Perspective of Challenges
More informationThe Next Wave of Data Management. Is Big Data The New Normal?
The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management
More informationwww.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411
More informationHow To Understand The Benefits Of Big Data
Findings from the research collaboration of IBM Institute for Business Value and Saïd Business School, University of Oxford Analytics: The real-world use of big data How innovative enterprises extract
More informationTaming Big Data. 1010data ACCELERATES INSIGHT
Taming Big Data 1010data ACCELERATES INSIGHT Lightning-fast and transparent, 1010data analytics gives you instant access to all your data, without technical expertise or expensive infrastructure. TAMING
More informationCollaborations between Official Statistics and Academia in the Era of Big Data
Collaborations between Official Statistics and Academia in the Era of Big Data World Statistics Day October 20-21, 2015 Budapest Vijay Nair University of Michigan Past-President of ISI vnn@umich.edu What
More informationGood morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data.
Good morning. It is a pleasure to be with you here today to talk about the value and promise of Big Data. 1 Advances in information technologies are transforming the fabric of our society and data represent
More informationBig Data Solutions. Portal Development with MongoDB and Liferay. Solutions
Big Data Solutions Portal Development with MongoDB and Liferay Solutions Introduction Companies have made huge investments in Business Intelligence and analytics to better understand their clients and
More informationBig Data and Analytics: Challenges and Opportunities
Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif
More informationBIG Data. An Introductory Overview. IT & Business Management Solutions
BIG Data An Introductory Overview IT & Business Management Solutions What is Big Data? Having been a dominating industry buzzword for the past few years, there is no contesting that Big Data is attracting
More informationWHAT IS BIG DATA? David Bechtold
WHAT IS BIG DATA? David Bechtold Agenda 1. Introduction 2. What is Big Data? 3. Big Data a perspective 4. Characteristic of Big Data Three Vs 5. A Fourth V..? 6. Examples 7. How did we get here?... A historical
More informationInformation Visualization WS 2013/14 11 Visual Analytics
1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and
More informationUnderstanding Your Customer Journey by Extending Adobe Analytics with Big Data
SOLUTION BRIEF Understanding Your Customer Journey by Extending Adobe Analytics with Big Data Business Challenge Today s digital marketing teams are overwhelmed by the volume and variety of customer interaction
More informationOutline. What is Big data and where they come from? How we deal with Big data?
What is Big Data Outline What is Big data and where they come from? How we deal with Big data? Big Data Everywhere! As a human, we generate a lot of data during our everyday activity. When you buy something,
More informationBig Data Mining: Challenges and Opportunities to Forecast Future Scenario
Big Data Mining: Challenges and Opportunities to Forecast Future Scenario Poonam G. Sawant, Dr. B.L.Desai Assist. Professor, Dept. of MCA, SIMCA, Savitribai Phule Pune University, Pune, Maharashtra, India
More informationBusiness Analytics and the Nexus of Information
Business Analytics and the Nexus of Information 2 The Impact of the Nexus of Forces 4 From the Gartner Files: Information and the Nexus of Forces: Delivering and Analyzing Data 6 About IBM Business Analytics
More informationHow In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time
SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first
More informationHOW TO DO A SMART DATA PROJECT
April 2014 Smart Data Strategies HOW TO DO A SMART DATA PROJECT Guideline www.altiliagroup.com Summary ALTILIA s approach to Smart Data PROJECTS 3 1. BUSINESS USE CASE DEFINITION 4 2. PROJECT PLANNING
More informationBig Data. Fast Forward. Putting data to productive use
Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize
More informationAnuradha Bhatia, Faculty, Computer Technology Department, Mumbai, India
Volume 3, Issue 9, September 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Real Time
More informationDoing Multidisciplinary Research in Data Science
Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University aadamov@qu.edu.az http://ce.qu.edu.az/~aadamov 16 May
More informationSAP Makes Big Data Real Real Time. Real Results.
SAP Makes Big Data Real Real Time. Real Results. MAKE BIG DATA REAL WITH SAP SOLUTIONS: ACCELERATE. APPLY. ACHIEVE Accelerate, Apply, and Achieve Big Results from Your Big Data Big Data represents an opportunity
More informationA New Era Of Analytic
Penang egovernment Seminar 2014 A New Era Of Analytic Megat Anuar Idris Head, Project Delivery, Business Analytics & Big Data Agenda Overview of Big Data Case Studies on Big Data Big Data Technology Readiness
More informationThe Rise of Industrial Big Data
GE Intelligent Platforms The Rise of Industrial Big Data Leveraging large time-series data sets to drive innovation, competitiveness and growth capitalizing on the big data opportunity The Rise of Industrial
More informationBIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics
BIG DATA & ANALYTICS Transforming the business and driving revenue through big data and analytics Collection, storage and extraction of business value from data generated from a variety of sources are
More informationData, Data Everywhere
Dr. Willa Pickering Lockheed Martin enior Fellow March 2012 Data, Data Everywhere Big Data what is it Protecting Data in Cloud how do we handle it Data Analysis are we prepared to use it Willa Pickering
More informationAnalyzing Big Data: The Path to Competitive Advantage
White Paper Analyzing Big Data: The Path to Competitive Advantage by Marcia Kaplan Contents Introduction....2 How Big is Big Data?................................................................................
More informationThe Cloud for Insights
The Cloud for Insights A Guide for Small and Medium Business As the volume of data grows, businesses are using the power of the cloud to gather, analyze, and visualize data from internal and external sources
More informationNew Design Principles for Effective Knowledge Discovery from Big Data
New Design Principles for Effective Knowledge Discovery from Big Data Anjana Gosain USICT Guru Gobind Singh Indraprastha University Delhi, India Nikita Chugh USICT Guru Gobind Singh Indraprastha University
More informationSunnie Chung. Cleveland State University
Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:
More informationBIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON
BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON Overview * Introduction * Multiple faces of Big Data * Challenges of Big Data * Cloud Computing
More informationCreate and Drive Big Data Success Don t Get Left Behind
Create and Drive Big Data Success Don t Get Left Behind The performance boost from MapR not only means we have lower hardware requirements, but also enables us to deliver faster analytics for our users.
More informationInternational Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop
ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com
More informationHow To Make Sense Of Data With Altilia
HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to
More informationA Strategic Approach to Unlock the Opportunities from Big Data
A Strategic Approach to Unlock the Opportunities from Big Data Yue Pan, Chief Scientist for Information Management and Healthcare IBM Research - China [contacts: panyue@cn.ibm.com ] Big Data or Big Illusion?
More informationDatabricks. A Primer
Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful
More informationTen Mistakes to Avoid
EXCLUSIVELY FOR TDWI PREMIUM MEMBERS TDWI RESEARCH SECOND QUARTER 2014 Ten Mistakes to Avoid In Big Data Analytics Projects By Fern Halper tdwi.org Ten Mistakes to Avoid In Big Data Analytics Projects
More informationHadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
More informationKeywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
More informationBig Data + Predictive Analytics = Actionable Business Insights: Consider Big Data as the Most Important Thing for Business since the Internet
Big Data + Predictive Analytics = Actionable Business Insights: Consider Big Data as the Most Important Thing for Business since the Internet Adapted from the forthcoming book, Business Innovation in the
More informationEXECUTIVE REPORT. Big Data and the 3 V s: Volume, Variety and Velocity
EXECUTIVE REPORT Big Data and the 3 V s: Volume, Variety and Velocity The three V s are the defining properties of big data. It is critical to understand what these elements mean. The main point of the
More informationCHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved
CHAPTER SIX DATA Business Intelligence 2011 The McGraw-Hill Companies, All Rights Reserved 2 CHAPTER OVERVIEW SECTION 6.1 Data, Information, Databases The Business Benefits of High-Quality Information
More informationBIG DATA & SOCIAL INNOVATION KENNETH THOMAS, CLIENT MANAGER
BIG DATA & SOCIAL INNOVATION KENNETH THOMAS, CLIENT MANAGER 1 MAKING THE RIGHT DECISSION AT THE RIGHT PLACE AT THE RIGHT TIME 2 THE DATA MULTIPLIER EFFECT AT WORK BUSINESS DRIVEN HUMAN DRIVEN MACHINE DRIVEN
More informationBIG Big Data Public Private Forum
DATA STORAGE Martin Strohbach, AGT International (R&D) THE DATA VALUE CHAIN Value Chain Data Acquisition Data Analysis Data Curation Data Storage Data Usage Structured data Unstructured data Event processing
More informationThe Missing Data Scientists. www.wipro.com
www.wipro.com The Missing Data Scientists P. Srinivasa Rao, Vice President & Global Business Head Analytics and Information Management, Wipro Technologies Table of Contents 1. Introduction...03 2. Why
More informationBIG DATA ANALYTICS For REAL TIME SYSTEM
BIG DATA ANALYTICS For REAL TIME SYSTEM Where does big data come from? Big Data is often boiled down to three main varieties: Transactional data these include data from invoices, payment orders, storage
More informationBIG DATA CHALLENGES AND PERSPECTIVES
BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,
More informationW H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
More informationBig Data better business benefits
Big Data better business benefits Paul Edwards, HouseMark 2 December 2014 What I ll cover.. Explain what big data is Uses for Big Data and the potential for social housing What Big Data means for HouseMark
More informationBig Data Integration: A Buyer's Guide
SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology
More informationPre-Talk Talk. What does ESS look like as more of this CI arrives?
Cloud Computing Rob Fatland Microsoft Research For the MRC story: http://research.microsoft.com/azure WRF two years and counting: http://weatherservice.cloudapp.net Pre-Talk Talk What does ESS look like
More informationThe Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg
The Big Picture on Big Data Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg Objective of Talk 1. Deliver a Primer on Big Data. 2. How does this emerging topic apply to Quality? 3.
More informationSmarter Planet evolution
Smarter Planet evolution 13/03/2012 2012 IBM Corporation Ignacio Pérez González Enterprise Architect ignacio.perez@es.ibm.com @ignaciopr Mike May Technologies of the Change Capabilities Tendencies Vision
More informationSecure Data Transmission Solutions for the Management and Control of Big Data
Secure Data Transmission Solutions for the Management and Control of Big Data Get the security and governance capabilities you need to solve Big Data challenges with Axway and CA Technologies. EXECUTIVE
More informationHow To Create A Data Science System
Enhance Collaboration and Data Sharing for Faster Decisions and Improved Mission Outcome Richard Breakiron Senior Director, Cyber Solutions Rbreakiron@vion.com Office: 571-353-6127 / Cell: 803-443-8002
More informationBIG DATA. - How big data transforms our world. Kim Escherich Executive Innovation Architect, IBM Global Business Services
BIG DATA - How big data transforms our world Kim Escherich Executive Innovation Architect, IBM Global Business Services 1 2 What happens? What is data? 340.282.366.920.938.463.463.374.607.431.768.211.456
More informationExploiting the power of Big Data
Exploiting the power of Big Data Timos Sellis School of Computer Science and Information Technology timos.sellis@rmit.edu.au ITECHLAW Asia-Pacific Conference, February 26-28, 2014 Melbourne Australia Timeline
More informationDecisyon/Engage. Connecting you to the voice of the market. Contacts. www.decisyon.com
Connecting you to the voice of the market Contacts www.decisyon.com Corporate Headquarters 795 Folsom Street, 1st Floor San Francisco, CA 94107 1 844-329-3972 European Office Viale P. L. Nervi Directional
More informationUnderstanding traffic flow
White Paper A Real-time Data Hub For Smarter City Applications Intelligent Transportation Innovation for Real-time Traffic Flow Analytics with Dynamic Congestion Management 2 Understanding traffic flow
More informationBig Data Analytics: Driving Value Beyond the Hype
Transportation Challenges and Opportunities: A Colloquia Series Fresh Approaches to Emerging Issues Big Data Analytics: Driving Value Beyond the Hype OCTOBER 2, 2012 CAMBRIDGE, MASSACHUSETTS WE ARE IN
More informationDatabricks. A Primer
Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically
More informationBig Data; Old News or New Hype? Marcel den Hartog, June 2012
Big Data; Old News or New Hype? Marcel den Hartog, June 2012 One of the first Big Data projects in 1964 The Ranger series of spacecraft were designed solely to take high-quality pictures of the Moon and
More informationValue from Big Data really?
Value from Big Data really? DAMA SA Chapter Meeting: Johannesburg 24 June 2014 Let s talk about Big Data! Page 2 Is Digital Transformation really happening? 1993 2013 Page 3 But before we do that; where
More informationHow To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
More informationBig Data Analytics in Space Exploration and Entrepreneurship
Space Society of Silicon Valley Big Data Analytics in Space Exploration and Entrepreneurship Tiffani Crawford, PhD January 14, 2015 Big Data Analytics Data Characteristics Large quantities of many data
More informationIBM Software Top tips for securing big data environments
IBM Software Top tips for securing big data environments Why big data doesn t have to mean big security challenges 2 Top Comprehensive tips for securing data big protection data environments for physical,
More informationATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V
ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V WHITE PAPER Create the Data Center of the Future Accelerate
More informationHow Big Data is Different
FALL 2012 VOL.54 NO.1 Thomas H. Davenport, Paul Barth and Randy Bean How Big Data is Different Brought to you by Please note that gray areas reflect artwork that has been intentionally removed. The substantive
More informationBig Data: Overview and Roadmap. 2015 eglobaltech. All rights reserved.
Big Data: Overview and Roadmap 2015 eglobaltech. All rights reserved. What is Big Data? Large volumes of complex and variable data that require advanced techniques and technologies to enable capture, storage,
More informationBIG DATA Impact on DMOs. TTRA June 21, 2013
BIG DATA Impact on DMOs TTRA June 21, 2013 What is BIG DATA? 1. Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or
More informationBIG DATA: BIG BOOST TO BIG TECH
BIG DATA: BIG BOOST TO BIG TECH Ms. Tosha Joshi Department of Computer Applications, Christ College, Rajkot, Gujarat (India) ABSTRACT Data formation is occurring at a record rate. A staggering 2.9 billion
More informationWHITEPAPER BIG DATA GOVERNANCE. How To Avoid The Pitfalls of Big Data Governance? www.analytixds.com
BIG DATA GOVERNANCE How To Avoid The Pitfalls of Big Data Governance? of The need to provide answers quickly... 3 You can t measure what you don t manage... 3 Aligning the overall architecture with the
More informationMiracle Integrating Knowledge Management and Business Intelligence
ALLGEMEINE FORST UND JAGDZEITUNG (ISSN: 0002-5852) Available online www.sauerlander-verlag.com/ Miracle Integrating Knowledge Management and Business Intelligence Nursel van der Haas Technical University
More informationA Visualization is Worth a Thousand Tables: How IBM Business Analytics Lets Users See Big Data
White Paper A Visualization is Worth a Thousand Tables: How IBM Business Analytics Lets Users See Big Data Contents Executive Summary....2 Introduction....3 Too much data, not enough information....3 Only
More informationNow, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith / http://about.me/mikessmith
Now, Next and the Future: IT, Big Data and other Implications for RIM Agenda for This Afternoon Now: What trends are creating implications within the profession? Next: Why is IT now concerned about RIM?
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK BIG DATA HOLDS BIG PROMISE FOR SECURITY NEHA S. PAWAR, PROF. S. P. AKARTE Computer
More information