PLATYPUS SYMPOSIUM BIG DATA. Associate Professor Paul Kennedy University of Technology, Sydney
|
|
- Janis Harrison
- 8 years ago
- Views:
Transcription
1 PLATYPUS SYMPOSIUM BIG DATA Associate Professor Paul Kennedy University of Technology, Sydney
2 Big Data Associate Professor Paul Kennedy School of Software Faculty of Engineering & IT, UTS UTS Centre for Quantum Computation and Intelligent Systems
3 Big Data What is it? How is it used? Why is it important?
4 Big Data What is it?
5 Collecting Data Humans have always collected, checked and organised data 5500 years ago Sumerians marked tax records onto dried mud tablets Scientists have looked through microscopes and telescopes and drawn what they saw Market researchers ran surveys or had TV diaries Medical laboratories took dozens of measurements per patient Source: Walters Art Museum / Wikimedia Commons / Public Domain
6 Data Analysing Since then, people have sought ways to use the recorded information to improve their lives (financially, health,...) Understanding People can understand these amounts of data and maybe make predictions for the future But nowadays, there is a data explosion
7 Data explosion Most data now goes straight to computers without humans seeing them Tax records submitted electronically Telescopes operated remotely and digital images goes to computer files Market and POS data go to data warehouses High throughput technology make simultaneous measurements of 1000s of genes per patient This deluge of data is useless to unaided people!
8 Is it really an explosion? 2011: 1.8 zetabytes of information created globally and expected to double each year = 200 billion 2-hour HD movies that one person could watch for 47 million years straight! From sensors, satellites, social media, mobile communications, , RFID and enterprise applications Source: TechAmerica Foundation, 2012, Demystifying Big Data
9 Big Data... Huge global interest currently Obama administration in 2011 announced $200m for Big Data R&D in US TechAmerica Foundation released report describing transformational power of Big Data and recommendations for training huge number of data scientists and analysts urgently needed Source: TechAmerica Foundation, 2012, Demystifying Big Data
10 Big Data large volumes of high velocity, complex and variable data that require advanced techniques and technologies to enable the capture, storage, distribution, management and analysis of the information Challenge Opportunity making sense of the data exploiting it to enhance business Source: TechAmerica Foundation, 2012, Demystifying Big Data
11 Helping to catch the backpacker killer Australia s most notorious serial murder case Early 1990s, 7 young backpackers murdered Police had developed a profile Huge dataset generated of vehicle records, gym memberships, gun licensing and police records 18 million suspects! Link analysis software from Sydney company NetMap Analytics, narrowed list to hundreds then 32, which included the murderer - Ivan Milat
12
13 Big Data - 4 Vs Volume - the amount of data has increased more sources, higher resolution sensors Velocity - speed of production and change real time analysis gives improved decisions Variety - different formats and sources e.g. social media, video, chat, genomics Veracity - quality and provenance of data inconsistent, incomplete, ambiguous, latency
14 Data Structured rows & columns, like Excel spreadsheets 15% of data Unstructured generally human generated text, multimedia, audio 85% of data Semi-structured As for unstructured, but metadata or tags
15 From Data to Knowledge Data raw uninterpreted facts e.g. Tom, 20 years old, student Information relates items of Data together e.g. Tom is 20 years old Knowledge relates items of Information together Tom is 20 years old Tom pays > $1500 insurance Modelling the world (= generalising) [18-25] years old P(accident) = high
16 Data Analytics
17 Data Analytics is the analysis of large databases to find novel, commercially valuable and exploitable patterns. Aim discover meaningful insights and pus knowledge from data Discoveries expressed as models Data mining = process of building models
18 Models A model Captures the essence of the discovered knowledge Can assist in understanding the world Can be used to make predictions
19 Data Mining / Data Analytics Extremely large datasets Discovery of the non-obvious No specific hypothesis (compare to statistical hypothesis testing) Useful knowledge to improve processes Impossible to do manually Knowledge from the data in any way possible
20 Fitting to the business Understand the business context, and stronger, framing a business question Translating the business question into a data analytics question Collecting, understanding and processing data from across the business and possibly externally Build models and evaluate them Deploying the results in the business to deliver benefits Iterative process
21 Two main modelling Unsupervised methods approaches Model tries to make sense of the data Clustering, association rule mining Supervised methods Models learns a relationship between inputs and outputs from old data Model can then be used to predict output for new inputs Classification, prediction, regression Decision trees, neural networks, support vector machines, random forest
22 Source: Kenneth Jensen / Wikimedia Commons / Public Domain CRISP-DM Shearer. The CRISP DM model: The new blueprint for data mining. Journal of Data Warehousing, 5(4):13 22, Fall 2000.
23 Attributes Instances Friends
24 Business Problem: Who has better access to other friends? structural component
25 Possible answer
26 Business Problem: Predict whether someone gets sunburned The Class The mining table
27 One possible answer: Characterisation of the type of person with a decision tree Hair Colour blonde ginger brown Lotion Lotion Lotion no yes no yes no sunburned no sunburned no no
28 Source: TechAmerica Foundation, 2012, Demystifying Big Data Information Flow
29 Big Data How is it used?
30 LinkedIn 2006: LinkedIn ~8M accounts Business problem: users not seeking out connections with others on the site enough Jonathan Goldman found a way to predict whose networks a given profile would land in Added a module to present to the user some people that they might possibly know, but were not in their network Achieved a click-through rate 30% higher than the rate obtained by other prompts to visit site pages Credited with increasing growth trajectory of LinkedIn Davenport & Patil, Data Scientist: The Sexiest Job of the 21st Century, Harvard Business Review, October 2012.
31 Predicting the 2012 US election result Nate Silver used predictive analytics & statistics to correctly predict outcomes of 50 out of 50 states from polling and related data Republican pundits were confident in their landslide-win predictions. Democrat pundits predicted razor-thin victory Shows the power of a data-centric approach over gut-feeling
32 The lion, the witch and the wardrobe Movie galaxies
33 Fellowship of the ring
34 The return of the king
35
36 Fraud Detection Large Australian project management website Few accounts laundered money through credit card transactions Historical data to predict accounts likely to be fraudsters Deployed to filter and mark potential fraudsters Accuracy ~98% First week detected a large Ukrainian fraud syndicate with >50 user accounts, two smaller groups from China and Vietnam and several other minor fraudsters Source: WikiCommons: Repro in book: Rosén: En ren historia. Ljungby, 1992.
37 Big Data Why is it important?
38 Makes use of the unstructured 85% of data that is otherwise unusable Potential for evidence-based data-driven decision-making Competitive edge e-business - after the buzz-word has faded, the principles will underpin society Affordable due to convergence of technology
39 Institute of Analytics Professionals of Australia Our mission is to unite, inform, support and promote analytics professionals in Australia. We provide information sources, a virtual community, a networking hub and a professional identity. We promote the benefits of analytics in modern business.
40 Information is the oil of the 21st century, and analytics is the combustion engine. Peter Sondergaard, SVP, Gartner Research. Speech given at Gartner Symposium/ITxpo
41 Thanks & Questions... Associate Professor Paul Kennedy School of Software Faculty of Engineering & IT, UTS
An Introduction to Advanced Analytics and Data Mining
An Introduction to Advanced Analytics and Data Mining Dr Barry Leventhal Henry Stewart Briefing on Marketing Analytics 19 th November 2010 Agenda What are Advanced Analytics and Data Mining? The toolkit
More informationDMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support
DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support Rok Rupnik, Matjaž Kukar, Marko Bajec, Marjan Krisper University of Ljubljana, Faculty of Computer and Information
More informationUsing Big Data Analytics to
Using Big Data Analytics to Improve Government Performance Arun Chandrasekaran Gartner is a registered trademark of Gartner, Inc. or its affiliates. This publication may not be reproduced or distributed
More informationBig Data. Fast Forward. Putting data to productive use
Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize
More informationData Project Extract Big Data Analytics course. Toulouse Business School London 2015
Data Project Extract Big Data Analytics course Toulouse Business School London 2015 How do you analyse data? Project are often a flop: Need a problem, a business problem to solve. Start with a small well-defined
More informationIntroduction to Data Mining
Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:
More informationStatistics for BIG data
Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before
More informationData Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC
Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep Neil Raden Hired Brains Research, LLC Traditionally, the job of gathering and integrating data for analytics fell on data warehouses.
More informationPredictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD
Predictive Analytics Techniques: What to Use For Your Big Data March 26, 2014 Fern Halper, PhD Presenter Proven Performance Since 1995 TDWI helps business and IT professionals gain insight about data warehousing,
More informationData Mining Applications in Higher Education
Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2
More informationAzure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
More informationnot possible or was possible at a high cost for collecting the data.
Data Mining and Knowledge Discovery Generating knowledge from data Knowledge Discovery Data Mining White Paper Organizations collect a vast amount of data in the process of carrying out their day-to-day
More informationData Analytics and Business Intelligence (8696/8697)
http: // togaware. com Copyright 2014, Graham.Williams@togaware.com 1/39 Data Analytics and Business Intelligence (8696/8697) Introducing Data Mining Graham.Williams@togaware.com Chief Data Scientist Australian
More informationCertification In SAS Programming. Introduction to SAS Program
Certification In SAS Programming Introduction to SAS Program What Lies Ahead In this session, you will gain answers to: Overview of Analytics Careers in Analytics Why Use SAS? Introduction to SAS System
More informationBIG DATA What it is and how to use?
BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14
More informationDatabase Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
More informationBIG DATA. Value 8/14/2014 WHAT IS BIG DATA? THE 5 V'S OF BIG DATA WHAT IS BIG DATA?
WHAT IS BIG DATA? BIG DATA DR. KLARA NELSON THE UNIVERSITY OF TAMPA "Volumes of data that are unusually large, or types of data that are unstructured" Thomas Davenport, Keeping Up with the Quants, 2013,
More informationICT Perspectives on Big Data: Well Sorted Materials
ICT Perspectives on Big Data: Well Sorted Materials 3 March 2015 Contents Introduction 1 Dendrogram 2 Tree Map 3 Heat Map 4 Raw Group Data 5 For an online, interactive version of the visualisations in
More informationInternational Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
More informationData Mining: Overview. What is Data Mining?
Data Mining: Overview What is Data Mining? Recently * coined term for confluence of ideas from statistics and computer science (machine learning and database methods) applied to large databases in science,
More informationThe Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
More informationMachine Learning and Data Mining. Fundamentals, robotics, recognition
Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,
More informationPractical Data Science with Azure Machine Learning, SQL Data Mining, and R
Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be
More informationWHITEPAPER BIG DATA GOVERNANCE. How To Avoid The Pitfalls of Big Data Governance? www.analytixds.com
BIG DATA GOVERNANCE How To Avoid The Pitfalls of Big Data Governance? of The need to provide answers quickly... 3 You can t measure what you don t manage... 3 Aligning the overall architecture with the
More informationIs Big Data a Big Deal? What Big Data Does to Science
Is Big Data a Big Deal? What Big Data Does to Science Netherlands escience Center Wilco Hazeleger Wilco Hazeleger Student @ Wageningen University and Reading University Meteorology PhD @ Utrecht University,
More informationComparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
More informationArchitecture 3.0 Landscape Analytics
Architecture 3.0 Landscape Analytics Jürgen Döllner Hasso- Plattner- Institut Landscape Analytics Big Data Big Data Analytics Visual Analytics Predictive Analytics Landscape Analytics Big Data Data is
More informationFrom Raw Data to. Actionable Insights with. MATLAB Analytics. Learn more. Develop predictive models. 1Access and explore data
100 001 010 111 From Raw Data to 10011100 Actionable Insights with 00100111 MATLAB Analytics 01011100 11100001 1 Access and Explore Data For scientists the problem is not a lack of available but a deluge.
More informationFraming Business Problems as Data Mining Problems
Framing Business Problems as Data Mining Problems Asoka Diggs Data Scientist, Intel IT January 21, 2016 Legal Notices This presentation is for informational purposes only. INTEL MAKES NO WARRANTIES, EXPRESS
More informationUnderstanding the Transformative Power of Big Data & Predictive Analytics
Understanding the Transformative Power of Big Data & Predictive Analytics Dallas Fort Worth Area IMA Meeting September 18, 2014 Today s topics Demystify the buzz of big data and analytics Discuss the relevance
More informationIn this presentation, you will be introduced to data mining and the relationship with meaningful use.
In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine
More informationCertificate Program in Applied Big Data Analytics in Dubai. A Collaborative Program offered by INSOFE and Synergy-BI
Certificate Program in Applied Big Data Analytics in Dubai A Collaborative Program offered by INSOFE and Synergy-BI Program Overview Today s manager needs to be extremely data savvy. They need to work
More informationMaximizing Return and Minimizing Cost with the Decision Management Systems
KDD 2012: Beijing 18 th ACM SIGKDD Conference on Knowledge Discovery and Data Mining Rich Holada, Vice President, IBM SPSS Predictive Analytics Maximizing Return and Minimizing Cost with the Decision Management
More informationDanny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank
Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»
More informationClass 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?
Class 1 Data Mining Data Mining and Artificial Intelligence We are in the 21 st century So where are the robots? Data mining is the one really successful application of artificial intelligence technology.
More informationPerspectives on Big Data Research Considerations for Transportation Agencies and Researchers
Perspectives on Big Data Research Considerations for Transportation Agencies and Researchers Transportation Research Board Donald Ludlow, MCP, AICP June 25, 2015 Presentation Map Big Data: Underutilized
More informationExploiting Data at Rest and Data in Motion with a Big Data Platform
Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags
More informationHow To Understand Business Intelligence
An Introduction to Advanced PREDICTIVE ANALYTICS BUSINESS INTELLIGENCE DATA MINING ADVANCED ANALYTICS An Introduction to Advanced. Where Business Intelligence Systems End... and Predictive Tools Begin
More informationISSN:2321-1156 International Journal of Innovative Research in Technology & Science(IJIRTS)
Nguyễn Thị Thúy Hoài, College of technology _ Danang University Abstract The threading development of IT has been bringing more challenges for administrators to collect, store and analyze massive amounts
More informationReal-Time Solutions to Big Data Problems
Real-Time Solutions to Big Data Problems IT Infrastructure (analysis / storage) Internet of Everything Big Data Big Data The term Big Data refers to data that overwhelms, IT infrastructure and complicates
More informationThe Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
More informationDigging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA
Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA ABSTRACT Current trends in data mining allow the business community to take advantage of
More informationBig Data. Introducción. Santiago González <sgonzalez@fi.upm.es>
Big Data Introducción Santiago González Contenidos Por que BIG DATA? Características de Big Data Tecnologías y Herramientas Big Data Paradigmas fundamentales Big Data Data Mining
More informationBig Analytics: A Next Generation Roadmap
Big Analytics: A Next Generation Roadmap Cloud Developers Summit & Expo: October 1, 2014 Neil Fox, CTO: SoftServe, Inc. 2014 SoftServe, Inc. Remember Life Before The Web? 1994 Even Revolutions Take Time
More informationHow to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning
How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume
More informationManagement Decision Making. Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011
Management Decision Making Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011 Management decision making Decision making Spreadsheet exercise Data visualization,
More informationKeywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
More informationInternational Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop
ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com
More informationNo Data Governance, No Actionable Insights
DATA SMALL DATA MASSIVE DATA No Data Governance, No Actionable Insights Ram Kumar Chief Information Officer, Asia Insurance Australia Group (IAG) Australia MORE DATA MEDIUM DATA LARGE DATA OBESE DATA June
More informationApplications of Deep Learning to the GEOINT mission. June 2015
Applications of Deep Learning to the GEOINT mission June 2015 Overview Motivation Deep Learning Recap GEOINT applications: Imagery exploitation OSINT exploitation Geospatial and activity based analytics
More informationBig Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013
Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013 Housekeeping 1. Any questions coming out of today s presentation can be discussed in the bar this evening 2. OCF is
More informationFIVE INDUSTRIES. Where Big Data Is Making a Difference
FIVE INDUSTRIES Where Big Data Is Making a Difference To understand how Big Data can transform businesses, we have to understand its nature. Although there are numerous definitions of Big Data, many will
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411
More informationData Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over
More informationTHE NEXT PHASE IN THE EVOLUTION OF BIG DATA ANALYTICS
THE NEXT PHASE IN THE EVOLUTION OF BIG DATA ANALYTICS Professor Dineli Mather Head, School of Information & Business Analytics Faculty of Business & Law AGENDA Big Data Analytics: The Dawn of a New Discipline
More informationI. Justification and Program Goals
MS in Data Science proposed by Department of Computer Science, B. Thomas Golisano College of Computing and Information Sciences Department of Information Sciences and Technologies, B. Thomas Golisano College
More informationCleaned Data. Recommendations
Call Center Data Analysis Megaputer Case Study in Text Mining Merete Hvalshagen www.megaputer.com Megaputer Intelligence, Inc. 120 West Seventh Street, Suite 10 Bloomington, IN 47404, USA +1 812-0-0110
More informationDATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.
DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,
More informationIntroduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
More informationHigh Productivity Data Processing Analytics Methods with Applications
High Productivity Data Processing Analytics Methods with Applications Dr. Ing. Morris Riedel et al. Adjunct Associate Professor School of Engineering and Natural Sciences, University of Iceland Research
More informationFoundations of Artificial Intelligence. Introduction to Data Mining
Foundations of Artificial Intelligence Introduction to Data Mining Objectives Data Mining Introduce a range of data mining techniques used in AI systems including : Neural networks Decision trees Present
More informationThe Analytical Revolution
Predictive Analytics World 19 October 2011 The Analytical Revolution Colin Shearer Worldwide Industry Solutions Leader SPSS Business Analytics software Our world is becoming smarter Instrumented Interconnected
More informationSurfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics
Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,
More informationBig Data and Open Data
Big Data and Open Data Bebo White SLAC National Accelerator Laboratory/ Stanford University!! bebo@slac.stanford.edu dekabytes hectobytes Big Data IS a buzzword! The Data Deluge From the beginning of
More informationData Mining and Machine Learning in Bioinformatics
Data Mining and Machine Learning in Bioinformatics PRINCIPAL METHODS AND SUCCESSFUL APPLICATIONS Ruben Armañanzas http://mason.gmu.edu/~rarmanan Adapted from Iñaki Inza slides http://www.sc.ehu.es/isg
More informationData Mining and Neural Networks in Stata
Data Mining and Neural Networks in Stata 2 nd Italian Stata Users Group Meeting Milano, 10 October 2005 Mario Lucchini e Maurizo Pisati Università di Milano-Bicocca mario.lucchini@unimib.it maurizio.pisati@unimib.it
More informationIndex Contents Page No. Introduction . Data Mining & Knowledge Discovery
Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.
More informationCHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved
CHAPTER SIX DATA Business Intelligence 2011 The McGraw-Hill Companies, All Rights Reserved 2 CHAPTER OVERVIEW SECTION 6.1 Data, Information, Databases The Business Benefits of High-Quality Information
More informationA Strategic Approach to Unlock the Opportunities from Big Data
A Strategic Approach to Unlock the Opportunities from Big Data Yue Pan, Chief Scientist for Information Management and Healthcare IBM Research - China [contacts: panyue@cn.ibm.com ] Big Data or Big Illusion?
More informationTen Mistakes to Avoid
EXCLUSIVELY FOR TDWI PREMIUM MEMBERS TDWI RESEARCH SECOND QUARTER 2014 Ten Mistakes to Avoid In Big Data Analytics Projects By Fern Halper tdwi.org Ten Mistakes to Avoid In Big Data Analytics Projects
More informationData Science and Business Analytics Certificate Data Science and Business Intelligence Certificate
Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate Description The Helzberg School of Management has launched two graduate-level certificates: one in Data
More informationHP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica
HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica So What s the market s definition of Big Data? Datasets whose volume, velocity, variety
More informationBig Data: Overview and Roadmap. 2015 eglobaltech. All rights reserved.
Big Data: Overview and Roadmap 2015 eglobaltech. All rights reserved. What is Big Data? Large volumes of complex and variable data that require advanced techniques and technologies to enable capture, storage,
More informationEFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
More informationIntroduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing
Introduction to Data Mining and Machine Learning Techniques Iza Moise, Evangelos Pournaras, Dirk Helbing Iza Moise, Evangelos Pournaras, Dirk Helbing 1 Overview Main principles of data mining Definition
More informationData mining for prediction
Data mining for prediction Prof. Gianluca Bontempi Département d Informatique Faculté de Sciences ULB Université Libre de Bruxelles email: gbonte@ulb.ac.be Outline Extracting knowledge from observations.
More informationINTRODUCING AZURE MACHINE LEARNING
David Chappell INTRODUCING AZURE MACHINE LEARNING A GUIDE FOR TECHNICAL PROFESSIONALS Sponsored by Microsoft Corporation Copyright 2015 Chappell & Associates Contents What is Machine Learning?... 3 The
More informationEMPOWER WITH DATA YOUR BUSINESS AND KEEPING IT SAFE. maximizing data s business value
EMPOWER YOUR BUSINESS WITH DATA maximizing data s business value AND KEEPING IT SAFE EMPOWER YOUR BUSINESS WITH DATA maximizing data s business value AND KEEPING IT SAFE Data is an organization s lifeblood.
More informationPDF PREVIEW EMERGING TECHNOLOGIES. Applying Technologies for Social Media Data Analysis
VOLUME 34 BEST PRACTICES IN BUSINESS INTELLIGENCE AND DATA WAREHOUSING FROM LEADING SOLUTION PROVIDERS AND EXPERTS PDF PREVIEW IN EMERGING TECHNOLOGIES POWERFUL CASE STUDIES AND LESSONS LEARNED FOCUSING
More informationChapter 7: Data Mining
Chapter 7: Data Mining Overview Topics discussed: The Need for Data Mining and Business Value The Data Mining Process: Define Business Objectives Get Raw Data Identify Relevant Predictive Variables Gain
More informationNavigating the Four Vs of Big Data: Shrinking the Haystack for Actionable Insights
Navigating the Four Vs of Big Data: Shrinking the Haystack for Actionable Insights An ISS open source project Table of Contents 1. Introduction and Overview 2. Optimizing the Three Vs of Big Data a. Volume
More informationReaping the Rewards of Big Data
Reaping the Rewards of Big Data TABLE OF CONTENTS INTRODUCTION: 2 TABLE OF CONTENTS FINDING #1: BIG DATA PLATFORMS ARE ESSENTIAL FOR A MAJORITY OF ORGANIZATIONS TO MANAGE FUTURE BIG DATA CHALLENGES. 4
More informationThe Data Engineer. Mike Tamir Chief Science Officer Galvanize. Steven Miller Global Leader Academic Programs IBM Analytics
The Data Engineer Mike Tamir Chief Science Officer Galvanize Steven Miller Global Leader Academic Programs IBM Analytics Alessandro Gagliardi Lead Faculty Galvanize Businesses are quickly realizing that
More informationData Mining for Fun and Profit
Data Mining for Fun and Profit Data mining is the extraction of implicit, previously unknown, and potentially useful information from data. - Ian H. Witten, Data Mining: Practical Machine Learning Tools
More informationData Isn't Everything
June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,
More informationFuzzy Signature Neural Network
Fuzzy Signature Neural Network Final presentation for COMP8780 IHCC Project Supervisor: Professor Tom GEDEON Presented by: Outline Background Neural Network Fuzzy Logic, Fuzzy Rule Based System and Fuzzy
More informationHow To Create A Data Science System
Enhance Collaboration and Data Sharing for Faster Decisions and Improved Mission Outcome Richard Breakiron Senior Director, Cyber Solutions Rbreakiron@vion.com Office: 571-353-6127 / Cell: 803-443-8002
More informationBig Data Integration: A Buyer's Guide
SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology
More informationBig Data better business benefits
Big Data better business benefits Paul Edwards, HouseMark 2 December 2014 What I ll cover.. Explain what big data is Uses for Big Data and the potential for social housing What Big Data means for HouseMark
More informationIntroduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
More informationACEDS Membership Benefits Training, Resources and Networking for the E-Discovery Community
ACEDS Membership Benefits Training, Resources and Networking for the E-Discovery Community! Exclusive News and Analysis! Weekly Web Seminars! Podcasts! On- Demand Training! Networking! Resources! Jobs
More informationUsing reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management
Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management Paper Jean-Louis Amat Abstract One of the main issues of operators
More informationAre You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
More informationThe Data Discovery Revolution: Changing the Economics of Data Governance
The Data Discovery Revolution: Changing the Economics of Data Governance Data In the News: Data Consistency Problems Poor master data is causing problems for organizations trying to analyse data across
More informationMLg. Big Data and Its Implication to Research Methodologies and Funding. Cornelia Caragea TARDIS 2014. November 7, 2014. Machine Learning Group
Big Data and Its Implication to Research Methodologies and Funding Cornelia Caragea TARDIS 2014 November 7, 2014 UNT Computer Science and Engineering Data Everywhere Lots of data is being collected and
More informationThe Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg
The Big Picture on Big Data Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg Objective of Talk 1. Deliver a Primer on Big Data. 2. How does this emerging topic apply to Quality? 3.
More informationSmarter Planet evolution
Smarter Planet evolution 13/03/2012 2012 IBM Corporation Ignacio Pérez González Enterprise Architect ignacio.perez@es.ibm.com @ignaciopr Mike May Technologies of the Change Capabilities Tendencies Vision
More informationData Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
More informationKey Findings Advanced, Predictive Analytics Breaking the Barriers to Adoption
Key Findings Advanced, Predictive Analytics Breaking the Barriers to Adoption January 2015 Vanguard Marketing International, Inc. Tel 480.488.5707 Advanced, Predictive Analytics Breaking the Barriers to
More informationMetrics that Matter Security Risk Analytics
Metrics that Matter Security Risk Analytics Rich Skinner, CISSP Director Security Risk Analytics & Big Data Brinqa rskinner@brinqa.com April 1 st, 2014. Agenda Challenges in Enterprise Security, Risk
More information