Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI



Similar documents
Introduction to Data Mining

Database Marketing, Business Intelligence and Knowledge Discovery

Data Mining: Concepts and Techniques

Introduction to Data Mining

Information Management course

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

DATA MINING AND WAREHOUSING CONCEPTS

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam

Data Warehousing and Data Mining in Business Applications

A Knowledge Management Framework Using Business Intelligence Solutions

Introduction. A. Bellaachia Page: 1

Analance Data Integration Technical Whitepaper

DATA ANALYSIS USING BUSINESS INTELLIGENCE TOOL. A Thesis. Presented to the. Faculty of. San Diego State University. In Partial Fulfillment

Cis330. Mostafa Z. Ali

Loss Prevention Data Mining Using big data, predictive and prescriptive analytics to enpower loss prevention

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

Analance Data Integration Technical Whitepaper

SPATIAL DATA CLASSIFICATION AND DATA MINING

How To Turn Big Data Into An Insight

Introduction to Data Mining

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

A Review of Data Mining Techniques

Business Intelligence Meets Business Process Management. Powerful technologies can work in tandem to drive successful operations

Technical Management Strategic Capabilities Statement. Business Solutions for the Future

Data Mining Analytics for Business Intelligence and Decision Support

Decision Support and Business Intelligence Systems. Chapter 1: Decision Support Systems and Business Intelligence

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

III JORNADAS DE DATA MINING

TRENDS IN DATA WAREHOUSING

Chapter 4 Getting Started with Business Intelligence

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

PUSH INTELLIGENCE. Bridging the Last Mile to Business Intelligence & Big Data Copyright Metric Insights, Inc.

Data Mining System, Functionalities and Applications: A Radical Review

Data Warehousing and Data Mining

Business Analytics and Data Visualization. Decision Support Systems Chattrakul Sombattheera

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

Outline. BI and Enterprise-wide decisions BI in different Business Areas BI Strategy, Architecture, and Perspectives

Next Generation Business Performance Management Solution

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Class 2. Learning Objectives

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April ISSN

Data Mining Introduction

Data Warehouse: Introduction

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS

Chapter 1 DECISION SUPPORT SYSTEMS AND BUSINESS INTELLIGENCE

CHAPTER SIX DATA. Business Intelligence The McGraw-Hill Companies, All Rights Reserved

Dashboards PRESENTED BY: Quaid Saifee Director, WIT Inc.

Pentaho Data Mining Last Modified on January 22, 2007

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata

DATA MANAGEMENT FOR THE INTERNET OF THINGS

Data Mining for Successful Healthcare Organizations

Data Mining Solutions for the Business Environment

Demonstration of SAP Predictive Analysis 1.0, consumption from SAP BI clients and best practices

Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Delivering Smart Answers!

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users

Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms

CHAPTER 1 INTRODUCTION

SAP Solution Brief SAP HANA. Transform Your Future with Better Business Insight Using Predictive Analytics

Dashboard Reporting Business Intelligence

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept

Increasing the Business Performances using Business Intelligence

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining

Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction

An Overview of Database management System, Data warehousing and Data Mining

SAP Manufacturing Intelligence By John Kong 26 June 2015

Web Data Mining: A Case Study. Abstract. Introduction

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

Data Warehouse Overview. Srini Rengarajan

Explore the Possibilities

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

MDM and Data Warehousing Complement Each Other

Using Tableau Software with Hortonworks Data Platform

[callout: no organization can afford to deny itself the power of business intelligence ]

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH

Data Management Practices for Intelligent Asset Management in a Public Water Utility

CUSTOMER RELATIONSHIP MANAGEMENT (CRM) CII Institute of Logistics

Big Data Strategies Creating Customer Value In Utilities

The Business Value of Predictive Analytics

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Enterprise Data Quality

Data Warehouse (DW) Maturity Assessment Questionnaire

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

Data Warehouse design

ORACLE TAX ANALYTICS. The Solution. Oracle Tax Data Model KEY FEATURES

Big Data for Investment Research Management

Business Intelligence, Analytics & Reporting: Glossary of Terms

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

Global Headquarters: 5 Speen Street Framingham, MA USA P F

AMIS 7640 Data Mining for Business Intelligence

ISSN: (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies

Infor10 Corporate Performance Management (PM10)

THE ROLE OF BUSINESS INTELLIGENCE IN BUSINESS PERFORMANCE MANAGEMENT

Transcription:

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI Yudho Giri Sucahyo, Ph.D, CISA (yudho@cs.ui.ac.id) Faculty of Computer Science, University of Indonesia

Objectives Motivation: Why data mining? What is data mining? Understand the drivers for BI initiatives in modern organizations Understand the structure, components, and process of BI 2

Motivation: Why data mining? Data explosion problem: Automated data collection tools and mature database technology lead to tremendous amounts of data stored in databases, data warehouses and other information repositories. We are drowning in data, but starving for knowledge! Data Mining: Extraction of interesting knowledge (rules, regularities, patterns, constraints) from data in large databases [JH]. Analysis of the large quantities of data that are stored in computers [DO], Alternative names KDD, knowledge extraction, data archeology, information harvesting, business intelligence, etc. 3

Data Rich but Information Poor Databases are too big Data Mining can help discover knowledge 4

Evolution of Database Technology Data Collection, database creation, network DBMS Relational data model, relational DBMS implementation RDBMS, advanced data models (extended-relational, OO, etc.) and application-oriented DBMS (spatial, scientific, engineering, etc.) Data mining and data warehousing, multimedia databases, and Web technology 5

Potential Applications See TSBD lecture notes Data Mining See Chapter 1 of DO Retailing Banking Credit Card Management Insurance Telecommunications Telemarketing Human Resource Management 6

Data Mining Should Not be Used Blindly Data mining find regularities from history, but history is not the same as the future. Association does not dictate trend nor causality. Some abnormal data could be caused by human. 7

Another view of BI BI is a broad field and it is viewed differently by different people. Common agreement on major components: A centralized repository of data data warehouse An end-user set of tools to create reports and queries from data and information and to analyze the data, information, and reports business analytics To find non-obvious relationship among large amounts of data data mining, for text text mining, for web web mining Business Performance Management (BPM) to set goals as metrics and standards and monitoring and measuring performance by using the BI methodology. 8

Drivers of BI Organizations are being compelled to capture, understand, and harness their data to support decision making in order to improve business operations Business cycle times are now extremely compressed; faster, more informed, and better decision making is therefore a competitive imperative Managers need the right information at the right time and in the right place Case Study 1: BI success story at Toyota Motor Company (Chapter 1 ET pg. 4-6). 9

Business Value of BI 10

Data Mining Functionality Association 11 From association, correlation, to causality Finding rules like A -> B Classification and Prediction Classify data based on the values ina classifying attribute Predict some unknown or missing attribute values based on other information Cluster analysis Group data to form new classes, e.g., cluster houses to find distribution patterns Outlier and exception data analysis Time series analysis (trend and deviation) Trend and deviation analysis: regression, sequential pattern, similiar sequences e.g. Stock analysis

Sarbanes-Oxley Act of 2002 (extracted from Gartner, Inc., 2004) The Sarbanes-Oxley Act of 2002 mandates drove one firm to implement a new financial performance management system, capable of meeting the new requirements to: Perform flawless analysis and compilation of thousands of transactions and journal entries. Balance more access to data with the need to control access to sensitive insider information. Deliver reports to the SEC in less time. 12

Sarbanes-Oxley Act of 2002 (extracted from Gartner, Inc., 2004)... continued Within the overarching goal of achieving financial-reporting compliance, these objectives included the following: Get more eyes on the data and KPI and build in strict security controls Provide live reports that allow people to drill down to the lowest level of transaction detail Proactively scour the financial databases for anomalies, using variance triggers Gather all financial data into a cohesive database Complement accounting and budgeting applications for flexible reporting, free-form investigation, and automated data analysis. BI can proactively alert specific individuals whenever an anomay is detected. 13

14 Now let us see some screenshots...

Dashboard 15

And another dashboard... 16

And another dashboard... 17

Financial Reporting 18

19 Back to theory...

KDD Process Interpretation/ Knowledge Evaluation Transformation Data Mining Preprocessin g Selection Target Data Data 20

Steps of a KDD Process Learning the application domain: 21 relevant prior knowledge and goals of application Creating a target data set: data selection Data cleaning and preprocessing: (may take 60% of effort!) Data reduction and projection: Find useful features, dimensionality/variable reduction, invariant representation. Choosing functions of data mining summarization, classification, regression, association, clustering. Choosing the mining algorithm(s) Data mining: search for patterns of interest Interpretation: analysis of results. visualization, transformation, removing redundant patterns, etc. Use of discovered knowledge

Teradata Advanced Analytics Methodology (similar to CRISP-DM) 22

Structure and Components of BI 23

Structure and Components of BI... continued Data Warehouse Data flows from operational systems (e.g., CRM, ERP) to a DW, which is a special database or repository of data that has been prepared to support decision-making applications ranging from those for simple reporting and querying to complex optimization Business Analytics/OLAP Software tools that allow users to create ondemand reports and queries and to conduct analysis of data 24

Structure and Components of BI... continued Data Mining Data mining is a class of database information analysis that looks for hidden patterns in a group of data that can be used to predict future behavior Used to replace or enhance human intelligence by scanning through massive storehouses of data to discover meaningful new correlations, patterns, and trends, by using pattern recognition technologies and advanced statistics 25

Structure and Components of BI... continued Business Performance Management (BPM) Based on the balanced scorecard methodology a framework for defining, implementing, and managing an enterprise s business strategy by linking objectives with factual measures Dashboards A visual presentation of critical data for executives to view. It allows executives to see hot spots in seconds and explore the situation 26

BI: Today and Tomorrow Recent industry analyst reports show that in the coming years, millions of people will use BI visual tools and analytics every day BI takes advantage of already developed and installed components of IT technologies, helping companies leverage their current IT investments and use valuable data stored in legacy and transactional systems Some Issues: Mining information from heterogeneous databases and global information systems Handling relational and complex types of data Efficiency and scalability of data mining algorithms 27