Hexaware E-book on Predictive Analytics



Similar documents
What is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8

Session 10 : E-business models, Big Data, Data Mining, Cloud Computing

Data Mining Solutions for the Business Environment

IBM SPSS Modeler Premium

Data are everywhere. IBM projects that every day we generate 2.5 quintillion bytes of data. In relative terms, this means 90

Text Analytics Beginner s Guide. Extracting Meaning from Unstructured Data

Solve your toughest challenges with data mining

Advanced Analytics. The Way Forward for Businesses. Dr. Sujatha R Upadhyaya

not possible or was possible at a high cost for collecting the data.

DATA MINING AND WAREHOUSING CONCEPTS

Real World Application and Usage of IBM Advanced Analytics Technology

Introduction. A. Bellaachia Page: 1

How To Understand Business Intelligence

SPATIAL DATA CLASSIFICATION AND DATA MINING

Solve Your Toughest Challenges with Data Mining

Data Mining for Everyone

The Prophecy-Prototype of Prediction modeling tool

The Data Mining Process

TEXT ANALYTICS INTEGRATION

Data Search. Searching and Finding information in Unstructured and Structured Data Sources

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Solve your toughest challenges with data mining

IBM SPSS Modeler Professional

Data Mining with MicroStrategy

2015 Workshops for Professors

The Future of Business Analytics is Now! 2013 IBM Corporation

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Data Mining Algorithms Part 1. Dejan Sarka

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

Introduction to Data Mining

Web Data Mining: A Case Study. Abstract. Introduction

How To Use Social Media To Improve Your Business

Discovering, Not Finding. Practical Data Mining for Practitioners: Level II. Advanced Data Mining for Researchers : Level III

Combining the Power of Predictive Analytics with IBM Cognos Business Intelligence

Breadboard BI. Unlocking ERP Data Using Open Source Tools By Christopher Lavigne

Megaputer Intelligence

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS

Sunnie Chung. Cleveland State University

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

STAR WARS AND THE ART OF DATA SCIENCE

Voice of the Customer: How to Move Beyond Listening to Action Merging Text Analytics with Data Mining and Predictive Analytics

Delivering new insights and value to consumer products companies through big data

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010

KNOWLEDGE BASE DATA MINING FOR BUSINESS INTELLIGENCE

Foundations of Business Intelligence: Databases and Information Management

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Reduce and manage operating costs and improve efficiency. Support better business decisions based on availability of real-time information

Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

Business Intelligence & Product Analytics

Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?

Data Mining + Business Intelligence. Integration, Design and Implementation

Business Intelligence Solutions for Gaming and Hospitality

Maximizing Return and Minimizing Cost with the Decision Management Systems

Data are everywhere. IBM projects that every day we generate 2.5

A Survey on Web Research for Data Mining

Data Mining Techniques

Predictive Analytics for Government Chih-Feng Ku Solutions Manager, Business Analytics IBM Asia Pacific Business Analytics

Data Mining: Introduction. Lecture Notes for Chapter 1. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler

Whitepaper. Power of Predictive Analytics. Published on: March 2010 Author: Sumant Sahoo

Anomaly and Fraud Detection with Oracle Data Mining 11g Release 2

III JORNADAS DE DATA MINING

Chapter 4 Getting Started with Business Intelligence

Predictive Analytics: Turn Information into Insights

BUSINESS INTELLIGENCE

IBM Social Media Analytics

Discover How a 360-Degree View of the Customer Boosts Productivity and Profits. eguide

Oracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Business Intelligence; Building an Intelligent management RAGHAVENDRA R N

Adobe Insight, powered by Omniture

Know Your Buyer: A predictive approach to understand online buyers behavior By Sandip Pal Happiest Minds, Analytics Practice

Big Data. Fast Forward. Putting data to productive use

SAP Solution Brief SAP HANA. Transform Your Future with Better Business Insight Using Predictive Analytics

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

Chapter ML:XI. XI. Cluster Analysis

Importance or the Role of Data Warehousing and Data Mining in Business Applications

Foundations of Business Intelligence: Databases and Information Management

Data Mining Applications in Higher Education

Data Warehousing and Data Mining in Business Applications

Transforming the Telecoms Business using Big Data and Analytics

Jet Enterprise Frequently Asked Questions Pg. 1 03/18/2011 JEFAQ - 02/13/ Copyright Jet Reports International, Inc.

Business Analytics and the Nexus of Information

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012

Navigating Big Data business analytics

Harnessing the Power of Big Data for Real-Time IT: Sumo Logic Log Management and Analytics Service

How To Handle Big Data With A Data Scientist

Transcription:

Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012

Hexaware E-book on Predictive Analytics What is Data mining? Data mining, or knowledge discovery, is the computer-assisted process of finding hidden patterns in data. Data mining tools predict behaviors and future trends, allowing businesses to make proactive, knowledge-driven decisions. Hence it is also called predictive analytics. Data mining tools can answer business questions that traditionally were time consuming to resolve. They scour databases for hidden patterns, finding predictive information that experts may miss because it lies outside their expectations. When and where the data mining and predictive analytics could be useful? The amount of raw data stored in corporate databases is exploding. From trillions of point-of-sale transactions and credit card purchases to pixel-by-pixel images of galaxies, databases are now measured in gigabytes and terabytes. Raw data by itself, however, does not provide much information. In today's fiercely competitive business environment, companies need to rapidly turn these terabytes of raw data into significant insights into their customers and markets to guide their marketing, investment, and management strategies. In these scenarios data mining would help you unlock the hidden potential of your data and deliver actionable insights. Can you name some specific uses of data mining? Some Specific uses of data mining include: Market segmentation - Identify the common characteristics of customers who buy the same products from your company. Customer churn - Predict which customers are likely to leave your company and go to a competitor. Fraud detection - Identify which transactions are most likely to be fraudulent. Direct marketing - Identify which prospects should be included in a mailing list to obtain the highest response rate. Interactive marketing - Predict what each individual accessing a Web site is most likely interested in seeing. Market basket analysis - Understand what products or services are commonly purchased together; e.g., beer and diapers. Trend analysis - Reveal the difference between typical customers this month and last. Will sharing data to do data mining raise privacy related issues? What needs to be done in such scenarios? There is a way to deal with sensitive data like credit card numbers, insurance policy numbers and account numbers. Data need to be masked or recoded to maintain the privacy. 1

What are all steps involved in data mining? Data mining process involves predefined steps starting from Business case understanding or Problem understanding, Data understanding Data extraction Pre-processing Mining model building Testing and Evaluation Is there any maintenance to be done for the mining model once it is deployed? Yes, the mining model needs to be calibrated at least once in six months. The frequency varies based on the business need and the volume of data flow Calibration involves Checking the predicted output with the actual output Modifying the mining model if required Public Can data mining solution be offered on the cloud? Yes, Data mining solution can be offered on the cloud. Organizations can adopt Pay per use method without investing on the infrastructure required. Here the challenge is to upload huge amount of data on the cloud. Name some tools used for Data mining? a. Microsoft BI stack has SSAS as part of SQL Server 2008 b. There is an open source tool called R which offers data mining solution c. Rapid Miner is an another open source tool d. SAS is of course is a highly sophisticated tool with enormous computational power e. IBM has it s tool called SPSS f. Oracle s ODM Oracle Data Miner The above list is not exhaustive. What are all the industries in which Predictive Analytics is applicable? We have recently provided predictive analytics solutions for following Industries: Insurance Education Mining Logistics Health & Hygiene In short, predictive analytics can be deployed for diverse industries. 2

What are all the risks involved in using predictive analytics solution? Wrong understanding of business problems and data will result in a prediction model with complex statistical algorithms, but it will be of no use to the business. Wrong interpretation of the results would lead to wrong decisions. Poor data quality would result in poor predictions. Absence of maintenance of the mining model would make predictions obsolete. Building the predictive analytics solution with resources with less statistical knowledge will lead to less accurate models. What is Text mining? Text Mining is the process of deriving high quality information from unstructured text data. There are various techniques used to derive high quality information from textual data, such as computational linguistics, information retrieval, statistics, machine learning, etc. Various forms of text mining include categorization, classification, clustering, concept extraction, summarization, sentiment analysis, etc. Are the open source tools sufficient and robust in providing answers to tough business cases? Open source tools like R and Rapid miner provide excellent flexibility to build the model. Online R community constantly updates algorithms and industry specific solutions as packages to R after validating. So far there are around 3500 packages built in R. R s popularity has been increasing over the other predictive analytics tools. In a recent survey Kdnuggets.com reports that R has 24% of market share and R is the most sought after statistical programming language. Can you list some of the best practices in Data mining and Predictive analytics? Executive Support: Support from the decision makers and middle management would make a world of difference. Business problem specificity: Identification of correct business problem to apply predictive analytics is vital to the success of the mining model. Availability of historical data: Richer the data, the more robust will be the mining model. Good quality data: It is the most important factor for an accurate mining model. Pre-processing: While building the mining model, one of the main activities is pre-processing where the data is cleansed, sliced, diced and categorized to suit to the mining model. Good business knowledge and a sound data mining knowledge is required to do this as this is the base for the predictive model. Selection of statistical techniques: Experienced data mining resource can choose the correct statistical technique and can compare the accuracy of other techniques. Interpretation of output: It is extremely important to interpret the output in the correct way and link it back to the business problem stated initially. 3

4 Thank you for reading our E- Book, in case you have any queries please write back to us at corporatemarketing@hexaware.com If you want to keep up with the industry's latest trends, please visit our blog on BI http://blogs.hexaware.com/index/business-intelligence For more information on our Business Intelligence & Analytics services please visit http://hexaware.com/business-intelligence-analytics.htm About Hexaware Hexaware is a leading global provider of IT and BPO services. The company has achieved leadership position in domains such as Banking, Financial Services, Insurance, Transportation, Logistics and HR-IT solutions. Hexaware focuses on delivering business result leveraging technology solution and specializes in Business Intelligence & Analytics, Enterprise Applications, Independent Testing and Legacy Modernization. Hexaware has been providing business technology solutions for over 20 years and offers world class services delivery, technology leadership and skilled human capital.