DATA MINING USING PENTAHO / WEKA
|
|
|
- Baldric Singleton
- 10 years ago
- Views:
Transcription
1 DATA MINING USING PENTAHO / WEKA Yannis Angelis Channels & Information Exploitation Division Application Delivery Sector EFG Eurobank 1
2 Agenda BI in Financial Environments Pentaho Community Platform Weka Platform Integration with Pentaho Platform Data mining using Weka 2
3 A brief (BI) history Open-source BI suites Open-source OLAP RDBMS support for DW Desktop OLAP Mainframe OLAP MySQL Codd s 12 Rules of OLAP Spreadsheets JRubik Mondrian Oracle partitions, bitmap indexes Essbase Sybase IQ Pentaho JPivot Business Objects Cognos PowerPlay Comshare APL BEE Palo MicroStrategy Express OpenI Microsoft OLAP Services Informix MetaCube
4 Banking flavors of BI Data warehouse Data integration / cleansing (ETL) Reporting Operational Analytical Data Mining Risk related areas Fraud detection / prediction Credit scoring / scorecards Marketing related areas Next best activity Churn management Anti-attrition 4
5 Eurobank specificities IBM DW implementation (blueprint & localized) Essbase & MS OLAP engines Cognos & MS Reporting Services (BO in some cases) Different DB flavors in data marts (MS SQL, Oracle, DB2) SPSS & SAS analytics Excel mania Open Source BI Solutions 5
6 Pentaho explained Pentaho Community - BI Platform - Reporting (PRD) - Mondrian (OLAP engine) - Kettle (PDI - ETL) - Weka (Data mining) All build under Java platform 6
7 Pentaho in action 7
8 What is Weka? The Weka or woodhen (Gallirallus australis) is an endemic bird of New Zealand. (Source: WikiPedia) 8
9 Weka platform WEKA (Waikato Environment for Knowledge Analysis) Funded by the NZ government (for more than 10 years) Develop an open-source state-of-the-art workbench of data mining tools Explore fielded applications Develop new fundamental methods Became part of Pentaho platform in 2006 (PDM Pentaho Data Mining) 9
10 Weka Functionality & Architecture Comprehensive set of data pre-processing tools, learning algorithms and evaluation methods data pre-processing tools classification/regression algorithms clustering algorithms attribute/subset evaluators & search algorithms for feature selection algorithms for finding association rules Graphical user interfaces (incl. data visualization) Environment for comparing learning algorithms Modular, object-oriented architecture Packages for different types of algorithms (filters, classifiers, clusterers, associations, attribute selection etc.) Sub-packages group components by functionality or purpose E.g. classifiers.bayes, filters.unsupervised.attribute Algorithms are Java Beans (implementing specific interface) GUIs use introspection/reflection to dynamically generate editor dialogs at runtime 10
11 Weka UI 11
12 Weka ESI (Extensibility Standards Interoperability) Extensibility Plugin mechanisms allow WEKA to be extended without modifying the classes in the WEKA distribution New algorithms can be implemented and added Standards & Interoperability PMML Support (import / export) LibSVM / SVM-Light data format support 12
13 Weka / Pentaho Integration Main point of integration is with Kettle (PDI) project Kettle can export data in ARFF format High-volume, low memory consumption Kettle gets WEKA-specific transformation steps WekaScoring: score data using a pre-constructed WEKA model (classification, regression or clustering) or PMML model as part of an ETL transformation KnowledgeFlow: execute arbitrary Knowledge Flow processes as part of an ETL transformation WekaForecasting: load or import a time series forecasting model (created in Weka's time series analysis and forecasting environment) as part of an ETL transformation Only WekaScoring can be used (is free and downloadable) with the community edition of Pentaho 13
14 Data mining with Weka (One-of-the-many) Definition: Extraction of implicit, previously unknown, and potentially useful information from data Goal: improve marketing, sales, and customer support operations, risk assessment etc. Who is likely to remain a loyal customer? What products should be marketed to which prospects? What determines whether a person will respond to a certain offer? How can I detect potential fraud? Central idea: historical data contains information that will be useful in the future (patterns => generalizations) Data Mining employs a set of algorithms that automatically detect patterns and regularities in data 14
15 Eurobank s Case Problem: Prediction (Probability Score) of a Corporate Customer Delinquency (or default) in the next year Customer historical data used include: Customer footings behavior (assets & liabilities) Customer delinquencies (rates and time data) Business Sector behavioral data 15
16 Implementation Thoughts Variable selection using the Information Value (IV) criterion. IV = Value of IV less than 0.02 i (G i B i ) ln( G i B i ) Statistical strength a very weak statistical relation a weak statistical relation an average statistical relation a strong statistical relation greater than 0.5 an extremely strong statistical relation Automatic Binning of continuous data variables was used (Chi-merge). Manual corrections were made to address particularities in the data distribution of some variables (using again IV) 16
17 Weka Case Screenshots 17
18 Visualizations 18
19 Weka Limitations Traditional algorithms need to have all data in (main) memory => big datasets are an issue Solution: Incremental schemes Stream algorithms MOA (Massive Online Analysis) 19
20 Conclusion Try it yourself WEKA is available at A list of projects based on WEKA include applications for natural language processing, text mining etc. 20
Open Source Business Intelligence Intro
Open Source Business Intelligence Intro Stefano Scamuzzo Senior Technical Manager Architecture & Consulting Research & Innovation Division Engineering Ingegneria Informatica The Open Source Question In
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 UNDER THE GUIDANCE Dr. N.P. DHAVALE, DGM, INFINET Department SUBMITTED TO INSTITUTE FOR DEVELOPMENT AND RESEARCH IN BANKING TECHNOLOGY
An Introduction to WEKA. As presented by PACE
An Introduction to WEKA As presented by PACE Download and Install WEKA Website: http://www.cs.waikato.ac.nz/~ml/weka/index.html 2 Content Intro and background Exploring WEKA Data Preparation Creating Models/
Introduction Predictive Analytics Tools: Weka
Introduction Predictive Analytics Tools: Weka Predictive Analytics Center of Excellence San Diego Supercomputer Center University of California, San Diego Tools Landscape Considerations Scale User Interface
Breadboard BI. Unlocking ERP Data Using Open Source Tools By Christopher Lavigne
Breadboard BI Unlocking ERP Data Using Open Source Tools By Christopher Lavigne Introduction Organizations have made enormous investments in ERP applications like JD Edwards, PeopleSoft and SAP. These
Prof. Pietro Ducange Students Tutor and Practical Classes Course of Business Intelligence 2014 http://www.iet.unipi.it/p.ducange/esercitazionibi/
Prof. Pietro Ducange Students Tutor and Practical Classes Course of Business Intelligence 2014 http://www.iet.unipi.it/p.ducange/esercitazionibi/ Email: [email protected] Office: Dipartimento di Ingegneria
The Prophecy-Prototype of Prediction modeling tool
The Prophecy-Prototype of Prediction modeling tool Ms. Ashwini Dalvi 1, Ms. Dhvni K.Shah 2, Ms. Rujul B.Desai 3, Ms. Shraddha M.Vora 4, Mr. Vaibhav G.Tailor 5 Department of Information Technology, Mumbai
What is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO
What is Data Mining? Data Mining (Knowledge discovery in database) Data Mining: "The non trivial extraction of implicit, previously unknown, and potentially useful information from data" William J Frawley,
DBTech Pro Workshop. Knowledge Discovery from Databases (KDD) Including Data Warehousing and Data Mining. Georgios Evangelidis
DBTechNet DBTech Pro Workshop Knowledge Discovery from Databases (KDD) Including Data Warehousing and Data Mining Dimitris A. Dervos [email protected] http://aetos.it.teithe.gr/~dad Georgios Evangelidis
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani Technical Architect - Big Data Syntel Agenda Welcome to the Zoo! Evolution Timeline Traditional BI/DW Architecture Where Hadoop Fits In 2 Welcome to
Open Source Business Intelligence Tools: A Review
Open Source Business Intelligence Tools: A Review Amid Khatibi Bardsiri 1 Seyyed Mohsen Hashemi 2 1 Bardsir Branch, Islamic Azad University, Kerman, IRAN 2 Science and Research Branch, Islamic Azad University,
8. Business Intelligence Reference Architectures and Patterns
8. Business Intelligence Reference Architectures and Patterns Winter Semester 2008 / 2009 Prof. Dr. Bernhard Humm Darmstadt University of Applied Sciences Department of Computer Science 1 Prof. Dr. Bernhard
Creation of a BI environment from scratch with open source software
Creation of a BI environment from scratch with open A practical case Thierry DAUTCOURT Thusday, June 12 2014-1 Inria s Research Centres Inria LILLE Nord Europe Inria PARIS - Rocquencourt Inria NANCY Grand
Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer
Automated Data Ingestion Bernhard Disselhoff Enterprise Sales Engineer Agenda Pentaho Overview Templated dynamic ETL workflows Pentaho Data Integration (PDI) Use Cases Pentaho Overview Overview What we
GeoKettle: A powerful open source spatial ETL tool
GeoKettle: A powerful open source spatial ETL tool FOSS4G 2010 Dr. Thierry Badard, CTO Spatialytics inc. Quebec, Canada [email protected] Barcelona, Spain Sept 9th, 2010 What is GeoKettle? It is
Open Source Business Intelligence
Open Source Business Intelligence Stefano Scamuzzo Senior Technical Manager Architecture & Consulting Research & Innovation Division Engineering Ingegneria Informatica The Open Source Question In many
The BIg Picture. Dinsdag 17 september 2013
The BIg Picture Dinsdag 17 september 2013 2 Agenda A short historical overview on BI Current Issues Current trends Future architecture First steps to this architecture 3 MIS/EIS Data Warehouse BI Multidimensional
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
Data Mining + Business Intelligence. Integration, Design and Implementation
Data Mining + Business Intelligence Integration, Design and Implementation ABOUT ME Vijay Kotu Data, Business, Technology, Statistics BUSINESS INTELLIGENCE - Result Making data accessible Wider distribution
RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE. Luigi Grimaudo 178627 Database And Data Mining Research Group
RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE Luigi Grimaudo 178627 Database And Data Mining Research Group Summary RapidMiner project Strengths How to use RapidMiner Operator
IBM DB2 Universal Database Data Warehouse Edition. Version 8.1.2
IBM DB2 Universal Database Data Warehouse Edition Getting Started Version 8.1.2 IBM DB2 Universal Database Data Warehouse Edition Getting Started Version 8.1.2 Note Note: Before using this information
ANALYTICS CENTER LEARNING PROGRAM
Overview of Curriculum ANALYTICS CENTER LEARNING PROGRAM The following courses are offered by Analytics Center as part of its learning program: Course Duration Prerequisites 1- Math and Theory 101 - Fundamentals
Make Better Decisions Through Predictive Intelligence
IBM SPSS Modeler Professional Make Better Decisions Through Predictive Intelligence Highlights Easily access, prepare and model structured data with this intuitive, visual data mining workbench Rapidly
Open Source BI Platforms: a Functional and Architectural Comparison Matteo Golfarelli DEIS University of Bologna Agenda: 1. Introduction 2. Conduct of the Comparison 3. Platforms description 4. Discussion
DATA MINING ALPHA MINER
DATA MINING ALPHA MINER AlphaMiner is developed by the E-Business Technology Institute (ETI) of the University of Hong Kong under the support from the Innovation and Technology Fund (ITF) of the Government
Web Data Mining: A Case Study. Abstract. Introduction
Web Data Mining: A Case Study Samia Jones Galveston College, Galveston, TX 77550 Omprakash K. Gupta Prairie View A&M, Prairie View, TX 77446 [email protected] Abstract With an enormous amount of data stored
Beyond Traditional Management Reporting. 2013 IBM Corporation
Beyond Traditional Management Reporting 1 Agenda From Reporting to Business Analytics Expanding your capabilities set Workspace Authoring Statistical Analysis Predictive Modeling What-if analysis and planning
Fast and Easy Delivery of Data Mining Insights to Reporting Systems
Fast and Easy Delivery of Data Mining Insights to Reporting Systems Ruben Pulido, Christoph Sieb [email protected], [email protected] Abstract: During the last decade data mining and predictive
Open Source meets Business Intelligence Seminar Business Intelligence Winter Term 06/07
Open Source meets Business Intelligence Seminar Business Intelligence Winter Term 06/07 Monika Podolecheva University of Konstanz Department of Computer and Information Science Tutor: Prof. M. Sholl, Prof.
A DATA WAREHOUSE SOLUTION FOR E-GOVERNMENT
A DATA WAREHOUSE SOLUTION FOR E-GOVERNMENT Xiufeng Liu 1 & Xiaofeng Luo 2 1 Department of Computer Science Aalborg University, Selma Lagerlofs Vej 300, DK-9220 Aalborg, Denmark 2 Telecommunication Engineering
Data Integration with Talend Open Studio Robert A. Nisbet, Ph.D.
Data Integration with Talend Open Studio Robert A. Nisbet, Ph.D. Most college courses in statistical analysis and data mining are focus on the mathematical techniques for analyzing data structures, rather
Discussion on Airport Business Intelligence System Architecture
Discussion on Airport Business Intelligence System Architecture WANG Jian-bo FAN Chong-jun FU Hui-gang Business School University of Shanghai for Science and Technology Shanghai 200093, P. R. China Abstract
This Symposium brought to you by www.ttcus.com
This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data
Operationalise Predictive Analytics
Operationalise Predictive Analytics Publish SPSS, Excel and R reports online Predict online using SPSS and R models Access models and reports via Android app Organise people and content into projects Monitor
Pentaho Data Integration 4 and MySQL. Matt Casters: Pentaho's Chief Data Integration Kettle Project Founder
Pentaho Data Integration 4 and MySQL Matt Casters: Pentaho's Chief Data Integration Kettle Project Founder MySQL User Conference, Tuesday April 13th, 2010 Agenda Pentaho: an introduction Pentaho Data Integration
A Look at Self Service BI with SAP Lumira Natasha Kishinevsky Dunn Solutions Group SESSION CODE: 1405
A Look at Self Service BI with SAP Lumira Natasha Kishinevsky Dunn Solutions Group SESSION CODE: 1405 LEARNING POINTS How a business user analyzes data with Lumira Introduction to the SAP BI Lumira Connector
Introduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES
HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES Translating data into business value requires the right data mining and modeling techniques which uncover important patterns within
Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers
60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative
IBM SPSS Modeler Professional
IBM SPSS Modeler Professional Make better decisions through predictive intelligence Highlights Create more effective strategies by evaluating trends and likely outcomes. Easily access, prepare and model
Introduction to Data Mining
Introduction to Data Mining José Hernández ndez-orallo Dpto.. de Systems Informáticos y Computación Universidad Politécnica de Valencia, Spain [email protected] Horsens, Denmark, 26th September 2005
Solve your toughest challenges with data mining
IBM Software IBM SPSS Modeler Solve your toughest challenges with data mining Use predictive intelligence to make good decisions faster Solve your toughest challenges with data mining Imagine if you could
Solve your toughest challenges with data mining
IBM Software Business Analytics IBM SPSS Modeler Solve your toughest challenges with data mining Use predictive intelligence to make good decisions faster 2 Solve your toughest challenges with data mining
Online Courses. Version 9 Comprehensive Series. What's New Series
Version 9 Comprehensive Series MicroStrategy Distribution Services Online Key Features Distribution Services for End Users Administering Subscriptions in Web Configuring Distribution Services Monitoring
Business Intelligence for Big Data
Business Intelligence for Big Data Will Gorman, Vice President, Engineering May, 2011 2010, Pentaho. All Rights Reserved. www.pentaho.com. What is BI? Business Intelligence = reports, dashboards, analysis,
Business Intelligence. Advanced visualization. Reporting & dashboards. Mobile BI. Packaged BI
Data & Analytics 1 Data & Analytics Solutions - Overview Information Management Business Intelligence Advanced Analytics Data governance Data modeling & architecture Master data management Enterprise data
DATA MINING AND WAREHOUSING CONCEPTS
CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation
Open source geospatial Business Intelligence (BI) in action!
Open source geospatial Business Intelligence (BI) in action! OGRS 2009 Nantes, France Dr. Thierry Badard Etienne Dubé Belko Diallo Jean Mathieu Mamadou Ouattara GeoSOA research group Centre for Research
IBM SPSS Modeler Professional
IBM SPSS Modeler Professional Make better decisions through predictive intelligence Highlights Create more effective strategies by evaluating trends and likely outcomes. Easily access, prepare and model
Business Intelligence at Albert Heijn
Business Intelligence at Albert Heijn Information for Competitive Advantage Egbert Dijkstra Director Business Intelligence Information Management Europe Zaandam, April 2009 2008 Personal background 2008-2006
Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8
Enterprise Solutions Data Warehouse & Business Intelligence Chapter-8 Learning Objectives Concepts of Data Warehouse Business Intelligence, Analytics & Big Data Tools for DWH & BI Concepts of Data Warehouse
Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer
Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial
ORACLE DATA INTEGRATOR ENTERPRISE EDITION
ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION KEY FEATURES Out-of-box integration with databases, ERPs, CRMs, B2B systems, flat files, XML data, LDAP, JDBC, ODBC Knowledge
Data Search. Searching and Finding information in Unstructured and Structured Data Sources
1 Data Search Searching and Finding information in Unstructured and Structured Data Sources Erik Fransen Senior Business Consultant 11.00-12.00 P.M. November, 3 IRM UK, DW/BI 2009, London Centennium BI
SPSS Modeler Integration with IBM DB2 Analytics Accelerator
SPSS Modeler Integration with IBM DB2 Analytics Accelerator Markus Nentwig August 31, 2012 Markus Nentwig SPSS Modeler Integration with IDAA 1 / 12 Agenda 1 Motivation 2 Basics IBM SPSS Modeler IBM DB2
ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR
ORACLE DATA INTEGRATOR ENTERPRISE EDITION KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR ENTERPRISE EDITION OFFERS LEADING PERFORMANCE, IMPROVED PRODUCTIVITY, FLEXIBILITY AND LOWEST TOTAL COST OF OWNERSHIP
Predictive Analytics Powered by SAP HANA. Cary Bourgeois Principal Solution Advisor Platform and Analytics
Predictive Analytics Powered by SAP HANA Cary Bourgeois Principal Solution Advisor Platform and Analytics Agenda Introduction to Predictive Analytics Key capabilities of SAP HANA for in-memory predictive
III JORNADAS DE DATA MINING
III JORNADAS DE DATA MINING EN EL MARCO DE LA MAESTRÍA EN DATA MINING DE LA UNIVERSIDAD AUSTRAL PRESENTACIÓN TECNOLÓGICA IBM Alan Schcolnik, Cognos Technical Sales Team Leader, IBM Software Group. IAE
Sunnie Chung. Cleveland State University
Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:
Palo Open Source BI Suite
Palo Open Source BI Suite Matthias Wilharm BI Consultant [email protected] Winterthur, 24.09.2008 Basel Baden Bern Lausanne Zurich Düsseldorf Frankfurt/M. Freiburg i. Br. Hamburg Munich Stuttgart
Workday Big Data Analytics
Workday Big Data Analytics Today s fast-paced business climate demands that decision-makers stay informed. Having access to key information gives them the best insight into their business. However, many
Real-time Data Replication
Real-time Data Replication from Oracle to other databases using DataCurrents WHITEPAPER Contents Data Replication Concepts... 2 Real time Data Replication... 3 Heterogeneous Data Replication... 4 Different
NEMUG Feb 2008. Create Your Own Web Data Mart with MySQL
NEMUG Feb 2008 Create Your Own Web Data Mart with MySQL Jim Mason [email protected] ebt-now copyright 2008, all rights reserved What We ll Cover System i Web Reporting solutions Enterprise Open-source
Class 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?
Class 1 Data Mining Data Mining and Artificial Intelligence We are in the 21 st century So where are the robots? Data mining is the one really successful application of artificial intelligence technology.
Importance or the Role of Data Warehousing and Data Mining in Business Applications
Journal of The International Association of Advanced Technology and Science Importance or the Role of Data Warehousing and Data Mining in Business Applications ATUL ARORA ANKIT MALIK Abstract Information
Cloud Computing. With MySQL and Pentaho Data Integration. Matt Casters Chief Data Integration at Pentaho Kettle project founder
Cloud Computing With MySQL and Pentaho Data Integration Matt Casters Chief Data Integration at Pentaho Kettle project founder 1-2 Agenda Introduction to Kettle Introduction Use-cases + load demo Performance
PARAMETRIC COMPARISON OF DATA MINING TOOLS
PARAMETRIC COMPARISON OF DATA MINING TOOLS Neha Chauhan 1, Nisha Gautam 2 1 Student of Master of Technology, 2 Assistant Professor, Department of Computer Science and Engineering, AP Goyal Shimla University,
Fusion Applications Overview of Business Intelligence and Reporting components
Fusion Applications Overview of Business Intelligence and Reporting components This document briefly lists the components, their common acronyms and the functionality that they bring to Fusion Applications.
Index Contents Page No. Introduction . Data Mining & Knowledge Discovery
Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.
Data Warehousing and Data Mining in Business Applications
133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business
Pentaho Data Mining Last Modified on January 22, 2007
Pentaho Data Mining Copyright 2007 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest information, please visit our web site at www.pentaho.org
DATA WAREHOUSE AND DATA MINING NECCESSITY OR USELESS INVESTMENT
Scientific Bulletin Economic Sciences, Vol. 9 (15) - Information technology - DATA WAREHOUSE AND DATA MINING NECCESSITY OR USELESS INVESTMENT Associate Professor, Ph.D. Emil BURTESCU University of Pitesti,
PREDICTIVE ANALYTICS IN HIGHER EDUCATION NOVEMBER 6, 2014
PREDICTIVE ANALYTICS IN HIGHER EDUCATION NOVEMBER 6, 2014 WHAT IS PREDICTIVE ANALYTICS? Predictive Analytics helps connect data to effective action by drawing reliable conclusions about current conditions
Data Mining & Data Stream Mining Open Source Tools
Data Mining & Data Stream Mining Open Source Tools Darshana Parikh, Priyanka Tirkha Student M.Tech, Dept. of CSE, Sri Balaji College Of Engg. & Tech, Jaipur, Rajasthan, India Assistant Professor, Dept.
COURSE RECOMMENDER SYSTEM IN E-LEARNING
International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand
IBM SPSS Modeler 15 In-Database Mining Guide
IBM SPSS Modeler 15 In-Database Mining Guide Note: Before using this information and the product it supports, read the general information under Notices on p. 217. This edition applies to IBM SPSS Modeler
Data Mining and Data Warehousing on US Farmer s Data
Data Mining and Data Warehousing on US Farmer s Data Guide: Dr. Meiliu Lu Presented By, Yogesh Isawe Kalindi Mehta Aditi Kulkarni * Data Warehousing Project * Introduction * Background * Technologies Explored
Why include analytics as part of the School of Information Technology curriculum?
Why include analytics as part of the School of Information Technology curriculum? Lee Foon Yee, Senior Lecturer School of Information Technology, Nanyang Polytechnic Agenda Background Introduction Initiation
IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!
The Bloor Group IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS VENDOR PROFILE The IBM Big Data Landscape IBM can legitimately claim to have been involved in Big Data and to have a much broader
Data Mining for Everyone
Page 1 Data Mining for Everyone Christoph Sieb Senior Software Engineer, Data Mining Development Dr. Andreas Zekl Manager, Data Mining Development Page 2 Executive Summary Contents 2 Data mining in the
Data Mart/Warehouse: Progress and Vision
Data Mart/Warehouse: Progress and Vision Institutional Research and Planning University Information Systems What is data warehousing? A data warehouse: is a single place that contains complete, accurate
Introduction to Big Data! with Apache Spark" UC#BERKELEY#
Introduction to Big Data! with Apache Spark" UC#BERKELEY# So What is Data Science?" Doing Data Science" Data Preparation" Roles" This Lecture" What is Data Science?" Data Science aims to derive knowledge!
Université de Montpellier 2 Hugo Alatrista-Salas : [email protected]
Université de Montpellier 2 Hugo Alatrista-Salas : [email protected] WEKA Gallirallus Zeland) australis : Endemic bird (New Characteristics Waikato university Weka is a collection
CUSTOMER Presentation of SAP Predictive Analytics
SAP Predictive Analytics 2.0 2015-02-09 CUSTOMER Presentation of SAP Predictive Analytics Content 1 SAP Predictive Analytics Overview....3 2 Deployment Configurations....4 3 SAP Predictive Analytics Desktop
Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University
Bussiness Intelligence and Data Warehouse Schedule Bussiness Intelligence (BI) BI tools Oracle vs. Microsoft Data warehouse History Tools Oracle vs. Others Discussion Business Intelligence (BI) Products
IBM WebSphere DataStage Online training from Yes-M Systems
Yes-M Systems offers the unique opportunity to aspiring fresher s and experienced professionals to get real time experience in ETL Data warehouse tool IBM DataStage. Course Description With this training
Business Intelligence on a Budget: Open Source BI. Paul O Rorke
Business Intelligence on a Budget: Open Source BI Paul O Rorke Goals provide background & motivation discuss business models & licenses survey open source BI compare open versus closed BI identify trends
