A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING

Size: px
Start display at page:

Download "A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING"

Transcription

1 A THREE-TIERED WEB BASED EXPLORATION AND REPORTING TOOL FOR DATA MINING Ahmet Selman BOZKIR Hacettepe University Computer Engineering Department, Ankara, Turkey Ebru Akcapinar SEZER Hacettepe University Computer Engineering Department, Ankara, Turkey ABSTRACT In recent years, many companies have begun to use data mining and decision support systems (DSS) for decision making activities. Although their use is increasing continuously, DSSs are generally built as desktop applications and designed for the use of data mining experts. The purposes of the present study are selected as design and implementation of a webbased data mining exploration and reporting tool namely ASMiner. ASMiner provides exploring and reporting on three data mining techniques (decision trees, clustering and association rules mining), by presenting a scalable and fully webbased thin client data mining tool for both decision makers and knowledge workers. KEYWORDS DSS, Decision-making, Web-based data mining 1. INTRODUCTION The data mining is a useful tool for decision-makers in predicting and planning the future. It is possible to say that the data mining methods may have a crucial importance among the existing approaches to solve forecasting problems encountered in all engineering areas, medical and applied sciences, etc. in near future. Web based technologies have been revolutionized the design, development and implementation stages of decision support systems (Ba & Kalakota, 1995; Bhargava & Power, 2001). Moreover, the Web environment is expanding as a very important DSS development and delivery platform (Shim et al., 2002). The key advantages of the web based tools when compared with the traditional batch-based or client-server oriented tools include ease of-use, universal access across information technology platforms, and single minute response and feedback based upon dynamic and real-time data (Heinrichs & Him, 2003). Development of a completely web based data mining exploration and reporting tool to save time during the exploration and reporting phases of data mining applications and to enable even typical users to be effective decision makers are the main purposes of the present study. For the purpose of the study, a tool namely ASMiner is developed. ASMiner employs Microsoft SQL Server Analysis Services behind the scene as the data mining engine and it currently supports three data mining techniques such as decision trees, clustering and association rules. 2. WEB BASED DATA MINING TOOL DEVELOPED In market, it is possible to find numerous numbers of data mining tools and applications requiring professional data mining background and practice. Owing to these requirements, the data mining solutions as the software packages are used by data mining experts, only. Moreover, most of all commercial data mining solutions are implemented with non web-based approaches. Furthermore, report production in many data

2 mining software still requires exhaustive and time-consuming processes. To cope with these difficulties, ASMiner considers the knowledge workers to help them in the process of becoming data miners and to achieve this, it presents easy to understand, user friendly and perspicuous user interfaces in exploring mining models created in Microsoft Analysis Services. In market, some databases such as Oracle, MS SQL and WEKA etc exist. However, when developing ASMiner, Microsoft Analysis Services is preferred owing to its cheapness and commonly usefulness. Microsoft Analysis Services has been the business intelligence component of Microsoft SQL Server software since In decision tree algorithm platform, Microsoft invented it is own decision tree algorithm namely Microsoft Decision Trees. This algorithm can handle both categorical and continuous variables as well as CART and CHAID. In addition, it supports entropy and Bayesian score as the splitting strategy and unlike the other famous algorithms, it offers no pruning phase. In Analysis Services, as soon as a decision tree model is created, a corresponding dependency network is also formed. In clustering models, Microsoft Analysis Services offers two types of clustering algorithms such as K-Means and EM (Expectation- Maximization) with scalable and non-scalable versions. On the other hand, well known Apriori algorithm is employed in association rules mining. ASMiner uses client connectivity interfaces of SQL Server in both OLTP and data mining aspect. ADOMD.NET and AMO has been used as the entry point to Analysis Services. ADOMD.NET is mainly focused on retrieving mining models meta-data. However, AMO provides management options on server objects in Analysis Services. Thus, model training/processing operations and model settings can only be made via AMO. Domain experts can load, create and manage data mining models on Analysis Services by using a reduced version of Visual Studio that shipped with Microsoft SQL Server. As soon as a domain expert creates a data mining model in Analysis Services, model is saved with its metadata and this metadata can be retrieved by ADOMD.NET. Cooperating with AMO and ADOMD.NET, ASMiner accesses data mining models metadata and composes appropriate viewers that users request. Figure 1. Modules and sub-components chart of ASMiner ASMiner is formed by five main modules such as authentication mechanism, decision tree subsystem, clustering subsystem; association rules subsystem and management tools (Fig. 1). Authentication subsystem authorizes every request and validate if the user has access right to requested page and operation. Decision tree, clustering and association rules mining subsystems have their specific type of mining model viewers. In these viewers, some third party open source charting and visualization components are either used or selfdeveloped in this study. ASMiner also has a management tool developed for various purposes. These characteristics of ASMiner are explained in the subsequent paragraphs. Decision tree module of ASMiner contains three types of tree viewer such as general tree viewer, discrete tree viewer and radial tree viewer. General tree viewer has a capacity to draw both regression trees and discrete decision trees. To increase speed and interactivity, Javascript client side scripting technology is utilized when drawing a tree. In tree design, Walker tree drawing algorithm is employed for production of perspicuous and aesthetic trees. Users can navigate on trees by expanding or closing the nodes by clicking appropriate buttons on nodes. Besides,

3 Visifire (Visifire, 2008) charting solutions are employed in the node histogram display. One of the other important features of general tree viewer (Fig. 2) is to have a drill-through support. Finally, drill-through data can be stored as CSV or Excel formats. Figure 2. General tree viewer of ASMiner Discrete tree viewer has some special properties specified for discrete decision trees. Additionally, a radial tree viewer is empirically implemented to provide an opportunity of viewing tree structure in a different point of view for users. The dependency network graphs are produced for the correlation exploration. ASMiner has two types of dependency network viewers. By using these graphs, users can navigate on the overall graph and explore the content. A sample dependency network graph displayed with ZGRViewer is presented in Fig. 3. Another dependency network graph viewer is based on Flash technology. In the lack of Java Runtime, this Flash based viewer is thought to give service to users. On the other hand, this viewer is capable of highlighting and showing the most nearest neighbors of selected nodes beside the features like zooming, rotating and unique coloring of nodes. Figure 3. The ZGRViewer powered dependency graph In order to complete the purpose of decision tree based decision making and to serve the opportunities of decision tree based prediction, ASMiner has a web-based online prediction tool. In fact, decision tree based prediction is no more than hoping on the decision nodes with appropriate directions. At the last step of this recursive process, the value of target variable (attribute) becomes clear or a distribution table is given at worst case. In the case of regression trees, the value of target variable is calculated by the formula of decision node. Two types of prediction queries such as batch and singleton exist in Analysis Services. However, only singleton querying is supported by ASMiner.

4 Online prediction can be repeated many times to obtain best decision because it is an iterative process. Fig. 4 shows the stages of ASMiner web-based prediction tool. In the first and second stages, decision maker selects the predictable variable(s) and attach them with a required member of predict function family. Pure predict() function results the value of target variable. On the other hand, predict-support() function returns the support value of the predicted target variable. In the third stage, to make a prediction, decision maker must enter the case that will be predicted. Thus, in this stage, the input variables are entered. In the last stage, results are taken on the fly and evaluated by decision maker. If needed, predictable or input variables may be changed with different combinations and overall scenario repeats itself until the decision maker is satisfied. (1)Select a predictable variable (2) Attach a suitable predict function (3) Input the independent variables of case (4) Get the prediction results in a table Figure 4. The stages in web-based prediction of ASMiner ASMiner clustering subsystem focuses on describing and introducing discovered clusters in different point of views. Majority of the viewers implemented in ASMiner clustering subsystem targets to inform the users about characteristics, statistical differences and discriminations of clusters. Furthermore, a distribution based cluster dominancy exploration method and viewer (Fig. 5) are empirically developed to gain insight on that which clusters are highly dominant or recessive at the intersection of values of discrete variables in two dimensional spaces. ASMiner has six different types of clustering viewer implemented such as value distribution, cluster distribution, general cluster profiles, specific cluster characteristics, cluster comparison and lastly cluster neighborhood + distribution viewers. Moreover, ASMiner presents two new viewers used for value-variable distribution and cluster distribution. By using these viewers, decision makers have the opportunity of having statistical insights of clusters. In the cluster properties viewer, the properties of a selected cluster are listed in decreasing support value. Figure 5. Cluster dominancy distribution viewer Association rules mining is one of the most important data mining tasks. For this reason, an association rules mining module is implemented to ASMiner and three viewers are designed for this module. Itemsets viewer, rules viewer and rules dependency network viewer constitute the association rules module of ASMiner. ASMiner includes a comprehensive rules viewer. Unlike the other implementations of Apriori algorithm, Analysis Services focuses on the Importance (namely lift) score for measuring the usefulness of the rule (Maclennan et al., 2008). Rule importance score ranges between -1 to 1. Due to its potential

5 advantages, ASMiner focuses on the ways of filtering and saving the important rules that decision makers require to report. Therefore, rule viewer is equipped with a minimum importance, minimum confidence and textual search controls. A web-based and flexible management system for administrators of system is developed for ASMiner. By using this tool, user, roles, active mining models and the relationships among them can be managed. With the help of Analysis Services AMO programming interfaces, model information can be retrieved and the updates are directly reflected. Additionally, system administrator can specifically allow or ban the user(s) to explore selected models. Finally, anyone as the user has right to access the model, he/she can be forbidden of training or making predictions over it. Up to now, data mining oriented decision making processes have taken too much time due to the barriers between decision makers and data mining experts. In addition, each change during the generation of reports requires alternation in data mining models. For this reason, it results in new loops between the decision makers and system administrators. However, this limitation may be decreased by operations to be performed by the decision makers. This shows that the current approach in data mining software packages assuming the target users as data mining expert. This approach is the fundamental barrier to the common use of data mining. As the examples, researchers from different disciplines, officers of banks and insurance companies, market managers and decision makers can use ASMiner easily. Decision tree based risk assessment for all incoming requests can be carried out by these persons without needing a data mining expert. 3. CONCLUSION In this study, a web-based DSS namely ASMiner was developed. It is designed and implemented to take full advantages of ultimate technologies in Internet and in DSS. In the designing stage, some viewers were designed inspiring form the original Analysis Services viewers. For this reason, ASMiner can be assessed as the web based version of Analysis Services. In addition, although Analysis Services presents some features for connecting to itself on HTTP platform, ASMiner provides a pure three-tiered web-based data mining platform. In addition, by considering AJAX based techniques and controls, the performance and user interaction capabilities were enhanced. Due to the characteristics of ASMiner, it is possible to say that it has some advantageous when compared with the other reporting and exploring tools used in practice. As the further recommendation, extending the management capabilities on the data mining models and enhancing the system for administrative usage is planned. In addition, another important point that is aimed to implement in the future, is that supporting batch queries against live data sources. Furthermore, implementing web-based naïve Bayesian model viewer and sequence clustering model viewer are the important milestones in the development roadmap of ASMiner. Additionally, ASMiner would be fully automated and more comprehensive web-based DSS for both decision makers and data mining experts. REFERENCES Ba, S. and Kalakota, A. B., Executable Documents DSS. Proc. 3rd International. Conference on DSS. Hong- Kong. Bhargava, H.K. and Power, D.J., Decision Support Systems and Web Technologies. AMCIS 2001 Proceedings. Heinrichs, J.H. and Him, J., Integrating Web Based Data Mining Tools with Business Models For Knowledge Management, Decision Support Systems, Vol. 35, No. 1, pp Maclennan, J. et al, Data Mining with SQL Server Wiley, Indiana Polis, USA. Shim, J.P. et al, Past, Present, and Future of Decision Support Technology. Decision Support Systems, Vol. 4, No. 2, pp Visifire, 2009, Available:

A new web based data mining exploration and reporting tool for decision makers

A new web based data mining exploration and reporting tool for decision makers ORIGINAL RESEARCH A new web based data mining exploration and reporting tool for decision makers Ahmet Selman Bozkir, Ebru Akcapinar Sezer Department of Computer Engineering, Hacettepe University. Ankara,

More information

Identification of User Patterns in Social Networks by Data Mining Techniques: Facebook Case

Identification of User Patterns in Social Networks by Data Mining Techniques: Facebook Case Identification of User Patterns in Social Networks by Data Mining Techniques: Facebook Case A. Selman Bozkır 1, S. Güzin Mazman 2, and Ebru Akçapınar Sezer 1 1 Hacettepe University, Department of Computer

More information

IT462 Lab 5: Clustering with MS SQL Server

IT462 Lab 5: Clustering with MS SQL Server IT462 Lab 5: Clustering with MS SQL Server This lab should give you the chance to practice some of the data mining techniques you've learned in class. Preliminaries: For this lab, you will use the SQL

More information

COURSE RECOMMENDER SYSTEM IN E-LEARNING

COURSE RECOMMENDER SYSTEM IN E-LEARNING International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand

More information

from Larson Text By Susan Miertschin

from Larson Text By Susan Miertschin Decision Tree Data Mining Example from Larson Text By Susan Miertschin 1 Problem The Maximum Miniatures Marketing Department wants to do a targeted mailing gpromoting the Mythic World line of figurines.

More information

This white paper is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, AS TO THE INFORMATION IN THIS DOCUMENT.

This white paper is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, AS TO THE INFORMATION IN THIS DOCUMENT. Data Mining Tutorial Seth Paul Jamie MacLennan Zhaohui Tang Scott Oveson Microsoft Corporation June 2005 Abstract: Microsoft SQL Server 2005 provides an integrated environment for creating and working

More information

KnowledgeSEEKER Marketing Edition

KnowledgeSEEKER Marketing Edition KnowledgeSEEKER Marketing Edition Predictive Analytics for Marketing The Easiest to Use Marketing Analytics Tool KnowledgeSEEKER Marketing Edition is a predictive analytics tool designed for marketers

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL International Journal Of Advanced Technology In Engineering And Science Www.Ijates.Com Volume No 03, Special Issue No. 01, February 2015 ISSN (Online): 2348 7550 ASSOCIATION RULE MINING ON WEB LOGS FOR

More information

The Prophecy-Prototype of Prediction modeling tool

The Prophecy-Prototype of Prediction modeling tool The Prophecy-Prototype of Prediction modeling tool Ms. Ashwini Dalvi 1, Ms. Dhvni K.Shah 2, Ms. Rujul B.Desai 3, Ms. Shraddha M.Vora 4, Mr. Vaibhav G.Tailor 5 Department of Information Technology, Mumbai

More information

How To Use A Data Mining Tool

How To Use A Data Mining Tool Database Systems Journal vol. I, no. 2/2010 45 Commercially Available Data Mining Tools used in the Economic Environment Mihai ANDRONIE 1, Daniel CRIŞAN 2 1 Academy of Economic Studies, Bucharest, Romania

More information

Introduction Predictive Analytics Tools: Weka

Introduction Predictive Analytics Tools: Weka Introduction Predictive Analytics Tools: Weka Predictive Analytics Center of Excellence San Diego Supercomputer Center University of California, San Diego Tools Landscape Considerations Scale User Interface

More information

Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734

Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734 Cleveland State University Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734 SS Chung 14 Build a Data Mining Model using Data

More information

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days or 2008 Five Days Prerequisites Students should have experience with any relational database management system as well as experience with data warehouses and star schemas. It would be helpful if students

More information

Data Mining. SPSS Clementine 12.0. 1. Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine

Data Mining. SPSS Clementine 12.0. 1. Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine Data Mining SPSS 12.0 1. Overview Spring 2010 Instructor: Dr. Masoud Yaghini Introduction Types of Models Interface Projects References Outline Introduction Introduction Three of the common data mining

More information

SQL Server 2014 BI. Lab 04. Enhancing an E-Commerce Web Application with Analysis Services Data Mining in SQL Server 2014. Jump to the Lab Overview

SQL Server 2014 BI. Lab 04. Enhancing an E-Commerce Web Application with Analysis Services Data Mining in SQL Server 2014. Jump to the Lab Overview SQL Server 2014 BI Lab 04 Enhancing an E-Commerce Web Application with Analysis Services Data Mining in SQL Server 2014 Jump to the Lab Overview Terms of Use 2014 Microsoft Corporation. All rights reserved.

More information

Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users.

Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users. Bonus Chapter Ten Major Predictive Analytics Vendors In This Chapter Angoss FICO IBM RapidMiner Revolution Analytics Salford Systems SAP SAS StatSoft, Inc. TIBCO This chapter highlights ten of the major

More information

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT

More information

A Proposed Data Mining Model to Enhance Counter- Criminal Systems with Application on National Security Crimes

A Proposed Data Mining Model to Enhance Counter- Criminal Systems with Application on National Security Crimes A Proposed Data Mining Model to Enhance Counter- Criminal Systems with Application on National Security Crimes Dr. Nevine Makram Labib Department of Computer and Information Systems Faculty of Management

More information

Outlines. Business Intelligence. What Is Business Intelligence? Data mining life cycle

Outlines. Business Intelligence. What Is Business Intelligence? Data mining life cycle Outlines Business Intelligence Lecture 15 Why integrate BI into your smart client application? Integrating Mining into your application Integrating into your application What Is Business Intelligence?

More information

Principles of Data Mining by Hand&Mannila&Smyth

Principles of Data Mining by Hand&Mannila&Smyth Principles of Data Mining by Hand&Mannila&Smyth Slides for Textbook Ari Visa,, Institute of Signal Processing Tampere University of Technology October 4, 2010 Data Mining: Concepts and Techniques 1 Differences

More information

SQL Server Administrator Introduction - 3 Days Objectives

SQL Server Administrator Introduction - 3 Days Objectives SQL Server Administrator Introduction - 3 Days INTRODUCTION TO MICROSOFT SQL SERVER Exploring the components of SQL Server Identifying SQL Server administration tasks INSTALLING SQL SERVER Identifying

More information

KnowledgeSEEKER POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE

KnowledgeSEEKER POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE Most Effective Modeling Application Designed to Address Business Challenges Applying a predictive strategy to reach a desired business

More information

Improve Results with High- Performance Data Mining

Improve Results with High- Performance Data Mining Clementine 10.0 Specifications Improve Results with High- Performance Data Mining Data mining provides organizations with a clearer view of current conditions and deeper insight into future events. With

More information

XFlash A Web Application Design Framework with Model-Driven Methodology

XFlash A Web Application Design Framework with Model-Driven Methodology International Journal of u- and e- Service, Science and Technology 47 XFlash A Web Application Design Framework with Model-Driven Methodology Ronnie Cheung Hong Kong Polytechnic University, Hong Kong SAR,

More information

Zoomer: An Automated Web Application Change Localization Tool

Zoomer: An Automated Web Application Change Localization Tool Journal of Communication and Computer 9 (2012) 913-919 D DAVID PUBLISHING Zoomer: An Automated Web Application Change Localization Tool Wenhua Wang 1 and Yu Lei 2 1. Marin Software Company, San Francisco,

More information

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 UNDER THE GUIDANCE Dr. N.P. DHAVALE, DGM, INFINET Department SUBMITTED TO INSTITUTE FOR DEVELOPMENT AND RESEARCH IN BANKING TECHNOLOGY

More information

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010 Microsoft Services Exceed your business with Microsoft SharePoint Server 2010 Business Intelligence Suite Alexandre Mendeiros, SQL Server Premier Field Engineer January 2012 Agenda Microsoft Business Intelligence

More information

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.

More information

Data Mining with SQL Server Data Tools

Data Mining with SQL Server Data Tools Data Mining with SQL Server Data Tools Data mining tasks include classification (directed/supervised) models as well as (undirected/unsupervised) models of association analysis and clustering. 1 Data Mining

More information

<no narration for this slide>

<no narration for this slide> 1 2 The standard narration text is : After completing this lesson, you will be able to: < > SAP Visual Intelligence is our latest innovation

More information

CUSTOMER Presentation of SAP Predictive Analytics

CUSTOMER Presentation of SAP Predictive Analytics SAP Predictive Analytics 2.0 2015-02-09 CUSTOMER Presentation of SAP Predictive Analytics Content 1 SAP Predictive Analytics Overview....3 2 Deployment Configurations....4 3 SAP Predictive Analytics Desktop

More information

IBM SPSS Modeler 14.2 In-Database Mining Guide

IBM SPSS Modeler 14.2 In-Database Mining Guide IBM SPSS Modeler 14.2 In-Database Mining Guide Note: Before using this information and the product it supports, read the general information under Notices on p. 197. This edition applies to IBM SPSS Modeler

More information

Achieve Better Insight and Prediction with Data Mining

Achieve Better Insight and Prediction with Data Mining Clementine 11.1 Specifications Achieve Better Insight and Prediction with Data Mining Data mining provides organizations with a clearer view of current conditions and deeper insight into future events.

More information

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010

Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Ernst van Waning Senior Sales Engineer May 28, 2010 Agenda SPSS, an IBM Company SPSS Statistics User-driven product

More information

Overview. Background. Data Mining Analytics for Business Intelligence and Decision Support

Overview. Background. Data Mining Analytics for Business Intelligence and Decision Support Mining Analytics for Business Intelligence and Decision Support Chid Apte, PhD Manager, Abstraction Research Group IBM TJ Watson Research Center apte@us.ibm.com http://www.research.ibm.com/dar Overview

More information

What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11

What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11 What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11 W elcome to North Face Software s software. With this software, you can accomplish

More information

IBM SPSS Modeler 15 In-Database Mining Guide

IBM SPSS Modeler 15 In-Database Mining Guide IBM SPSS Modeler 15 In-Database Mining Guide Note: Before using this information and the product it supports, read the general information under Notices on p. 217. This edition applies to IBM SPSS Modeler

More information

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities April, 2013 gaddsoftware.com Table of content 1. Introduction... 3 2. Vendor briefings questions and answers... 3 2.1.

More information

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10 1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom

More information

Prerequisites. Course Outline

Prerequisites. Course Outline MS-55040: Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot Description This three-day instructor-led course will introduce the students to the concepts of data mining,

More information

Fluency With Information Technology CSE100/IMT100

Fluency With Information Technology CSE100/IMT100 Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999

More information

Students who successfully complete the Health Science Informatics major will be able to:

Students who successfully complete the Health Science Informatics major will be able to: Health Science Informatics Program Requirements Hours: 72 hours Informatics Core Requirements - 31 hours INF 101 Seminar Introductory Informatics (1) INF 110 Foundations in Technology (3) INF 120 Principles

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

SUGI 29 Systems Architecture. Paper 223-29

SUGI 29 Systems Architecture. Paper 223-29 Paper 223-29 SAS Add-In for Microsoft Office Leveraging SAS Throughout the Organization from Microsoft Office Jennifer Clegg, SAS Institute Inc., Cary, NC Stephen McDaniel, SAS Institute Inc., Cary, NC

More information

IBM Cognos TM1 Executive Viewer Fast self-service analytics

IBM Cognos TM1 Executive Viewer Fast self-service analytics Data Sheet IBM Cognos TM1 Executive Viewer Fast self-service analytics Overview IBM Cognos TM1 Executive Viewer provides business users with selfservice, real-time, Web-based access to information from

More information

Develop Predictive Models Using Your Business Expertise

Develop Predictive Models Using Your Business Expertise Clementine 8.5 Specifications Develop Predictive Models Using Your Business Expertise Clementine is an integrated data mining workbench, popular worldwide with data miners and business analysts alike.

More information

Make Better Decisions Through Predictive Intelligence

Make Better Decisions Through Predictive Intelligence IBM SPSS Modeler Professional Make Better Decisions Through Predictive Intelligence Highlights Easily access, prepare and model structured data with this intuitive, visual data mining workbench Rapidly

More information

Data Mining Algorithms Part 1. Dejan Sarka

Data Mining Algorithms Part 1. Dejan Sarka Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka (dsarka@solidq.com) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses

More information

KEYWORD SEARCH OVER PROBABILISTIC RDF GRAPHS

KEYWORD SEARCH OVER PROBABILISTIC RDF GRAPHS ABSTRACT KEYWORD SEARCH OVER PROBABILISTIC RDF GRAPHS In many real applications, RDF (Resource Description Framework) has been widely used as a W3C standard to describe data in the Semantic Web. In practice,

More information

M15_BERE8380_12_SE_C15.7.qxd 2/21/11 3:59 PM Page 1. 15.7 Analytics and Data Mining 1

M15_BERE8380_12_SE_C15.7.qxd 2/21/11 3:59 PM Page 1. 15.7 Analytics and Data Mining 1 M15_BERE8380_12_SE_C15.7.qxd 2/21/11 3:59 PM Page 1 15.7 Analytics and Data Mining 15.7 Analytics and Data Mining 1 Section 1.5 noted that advances in computing processing during the past 40 years have

More information

Visualizing e-government Portal and Its Performance in WEBVS

Visualizing e-government Portal and Its Performance in WEBVS Visualizing e-government Portal and Its Performance in WEBVS Ho Si Meng, Simon Fong Department of Computer and Information Science University of Macau, Macau SAR ccfong@umac.mo Abstract An e-government

More information

Application of Data Warehouse and Data Mining. in Construction Management

Application of Data Warehouse and Data Mining. in Construction Management Application of Data Warehouse and Data Mining in Construction Management Jianping ZHANG 1 (zhangjp@tsinghua.edu.cn) Tianyi MA 1 (matianyi97@mails.tsinghua.edu.cn) Qiping SHEN 2 (bsqpshen@inet.polyu.edu.hk)

More information

IS 2927 Independent Study in Systems & Technology Applications of Information Technology. Adaptive Online Course Recommendation System Part II

IS 2927 Independent Study in Systems & Technology Applications of Information Technology. Adaptive Online Course Recommendation System Part II IS 2927 Independent Study in Systems & Technology Applications of Information Technology Adaptive Online Course Recommendation System Part II Li-Chen Mao - 1 - PROJECT OVERVIEW Course: IS 2927 Independent

More information

An Introduction to WEKA. As presented by PACE

An Introduction to WEKA. As presented by PACE An Introduction to WEKA As presented by PACE Download and Install WEKA Website: http://www.cs.waikato.ac.nz/~ml/weka/index.html 2 Content Intro and background Exploring WEKA Data Preparation Creating Models/

More information

PREDICTING STUDENTS PERFORMANCE USING ID3 AND C4.5 CLASSIFICATION ALGORITHMS

PREDICTING STUDENTS PERFORMANCE USING ID3 AND C4.5 CLASSIFICATION ALGORITHMS PREDICTING STUDENTS PERFORMANCE USING ID3 AND C4.5 CLASSIFICATION ALGORITHMS Kalpesh Adhatrao, Aditya Gaykar, Amiraj Dhawan, Rohit Jha and Vipul Honrao ABSTRACT Department of Computer Engineering, Fr.

More information

Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier

Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier Data Mining: Concepts and Techniques Jiawei Han Micheline Kamber Simon Fräser University К MORGAN KAUFMANN PUBLISHERS AN IMPRINT OF Elsevier Contents Foreword Preface xix vii Chapter I Introduction I I.

More information

Test Run Analysis Interpretation (AI) Made Easy with OpenLoad

Test Run Analysis Interpretation (AI) Made Easy with OpenLoad Test Run Analysis Interpretation (AI) Made Easy with OpenLoad OpenDemand Systems, Inc. Abstract / Executive Summary As Web applications and services become more complex, it becomes increasingly difficult

More information

Database Marketing, Business Intelligence and Knowledge Discovery

Database Marketing, Business Intelligence and Knowledge Discovery Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski

More information

PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.

PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS. PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS Project Project Title Area of Abstract No Specialization 1. Software

More information

CHAPTER 1: INTRODUCTION TO THE COURSE

CHAPTER 1: INTRODUCTION TO THE COURSE Chapter 1: Introduction to the Course CHAPTER 1: INTRODUCTION TO THE COURSE Objectives Introduction The objectives are: Know the structure and scope of the course. This chapter provides an overview of

More information

Oracle Data Miner (Extension of SQL Developer 4.0)

Oracle Data Miner (Extension of SQL Developer 4.0) An Oracle White Paper October 2013 Oracle Data Miner (Extension of SQL Developer 4.0) Generate a PL/SQL script for workflow deployment Denny Wong Oracle Data Mining Technologies 10 Van de Graff Drive Burlington,

More information

The basic data mining algorithms introduced may be enhanced in a number of ways.

The basic data mining algorithms introduced may be enhanced in a number of ways. DATA MINING TECHNOLOGIES AND IMPLEMENTATIONS The basic data mining algorithms introduced may be enhanced in a number of ways. Data mining algorithms have traditionally assumed data is memory resident,

More information

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING

More information

Predicting Students Final GPA Using Decision Trees: A Case Study

Predicting Students Final GPA Using Decision Trees: A Case Study Predicting Students Final GPA Using Decision Trees: A Case Study Mashael A. Al-Barrak and Muna Al-Razgan Abstract Educational data mining is the process of applying data mining tools and techniques to

More information

How To Create A Visual Analytics Tool

How To Create A Visual Analytics Tool W H I T E P A P E R Visual Analytics for the Masses 1 State of Visual Analytics Visual analytics, in the field of business intelligence, is the integration of data visualization and interactive visual

More information

ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION

ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION K.Vinodkumar 1, Kathiresan.V 2, Divya.K 3 1 MPhil scholar, RVS College of Arts and Science, Coimbatore, India. 2 HOD, Dr.SNS

More information

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be

More information

Oracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc.

Oracle9i Data Warehouse Review. Robert F. Edwards Dulcian, Inc. Oracle9i Data Warehouse Review Robert F. Edwards Dulcian, Inc. Agenda Oracle9i Server OLAP Server Analytical SQL Data Mining ETL Warehouse Builder 3i Oracle 9i Server Overview 9i Server = Data Warehouse

More information

EnterpriseLink Benefits

EnterpriseLink Benefits EnterpriseLink Benefits GGY AXIS 5001 Yonge Street Suite 1300 Toronto, ON M2N 6P6 Phone: 416-250-6777 Toll free: 1-877-GGY-AXIS Fax: 416-250-6776 Email: axis@ggy.com Web: www.ggy.com Table of Contents

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association

More information

Development of a Learning Content Management Systems

Development of a Learning Content Management Systems Development of a Learning Content Management Systems Lejla Abazi-Bexheti Abstract Change appears to be the only constant in the field of ICT and what was treated as advanced feature few years ago is today

More information

Application Tool for Experiments on SQL Server 2005 Transactions

Application Tool for Experiments on SQL Server 2005 Transactions Proceedings of the 5th WSEAS Int. Conf. on DATA NETWORKS, COMMUNICATIONS & COMPUTERS, Bucharest, Romania, October 16-17, 2006 30 Application Tool for Experiments on SQL Server 2005 Transactions ŞERBAN

More information

2015 Workshops for Professors

2015 Workshops for Professors SAS Education Grow with us Offered by the SAS Global Academic Program Supporting teaching, learning and research in higher education 2015 Workshops for Professors 1 Workshops for Professors As the market

More information

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers 60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative

More information

Efficient Integration of Data Mining Techniques in Database Management Systems

Efficient Integration of Data Mining Techniques in Database Management Systems Efficient Integration of Data Mining Techniques in Database Management Systems Fadila Bentayeb Jérôme Darmont Cédric Udréa ERIC, University of Lyon 2 5 avenue Pierre Mendès-France 69676 Bron Cedex France

More information

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS 9 8 TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS Assist. Prof. Latinka Todoranova Econ Lit C 810 Information technology is a highly dynamic field of research. As part of it, business intelligence

More information

IBM SPSS Modeler Professional

IBM SPSS Modeler Professional IBM SPSS Modeler Professional Make better decisions through predictive intelligence Highlights Create more effective strategies by evaluating trends and likely outcomes. Easily access, prepare and model

More information

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence Augmented Search for Web Applications New frontier in big log data analysis and application intelligence Business white paper May 2015 Web applications are the most common business applications today.

More information

Data Mining Analytics for Business Intelligence and Decision Support

Data Mining Analytics for Business Intelligence and Decision Support Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing

More information

Data Mining Extensions (DMX) Reference

Data Mining Extensions (DMX) Reference Data Mining Extensions (DMX) Reference SQL Server 2012 Books Online Summary: Data Mining Extensions (DMX) is a language that you can use to create and work with data mining models in Microsoft SQL Server

More information

Ezgi Dinçerden. Marmara University, Istanbul, Turkey

Ezgi Dinçerden. Marmara University, Istanbul, Turkey Economics World, Mar.-Apr. 2016, Vol. 4, No. 2, 60-65 doi: 10.17265/2328-7144/2016.02.002 D DAVID PUBLISHING The Effects of Business Intelligence on Strategic Management of Enterprises Ezgi Dinçerden Marmara

More information

THE NEXT GENERATION OF DATA ANALYSIS TOOLS Alexandros Karakos, Pericles Karakos

THE NEXT GENERATION OF DATA ANALYSIS TOOLS Alexandros Karakos, Pericles Karakos The XIII International Conference Applied Stochastic Models and Data Analysis (ASMDA-2009) June 30-July 3, 2009, Vilnius, LITHUANIA ISBN 978-9955-28-463-5 L. Sakalauskas, C. Skiadas and E. K. Zavadskas

More information

Management Decision Making. Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011

Management Decision Making. Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011 Management Decision Making Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011 Management decision making Decision making Spreadsheet exercise Data visualization,

More information

Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100

Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 Erkan Er Abstract In this paper, a model for predicting students performance levels is proposed which employs three

More information

LVQ Plug-In Algorithm for SQL Server

LVQ Plug-In Algorithm for SQL Server LVQ Plug-In Algorithm for SQL Server Licínia Pedro Monteiro Instituto Superior Técnico licinia.monteiro@tagus.ist.utl.pt I. Executive Summary In this Resume we describe a new functionality implemented

More information

Programmabilty. Programmability in Microsoft Dynamics AX 2009. Microsoft Dynamics AX 2009. White Paper

Programmabilty. Programmability in Microsoft Dynamics AX 2009. Microsoft Dynamics AX 2009. White Paper Programmabilty Microsoft Dynamics AX 2009 Programmability in Microsoft Dynamics AX 2009 White Paper December 2008 Contents Introduction... 4 Scenarios... 4 The Presentation Layer... 4 Business Intelligence

More information

Achieve Better Insight and Prediction with Data Mining

Achieve Better Insight and Prediction with Data Mining Clementine 12.0 Specifications Achieve Better Insight and Prediction with Data Mining Data mining provides organizations with a clearer view of current conditions and deeper insight into future events.

More information

SQL Server 2005 Features Comparison

SQL Server 2005 Features Comparison Page 1 of 10 Quick Links Home Worldwide Search Microsoft.com for: Go : Home Product Information How to Buy Editions Learning Downloads Support Partners Technologies Solutions Community Previous Versions

More information

The Pastel Business Intelligence Centre will revolutionise the way you view your accounting data

The Pastel Business Intelligence Centre will revolutionise the way you view your accounting data A The Pastel Business Intelligence Centre will revolutionise the way you view your accounting data It has been Pastel s objective to deliver technology that goes way beyond accounting. What this means

More information

Data Warehousing and Data Mining in Business Applications

Data Warehousing and Data Mining in Business Applications 133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business

More information

The Analysis of Data Collected by Time and Attendance Systems

The Analysis of Data Collected by Time and Attendance Systems The Analysis of Data Collected by Time and Attendance Systems Tomasz Jędrzejewski 1, Bogdan Trawiński 1, Aleksander Zgrzywa 1 Abstract: Time and attendance software systems are tools for efficient management

More information

Financial Trading System using Combination of Textual and Numerical Data

Financial Trading System using Combination of Textual and Numerical Data Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,

More information

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam

More information

Up Your R Game. James Taylor, Decision Management Solutions Bill Franks, Teradata

Up Your R Game. James Taylor, Decision Management Solutions Bill Franks, Teradata Up Your R Game James Taylor, Decision Management Solutions Bill Franks, Teradata Today s Speakers James Taylor Bill Franks CEO Chief Analytics Officer Decision Management Solutions Teradata 7/28/14 3 Polling

More information

OWB Users, Enter The New ODI World

OWB Users, Enter The New ODI World OWB Users, Enter The New ODI World Kulvinder Hari Oracle Introduction Oracle Data Integrator (ODI) is a best-of-breed data integration platform focused on fast bulk data movement and handling complex data

More information

A Framework for Developing the Web-based Data Integration Tool for Web-Oriented Data Warehousing

A Framework for Developing the Web-based Data Integration Tool for Web-Oriented Data Warehousing A Framework for Developing the Web-based Integration Tool for Web-Oriented Warehousing PATRAVADEE VONGSUMEDH School of Science and Technology Bangkok University Rama IV road, Klong-Toey, BKK, 10110, THAILAND

More information

Oracle Real Time Decisions

Oracle Real Time Decisions A Product Review James Taylor CEO CONTENTS Introducing Decision Management Systems Oracle Real Time Decisions Product Architecture Key Features Availability Conclusion Oracle Real Time Decisions (RTD)

More information

Knowledge Discovery from patents using KMX Text Analytics

Knowledge Discovery from patents using KMX Text Analytics Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs anton.heijs@treparel.com Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers

More information