72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD
|
|
|
- Antony Miles
- 10 years ago
- Views:
Transcription
1 72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD Paulo Gottgtroy Auckland University of Technology [email protected] Abstract This paper is concerned with the integration of ontology engineering and the process of knowledge discovery in databases (KDD).It presents a hybrid life, Ontology Driven Knowledge Discovery process and methodology ODKD, which leverages both ontology engineering and KDD taking in consideration the best industry and research practices. A brief application of the life cycle is described at the end of the paper. Keywords: Ontology Engineering, Knowledge Engineering, Knowledge Discovery, Data Mining, knowledge discovery process Introduction Although a great advance has been reached independently by both the ontology engineering field and the second generation of KDD tools (the idea of a continuum process for making sense of data) in the mid 90s (Fayyad U et al. 1996), somewhat less traditional has been the investigation of the role of ontologies in incremental and cyclic approaches of knowledge discovery. This paper is concerned with the interaction between prior knowledge (by means of ontologies) and the process of knowledge discovery. It starts describing relevant topics to the integration of ontology engineering and knowledge discovery processes. The ontology driven knowledge discovery process is then presented along with its phases and its relation to the industry life cycle. Finally, the paper is summarized and research outcomes are presented. Related Research There are different relevant topics to the integration of ontologies and KDD processes in the literature such as the role of domain knowledge in KDD, ontology/kdd integration and KDD life cycle. Domain knowledge has been playing an important role in the knowledge engineering research since the initial development of expert systems. Most recently domain knowledge has gain importance again in the integration of ontology engineering and KDD processes: Domingos (Domingos P. 1999) suggests use of domain knowledge as the most promising approach for constraining knowledge discovery and for avoiding the well-known problem of data overfitting by the discovered models. Anand et al. (Anand et al, 1995) also identify the use of domain knowledge in KDD tasks: for description of attribute relationship rules, for hierarchical generalization trees and constraints. An example of the latter is the specification of degrees of confidence in the different sources of evidence. Phillips J. & Buchaman B. (Phillips J., et al 2001) propose an ontology guided methodology to gradually accumulate knowledge of databases in order gain domain knowledge in the iterative process of a KDD task. In spite of the increase investigation in the integration of domain knowledge, by means of ontologies, and KDD, most approaches concentrate only in the data mining phase of the 1
2 knowledge discovery process while the role of ontologies in other phases of the knowledge discovery process has been relegated. There are currently three approaches being investigated in the ontology and KDD integration emergent topic research: Onto4KDD, KDD4Onto, and one approach that integrates both previous approaches, named here Onto4KDD4Onto. Onto4KDD is defined as the application of ontologies in order to improve the KDD process. For example, domain ontologies can be used to improve the understanding of a problem and to support hypothesis-driven analysis and discovery approaches. In contrast, KDD4Onto approaches are focused on the application of mining techniques in order to automatically or semi-automatically acquire knowledge from data. Although some researchers are addressing Onto4KDD or KDD4Onto, rare is the research that encompasses both perspectives. This work is an attempt to integrate both approaches. It bridges the gap between ontological engineering and knowledge discovery in databases in order to improve both processes. There are also several knowledge discovery life cycles in the literature. Most of them reflect the background of their proponents, such as those originated in the database community, in the artificial intelligence community, in the decision support community, and in the information systems community (Gómez-Pérez, A. 2004). This research takes into consideration different aspects of these life cycles in order to develop a hybrid life cycle incorporating the best practices developed in the ontology engineering field as well as the best industry practice in the knowledge discovery scenario. To this end it adopts the Cross Industry Standard Process for Data Mining (CRISP-DM) - and implements some of the requirements for the next generation of KDD tools (CRISP-DM 2007). CRISP-DM is a comprehensive methodology and process model that defines in different levels a complete mapping for Knowledge discover in database task. Although called process for data mining, CRISP-DM is the industry standard for knowledge discovery tasks. It breaks down the life cycle of a process into six phases: business understanding, data understanding, data preparation, modelling, evaluation, and deployment. Ontology Driven Knowledge Discovery - ODKD The Ontology Driven Knowledge discovery ODKD is a methodology and process model that defines, in different levels, a mapping between ontology engineering and Knowledge discover in database (KDD) process. It defines a hybrid life cycle for knowledge discovery tasks composed of five phases. Each phase is divided in tasks related, directly or indirectly, to both ontology engineering and to CRISP-DM. The life cycle is also supported by a methodology and guidelines which support the implementation of the tasks. 2
3 Table 1. Ontology Driven Knowledge Discovery phases and tasks. As CRISP-DM, the ontology driven knowledge discovery process has a hierarchical breakdown into tasks table 1: ontology preparation, ontology analysis, instance preparation, modelling, and evaluation. These tasks are responsible for the execution of specific ontology engineering and data modelling processes as well as for enabling a parallel integration of both knowledge and ontology engineering processes. The first two phases, ontology preparation and ontology analysis are more related to the ontology engineering process of, assessment, selection and creation of an ontological model. The Instance preparation and Modelling phases are related to the data mining exercise. It is concerned with preparing data for a modelling task while selecting the most appropriated data mining technique to the problem. The last phase, Evaluation, is the most integrative phase where the results of a modelling exercise must be evaluated based on the previous knowledge and new knowledge is inserted in the ontological model. Ontology Preparation First and foremost, the prerequisite to a knowledge discovery exercise is data and business understanding. Without this understanding, no algorithm, regardless of sophistication, is going to provide a meaningful and useful result. Without this background a user/system will not be able to identify the problems he/she/it is trying to solve, prepare the data for mining, or correctly interpret the results. The methodology initiates with a requirement gathering exercise which involves several stakeholders such as subject matter experts and business users, followed by the selection of problem related data and ontology sources. After reaching a consensus around the problem and the source requirements the actions are subdivided into three main pipelines, domain understanding, data understanding, and ontology building see table 1. The domain understanding pipeline is composed of two sub-tasks: domain ontology selection and application ontology selection. The domain ontology selection aims to select knowledge 3
4 that cover a broader problem perspective, for instance, general biomedical ontologies when dealing with a medical problem, while the application ontology selection aims in selecting specific ontological models of a problem, for instance, ontologies related to a specific disease when classifying/diagnosing patients. The selection process is common for both domain and application ontologies. It involves several key steps, including assessing and selecting ontologies candidates (ontology investigation), knowledge quadrant analysis, and knowledge representation mapping plan. The data understanding phase starts with an initial data collection based on the results of data sessions previously executed in the requirement gathering exercise. As in the ontology pipeline, the phase then proceeds with activities to get familiar with the data, to identify data quality problems and to map the data to the ontologies selected. Ontology building is the last step of the ontology preparation phase. It consists of three main tasks: Ontology Integration, Ontology Merging/Alignment and Ontology Creation. Ontology integration is the incorporation of the ontologies identified in the business understanding tasks. The main objective is to reuse ontologies already developed as well as identified as available in the previous selection process in order to add a broader perspective to the problem domain. Ontology merge/alignment is a mapping of concepts and relations between two ontologies (Sowa, J.F. 2001). The main goal is to incorporate and compare existent ontologies to form a body of concepts and relationships able to represent the domain and specific problem. In spite of the current availability of large ontologies covering a very wide range of knowledge in several domains it is very likely that the ontologies acquired in the ontology merging/alignment won t be able to cover fully the necessary knowledge for a KDD task. The Ontology Creation is then concerned with the incorporation of problem specific knowledge and the creation of concepts not covered by the ontological model built in the previous task (merge/alignment). Ontology Analysis Ontology Analysis is related to the discovery of the first insights into the ontological model as well as to the investigation and/or checking of initial hypotheses created based on the disclosure of hidden information in the developed model. The ODKD process defines four tasks in this phase: Ontology Visualization, Ontology Query, Conceptual Matching and Ontology Population. The first two tasks are related to the exploration of the ontological model by means of visualisation, search and analytical process. The last two tasks are related to the construction of the knowledge base by the insertion of data from the available databases and the population of instances from the selected ontologies. Instance Preparation The instance preparation phase covers all activities to construct the final dataset that will be fed into the modelling tool(s) from the ontological model. Instance preparation tasks are likely to be executed multiple times depending on the model being target. Tasks include concept, instances and slot (or attribute in a database taxonomy) selection, transformation and cleaning of instances as well as exportation to modelling tools. It is composed of three main 4
5 tasks in this phase: Instance Cleaning, Instance Selection and Instance Export. Instance cleaning can be considered a special case of data cleaning, also called data cleansing or scrubbing. It deals with detecting and removing errors and inconsistencies from instances in order to improve the quality of features being selected for a modelling task. The Instance selection task brings, probably, the most important contribution to an ontology driven knowledge discovery process. Instead of concentrating the effort in analysing the data characteristics, this task concentrates its effort in giving meaning to data and reducing the features by understanding the domain. The instance export task is concerned with the translation of the ontological model into a format used by different data mining workbenches. The ontology driven knowledge discovery methodology then supports the reverse transformation of an ontological model into a dataset by exporting the knowledge base into a database format. Modelling In this phase, various modelling techniques are selected, different test sets are formed and then the models are applied. Typically, several techniques are applied for the same problem in order to validate the patterns found from different perspectives. This task is aligned with the data mining task of a KDD process.the ODKD methodology supports this phase by extracting knowledge from the ontological model and preparing a data set to the mining model selected. Evaluation This phase is concerned with the evaluation and acquisition of knowledge extracted by a data mining model. The knowledge acquired must be mapped and/or translated into the ontology and then evaluated before its final incorporation in the ontological model. This phase is divided in three tasks in the ODKD process: Knowledge Extraction, Knowledge Assessment and Ontology Learning. Knowledge Extraction is responsible for the incorporation of the knowledge discovered by the data mining into the ontological model. It might also be considered as an automatic knowledge acquisition. After the acquisition of the knowledge a new cycle of analysis begins to assess the knowledge incorporated. Depending on the requirements, this phase can be as simple as generating a report about the knowledge extracted or as complex as implementing a repeatable data mining process across which might involve the analysis of the current ontological model to validate the knowledge acquired or even acquire more domain knowledge to support the findings and/or go back to the ontology analysis to do extra analysis and select new test sets. Creation of the model and assessment are generally not the end of the project. Even if the purpose of the model is to increase knowledge of the data, the knowledge gained will need to be organized and presented in a way that the user can use it. In some sense, it might be compared to the deployment phase where the KDD team decides to agree with the knowledge assessed and uses its structure to update the ontological model. For example, the pattern founded might indicate that some molecular functions are responsible for triggering a disease. 5
6 This knowledge can then be used to support the decision of a medical team in the treatment of a disease. Conclusion and Contribution The Ontology Driven Knowledge Discovery Process is a conceptual proposal that alongside the Evolving Ontology model (Gottgtroy, P. 2006) are the basis for the development of a Framework able to enhance the process of knowledge discovery from data by adding a high level abstraction to the process which may allow a better reuse of previous knowledge as well as the creation and evaluation of new knowledge. The main outcome of this paper is a hybrid knowledge discovery process Table 1 - which defines a life cycle based on the principles of the most adopted KDD process in the industry. The process is divided into five phases which are composed of different tasks. The tasks have some methodological guidelines which indicate some of the best practices experimented in this research. These practices can be extended in accordance with the problem domain and KDD task. The process also presents guidelines based on both industry specific experiences and in the research developed. As a cyclic process it doesn t have a pre-defined and rigid order. To the contrary, the main objective is to create a reference life cycle which can be extended while keeping the most important tasks for the integration of ontology engineering and KDD. The application of this life cycle, the framework and its tools are presented in (Kasabov, N. et al 2007). The process and methodology presented in this paper has been also used in different research and industry projects. We believe that the integration of ontology engineering and KDD will play an important role in the semantic technologies adoption. This work then contributes to both research and industry applications by suggesting a hybrid methodology which describes best practices and leverages both ontology engineering and KDD processes. References Fayyad U et al. From Data Mining to Knowledge Discovery: an Overview, in Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, Domingos P., The Role of Occam's Razor in Knowledge Discovery, Data Mining and Knowledge Discovery, an International Journal, Kluwer Academic Publishers, Vol.3, (1999), Anand S. S., Bell D. A., Hughes J. G. The Role of Domain Knowledge in Data Mining, Proc. ACM CIKM 95, Baltimore MD USA, pp Phillips, J. and Buchanan, B. G. Ontology-guided knowledge discovery in databases. In Proceedings of the 1st international Conference on Knowledge Capture (Victoria, British Columbia, Canada, October 22-23, 2001). K-CAP '01. ACM Press, New York, NY, Gómez-Pérez, A., and Fernández-López, M., Corcho, O.: Ontological Engineering with examples from the areas of Knowledge Management, e-commerce and the Semantic Web. Springer-Verlag, Berlin Heidelberg New York (2004) CRISP-DM Sowa, J.F. (2001) Ontology, Metadata, and Semiotics. Retrievable from the internet 28/01/2006 at: Gottgtroy, P., Kasabov, N. & MacDonell, S. Evolving Ontologies for Intelligent Decision 6
7 Support Publisher: Elsevier in the series Capturing Intelligence - Fuzzy Logic and the Semantic Web (2006). Kasabov, N, Jain. V., Gottgtroy, P., Benevuska, L. & Joseph, F. (2007). Evolving Brain-Gene Ontology and Simulation System (BGOS): Towards Integrating Bioinformatics and Neuroinformatics Data, Information and Knowledge to Facilitate Discoveries. Special Issue of Neural Networks. In Press. 7
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Chapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
Chapter ML:XI. XI. Cluster Analysis
Chapter ML:XI XI. Cluster Analysis Data Mining Overview Cluster Analysis Basics Hierarchical Cluster Analysis Iterative Cluster Analysis Density-Based Cluster Analysis Cluster Evaluation Constrained Cluster
Data Integration Alternatives Managing Value and Quality
Solutions for Customer Intelligence, Communications and Care. Data Integration Alternatives Managing Value and Quality Using a Governed Approach to Incorporating Data Quality Services Within the Data Integration
A Knowledge Management Framework Using Business Intelligence Solutions
www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
An Introduction to Advanced Analytics and Data Mining
An Introduction to Advanced Analytics and Data Mining Dr Barry Leventhal Henry Stewart Briefing on Marketing Analytics 19 th November 2010 Agenda What are Advanced Analytics and Data Mining? The toolkit
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
In this presentation, you will be introduced to data mining and the relationship with meaningful use.
In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine
E-Learning Using Data Mining. Shimaa Abd Elkader Abd Elaal
E-Learning Using Data Mining Shimaa Abd Elkader Abd Elaal -10- E-learning using data mining Shimaa Abd Elkader Abd Elaal Abstract Educational Data Mining (EDM) is the process of converting raw data from
A Review of Data Mining Techniques
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,
Data Integration Alternatives Managing Value and Quality
Solutions for Enabling Lifetime Customer Relationships Data Integration Alternatives Managing Value and Quality Using a Governed Approach to Incorporating Data Quality Services Within the Data Integration
Information Services for Smart Grids
Smart Grid and Renewable Energy, 2009, 8 12 Published Online September 2009 (http://www.scirp.org/journal/sgre/). ABSTRACT Interconnected and integrated electrical power systems, by their very dynamic
Using Semantic Data Mining for Classification Improvement and Knowledge Extraction
Using Semantic Data Mining for Classification Improvement and Knowledge Extraction Fernando Benites and Elena Sapozhnikova University of Konstanz, 78464 Konstanz, Germany. Abstract. The objective of this
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Dynamic Data in terms of Data Mining Streams
International Journal of Computer Science and Software Engineering Volume 2, Number 1 (2015), pp. 1-6 International Research Publication House http://www.irphouse.com Dynamic Data in terms of Data Mining
Data Mining and KDD: A Shifting Mosaic. Joseph M. Firestone, Ph.D. White Paper No. Two. March 12, 1997
1 of 11 5/24/02 3:50 PM Data Mining and KDD: A Shifting Mosaic By Joseph M. Firestone, Ph.D. White Paper No. Two March 12, 1997 The Idea of Data Mining Data Mining is an idea based on a simple analogy.
Information Visualization WS 2013/14 11 Visual Analytics
1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and
Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success
Developing an MDM Strategy Key Components for Success WHITE PAPER Table of Contents Introduction... 2 Process Considerations... 3 Architecture Considerations... 5 Conclusion... 9 About Knowledgent... 10
COURSE RECOMMENDER SYSTEM IN E-LEARNING
International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand
WHITEPAPER. Creating and Deploying Predictive Strategies that Drive Customer Value in Marketing, Sales and Risk
WHITEPAPER Creating and Deploying Predictive Strategies that Drive Customer Value in Marketing, Sales and Risk Overview Angoss is helping its clients achieve significant revenue growth and measurable return
A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks
A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks Text Analytics World, Boston, 2013 Lars Hard, CTO Agenda Difficult text analytics tasks Feature extraction Bio-inspired
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
Association rules for improving website effectiveness: case analysis
Association rules for improving website effectiveness: case analysis Maja Dimitrijević, The Higher Technical School of Professional Studies, Novi Sad, Serbia, [email protected] Tanja Krunić, The
Gerard Mc Nulty Systems Optimisation Ltd [email protected]/0876697867 BA.,B.A.I.,C.Eng.,F.I.E.I
Gerard Mc Nulty Systems Optimisation Ltd [email protected]/0876697867 BA.,B.A.I.,C.Eng.,F.I.E.I Data is Important because it: Helps in Corporate Aims Basis of Business Decisions Engineering Decisions Energy
Data Mart/Warehouse: Progress and Vision
Data Mart/Warehouse: Progress and Vision Institutional Research and Planning University Information Systems What is data warehousing? A data warehouse: is a single place that contains complete, accurate
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
Why are Organizations Interested?
SAS Text Analytics Mary-Elizabeth ( M-E ) Eddlestone SAS Customer Loyalty [email protected] +1 (607) 256-7929 Why are Organizations Interested? Text Analytics 2009: User Perspectives on Solutions
Enhancing Quality of Data using Data Mining Method
JOURNAL OF COMPUTING, VOLUME 2, ISSUE 9, SEPTEMBER 2, ISSN 25-967 WWW.JOURNALOFCOMPUTING.ORG 9 Enhancing Quality of Data using Data Mining Method Fatemeh Ghorbanpour A., Mir M. Pedram, Kambiz Badie, Mohammad
Supply Chain Platform as a Service: a Cloud Perspective on Business Collaboration
Supply Chain Platform as a Service: a Cloud Perspective on Business Collaboration Guopeng Zhao 1, 2 and Zhiqi Shen 1 1 Nanyang Technological University, Singapore 639798 2 HP Labs Singapore, Singapore
Building Ontology Networks: How to Obtain a Particular Ontology Network Life Cycle?
See discussions, stats, and author profiles for this publication at: http://www.researchgate.net/publication/47901002 Building Ontology Networks: How to Obtain a Particular Ontology Network Life Cycle?
Data Mining Analytics for Business Intelligence and Decision Support
Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing
Machine Learning, Data Mining, and Knowledge Discovery: An Introduction
Machine Learning, Data Mining, and Knowledge Discovery: An Introduction AHPCRC Workshop - 8/17/10 - Dr. Martin Based on slides by Gregory Piatetsky-Shapiro from Kdnuggets http://www.kdnuggets.com/data_mining_course/
POLAR IT SERVICES. Business Intelligence Project Methodology
POLAR IT SERVICES Business Intelligence Project Methodology Table of Contents 1. Overview... 2 2. Visualize... 3 3. Planning and Architecture... 4 3.1 Define Requirements... 4 3.1.1 Define Attributes...
NEW TECHNIQUE TO DEAL WITH DYNAMIC DATA MINING IN THE DATABASE
www.arpapress.com/volumes/vol13issue3/ijrras_13_3_18.pdf NEW TECHNIQUE TO DEAL WITH DYNAMIC DATA MINING IN THE DATABASE Hebah H. O. Nasereddin Middle East University, P.O. Box: 144378, Code 11814, Amman-Jordan
I. INTRODUCTION NOESIS ONTOLOGIES SEMANTICS AND ANNOTATION
Noesis: A Semantic Search Engine and Resource Aggregator for Atmospheric Science Sunil Movva, Rahul Ramachandran, Xiang Li, Phani Cherukuri, Sara Graves Information Technology and Systems Center University
Data Discovery, Analytics, and the Enterprise Data Hub
Data Discovery, Analytics, and the Enterprise Data Hub Version: 101 Table of Contents Summary 3 Used Data and Limitations of Legacy Analytic Architecture 3 The Meaning of Data Discovery & Analytics 4 Machine
Enhanced Boosted Trees Technique for Customer Churn Prediction Model
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V5 PP 41-45 www.iosrjen.org Enhanced Boosted Trees Technique for Customer Churn Prediction
This software agent helps industry professionals review compliance case investigations, find resolutions, and improve decision making.
Lost in a sea of data? Facing an external audit? Or just wondering how you re going meet the challenges of the next regulatory law? When you need fast, dependable support and company-specific solutions
Santhosh John. International Journal of Information and Education Technology, Vol. 4, No. 4, August 2014
Development of an Educational Ontology for Java Programming (JLEO) with a Hybrid Methodology Derived from Conventional Software Engineering Process Models Santhosh John Abstract Semantic Web refers to
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS Stacey Franklin Jones, D.Sc. ProTech Global Solutions Annapolis, MD Abstract The use of Social Media as a resource to characterize
Profile Based Personalized Web Search and Download Blocker
Profile Based Personalized Web Search and Download Blocker 1 K.Sheeba, 2 G.Kalaiarasi Dhanalakshmi Srinivasan College of Engineering and Technology, Mamallapuram, Chennai, Tamil nadu, India Email: 1 [email protected],
Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing
Introduction to Data Mining and Machine Learning Techniques Iza Moise, Evangelos Pournaras, Dirk Helbing Iza Moise, Evangelos Pournaras, Dirk Helbing 1 Overview Main principles of data mining Definition
Anatomy of a Decision
[email protected] @BlueHillBoston 617.624.3600 Anatomy of a Decision BI Platform vs. Tool: Choosing Birst Over Tableau for Enterprise Business Intelligence Needs What You Need To Know The demand
Sanjeev Kumar. contribute
RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 [email protected] 1. Introduction The field of data mining and knowledgee discovery is emerging as a
DATA ANALYSIS USING BUSINESS INTELLIGENCE TOOL. A Thesis. Presented to the. Faculty of. San Diego State University. In Partial Fulfillment
DATA ANALYSIS USING BUSINESS INTELLIGENCE TOOL A Thesis Presented to the Faculty of San Diego State University In Partial Fulfillment of the Requirements for the Degree Master of Science in Computer Science
KNOWLEDGE GRID An Architecture for Distributed Knowledge Discovery
KNOWLEDGE GRID An Architecture for Distributed Knowledge Discovery Mario Cannataro 1 and Domenico Talia 2 1 ICAR-CNR 2 DEIS Via P. Bucci, Cubo 41-C University of Calabria 87036 Rende (CS) Via P. Bucci,
Subject Description Form
Subject Description Form Subject Code Subject Title COMP417 Data Warehousing and Data Mining Techniques in Business and Commerce Credit Value 3 Level 4 Pre-requisite / Co-requisite/ Exclusion Objectives
A Near Real-Time Personalization for ecommerce Platform Amit Rustagi [email protected]
A Near Real-Time Personalization for ecommerce Platform Amit Rustagi [email protected] Abstract. In today's competitive environment, you only have a few seconds to help site visitors understand that you
Salesforce Certified Data Architecture and Management Designer. Study Guide. Summer 16 TRAINING & CERTIFICATION
Salesforce Certified Data Architecture and Management Designer Study Guide Summer 16 Contents SECTION 1. PURPOSE OF THIS STUDY GUIDE... 2 SECTION 2. ABOUT THE SALESFORCE CERTIFIED DATA ARCHITECTURE AND
Data Isn't Everything
June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
SURVEY REPORT DATA SCIENCE SOCIETY 2014
SURVEY REPORT DATA SCIENCE SOCIETY 2014 TABLE OF CONTENTS Contents About the Initiative 1 Report Summary 2 Participants Info 3 Participants Expertise 6 Suggested Discussion Topics 7 Selected Responses
Oracle Real Time Decisions
A Product Review James Taylor CEO CONTENTS Introducing Decision Management Systems Oracle Real Time Decisions Product Architecture Key Features Availability Conclusion Oracle Real Time Decisions (RTD)
MULTI AGENT-BASED DISTRIBUTED DATA MINING
MULTI AGENT-BASED DISTRIBUTED DATA MINING REECHA B. PRAJAPATI 1, SUMITRA MENARIA 2 Department of Computer Science and Engineering, Parul Institute of Technology, Gujarat Technology University Abstract:
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
Introduction to Data Mining
Introduction to Data Mining José Hernández ndez-orallo Dpto.. de Systems Informáticos y Computación Universidad Politécnica de Valencia, Spain [email protected] Horsens, Denmark, 26th September 2005
Better planning and forecasting with IBM Predictive Analytics
IBM Software Business Analytics SPSS Predictive Analytics Better planning and forecasting with IBM Predictive Analytics Using IBM Cognos TM1 with IBM SPSS Predictive Analytics to build better plans and
Introduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis
Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis (Version 1.17) For validation Document version 0.1 7/7/2014 Contents What is SAP Predictive Analytics?... 3
[callout: no organization can afford to deny itself the power of business intelligence ]
Publication: Telephony Author: Douglas Hackney Headline: Applied Business Intelligence [callout: no organization can afford to deny itself the power of business intelligence ] [begin copy] 1 Business Intelligence
BUSINESS RULES AND GAP ANALYSIS
Leading the Evolution WHITE PAPER BUSINESS RULES AND GAP ANALYSIS Discovery and management of business rules avoids business disruptions WHITE PAPER BUSINESS RULES AND GAP ANALYSIS Business Situation More
FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT
FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT ANURADHA.T Assoc.prof, [email protected] SRI SAI KRISHNA.A [email protected] SATYATEJ.K [email protected] NAGA ANIL KUMAR.G
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, [email protected] Abstract. Despite the dramatic growth of online genomic
Federico Rajola. Customer Relationship. Management in the. Financial Industry. Organizational Processes and. Technology Innovation.
Federico Rajola Customer Relationship Management in the Financial Industry Organizational Processes and Technology Innovation Second edition ^ Springer Contents 1 Introduction 1 1.1 Identification and
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France
Master of Science in Health Information Technology Degree Curriculum
Master of Science in Health Information Technology Degree Curriculum Core courses: 8 courses Total Credit from Core Courses = 24 Core Courses Course Name HRS Pre-Req Choose MIS 525 or CIS 564: 1 MIS 525
SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH
330 SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH T. M. D.Saumya 1, T. Rupasinghe 2 and P. Abeysinghe 3 1 Department of Industrial Management, University of Kelaniya,
Decision Modeling for Dashboard Projects
Decision Modeling for Dashboard Projects How to Build a Decision Requirements Model that Drives Successful Dashboard Projects Gagan Saxena VP Consulting Decision modeling provides a formal framework to
KnowledgeSEEKER Marketing Edition
KnowledgeSEEKER Marketing Edition Predictive Analytics for Marketing The Easiest to Use Marketing Analytics Tool KnowledgeSEEKER Marketing Edition is a predictive analytics tool designed for marketers
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON BIG DATA ISSUES AMRINDER KAUR Assistant Professor, Department of Computer
CRISP-DM, which stands for Cross-Industry Standard Process for Data Mining, is an industry-proven way to guide your data mining efforts.
CRISP-DM, which stands for Cross-Industry Standard Process for Data Mining, is an industry-proven way to guide your data mining efforts. As a methodology, it includes descriptions of the typical phases
Operationalizing Data Governance through Data Policy Management
Operationalizing Data Governance through Data Policy Management Prepared for alido by: David Loshin nowledge Integrity, Inc. June, 2010 2010 nowledge Integrity, Inc. Page 1 Introduction The increasing
A Risk Management Approach to Data Preservation
A Risk Management Approach to Data Preservation Ricardo Vieira* ([email protected]) Digital Preservation Digital Preservation (DP) aims at maintaining valuable digital objects accessible over long periods
An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials
ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Information Security in Big Data using Encryption and Decryption
International Research Journal of Computer Science (IRJCS) ISSN: 2393-9842 Information Security in Big Data using Encryption and Decryption SHASHANK -PG Student II year MCA S.K.Saravanan, Assistant Professor
INTELLIGENT DEFECT ANALYSIS, FRAMEWORK FOR INTEGRATED DATA MANAGEMENT
INTELLIGENT DEFECT ANALYSIS, FRAMEWORK FOR INTEGRATED DATA MANAGEMENT Website: http://www.siglaz.com Abstract Spatial signature analysis (SSA) is one of the key technologies that semiconductor manufacturers
Effecting Data Quality Improvement through Data Virtualization
Effecting Data Quality Improvement through Data Virtualization Prepared for Composite Software by: David Loshin Knowledge Integrity, Inc. June, 2010 2010 Knowledge Integrity, Inc. Page 1 Introduction The
IMPROVING BUSINESS PROCESS MODELING USING RECOMMENDATION METHOD
Journal homepage: www.mjret.in ISSN:2348-6953 IMPROVING BUSINESS PROCESS MODELING USING RECOMMENDATION METHOD Deepak Ramchandara Lad 1, Soumitra S. Das 2 Computer Dept. 12 Dr. D. Y. Patil School of Engineering,(Affiliated
Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition
Towards Semantics-Enabled Distributed Infrastructure for Knowledge Acquisition Vasant Honavar 1 and Doina Caragea 2 1 Artificial Intelligence Research Laboratory, Department of Computer Science, Iowa State
A Meta-model of Business Interaction for Assisting Intelligent Workflow Systems
A Meta-model of Business Interaction for Assisting Intelligent Workflow Systems Areti Manataki and Yun-Heh Chen-Burger Centre for Intelligent Systems and their Applications, School of Informatics, The
ARTIFICIAL INTELLIGENCE METHODS IN EARLY MANUFACTURING TIME ESTIMATION
1 ARTIFICIAL INTELLIGENCE METHODS IN EARLY MANUFACTURING TIME ESTIMATION B. Mikó PhD, Z-Form Tool Manufacturing and Application Ltd H-1082. Budapest, Asztalos S. u 4. Tel: (1) 477 1016, e-mail: [email protected]
Data Validation with OWL Integrity Constraints
Data Validation with OWL Integrity Constraints (Extended Abstract) Evren Sirin Clark & Parsia, LLC, Washington, DC, USA [email protected] Abstract. Data validation is an important part of data integration
WHITEPAPER. Why Dependency Mapping is Critical for the Modern Data Center
WHITEPAPER Why Dependency Mapping is Critical for the Modern Data Center OVERVIEW The last decade has seen a profound shift in the way IT is delivered and consumed by organizations, triggered by new technologies
IRMAC SAS INFORMATION MANAGEMENT, TRANSFORMING AN ANALYTICS CULTURE. Copyright 2012, SAS Institute Inc. All rights reserved.
IRMAC SAS INFORMATION MANAGEMENT, TRANSFORMING AN ANALYTICS CULTURE ABOUT THE PRESENTER Marc has been with SAS for 10 years and leads the information management practice for canada. Marc s area of specialty
