A STUDY IN USER-CENTRIC DATA INTEGRATION
|
|
- Charity Davidson
- 5 years ago
- Views:
Transcription
1 A STUDY IN USER-CENTRIC DATA INTEGRATION Heiner Stuckenschmidt 1 2, Jan Noessner 1, and Faraz Fallahi 3 1 School of Business Informatics and Mathematics, University of Mannheim Mannheim. Germany 2 Institute for Enterprise Systems (InES), L 15, 1-6, Mannheim. Germany 3 ontoprise GmbH, An der RaumFabrik 33a, Karlsruhe. Germany
2 Motivation 1 Data Integration maps different data sources to a consistent target structure. Target Structure (Ontology) (Encompassing consistent view to the data) Data Integration Rules Data Sources (Direct extraction out of different data sources) Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 2
3 Outline Outline Motivation Related Work User-Centric Mapping Assistant Approach Study Design and Datasets 5 6 Experimental Results Conclusion and Future Work Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 3
4 Related Work 2 Automatic data integration approachesare still errorproneand need to be supervised by human domain experts. The problem of data integration has been studied intensively on a technical level in different areas of computer science. Researchers have investigated the automatic identification of semantic relations between different datasets (Euzenatand Shvaiko, 2007). A prominent line of research investigates the use of ontologies-formal representations of the conceptual structure of an application domain -as a basis for both, identifying and using semantic relations. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 4
5 Related Work 2 Existing work in user-centric data integration investigated rather simple scenarios. In a recent study, Gassand Maedchehave investigated the problem of data integration in the context of personal information management from a usercentric point of view (Gassand Maedche, 2011). The scenario addressed in their work, however, focuses on the integration of rather simple data schemas, in that case personal data where the task is mainly to map properties describing a person (e.g. name or bank account number). Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 5
6 Related Work 2 Traditional User Interfaces try to visualize integration rules Most approaches are based on advanced visualization of the models to be integrated and the mappings created by the user (Granitzeret al., 2010). Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 6
7 Drawbacksare visualization limits in number and complexity of integration rules. Visualizations quickly reach their limits if Many integration rules exist Related Work 2 Presentation of Data Integration Rules in AgreementMaker Very complex mapping rules exist, which are hard to AND AND?_VAR0 <= 2.0) AND (?_SIID[< AND?_VAR1 >= 3.5). High expert knowledge is needed to interprete the consequences of the Mapping Rules Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 7
8 Related Work 2 The need of User-Centric Data Integration has been recognized. Recently, researchers in ontology and schema matching have recognized the need for user support in aligning complex conceptual models (Falconer, 2009; Falconer and Storey, 2007). Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 8
9 Related Work 2 The cognitive support model for data integration by Falconer and Noy(2011) underlines the user interaction. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 9
10 User-Centric Mapping Assistant Approach 3 Our Modified Cognitive Support Model is based on identifying wrong instances and asking questions in natural language. User identifies instances which have been classified incorrectly. User answers questions. User Inspection Decision which concept to examine Diagnostic algorithm generates the minimal amount of user questions Questions are represented to the user in natural language sentences in a todo-list. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 10
11 User-Centric Mapping Assistant Approach 3 Our Interactive User Interface enables users to investigate data on the instance level. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 11
12 User-Centric Mapping Assistant Approach 3 In the Analysis and Decision Making phase the user decides which concept he wants to examine. 1 User decides which concept he wants to examine Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 12
13 User-Centric Mapping Assistant Approach 3 In the Interaction phase the user identifies wrong classified instances. 1 User decides which concept he wants to examine 2 User identifies instances which have been classified incorrectly. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 13
14 User-Centric Mapping Assistant Approach 3 In the Analysis and Generation phase the minimal amount of user questions is generated by the system. 1 User decides which concept he wants to examine 2 User identifies instances which have been classified incorrectly. 3 Diagnostic algorithm generates the minimal amount of user questions Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 14
15 User-Centric Mapping Assistant Approach 3 In the Representationphase the questions are represented to the user in natural language. 1 User decides which concept he wants to examine 2 User identifies instances which have been classified incorrectly. 3 Diagnostic algorithm generates the minimal amount of user questions 4 Questions are represented to the user in natural language sentences in a todo-list. Is MX5_Mieta an instance of HighPerformanceCar? Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 15
16 Outline Outline Motivation Related Work User-Centric Mapping Assistant Approach Study Design and Datasets 5 6 Experimental Results Conclusion and Future Work Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 16
17 Study Design and Datasets 4 The Source Dataset is an instructional dataset from the web. The target Schema is manually created. Source Dataset Instructional dataset from the carselling domain ( straccia/down load/teaching/si/2006/autos.owl) Target Schema The dataset contains: 324 data records (cars, car parts, etc.) 100 attributes (like speed, fuel consumption,...). 91 concepts organized in a concept hierarchy. Complex enough, but small enough to be handled in a user-study. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 17
18 Ten Integration Rules were wrong and had to be identified by the subjects (Dependent Variable). Two Datasets containing 10 wrong integration rules each. Type 1: Easy Mistakes Study Design and Datasets 4 Wheel Engine Type 2: Complex Mistakes AirCondition Filter: haszonenumber = 2 hasautomatic = false AutomaticOneZoneAirCondition The subjects had to find as many wrong integration rules as possible. The dependent variableis the number of errors the subjects found in the respective dataset Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 18
19 Study Design and Datasets 4 We compared the conventional approach with the MappingAssistant approach (Independent Variable). Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 19
20 Study Design and Datasets 4 We compared the conventional approach with the MappingAssistant approach (Independent Variable). Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 20
21 Study Design and Datasets 4 We compared the conventional approach with the MappingAssistant approach (Independent Variable). Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 21
22 Study Design and Datasets 4 We compared the conventional approach with the MappingAssistant approach (Independent Variable). Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 22
23 Study Design and Datasets 4 For Simulating Background Knowledge the subjects had an information sheet. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 23
24 Study Design and Datasets 4 Both, the order of tasks and the order of datasets wereswitched. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 24
25 Study Design and Datasets 4 We performed the study with 22 subjects. 22 subjects participated in the user study, each performed both tasks on both datasets. 6 female, 16 male average age: 27.8 years (min = 21, max > 50). 54% of the subjects were students. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 25
26 Experimental Results 5 Precision, Recall, and F-Measure number of errors that have correctly been identified by a subject number of errors been identified by a subject number of errors that have correctly been identified by a subject number of all existing correct errors (10) 12 2 Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 26
27 Experimental Results 5 In the Average Performance of Subjectsthe recall was one third higher in the MappingAssistant approach. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 27
28 Experimental Results 5 Comparing the Performance on the Subject Level91% of the subjects found more mistakes in the MappingAssistant approach. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 28
29 In the standard approach subjects with low technical knowledge reached lower F-Scores. Experimental Results 5 Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 29
30 Experimental Results 5 In the MappingAssistantapproach the reached F-Score is independent from the level of knowledge. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 30
31 The User Feedback is better for the MappingAssistantapproach than for the standard approach. Task 1 Experimental Results 5 Task 2 Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 31
32 Conclusion and Future Work 6 Conclusion The goal of our research was to enable the people with less or no knowledge of technologies to integrate their data. We presented a user-centric approach to data integration that is based on a cognitive support model. We presented the results of a user study demonstrating that our MappingAssistantapproach empowers users to solve data integration problems more effectively and efficiently. In particular, we showed that users were able to find more errors in mapping rules in a given period of time. Further, we were able to show that while with conventional mapping technology a high level of expertise in mapping technology is required, while the MappingAssistantapproach significantly reduces the performance difference of experienced and inexperienced users. s5 Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 32
33 Slide 32 s5 auch hier ist das while zuviel oder?! shaihulud;
34 Conclusion and Future Work 6 In Future Work we will focus on correcting the wrong integration rules. Select concept and mark wrong instance Actualizing the integration rule Feedback questions from the sysstem to the user Selection of a correction suggestion Identified the wront integration rules Calculation of correction suggestions of the integration rule Selection of the integration rule and mark wrong instances Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 33
35 End for your attention! If you have any questions feel free to ask. Jan Noessner - Lehrstuhl für künstliche Intelligenz University of Mannheim 34
IST World. European RTD Information and Service Portal FP6-2004-IST-3 015823. Brigitte Jörg, Language Technology Lab, DFKI GmbH
IST World European RTD Information and Service Portal FP6-2004-IST-3 015823 About the Project [European RTD Information and Service Portal] Duration: 30 Months (April 2005 September 2007) Project Type:
Experiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
Improving the PRAIS portal for future report submissions by reporting entities Science, Technology and Implementation (STI unit)
UN Campus, Platz der Vereinten Nationen 1, 53113 Bonn, Germany Postal Address: PO Box 260129, 53153 Bonn, Germany Tel. +49 (0) 228 815 2800 Fax: +49 (0) 228 815 2898/99 E-mail: secretariat@unccd.int Web-site:
Mathematics Cognitive Domains Framework: TIMSS 2003 Developmental Project Fourth and Eighth Grades
Appendix A Mathematics Cognitive Domains Framework: TIMSS 2003 Developmental Project Fourth and Eighth Grades To respond correctly to TIMSS test items, students need to be familiar with the mathematics
Disambiguating Implicit Temporal Queries by Clustering Top Relevant Dates in Web Snippets
Disambiguating Implicit Temporal Queries by Clustering Top Ricardo Campos 1, 4, 6, Alípio Jorge 3, 4, Gaël Dias 2, 6, Célia Nunes 5, 6 1 Tomar Polytechnic Institute, Tomar, Portugal 2 HULTEC/GREYC, University
Achille Felicetti" VAST-LAB, PIN S.c.R.L., Università degli Studi di Firenze!
3D-COFORM Mapping Tool! Achille Felicetti" VAST-LAB, PIN S.c.R.L., Università degli Studi di Firenze!! The 3D-COFORM Project! Work Package 6! Tools for the semi-automatic processing of legacy information!
The Masters of Science in Information Systems & Technology
The Masters of Science in Information Systems & Technology College of Engineering and Computer Science University of Michigan-Dearborn A Rackham School of Graduate Studies Program PH: 313-593-5361; FAX:
Predicate logic Proofs Artificial intelligence. Predicate logic. SET07106 Mathematics for Software Engineering
Predicate logic SET07106 Mathematics for Software Engineering School of Computing Edinburgh Napier University Module Leader: Uta Priss 2010 Copyright Edinburgh Napier University Predicate logic Slide 1/24
Semantic Business Analytics in Industrial Facilities a Case Study
Semantic Business Analytics in Industrial Facilities a Case Study Jürgen Angele, Eddie Mönch ontoprise GmbH An der RaumFabrik 29 76227 Karlsruhe angele@ontoprise.de eddie.moench@ontoprise.de Abstract:
<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany
Information Systems University of Koblenz Landau, Germany Exploiting Spatial Context in Images Using Fuzzy Constraint Reasoning Carsten Saathoff & Agenda Semantic Web: Our Context Knowledge Annotation
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, fabian.gruening@informatik.uni-oldenburg.de Abstract: Independent
Mining the Software Change Repository of a Legacy Telephony System
Mining the Software Change Repository of a Legacy Telephony System Jelber Sayyad Shirabad, Timothy C. Lethbridge, Stan Matwin School of Information Technology and Engineering University of Ottawa, Ottawa,
Language and Computation
Language and Computation week 13, Thursday, April 24 Tamás Biró Yale University tamas.biro@yale.edu http://www.birot.hu/courses/2014-lc/ Tamás Biró, Yale U., Language and Computation p. 1 Practical matters
HELP DESK SYSTEMS. Using CaseBased Reasoning
HELP DESK SYSTEMS Using CaseBased Reasoning Topics Covered Today What is Help-Desk? Components of HelpDesk Systems Types Of HelpDesk Systems Used Need for CBR in HelpDesk Systems GE Helpdesk using ReMind
Error Log Processing for Accurate Failure Prediction. Humboldt-Universität zu Berlin
Error Log Processing for Accurate Failure Prediction Felix Salfner ICSI Berkeley Steffen Tschirpke Humboldt-Universität zu Berlin Introduction Context of work: Error-based online failure prediction: error
Optical Digitizing by ATOS for Press Parts and Tools
Optical Digitizing by ATOS for Press Parts and Tools Konstantin Galanulis, Carsten Reich, Jan Thesing, Detlef Winter GOM Gesellschaft für Optische Messtechnik mbh, Mittelweg 7, 38106 Braunschweig, Germany
A Test Case Generator for the Validation of High-Level Petri Nets
A Test Case Generator for the Validation of High-Level Petri Nets Jörg Desel Institut AIFB Universität Karlsruhe D 76128 Karlsruhe Germany E-mail: desel@aifb.uni-karlsruhe.de Andreas Oberweis, Torsten
Multi-Algorithm Ontology Mapping with Automatic Weight Assignment and Background Knowledge
Multi-Algorithm Mapping with Automatic Weight Assignment and Background Knowledge Shailendra Singh and Yu-N Cheah School of Computer Sciences Universiti Sains Malaysia 11800 USM Penang, Malaysia shai14@gmail.com,
Screen Design : Navigation, Windows, Controls, Text,
Overview Introduction Fundamentals of GUIs Screen Design : Navigation, Windows, Controls, Text, Evaluating GUI Performance - Methods - Comparison 1 Example: Automotive HMI (CAR IT 03/2013) 64, 68, 69 2
What is Visual Analytics?
What is Visual Analytics? Methods@Manchester Oscar de Bruijn Decision and Cognitive Sciences Manchester Business School 1 Overview What is the problem? How does Visual Analytics offer a solution What is
Some Research Challenges for Big Data Analytics of Intelligent Security
Some Research Challenges for Big Data Analytics of Intelligent Security Yuh-Jong Hu hu at cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,
Knowledge-based systems and the need for learning
Knowledge-based systems and the need for learning The implementation of a knowledge-based system can be quite difficult. Furthermore, the process of reasoning with that knowledge can be quite slow. This
Identify Disorders in Health Records using Conditional Random Fields and Metamap
Identify Disorders in Health Records using Conditional Random Fields and Metamap AEHRC at ShARe/CLEF 2013 ehealth Evaluation Lab Task 1 G. Zuccon 1, A. Holloway 1,2, B. Koopman 1,2, A. Nguyen 1 1 The Australian
The Advantages of Using Visual Interfaces
Knowledge Extraction and Integration using Automatic and Visual Methods Vedran Sabol, Roman Kern, Barbara Kump, Viktoria Pammer, Michael Granitzer vsabol rkern bkump vpammer mgrani@know-center.at Know-Center,
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati tnpatil2@gmail.com, ss_sherekar@rediffmail.com
Using Artificial Intelligence to Manage Big Data for Litigation
FEBRUARY 3 5, 2015 / THE HILTON NEW YORK Using Artificial Intelligence to Manage Big Data for Litigation Understanding Artificial Intelligence to Make better decisions Improve the process Allay the fear
On the role of a Librarian Agent in Ontology-based Knowledge Management Systems
On the role of a Librarian Agent in Ontology-based Knowledge Management Systems Nenad Stojanovic Institute AIFB, University of Karlsruhe, 76128 Karlsruhe, Germany nst@aifb.uni-karlsruhe.de Abstract: In
Collaborative Development of Knowledge Bases in Distributed Requirements Elicitation
Collaborative Development of Knowledge Bases in Distributed s Elicitation Steffen Lohmann 1, Thomas Riechert 2, Sören Auer 2, Jürgen Ziegler 1 1 University of Duisburg-Essen Department of Informatics and
Mining. Practical. Data. Monte F. Hancock, Jr. Chief Scientist, Celestech, Inc. CRC Press. Taylor & Francis Group
Practical Data Mining Monte F. Hancock, Jr. Chief Scientist, Celestech, Inc. CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint of the Taylor Ei Francis Group, an Informs
E10: Controlled Experiments
E10: Controlled Experiments Quantitative, empirical method Used to identify the cause of a situation or set of events X is responsible for Y Directly manipulate and control variables Correlation does not
Please quote as: Hirdes, E. M.; Thillainathan, N. & Leimeister, J. M. (2012): Towards Modeling Educational Objectives in Serious Games.
Please quote as: Hirdes, E. M.; Thillainathan, N. & Leimeister, J. M. (2012): Towards Modeling Educational Objectives in Serious Games. In: Pedagogically-driven Serious Games 2012, Saarbrücken, Germany.
A Recommendation Framework Based on the Analytic Network Process and its Application in the Semantic Technology Domain
A Recommendation Framework Based on the Analytic Network Process and its Application in the Semantic Technology Domain Student: Filip Radulovic - fradulovic@fi.upm.es Supervisors: Raúl García-Castro, Asunción
Semantic EPC: Enhancing Process Modeling Using Ontologies
Institute for Information Systems IWi Institut (IWi) für at the German Research Wirtschaftsinformatik Center for im DFKI Saarbrücken Artificial Intelligence (DFKI), Saarland University Semantic EPC: Enhancing
CENG 734 Advanced Topics in Bioinformatics
CENG 734 Advanced Topics in Bioinformatics Week 9 Text Mining for Bioinformatics: BioCreative II.5 Fall 2010-2011 Quiz #7 1. Draw the decompressed graph for the following graph summary 2. Describe the
Overview. Evaluation Connectionist and Statistical Language Processing. Test and Validation Set. Training and Test Set
Overview Evaluation Connectionist and Statistical Language Processing Frank Keller keller@coli.uni-sb.de Computerlinguistik Universität des Saarlandes training set, validation set, test set holdout, stratification
Big Data: Rethinking Text Visualization
Big Data: Rethinking Text Visualization Dr. Anton Heijs anton.heijs@treparel.com Treparel April 8, 2013 Abstract In this white paper we discuss text visualization approaches and how these are important
Visualization methods for patent data
Visualization methods for patent data Treparel 2013 Dr. Anton Heijs (CTO & Founder) Delft, The Netherlands Introduction Treparel can provide advanced visualizations for patent data. This document describes
Microsoft' Excel & Access Integration
Microsoft' Excel & Access Integration with Office 2007 Michael Alexander and Geoffrey Clark J1807 ; pwiueyb Wiley Publishing, Inc. Contents About the Authors Acknowledgments Introduction Part I: Basic
A Pattern-based Framework of Change Operators for Ontology Evolution
A Pattern-based Framework of Change Operators for Ontology Evolution Muhammad Javed 1, Yalemisew M. Abgaz 2, Claus Pahl 3 Centre for Next Generation Localization (CNGL), School of Computing, Dublin City
The Visualization Pipeline
The Visualization Pipeline Conceptual perspective Implementation considerations Algorithms used in the visualization Structure of the visualization applications Contents The focus is on presenting the
In this presentation, you will be introduced to data mining and the relationship with meaningful use.
In this presentation, you will be introduced to data mining and the relationship with meaningful use. Data mining refers to the art and science of intelligent data analysis. It is the application of machine
CYBER SCIENCE 2015 AN ANALYSIS OF NETWORK TRAFFIC CLASSIFICATION FOR BOTNET DETECTION
CYBER SCIENCE 2015 AN ANALYSIS OF NETWORK TRAFFIC CLASSIFICATION FOR BOTNET DETECTION MATIJA STEVANOVIC PhD Student JENS MYRUP PEDERSEN Associate Professor Department of Electronic Systems Aalborg University,
COCOVILA Compiler-Compiler for Visual Languages
LDTA 2005 Preliminary Version COCOVILA Compiler-Compiler for Visual Languages Pavel Grigorenko, Ando Saabas and Enn Tyugu 1 Institute of Cybernetics, Tallinn University of Technology Akadeemia tee 21 12618
Database Design Overview. Conceptual Design ER Model. Entities and Entity Sets. Entity Set Representation. Keys
Database Design Overview Conceptual Design. The Entity-Relationship (ER) Model CS430/630 Lecture 12 Conceptual design The Entity-Relationship (ER) Model, UML High-level, close to human thinking Semantic
Functional Modelling in secondary schools using spreadsheets
Functional Modelling in secondary schools using spreadsheets Peter Hubwieser Techn. Universität München Institut für Informatik Boltzmannstr. 3, 85748 Garching Peter.Hubwieser@in.tum.de http://ddi.in.tum.de
Intelligent Retrieval for Component Reuse in System-On-Chip Design
Intelligent Retrieval for Component Reuse in System-On-Chip Design Andrea Freßmann, Rainer Maximini, Martin Schaaf University of Hildesheim, Data- and Knowledge Management Group PO Box 101363, 31113 Hildesheim,
Disributed Query Processing KGRAM - Search Engine TOP 10
fédération de données et de ConnaissancEs Distribuées en Imagerie BiomédicaLE Data fusion, semantic alignment, distributed queries Johan Montagnat CNRS, I3S lab, Modalis team on behalf of the CrEDIBLE
Business Intelligence for The Internet of Things
Business Intelligence for The Internet of Things Ø mario.guarracino@cnr.it Ø http://www.na.icar.cnr.it/~mariog Ø Office FI@KTU 204a Logistic information Lectures Ø On Modays, following usual schedule Office
Data and Analysis. Informatics 1 School of Informatics, University of Edinburgh. Part III Unstructured Data. Ian Stark. Staff-Student Liaison Meeting
Inf1-DA 2010 2011 III: 1 / 89 Informatics 1 School of Informatics, University of Edinburgh Data and Analysis Part III Unstructured Data Ian Stark February 2011 Inf1-DA 2010 2011 III: 2 / 89 Part III Unstructured
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object
Training Management System for Aircraft Engineering: indexing and retrieval of Corporate Learning Object Anne Monceaux 1, Joanna Guss 1 1 EADS-CCR, Centreda 1, 4 Avenue Didier Daurat 31700 Blagnac France
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS Breno C. Costa, Bruno. L. A. Alberto, André M. Portela, W. Maduro, Esdras O. Eler PDITec, Belo Horizonte,
CHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION 1.1 Background The command over cloud computing infrastructure is increasing with the growing demands of IT infrastructure during the changed business scenario of the 21 st Century.
Ontology and automatic code generation on modeling and simulation
Ontology and automatic code generation on modeling and simulation Youcef Gheraibia Computing Department University Md Messadia Souk Ahras, 41000, Algeria youcef.gheraibia@gmail.com Abdelhabib Bourouis
IRIS - English-Irish Translation System
IRIS - English-Irish Translation System Mihael Arcan, Unit for Natural Language Processing of the Insight Centre for Data Analytics at the National University of Ireland, Galway Introduction about me,
MISTAKE-HANDLING ACTIVITIES IN THE MATHEMATICS CLASSROOM: EFFECTS OF AN IN-SERVICE TEACHER TRAINING ON STUDENTS PERFORMANCE IN GEOMETRY
MISTAKE-HANDLING ACTIVITIES IN THE MATHEMATICS CLASSROOM: EFFECTS OF AN IN-SERVICE TEACHER TRAINING ON STUDENTS PERFORMANCE IN GEOMETRY Aiso Heinze and Kristina Reiss Institute of Mathematics, University
Projektgruppe. Categorization of text documents via classification
Projektgruppe Steffen Beringer Categorization of text documents via classification 4. Juni 2010 Content Motivation Text categorization Classification in the machine learning Document indexing Construction
Table of Contents. Chapter No. 1 Introduction 1. iii. xiv. xviii. xix. Page No.
Table of Contents Title Declaration by the Candidate Certificate of Supervisor Acknowledgement Abstract List of Figures List of Tables List of Abbreviations Chapter Chapter No. 1 Introduction 1 ii iii
TS3: an Improved Version of the Bilingual Concordancer TransSearch
TS3: an Improved Version of the Bilingual Concordancer TransSearch Stéphane HUET, Julien BOURDAILLET and Philippe LANGLAIS EAMT 2009 - Barcelona June 14, 2009 Computer assisted translation Preferred by
A Semantic Model for Multimodal Data Mining in Healthcare Information Systems. D.K. Iakovidis & C. Smailis
A Semantic Model for Multimodal Data Mining in Healthcare Information Systems D.K. Iakovidis & C. Smailis Department of Informatics and Computer Technology Technological Educational Institute of Lamia,
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka (dsarka@solidq.com) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
A User Centered Approach for the Design and Evaluation of Interactive Information Visualization Tools
A User Centered Approach for the Design and Evaluation of Interactive Information Visualization Tools Sarah Faisal, Paul Cairns, Ann Blandford University College London Interaction Centre (UCLIC) Remax
Instructional Design for Engineering Programs
Instructional Design for Engineering Programs - Vedhathiri Thanikachalam B.E (1968), M.Tech(1970), PhD (1975), - M.S (1988), FIE., FIGS., M. ISTE, M.ISTD - vthani2025@yahoo.in What is the guiding force
480093 - TDS - Socio-Environmental Data Science
Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 480 - IS.UPC - University Research Institute for Sustainability Science and Technology 715 - EIO - Department of Statistics and
72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD
72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD Paulo Gottgtroy Auckland University of Technology Paulo.gottgtroy@aut.ac.nz Abstract This paper is
Depth-of-Knowledge Levels for Four Content Areas Norman L. Webb March 28, 2002. Reading (based on Wixson, 1999)
Depth-of-Knowledge Levels for Four Content Areas Norman L. Webb March 28, 2002 Language Arts Levels of Depth of Knowledge Interpreting and assigning depth-of-knowledge levels to both objectives within
Defining Equity and Debt using REA Claim Semantics
Defining Equity and Debt using REA Claim Semantics Mike Bennett Enterprise Data Management Council, London, England mbennett@edmcouncil.org Abstract. The Financial Industry Business Ontology (FIBO) includes
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
Flexible mobility management strategy in cellular networks
Flexible mobility management strategy in cellular networks JAN GAJDORUS Department of informatics and telecommunications (161114) Czech technical university in Prague, Faculty of transportation sciences
SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY
SWAP: ONTOLOGY-BASED KNOWLEDGE MANAGEMENT WITH PEER-TO-PEER TECHNOLOGY M. EHRIG, C. TEMPICH AND S. STAAB Institute AIFB University of Karlsruhe 76128 Karlsruhe, Germany E-mail: {meh,cte,sst}@aifb.uni-karlsruhe.de
TRANSPORT SERVICE. RFID Vehicle Outbound Logistics Management Case Study
TRANSPORT SERVICE RFID Vehicle Outbound Logistics Management Case Study NV TRANSPORT SERVICE NV Transport Service (TS) a subsidiary of Hödlmayr International AG and Autologic Holding plc is the releasing
Specification and Analysis of Contracts Lecture 1 Introduction
Specification and Analysis of Contracts Lecture 1 Introduction Gerardo Schneider gerardo@ifi.uio.no http://folk.uio.no/gerardo/ Department of Informatics, University of Oslo SEFM School, Oct. 27 - Nov.
Chapter 8 The Enhanced Entity- Relationship (EER) Model
Chapter 8 The Enhanced Entity- Relationship (EER) Model Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Outline Subclasses, Superclasses, and Inheritance Specialization
Advanced Data Warehouse Design
Data-Centric Systems and Applications Advanced Data Warehouse Design From Conventional to Spatial and Temporal Applications Bearbeitet von Elzbieta Malinowski, Esteban Zimányi 1st ed. 2008. Corr. 2nd printing
Mining a Corpus of Job Ads
Mining a Corpus of Job Ads Workshop Strings and Structures Computational Biology & Linguistics Jürgen Jürgen Hermes Hermes Sprachliche Linguistic Data Informationsverarbeitung Processing Institut Department
T-61.3050 : Email Classification as Spam or Ham using Naive Bayes Classifier. Santosh Tirunagari : 245577
T-61.3050 : Email Classification as Spam or Ham using Naive Bayes Classifier Santosh Tirunagari : 245577 January 20, 2011 Abstract This term project gives a solution how to classify an email as spam or
Internet of Things, data management for healthcare applications. Ontology and automatic classifications
Internet of Things, data management for healthcare applications. Ontology and automatic classifications Inge.Krogstad@nor.sas.com SAS Institute Norway Different challenges same opportunities! Data capture
Deliverable D.4.2 Executive Summary
Deliverable D.4.2 Executive Summary Knowledge Lenses and Process Support Tools Authors: Victoria Uren, Open University, v.s.uren@open.ac.uk Sam Chapman University of Sheffield, sam@dcs.shef.ac.uk Aba-Sah
Success in Change. Anabel Houben Carsten Frigge C4 Consulting GmbH. Representative Survey on Success and Failure in Managing Change
Anabel Houben Carsten Frigge C4 Consulting GmbH Rainer Trinczek Hans J. Pongratz Technical University of Munich Success in Change Representative Survey on Success and Failure in Managing Change Management
Application of ontologies for the integration of network monitoring platforms
Application of ontologies for the integration of network monitoring platforms Jorge E. López de Vergara, Javier Aracil, Jesús Martínez, Alfredo Salvador, José Alberto Hernández Networking Research Group,
An Incrementally Trainable Statistical Approach to Information Extraction Based on Token Classification and Rich Context Models
Dissertation (Ph.D. Thesis) An Incrementally Trainable Statistical Approach to Information Extraction Based on Token Classification and Rich Context Models Christian Siefkes Disputationen: 16th February
ANALYTICS IN BIG DATA ERA
ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY, DISCOVER RELATIONSHIPS AND CLASSIFY HUGE AMOUNT OF DATA MAURIZIO SALUSTI SAS Copyr i g ht 2012, SAS Ins titut
Intelligent interoperable application for employment exchange system using ontology
1 Webology, Volume 10, Number 2, December, 2013 Home Table of Contents Titles & Subject Index Authors Index Intelligent interoperable application for employment exchange system using ontology Kavidha Ayechetty
Graduate School of Informatics
Graduate School of Informatics Admissions Policy '( ) ' ' - Master's Degree Program Major Enrollment Capacity 40 40 Doctor's Degree Program Major Enrollment Capacity 8 1 M. Entrance examination for international
How To Be A Critical Thinker
Building Critical Thinking Skills in General Education and Career Programs Wayne County Community College District Presented By: Mary Mahoney Jo Ann Allen Nyquist College Definition of Critical Thinking
Converging Web-Data and Database Data: Big - and Small Data via Linked Data
DBKDA/WEB Panel 2014, Chamonix, 24.04.2014 DBKDA/WEB Panel 2014, Chamonix, 24.04.2014 Reutlingen University Converging Web-Data and Database Data: Big - and Small Data via Linked Data Moderation: Fritz
Design Strategies to Improve the Validity of Learning, Assessment, and Evaluation
Design Strategies to Improve the Validity of Learning, Assessment, and Evaluation Eva L. Baker UCLA/CRESST Segundo Congreso Latinoamericano de Medición y Evaluación Educacional (COLMEE) María Isabel Sheraton
Data Visualization An Outlook on Disruptive Techniques (Technical Insights)
Data Visualization An Outlook on Disruptive Techniques (Technical Insights) Comprehend Complex Data Sets through Visual Representations June 2014 Contents Section Slide Numbers Executive Summary 3 Research
Maschinelles Lernen mit MATLAB
Maschinelles Lernen mit MATLAB Jérémy Huard Applikationsingenieur The MathWorks GmbH 2015 The MathWorks, Inc. 1 Machine Learning is Everywhere Image Recognition Speech Recognition Stock Prediction Medical
Software Engineering of NLP-based Computer-assisted Coding Applications
Software Engineering of NLP-based Computer-assisted Coding Applications 1 Software Engineering of NLP-based Computer-assisted Coding Applications by Mark Morsch, MS; Carol Stoyla, BS, CLA; Ronald Sheffer,
Outline. Lecture 13: Web Usability. Top Ten Web Design Mistakes. Web Usability Principles Usability Evaluations
Lecture 13: Web Usability Outline Web Usability Principles Usability Evaluations Wendy Liu CSC309F Fall 2007 1 2 What Makes Web Application Development Hard? Target audience can be difficult to define
FUNDAMENTALS OF ARTIFICIAL INTELLIGENCE KNOWLEDGE REPRESENTATION AND NETWORKED SCHEMES
Riga Technical University Faculty of Computer Science and Information Technology Department of Systems Theory and Design FUNDAMENTALS OF ARTIFICIAL INTELLIGENCE Lecture 7 KNOWLEDGE REPRESENTATION AND NETWORKED
Blog Post Extraction Using Title Finding
Blog Post Extraction Using Title Finding Linhai Song 1, 2, Xueqi Cheng 1, Yan Guo 1, Bo Wu 1, 2, Yu Wang 1, 2 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 2 Graduate School
Other Required Courses (14-18 hours)
1) IT Business Track Required Info Technology Courses (19 hours) 1,2&3 ITEC 2110 Digital Media 1,2&3 ITEC 3100 Intro to Networks 1,2&3 ITEC 3200 Intro to Databases 1 ITEC 3350 ECommerce 1,2&3 ITEC 3900
Big Data in Education
Big Data in Education Assessment of the New Educational Standards Markus Iseli, Deirdre Kerr, Hamid Mousavi Big Data in Education Technology is disrupting education, expanding the education ecosystem beyond
Master Thesis Proposal
Master Thesis Proposal Web Data Extraction of University Staff Competencies Edin Zildzo, 1125449 Supervisor: Ao.Univ.Prof.Dr. Jürgen Dorn Septemeber 11, 2014 1 Problem Statement Web data extraction is
A short guide to multiple choice and short answer exams
A short guide to multiple choice and short answer exams 1 A short guide to multiple choice and short answer exams www.intranet.birmingham.ac.uk/asc 2 A short guide to multiple choice and short answer exams
Extending Software Quality Models - A Sample In The Domain of Semantic Technologies
Extending Software Quality Models - A Sample In The Domain of Semantic Technologies Filip Radulovic Ontology Engineering Group Departamento de Inteligencia Artificial Facultad de Informática, Universidad
Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint
Linked Data Interface, Semantics and a T-Box Triple Store for Microsoft SharePoint Christian Fillies 1 and Frauke Weichhardt 1 1 Semtation GmbH, Geschw.-Scholl-Str. 38, 14771 Potsdam, Germany {cfillies,