Metrics-driven Framework for LOD Quality Assessment

Size: px
Start display at page:

Download "Metrics-driven Framework for LOD Quality Assessment"

Transcription

1 Metrics-driven Framework for LOD Quality Assessment Behshid Behkamal Ferdowsi University of Mashhad, Mashhad, Iran Abstract. The main objective of the Linked Open Data paradigm is to crystallize knowledge through the interlinking of already existing but dispersed data. The usefulness of the developed knowledge depends strongly on the quality of the aggregated and published data. Researchers have observed many challenges with the quality of Linked Open Data; therefore, our main objective in this thesis is to propose a metric-driven framework for evaluating the inherent quality dimensions of datasets before they are published as a viable part of the linked open data cloud. Keywords: #eswcphd2014behkamal 1 Introduction Linked Open Data (LOD) provides a distributed model for the semantic Web that allows any data provider to publish its publicly available data and meaningfully link them with other information sources over the Web. The main goal of the LOD initiative is to create knowledge by interlinking dispersed data. It is undeniable that the realization of this goal depends strongly on the quality of the published data. Therefore, quality evaluation is an important issue that must be addressed with the objective of helping data providers to evaluate their data before publishing as a dataset in the LOD cloud. In the area of data quality assessment, researchers have developed several frameworks, metrics and tools to evaluate data quality in general. For example, [1] describes subjective and objective assessments of data quality and presents three functional forms for developing objective data quality metrics. In [2], the authors have proposed a methodology for the assessment of organizational Information Quality (IQ), which consists of a questionnaire to measure IQ. In the area of the methodologies for data quality assessment, [3] provides a comparative description of existing methodologies and provides a comprehensive comparison of these methodologies. Also, the database community has developed a number of approaches such as user experience, expert judgment, sampling, parsing and cleansing techniques [4, 5] for measuring and enhancing data quality. While data quality is an important requirement for the successful organic growth of the LOD, only a very limited number of research initiatives exist, which focus on data quality for the Semantic Web and specifically for LOD. Based on our practical experience in publishing linked data [6], we have observed that many of the published

2 datasets suffer from quality issues such as syntax errors, redundant instances, and incorrect/incomplete attribute values. One of the better strategies to avoid such issues is to evaluate the quality of a dataset before it is published on the LOD cloud. This will help publishers to filter out low-quality data based on the quality assessment results, which in turn enables data consumers to make better and more informed decisions when using the shared datasets. 2 State of the art Here we primarily focus on data quality with respect to Semantic Web and LOD, but also briefly touch upon quality assessment frameworks as relevant to our work. 2.1 Information Quality (IQ) frameworks and quality models Many attempts have been made to compile and classify information quality criteria with different goals in mind. Naumann identifies three different kinds of classifications including goal-oriented models; semantic-oriented models; and processingoriented models [4]. Since we are going to extract inherent quality dimensions and customize them for LOD, we have systematically reviewed the semantics-oriented quality models and frameworks focusing on those models proposing inherent or intrinsic quality characteristics [7-10]. Given the models presented in [7] and [10] are proposed specifically for databases and data warehousing, they cannot be applied directly to our work. Only [9] investigates the quality dimensions in the context of our work which classifies the quality dimensions and criteria proposed by other researches in the LOD domain. Also, ISO 25012[8] is a general data quality model which defines quality dimensions from inherent and system dependent viewpoints. 2.2 Data Quality in the context of Semantic Web and Linked Data Despite its importance, data quality has only recently being receiving attention from the Semantic Web community. Most of related works in the context of quality assessment of LOD investigate the quality problems of the published datasets. For example, the authors of [11] have proposed a comprehensive approach that classifies quality problems of the published linked datasets and discuss common errors in RDF publishing, their consequences for applications, along with possible publisheroriented approaches to improve the quality of machine-readable and open data on the Web. In another work, Furber and Hepp propose an approach to evaluate the quality of datasets using SPARQL queries in order to identify quality problems. Using this approach, the authors identify quality problems of already available datasets such as Geonames and DBPedia [12]. There are also a number of works focusing on the quality evaluation of ontologies [13] that we have not investigated them, because our aim is the quality evaluation of datasets. Furthermore, some tools are developed for identifying common syntax errors in RDF documents in two groups of online validators, e.g. URIDebugger [14] and Va-

3 pour [15], and command line validatores, such as Jena Eyeball[16] and VRP [17]. Generally, all of these works primarily focus on data quality problems in published datasets, and none of them provides a solution for identifying the quality problems before the data is published. In this paper, we argue the importance of applying a quality model for assessing the quality of a given dataset before its publications a part of the linked open data cloud. 3 Problem Statement and Contribution Although data quality is an important issue for the successful organic growth of the Web of Data, there are only a very limited number of research initiatives that focus on data quality for the Semantic Web and specifically for the Web of Data. Based on our practical experience in publishing linked data [6], we have recognized that many of the published datasets suffer from quality deficiencies, most of which are related to inherent quality aspects of a dataset and not the context of other datasets. Thus one of the better strategies to avoid quality issues of the published datasets is to assess the quality of a dataset before release. This will help publishers to filter out low-quality data based on quality assessment results, which in turn enables data consumers to make better and more informed decisions when using shared datasets. Therefore, the objective of our work is to propose a metrics-driven framework that enables the automatic assessment of the quality of dataset before they are publicly published. For this purpose, we will explore the structural characteristics of data published in LOD cloud as well as the quality deficiencies of dataset itself. In other words, we will try to observe and clearly formulate a set of metrics that are quantitatively measurable for a given dataset. We will then try to find meaningful statistical correlations between the proposed metrics and inherent quality dimensions (that are not directly measurable) through empirical observational studies. Based on the correlations, we will create a framework that will be able to predict the inherent quality dimensions of datasets by only observing their measurable metrics. For this purpose, we have identified the characteristics of data published on the LOD cloud to extract the inherent quality dimensions and propose a set of metrics, which are quantitatively measurable for a given dataset. This way, we are able to assess inherent quality characteristics of datasets before publishing the data by observing the measured values of the relevant metrics. The novel contributions of our work can be summarized as follows: We clearly identify a set of important inherent quality characteristics for LOD datasets based on existing standard quality models and frameworks, e.g. ISO We systematically propose and validate a set of metrics for measuring the quality characteristics of datasets before they are published to the LOD cloud. We propose a quality model for LOD that considers the inherent data quality indicators of such data.

4 We introduce a novel approach for the assessment of the quality of datasets on LOD, which has its roots in measurement theory and software measurement techniques. 4 Research Methodology and Approach Our approach for data quality assessment involves the measurement of quality dimensions focusing specifically on inherent quality aspects of linked open datasets. To achieve this goal, we have applied following approach: 1. Exploratory analysis of the previous and current well-known models and frameworks on data quality and comparing dimensions and indictors of data quality presented in these models; 2. Selecting the most appropriate quality model and extracting a subset of quality dimensions that could be applied to inherent quality characteristics of LOD datasets; 3. Devising a set of metrics for assessment of selected inherent quality dimensions; 4. Theoretical validation of proposed metrics; 5. Proposing a quality models consist of selected quality dimensions and proposed metrics; 6. Implementing an automated tool for measuring the proposed metrics; 7. Empirical evaluation of the quality model by measuring the quality metrics of various dataset; 8. Developing a questionnaire for subjectively evaluation of the datasets used in the previous step; 9. Developing predictive statistical (machine learning)-based techniques to find a correlation between proposed metrics with inherent quality dimensions; 10. Applying final framework o predict the inherent quality of datasets by measuring the metrics. According to this approach, we have undertaken an exploratory analysis of the previous and current well-known models and frameworks on data quality focusing the models and Frameworks varying in their approach and application, but sharing a number of characteristics [7, 9, 10, 18-21]. We systematically review these data quality models and Frameworks focusing on those models proposing inherent or intrinsic quality [7-10]. Comparing existing dimensions and indictors of data quality presented in these models, we tried to identify the most appropriate quality dimensions that could be applied to inherent quality characteristics of LOD datasets. These inherent quality characteristics are namely completeness, semantic accuracy, syntactic accuracy, uniqueness, consistency and interlinking. In order to make the characteristics quantifiable, we define a set of metrics to measure the above six inherent quality characteristics. The employed approach for metric definition is Goal-Question-Metric(GQM) [22]. In GQM, the goals are gradually refined into several questions and each question is then refined into metrics. Also, one metric can be used to answer multiple questions. Considering the fact that only

5 few studies have been conducted which define quality metrics for LOD [5, 9, 23], we had to define the required metrics from scratch and prior work could not be reused for our purpose. We propose 32 metrics as measurement references for the inherent quality of linked open dataset to address six inherent quality dimensions. The main idea behind the design of these metrics has been comprehensiveness and simplicity. To achieve comprehensiveness, we have tried to cover as many quality deficiencies of a dataset as possible that can be identified at the time of publishing, which is the focus of our work. We have also considered as much structural characteristics of a dataset as possible. Therefore, the metrics are proposed in two main groups: quality-driven and structural. Quality-driven metric measures specific quality deficiency in a given dataset e.g. redundant instances in a dataset; while structural metrics represent a feature of any dataset presented in the RDF model, and is not related to the quality issues that might exist in those datasets, e.g. ratio of properties to classes. Furthermore, we have tried to define all of metrics in simple ratio scale. Taking into account of proposed metrics, it is understood that developing simple metrics is our secondary objective. After defining metrics, a hierarchical data quality model focusing on the inherent viewpoint is developed as shown in Figure 1. At the first level, it consists of six inherent quality dimensions, namely interlinking, uniqueness, consistency, syntactic accuracy, semantic accuracy and completeness. The quality metrics for assessing these quality dimensions are proposed at the second level of the model, some of which are used for measuring two quality dimensions. There are 32 metrics in this model, each of which are assigned by a number and defined in Table 2. Figure 1- The structure of LODQM 5 Intermediate Results An important outcome of this work is the evaluation of our assumption that a dataset within LOD can be automatically processed using metrics and evaluated based on statistical predictive models for measuring its quality before release. We expect that given the metrics values for a dataset, the developed predictive models are able to estimate/predict the values of inherent quality dimensions that are not directly measurable. Generally, the main outcomes that we have achieved are the following:

6 Identifying the inherent quality dimensions of LOD that can be assessed before publishing; Formulating a set of measurement-theoretic metrics for assessing the inherent quality of LOD; Developing a quality model for LOD by customizing ISO-25012; Achieving an innovative solution for quality assessment in the context of linked data in the early stage of publishing. In order to put proposed metrics into practice, we have implemented a tool that is able to automatically compute the metric values for any given input dataset. The code is implemented in the Java programming language (JDK 7 Update 25 x64) using Jena semantic web library. For better observation of metric behavior, different datasets from a variety of LOD domains are selected. Our codebase and selected datasets for this study are publicly accessible at [24, 25], respectively. Here, we have reported the results of our experiments over three datasets. The results of our observations over all of the datasets are reported in [26]. Table 1, presents the details of the selected datasets; and Table 2 summarizes the results of our experiments over them. Table 1- The details of the datasets used in our experiments Datasets No. of triples No. of instances No. of classes No. of properties DS1- FAO Water Areas 5, DS2- Water Economic Zones 25, DS3-Large Marine Ecosystems 6, Table 2 - The results of our experiments No. Metrics DS1 DS2 DS3 1 Missing properties values (Miss_Prp_Vlu) Out-of-range properties values (Out_Prp_Vlu) Misspelled property values (Msspl_Prp_Vlu) Undefined classes (Und_Cls) Membership of disjoint classes (Dsj_Cls) Inconsistent properties values (Inc_Prp_Vlu) Functional properties with inconsistent values (FP) Invalid usage of inverse-functional properties (IFP) Improper data types for the literals (Im_DT) Similar classes (Sml_Cls) Undefined properties (Und_Prp) Using disjoint properties (Dsj_Prp) Unused classes (Unusd_Cls) Unused properties (Unusd_Prp) Similar properties (Sml_Prp)

7 No. Metrics DS1 DS2 DS3 16 Using similar properties (Usg_Sml_Prp) Redundant triples (Rdn_Trp) Heterogeneity of data types (DT) Average missing properties (Avg_Miss_Prp_Vlu) Misusage of properties (Msusg_Prp) Misspelled classes (Msspl_Cls) Misspelled properties (Msspl_Prp) Ratio of properties to classes (Prp_Cls) Redundant instances (Rdn_Ins) Ratio of instances to classes (Ins_Cls) User-defined properties (User_Def_Prp) Misplaced classes/properties (Misplc_Cls_Prp) Object properties (Obj_Prp) Imported triples (Imp_Trp) External linking (Ext_Lnk) Connectivity of RDF graph (Gr_Cn) Intra-linking (Int_Lnk) In this step of our research, we are not able to address all of the research questions presented in Section 3. We can only answer RQ1 and RQ2. In response to RQ1, we have identified six inherent quality dimensions of linked open datasets that are presented in the first level of LODQM as shown in Figure 1. Regarding the second question (RQ2), a set of automatic measurable metrics is defined to measure six quality dimensions, as presented in Table 2. Currently, we are collecting the experts subjective perception about inherent quality dimensions to find relations between the measured values of the metrics and perceived quality. Thus, the other research questions, RQ3 and RQ4, can be addressed after completion of the work. 6 Evaluation Strategy In previous section, the results of empirical evaluation of the proposed metrics are presented. Here, we theoretically support our claim by validation of the metrics and evaluation of the quality model. Initially, the proposed metrics are validated from a measurement-theoretic perspective, and subsequently, the suitability of the proposed quality model will be discussed. Furthermore, we are going to subjectively evaluate our proposed model using expert' opinion as will be explained in the conclusion.

8 6.1 Theoretical Validation Generally, any kind of measure is a homomorphism from an empirical relational system to a numerical relational system[27]; therefore, it is imperative that measures be theoretically analyzed within the framework of measurement theory. There are two main groups of frameworks for the theoretical validation of metrics in the literature. The first group consists of frameworks directly based on measurement theory principles [28]; while the second group expresses the desirable properties of the numerical relational system that need to be satisfied by the metrics [29]. In this work, we have examined the properties of our metrics according to one of the most well-known frameworks in the latter group, namely Property-based measurement framework [29]. This framework provides five types of metrics including size, length, complexity, coupling and cohesion and offers a set of desirable properties for each of these types. Since, all of the proposed metrics are of the size type and according to [29], they are expected to exhibit three main properties, namely, non-negativity, null value and additivity. In other words, size cannot be negative (non-negativity), and it is expected to be null when a system does not contain any elements (null-value). Also, when modules of a system do not have any elements in common, we expect size to be additive (additivity).we have analyzed these three important properties for our proposed metrics and recognized that all of the metrics respect the properties required by the property-based measurement framework to form valid metric space. 6.2 Criteria Based Evaluation In this section, we evaluate our proposed quality model, LODQM, according to criteria in two dimensions of analytical criteria and practical criteria. These meta-criteria are presented in [30] to analyzes seven well-known conceptual frameworks on information quality. The analytical criteria require clear definitions of the terms used in a framework, a positioning of the framework within existing literature, and a consistent and systematic structure. The practical dimension consists of criteria which make the framework applicable, namely conciseness and the inclusion of tools that are based on the framework. To better evaluation of the our model based on these meta-criteria, we have answer the questions corresponding to meta-criteria which is proposed in [30]. Definitions: The exact definitions for all of the quality metrics and quality dimensions are presented in LODQM. Positioning: LODQM is clearly positioned within existing information quality literature in the context of LOD. Consistency: LODQM is divided into systematic dimensions that are collectively exhaustive. Since, our model has used GQM approach for metric development; there are common metrics for different quality dimensions. Conciseness: The quality model has six quality dimensions with 5-7 metrics for each of which. Tools: An automated tool is developed to measure the values of the proposed metrics for any input dataset in RDF format. Also, a questionnaire is developed

9 and will be applied to capture the experts' opinion in for subjectively evaluation of our model. According to above discussions, it is clear that the proposed quality model is a practical model for any datasets of LOD. 7 Conclusion The goal of this research is proposing a metrics-driven framework for predicting the quality of linked open datasets from an inherent point of view. To achieve this goal, we have followed an approach which is started by analysis of the well-known IQ frameworks, resulting in selection of six quality characteristics. Then, a set of metrics for assessing each of six quality dimensions are developed including quality-driven and structural. To put the proposed metrics into practice, we have implemented an automated tool and computed the metric values for various datasets from different domains of LOD. Finally, the suitability of the LODQM metrics is discussed. In the next phase of our work, we are going to investigate and analyze whether the proposed metrics can be good early indicators of inherent dimensions. Following our approach, we use questionnaire to receive experts subjective perception regarding inherent quality dimensions for all of the datasets used in this experiment to find relations between the measured values for the metrics and perceived quality by collecting the opinions of the experts in LOD domain. If the proposed metrics are shown to have meaningful correlation with the quality dimensions, then we are able to predict the inherent quality dimensions of any dataset once it is integrated into the LOD, by only observing the values of proposed metrics. The results will help publishers to filter out low-quality data, which in turn enables data consumers to make better and more informed decisions when using the shared datasets. Acknowledgement I would like to gratefully thank Prof. Mohsen Kahani for his brilliant guidance and Dr. Ebrahim Bagheri for his great advice during my work. References 1. Pipino, L.L., Lee, Y.W., Wang, R.Y.: Data quality assessment. Communications of the ACM 45, (2002) 2. Lee, Y.W., Strong, D.M., Kahn, B.K., Wang, R.Y.: AIMQ: a methodology for information quality assessment. Information & management 40, (2002) 3. Batini, C., Cappiello, C., Francalanci, C., Maurino, A.: Methodologies for data quality assessment and improvement. ACM Computing Surveys (CSUR), vol. 41, pp. 16 (2009) 4. Naumann, F., Rolker, C.: Assessment methods for information quality criteria. In: 5'th Conference on Information Quality pp (2000)

10 5. Batini, C., Scannapieca, M.: Data quality: concepts, methodologies and techniques. Springer (2006) 6. Behkamal, B., Kahani, M., Paydar, S., Dadkhah, M., Sekhavaty, E.: Publishing Persian linked data; challenges and lessons learned. In: 5th International Symposium on Telecommunications (IST), pp IEEE, (2010) 7. Helfert, M.: Managing and measuring data quality in data warehousing. In: World Multiconference on Systemics, Cybernetics and Informatics, pp (2001) 8. ISO: ISO/IEC Software engineering - Software product Quality Requirements and Evaluation (SQuaRE). Data quality model, (2008) 9. Zaveri, A., Rula, A., Maurino, A., Pietrobon, R., Lehmann, J., Auer, S., Hitzler, P.: Quality Assessment Methodologies for Linked Open Data. Submitted to Semantic Web Journal (2013) 10. Wang, R.Y., Strong, D.M., Guarascio, L.M.: Beyond accuracy: What data quality means to data consumers. Journal of Management Information Systems 12, 5-33 (1996) 11. Hogan, A., Harth, A., Passant, A., Decker, S., Polleres, A.: Weaving the pedantic web. In: 3rd International Workshop on Linked Data on the Web (LDOW2010). (2010) 12. Fürber, C., Hepp, M.: Using semantic web resources for data quality management. Knowledge Engineering and Management by the Masses, pp Springer (2010) 13. Brank, J., Grobelnik, M., Mladenić, D.: A survey of ontology evaluation techniques. (2005) 14. URI Debugger, Vapour online validator, Jena Eyeball: Command line validator, VRP: Command line validator, Dedeke, A.: A Conceptual Framework for Developing Quality Measures for Information Systems. In: 5th International Conference on Information Quality, pp (2000) 19. Naumann, F., Rolker, C.: Do Metadata Models meet IQ Requirements? In: Iternational Conference on Information Quality (IQ), pp (1999) 20. Su, Y., Jin, Z.: A Methodology for Information Quality Assessment in Data Warehousing. In: Communications, ICC'08. IEEE International Conference on, pp IEEE, (2008) 21. Moraga, C., Moraga, M., Caro, A., Calero, C.: Defining the intrinsic quality of web portal data. In: 8th International Conference on Web Information Systems and Technologies (WEBIST), pp (2012) 22. Caldiera, V.R.B.G., Rombach, H.D.: The goal question metric approach. Encyclopedia of software engineering 2, (1994) 23. Hartig, O.: Trustworthiness of data on the web. In: Proceedings of the STI Berlin & CSW PhD Workshop. Citeseer, (2008) 24. The code of metrics calculation tool Networked Ontology (NeOn) project, Behkamal, B., Kahani, M., Bagheri, E., Jeremic, Z.: A Metrics-Driven Approach for Quality Assessment of Linked Open Data. Accepted in Journal of Theoretical and Applied Electronic Commerce Research (2013) 27. Fenton, N.E., Pfleeger, S.L.: Software metrics: a rigorous and practical approach. PWS Publishing Co. (1998) 28. Poels, G., Dedene, G.: Distance-based software measurement: necessary and sufficient properties for software measures. Information and Software Technology 42, (2000)

11 29. Briand, L.C., Morasca, S., Basili, V.R.: Property-based software engineering measurement. Software Engineering, IEEE Transactions on 22, (1996) 30. Eppler, M.J., Wittig, D.: Conceptualizing information quality: A Review of Information Quality Frameworks from the Last Ten Years. In: 5th International Conference on Information Quality, pp (2000)

Quality Assessment for Linked Data: A Survey

Quality Assessment for Linked Data: A Survey Undefined 1 (2012) 1 5 1 IOS Press Quality Assessment for Linked Data: A Survey A Systematic Literature Review and Conceptual Framework Amrapali Zaveri** a,, Anisa Rula** b, Andrea Maurino b, Ricardo Pietrobon

More information

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

LinkZoo: A linked data platform for collaborative management of heterogeneous resources LinkZoo: A linked data platform for collaborative management of heterogeneous resources Marios Meimaris, George Alexiou, George Papastefanatos Institute for the Management of Information Systems, Research

More information

Quality Assessment for Linked Data: A Survey

Quality Assessment for Linked Data: A Survey Semantic Web 1 (2012) 1 5 1 IOS Press Quality Assessment for Linked Data: A Survey A Systematic Literature Review and Conceptual Framework Editor(s): Pascal Hitzler, Wright State University, USA Solicited

More information

Quality Assessment Methodologies for Linked Open Data

Quality Assessment Methodologies for Linked Open Data Undefined 1 (2012) 1 5 1 IS Press Quality Assessment Methodologies for Linked pen Data A Systematic Literature Review and Conceptual Framework Amrapali Zaveri a, Anisa Rula b, Andrea Maurino b, Ricardo

More information

Root causes affecting data quality in CRM

Root causes affecting data quality in CRM MKWI 2010 Business Intelligence 1125 Root causes affecting data quality in CRM Chair of Business Informatics, Catholic University of Eichstaett-Ingolstadt 1 Introduction An important field of application

More information

Using Semantic Web Resources for Data Quality Management

Using Semantic Web Resources for Data Quality Management Using Semantic Web Resources for Data Quality Management Christian Fürber and Martin Hepp E-Business & Web Science Research Group, Werner-Heisenberg-Weg 39, 85577 Neubiberg, Germany christian@fuerber.com,

More information

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING Ramesh Babu Palepu 1, Dr K V Sambasiva Rao 2 Dept of IT, Amrita Sai Institute of Science & Technology 1 MVR College of Engineering 2 asistithod@gmail.com

More information

DISCOVERING RESUME INFORMATION USING LINKED DATA

DISCOVERING RESUME INFORMATION USING LINKED DATA DISCOVERING RESUME INFORMATION USING LINKED DATA Ujjal Marjit 1, Kumar Sharma 2 and Utpal Biswas 3 1 C.I.R.M, University Kalyani, Kalyani (West Bengal) India sic@klyuniv.ac.in 2 Department of Computer

More information

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at

City Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at City Data Pipeline A System for Making Open Data Useful for Cities Stefan Bischof 1,2, Axel Polleres 1, and Simon Sperl 1 1 Siemens AG Österreich, Siemensstraße 90, 1211 Vienna, Austria {bischof.stefan,axel.polleres,simon.sperl}@siemens.com

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany a.schultz@fu-berlin.de,

More information

daq, an Ontology for Dataset Quality Information

daq, an Ontology for Dataset Quality Information daq, an Ontology for Dataset Quality Information Jeremy Debattista University of Bonn / Fraunhofer IAIS, Germany name.surname@iaisextern.fraunhofer.de Christoph Lange University of Bonn / Fraunhofer IAIS,

More information

SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA

SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA SPC BOARD (COMMISSIONE DI COORDINAMENTO SPC) AN OVERVIEW OF THE ITALIAN GUIDELINES FOR SEMANTIC INTEROPERABILITY THROUGH LINKED OPEN DATA INDEX EXECUTIVE SUMMARY... 3 1. PREFACE... 5 1.1. Acronyms... 5

More information

Current Research Topic In Software Engineering

Current Research Topic In Software Engineering Current Research Topic In Software Engineering A PROJECT REPORT Submitted by MD. Mithun Ahamed Id: 13-96937-2 Under the guidance of DR. Dip Nandi in partial fulfillment for the award of the degre of Master

More information

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia Proceedings of the I-SEMANTICS 2012 Posters & Demonstrations Track, pp. 26-30, 2012. Copyright 2012 for the individual papers by the papers' authors. Copying permitted only for private and academic purposes.

More information

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee 31 10589 Berlin, Germany. www.fokus.fraunhofer.

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee 31 10589 Berlin, Germany. www.fokus.fraunhofer. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee 31 10589 Berlin, Germany www.fokus.fraunhofer.de 1 Identification and Utilization of Components for a linked Open Data Platform

More information

Component visualization methods for large legacy software in C/C++

Component visualization methods for large legacy software in C/C++ Annales Mathematicae et Informaticae 44 (2015) pp. 23 33 http://ami.ektf.hu Component visualization methods for large legacy software in C/C++ Máté Cserép a, Dániel Krupp b a Eötvös Loránd University mcserep@caesar.elte.hu

More information

Review Protocol Agile Software Development

Review Protocol Agile Software Development Review Protocol Agile Software Development Tore Dybå 1. Background The concept of Agile Software Development has sparked a lot of interest in both industry and academia. Advocates of agile methods consider

More information

Data Quality Assessment

Data Quality Assessment Data Quality Assessment Leo L. Pipino, Yang W. Lee, and Richard Y. Wang How good is a company s data quality? Answering this question requires usable data quality metrics. Currently, most data quality

More information

Process-Family-Points

Process-Family-Points Process-Family-Points Sebastian Kiebusch 1, Bogdan Franczyk 1, and Andreas Speck 2 1 University of Leipzig, Faculty of Economics and Management, Information Systems Institute, Germany kiebusch@wifa.uni-leipzig.de,

More information

Risk Knowledge Capture in the Riskit Method

Risk Knowledge Capture in the Riskit Method Risk Knowledge Capture in the Riskit Method Jyrki Kontio and Victor R. Basili jyrki.kontio@ntc.nokia.com / basili@cs.umd.edu University of Maryland Department of Computer Science A.V.Williams Building

More information

A Knowledge Management Framework Using Business Intelligence Solutions

A Knowledge Management Framework Using Business Intelligence Solutions www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For

More information

Data-Gov Wiki: Towards Linked Government Data

Data-Gov Wiki: Towards Linked Government Data Data-Gov Wiki: Towards Linked Government Data Li Ding 1, Dominic DiFranzo 1, Sarah Magidson 2, Deborah L. McGuinness 1, and Jim Hendler 1 1 Tetherless World Constellation Rensselaer Polytechnic Institute

More information

Semantic Monitoring of Personal Web Activity to Support the Management of Trust and Privacy

Semantic Monitoring of Personal Web Activity to Support the Management of Trust and Privacy Semantic Monitoring of Personal Web Activity to Support the Management of Trust and Privacy Mathieu d Aquin, Salman Elahi, Enrico Motta Knowledge Media Institute, The Open University, Milton Keynes, UK

More information

TOWARD A FRAMEWORK FOR DATA QUALITY IN ELECTRONIC HEALTH RECORD

TOWARD A FRAMEWORK FOR DATA QUALITY IN ELECTRONIC HEALTH RECORD TOWARD A FRAMEWORK FOR DATA QUALITY IN ELECTRONIC HEALTH RECORD Omar Almutiry, Gary Wills and Richard Crowder School of Electronics and Computer Science, University of Southampton, Southampton, UK. {osa1a11,gbw,rmc}@ecs.soton.ac.uk

More information

Project Knowledge Management Based on Social Networks

Project Knowledge Management Based on Social Networks DOI: 10.7763/IPEDR. 2014. V70. 10 Project Knowledge Management Based on Social Networks Panos Fitsilis 1+, Vassilis Gerogiannis 1, and Leonidas Anthopoulos 1 1 Business Administration Dep., Technological

More information

DYNAMIC DATA QUALITY MANAGEMENT FRAMEWORK USING ISSUE TRACKING SYSTEMS

DYNAMIC DATA QUALITY MANAGEMENT FRAMEWORK USING ISSUE TRACKING SYSTEMS IADIS International Journal on Computer Science and Information Systems Vol. 10, No. 2, pp. 32-51 ISSN: 1646-3692 DYNAMIC DATA QUALITY MANAGEMENT FRAMEWORK USING ISSUE TRACKING Mortaza S. Bargh. Research

More information

Data Quality Managing successfully

Data Quality Managing successfully Consultants Intelligence Business White Paper Data Quality Managing successfully Data quality is essential. It is the key to acceptance of IT solutions. And: Poor data is expensive. Companies that have

More information

DOES THE EU INSURANCE MEDIATION DIRECTIVE HELP TO IMPROVE DATA QUALITY? A METRIC-BASED ANALYSIS

DOES THE EU INSURANCE MEDIATION DIRECTIVE HELP TO IMPROVE DATA QUALITY? A METRIC-BASED ANALYSIS DOES THE EU INSURANCE MEDIATION DIRECTIVE HELP TO IMPROVE DATA QUALITY? A METRIC-BASED ANALYSIS Heinrich, Bernd Kaiser, Marcus Klier, Mathias Abstract Most financial service companies view the requirements

More information

LiDDM: A Data Mining System for Linked Data

LiDDM: A Data Mining System for Linked Data LiDDM: A Data Mining System for Linked Data Venkata Narasimha Pavan Kappara Indian Institute of Information Technology Allahabad Allahabad, India kvnpavan@gmail.com Ryutaro Ichise National Institute of

More information

Linked Open Data A Way to Extract Knowledge from Global Datastores

Linked Open Data A Way to Extract Knowledge from Global Datastores Linked Open Data A Way to Extract Knowledge from Global Datastores Bebo White SLAC National Accelerator Laboratory HKU Expert Address 18 September 2014 Developments in science and information processing

More information

A Framework for Identifying and Managing Information Quality Metrics of Corporate Performance Management System

A Framework for Identifying and Managing Information Quality Metrics of Corporate Performance Management System Journal of Modern Accounting and Auditing, ISSN 1548-6583 February 2012, Vol. 8, No. 2, 185-194 D DAVID PUBLISHING A Framework for Identifying and Managing Information Quality Metrics of Corporate Performance

More information

Measuring Data Quality

Measuring Data Quality Departamento de Computación Facultad de Ciencias Exactas y Naturales Universidad de Buenos Aires INFORME TÉCNICO Measuring Data Quality Mónica Bobrowski, Martina Marré, Daniel Yankelevich Report n.: 99-002

More information

Ontology-Based Discovery of Workflow Activity Patterns

Ontology-Based Discovery of Workflow Activity Patterns Ontology-Based Discovery of Workflow Activity Patterns Diogo R. Ferreira 1, Susana Alves 1, Lucinéia H. Thom 2 1 IST Technical University of Lisbon, Portugal {diogo.ferreira,susana.alves}@ist.utl.pt 2

More information

Semantic Search in Portals using Ontologies

Semantic Search in Portals using Ontologies Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br

More information

BENCHMARKING INFORMATION QUALITY PERFORMANCE IN ASSET INTENSIVE ORGANISATIONS IN THE UK

BENCHMARKING INFORMATION QUALITY PERFORMANCE IN ASSET INTENSIVE ORGANISATIONS IN THE UK BENCHMARKING INFORMATION QUALITY PERFORMANCE IN ASSET INTENSIVE ORGANISATIONS IN THE UK Philip Woodall, Ajith Kumar Parlikad and Lucas Lebrun Institute for Manufacturing, Department of Engineering, University

More information

From Databases to Natural Language: The Unusual Direction

From Databases to Natural Language: The Unusual Direction From Databases to Natural Language: The Unusual Direction Yannis Ioannidis Dept. of Informatics & Telecommunications, MaDgIK Lab University of Athens, Hellas (Greece) yannis@di.uoa.gr http://www.di.uoa.gr/

More information

Standards for Big Data in the Cloud

Standards for Big Data in the Cloud Standards for Big Data in the Cloud International Cloud Symposium 15/10/2013 Carola Carstens (Project Officer) DG CONNECT, Unit G3 Data Value Chain European Commission Outline 1) Data Value Chain Unit

More information

Lightweight Data Integration using the WebComposition Data Grid Service

Lightweight Data Integration using the WebComposition Data Grid Service Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed

More information

Collaborative Development of Knowledge Bases in Distributed Requirements Elicitation

Collaborative Development of Knowledge Bases in Distributed Requirements Elicitation Collaborative Development of Knowledge Bases in Distributed s Elicitation Steffen Lohmann 1, Thomas Riechert 2, Sören Auer 2, Jürgen Ziegler 1 1 University of Duisburg-Essen Department of Informatics and

More information

SEMANTIC-BASED AUTHORING OF TECHNICAL DOCUMENTATION

SEMANTIC-BASED AUTHORING OF TECHNICAL DOCUMENTATION SEMANTIC-BASED AUTHORING OF TECHNICAL DOCUMENTATION R Setchi, Cardiff University, UK, Setchi@cf.ac.uk N Lagos, Cardiff University, UK, LagosN@cf.ac.uk ABSTRACT Authoring of technical documentation is a

More information

SmartLink: a Web-based editor and search environment for Linked Services

SmartLink: a Web-based editor and search environment for Linked Services SmartLink: a Web-based editor and search environment for Linked Services Stefan Dietze, Hong Qing Yu, Carlos Pedrinaci, Dong Liu, John Domingue Knowledge Media Institute, The Open University, MK7 6AA,

More information

A Knowledge-Based Cohesion Metric for Object-Oriented Software

A Knowledge-Based Cohesion Metric for Object-Oriented Software A Knowledge-Based Cohesion Metric for Object-Oriented Software CARA STEIN 1 LETHA ETZKORN 2 SAMPSON GHOLSTON 3 PHILLIP FARRINGTON 3 JULIE FORTUNE 3 1 Edinboro University of Pa Department of Mathematics

More information

EXTENDED ANGEL: KNOWLEDGE-BASED APPROACH FOR LOC AND EFFORT ESTIMATION FOR MULTIMEDIA PROJECTS IN MEDICAL DOMAIN

EXTENDED ANGEL: KNOWLEDGE-BASED APPROACH FOR LOC AND EFFORT ESTIMATION FOR MULTIMEDIA PROJECTS IN MEDICAL DOMAIN EXTENDED ANGEL: KNOWLEDGE-BASED APPROACH FOR LOC AND EFFORT ESTIMATION FOR MULTIMEDIA PROJECTS IN MEDICAL DOMAIN Sridhar S Associate Professor, Department of Information Science and Technology, Anna University,

More information

Mining the Web of Linked Data with RapidMiner

Mining the Web of Linked Data with RapidMiner Mining the Web of Linked Data with RapidMiner Petar Ristoski, Christian Bizer, and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {petar.ristoski,heiko,chris}@informatik.uni-mannheim.de

More information

Software project cost estimation using AI techniques

Software project cost estimation using AI techniques Software project cost estimation using AI techniques Rodríguez Montequín, V.; Villanueva Balsera, J.; Alba González, C.; Martínez Huerta, G. Project Management Area University of Oviedo C/Independencia

More information

How To Manage Data Quality In The Semantic Web

How To Manage Data Quality In The Semantic Web Using SPARQL and SPIN for Data Quality Management on the Semantic Web Christian Fürber and Martin Hepp Universität der Bundeswehr München, E-Business & Web Science Research Group Werner-Heisenberg-Weg

More information

The FAO Geopolitical Ontology: a reference for country-based information

The FAO Geopolitical Ontology: a reference for country-based information The FAO Geopolitical Ontology: a reference for country-based information Editor(s): Name Surname, University, Country Solicited review(s): Name Surname, University, Country Open review(s): Name Surname,

More information

Linked Statistical Data Analysis

Linked Statistical Data Analysis Linked Statistical Data Analysis Sarven Capadisli 1, Sören Auer 2, Reinhard Riedl 3 1 Universität Leipzig, Institut für Informatik, AKSW, Leipzig, Germany, 2 University of Bonn and Fraunhofer IAIS, Bonn,

More information

Open Data Integration Using SPARQL and SPIN

Open Data Integration Using SPARQL and SPIN Open Data Integration Using SPARQL and SPIN A Case Study for the Tourism Domain Antonino Lo Bue, Alberto Machi ICAR-CNR Sezione di Palermo, Italy Research funded by Italian PON SmartCities Dicet-InMoto-Orchestra

More information

ISO/IEC 9126 in practice: what do we need to know?

ISO/IEC 9126 in practice: what do we need to know? ISO/IEC 9126 in practice: what do we need to know? P. Botella, X. Burgués, J.P. Carvallo, X. Franch, G. Grau, J. Marco, C. Quer Abstract ISO/IEC 9126 is currently one of the most widespread quality standards.

More information

Software Engineering Practices in Jordan

Software Engineering Practices in Jordan Software Engineering Practices in Jordan Nuha El-Khalili Faculty of Information Technology, University of Petra, Amman, Jordan nuhak@uop.edu.jo Dima Damen Faculty of Information Technology, University

More information

Protocol for the Systematic Literature Review on Web Development Resource Estimation

Protocol for the Systematic Literature Review on Web Development Resource Estimation Protocol for the Systematic Literature Review on Web Development Resource Estimation Author: Damir Azhar Supervisor: Associate Professor Emilia Mendes Table of Contents 1. Background... 4 2. Research Questions...

More information

Information Quality Assessment in Context of Business Intelligence System

Information Quality Assessment in Context of Business Intelligence System Information Quality Assessment in Context of Business Intelligence System Samuel Otero Schmidt Universidade de São Paulo, Brasil Maria Aparecida Gouvêa e-mail: magouvea@usp.br Universidade de São Paulo,

More information

A Systematic Review Process for Software Engineering

A Systematic Review Process for Software Engineering A Systematic Review Process for Software Engineering Paula Mian, Tayana Conte, Ana Natali, Jorge Biolchini and Guilherme Travassos COPPE / UFRJ Computer Science Department Cx. Postal 68.511, CEP 21945-970,

More information

The Role of Information Technology Studies in Software Product Quality Improvement

The Role of Information Technology Studies in Software Product Quality Improvement The Role of Information Technology Studies in Software Product Quality Improvement RUDITE CEVERE, Dr.sc.comp., Professor Faculty of Information Technologies SANDRA SPROGE, Dr.sc.ing., Head of Department

More information

How To Write A Drupal 5.5.2.2 Rdf Plugin For A Site Administrator To Write An Html Oracle Website In A Blog Post In A Flashdrupal.Org Blog Post

How To Write A Drupal 5.5.2.2 Rdf Plugin For A Site Administrator To Write An Html Oracle Website In A Blog Post In A Flashdrupal.Org Blog Post RDFa in Drupal: Bringing Cheese to the Web of Data Stéphane Corlosquet, Richard Cyganiak, Axel Polleres and Stefan Decker Digital Enterprise Research Institute National University of Ireland, Galway Galway,

More information

An Ontology Model for Organizing Information Resources Sharing on Personal Web

An Ontology Model for Organizing Information Resources Sharing on Personal Web An Ontology Model for Organizing Information Resources Sharing on Personal Web Istiadi 1, and Azhari SN 2 1 Department of Electrical Engineering, University of Widyagama Malang, Jalan Borobudur 35, Malang

More information

Characteristics of Computational Intelligence (Quantitative Approach)

Characteristics of Computational Intelligence (Quantitative Approach) Characteristics of Computational Intelligence (Quantitative Approach) Shiva Vafadar, Ahmad Abdollahzadeh Barfourosh Intelligent Systems Lab Computer Engineering and Information Faculty Amirkabir University

More information

Measurement Information Model

Measurement Information Model mcgarry02.qxd 9/7/01 1:27 PM Page 13 2 Information Model This chapter describes one of the fundamental measurement concepts of Practical Software, the Information Model. The Information Model provides

More information

SCOUT: A Framework for Personalized Context- Aware Mobile Applications

SCOUT: A Framework for Personalized Context- Aware Mobile Applications SCOUT: A Framework for Personalized Context- Aware Mobile Applications William Van Woensel 1, Sven Casteleyn 1,2, Olga De Troyer 1,2 1 Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussel, Belgium {William.Van.Woensel,

More information

New Generation of Social Networks Based on Semantic Web Technologies: the Importance of Social Data Portability

New Generation of Social Networks Based on Semantic Web Technologies: the Importance of Social Data Portability New Generation of Social Networks Based on Semantic Web Technologies: the Importance of Social Data Portability Liana Razmerita 1, Martynas Jusevičius 2, Rokas Firantas 2 Copenhagen Business School, Denmark

More information

Appendix B Data Quality Dimensions

Appendix B Data Quality Dimensions Appendix B Data Quality Dimensions Purpose Dimensions of data quality are fundamental to understanding how to improve data. This appendix summarizes, in chronological order of publication, three foundational

More information

discussion on Standardization Nov, 2009

discussion on Standardization Nov, 2009 discussion on data quality China National Institute of China National Institute of Standardization Nov, 2009 outline 1.Background 2.Current status of standards development 3.future work 1.1 In general

More information

Usability Metric Framework for Mobile Phone Application

Usability Metric Framework for Mobile Phone Application Usability Metric Framework for Mobile Phone Application Azham Hussain Informatics Research Institute University of Salford Greater Manchester M5 4WT United Kingdom Maria Kutar Informatics Research Institute

More information

Publishing Linked Data Requires More than Just Using a Tool

Publishing Linked Data Requires More than Just Using a Tool Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,

More information

METRICS FOR MEASURING DATA QUALITY Foundations for an economic data quality management

METRICS FOR MEASURING DATA QUALITY Foundations for an economic data quality management METRICS FOR MEASURING DATA UALITY Foundations for an economic data quality management Bernd Heinrich, Marcus Kaiser, Mathias Klier Keywords: Abstract: Data uality, Data uality Management, Data uality Metrics

More information

A Semantic web approach for e-learning platforms

A Semantic web approach for e-learning platforms A Semantic web approach for e-learning platforms Miguel B. Alves 1 1 Laboratório de Sistemas de Informação, ESTG-IPVC 4900-348 Viana do Castelo. mba@estg.ipvc.pt Abstract. When lecturers publish contents

More information

Mahmoud Khraiwesh Faculty of Science and Information Technology Zarqa University Zarqa - Jordan mahmoud@zpu.edu.jo

Mahmoud Khraiwesh Faculty of Science and Information Technology Zarqa University Zarqa - Jordan mahmoud@zpu.edu.jo World of Computer Science and Information Technology Journal (WCSIT) ISSN: 2221-0741 Vol. 1, No. 2, 26-33, 2011 Validation Measures in CMMI Mahmoud Khraiwesh Faculty of Science and Information Technology

More information

Performance Evaluation of Reusable Software Components

Performance Evaluation of Reusable Software Components Performance Evaluation of Reusable Software Components Anupama Kaur 1, Himanshu Monga 2, Mnupreet Kaur 3 1 M.Tech Scholar, CSE Dept., Swami Vivekanand Institute of Engineering and Technology, Punjab, India

More information

Subjective Assessment of Data Quality considering their Interdependencies and Relevance according to the Type of Information Systems

Subjective Assessment of Data Quality considering their Interdependencies and Relevance according to the Type of Information Systems 389 Subjective Assessment of Data Quality considering their Interdependencies and Relevance according to the Type of Information Systems María del Pilar Angeles, Francisco Javier García-Ugalde Facultad

More information

Comparing Software Quality between Open Source and Closed Source Software Development. Jussi Heikkilä, Joona Hartikainen & Pramod Guruprasad

Comparing Software Quality between Open Source and Closed Source Software Development. Jussi Heikkilä, Joona Hartikainen & Pramod Guruprasad Comparing Software Quality between Open Source and Closed Source Software Development Jussi Heikkilä, Joona Hartikainen & Pramod Guruprasad Abstract 1. Introduction 2. Software quality 2.1 Definition 2.2

More information

SOFTWARE MEASUREMENT

SOFTWARE MEASUREMENT SOFTWARE MEASUREMENT SANDRO MORASCA Università dell'insubria Dipartimento di Scienze Chimiche, Fisiche e Matematiche Sede di Como Via Valleggio Como, I-00, Italy Email: morasca@uninsubria.it This article

More information

Knowledge Management System Framework and Agent-Based Tool to Support Collaborative Software Maintenance Environment

Knowledge Management System Framework and Agent-Based Tool to Support Collaborative Software Maintenance Environment Knowledge Management System Framework and Agent-Based Tool to Support Collaborative Software Maintenance Environment Mohd Zali Mohd Nor zali@itstar.com.my Rusli Abdullah rusli@fsktm.upm.edu.my hasan@fsktm.upm.edu.my

More information

María Elena Alvarado gnoss.com* elenaalvarado@gnoss.com Susana López-Sola gnoss.com* susanalopez@gnoss.com

María Elena Alvarado gnoss.com* elenaalvarado@gnoss.com Susana López-Sola gnoss.com* susanalopez@gnoss.com Linked Data based applications for Learning Analytics Research: faceted searches, enriched contexts, graph browsing and dynamic graphic visualisation of data Ricardo Alonso Maturana gnoss.com *Piqueras

More information

Enhancing DataQuality. Environments

Enhancing DataQuality. Environments Nothing is more likely to undermine the performance and business value of a data warehouse than inappropriate, misunderstood, or ignored data quality. Enhancing DataQuality in DataWarehouse Environments

More information

Automatic Timeline Construction For Computer Forensics Purposes

Automatic Timeline Construction For Computer Forensics Purposes Automatic Timeline Construction For Computer Forensics Purposes Yoan Chabot, Aurélie Bertaux, Christophe Nicolle and Tahar Kechadi CheckSem Team, Laboratoire Le2i, UMR CNRS 6306 Faculté des sciences Mirande,

More information

ONTOLOGY FOR MOBILE PHONE OPERATING SYSTEMS

ONTOLOGY FOR MOBILE PHONE OPERATING SYSTEMS ONTOLOGY FOR MOBILE PHONE OPERATING SYSTEMS Hasni Neji and Ridha Bouallegue Innov COM Lab, Higher School of Communications of Tunis, Sup Com University of Carthage, Tunis, Tunisia. Email: hasni.neji63@laposte.net;

More information

Using Provenance for Quality Assessment and Repair in Linked Open Data

Using Provenance for Quality Assessment and Repair in Linked Open Data Using Provenance for Quality Assessment and Repair in Linked Open Data Giorgos Flouris 1, Yannis Roussakis 1, María Poveda-Villalón 2, Pablo N. Mendes 3, and Irini Fundulaki 1,4 1 FORTH-ICS, Greece, 2

More information

Personalization of Web Search With Protected Privacy

Personalization of Web Search With Protected Privacy Personalization of Web Search With Protected Privacy S.S DIVYA, R.RUBINI,P.EZHIL Final year, Information Technology,KarpagaVinayaga College Engineering and Technology, Kanchipuram [D.t] Final year, Information

More information

AS software gradually becomes an important and necessary

AS software gradually becomes an important and necessary Software Testing Process Performance Improvement using Service-Based Testing Support Jithinan Sirathienchai, Peraphon Sophatsathit, and Decha Dechawatanapaisal I. INTRODUCTION AS software gradually becomes

More information

A Study on Software Metrics and Phase based Defect Removal Pattern Technique for Project Management

A Study on Software Metrics and Phase based Defect Removal Pattern Technique for Project Management International Journal of Soft Computing and Engineering (IJSCE) A Study on Software Metrics and Phase based Defect Removal Pattern Technique for Project Management Jayanthi.R, M Lilly Florence Abstract:

More information

The Ontology and Architecture for an Academic Social Network

The Ontology and Architecture for an Academic Social Network www.ijcsi.org 22 The Ontology and Architecture for an Academic Social Network Moharram Challenger Computer Engineering Department, Islamic Azad University Shabestar Branch, Shabestar, East Azerbaijan,

More information

Analysis of Software Process Metrics Using Data Mining Tool -A Rough Set Theory Approach

Analysis of Software Process Metrics Using Data Mining Tool -A Rough Set Theory Approach Analysis of Software Process Metrics Using Data Mining Tool -A Rough Set Theory Approach V.Jeyabalaraja, T.Edwin prabakaran Abstract In the software development industries tasks are optimized based on

More information

A Collaborative and Semantic Data Management Framework for Ubiquitous Computing Environment

A Collaborative and Semantic Data Management Framework for Ubiquitous Computing Environment A Collaborative and Semantic Data Management Framework for Ubiquitous Computing Environment Weisong Chen, Cho-Li Wang, and Francis C.M. Lau Department of Computer Science, The University of Hong Kong {wschen,

More information

A Review of Contemporary Data Quality Issues in Data Warehouse ETL Environment

A Review of Contemporary Data Quality Issues in Data Warehouse ETL Environment DOI: 10.15415/jotitt.2014.22021 A Review of Contemporary Data Quality Issues in Data Warehouse ETL Environment Rupali Gill 1, Jaiteg Singh 2 1 Assistant Professor, School of Computer Sciences, 2 Associate

More information

Data collection architecture for Big Data

Data collection architecture for Big Data Data collection architecture for Big Data a framework for a research agenda (Research in progress - ERP Sense Making of Big Data) Wout Hofman, May 2015, BDEI workshop 2 Big Data succes stories bias our

More information

Semantic Interoperability

Semantic Interoperability Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from

More information

Semantic Stored Procedures Programming Environment and performance analysis

Semantic Stored Procedures Programming Environment and performance analysis Semantic Stored Procedures Programming Environment and performance analysis Marjan Efremov 1, Vladimir Zdraveski 2, Petar Ristoski 2, Dimitar Trajanov 2 1 Open Mind Solutions Skopje, bul. Kliment Ohridski

More information

Transactions on Information and Communications Technologies vol 11, 1995 WIT Press, www.witpress.com, ISSN 1743-3517

Transactions on Information and Communications Technologies vol 11, 1995 WIT Press, www.witpress.com, ISSN 1743-3517 Impact analysis of process change proposals* M. Host and C. Wohlin Department of Communication Systems, Lund University, PO Box 118, S-221 00 Lund, Sweden Abstract Before software processes are changed

More information

Data Quality Mining: Employing Classifiers for Assuring consistent Datasets

Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, fabian.gruening@informatik.uni-oldenburg.de Abstract: Independent

More information

A HUMAN RESOURCE ONTOLOGY FOR RECRUITMENT PROCESS

A HUMAN RESOURCE ONTOLOGY FOR RECRUITMENT PROCESS A HUMAN RESOURCE ONTOLOGY FOR RECRUITMENT PROCESS Ionela MANIU Lucian Blaga University Sibiu, Romania Faculty of Sciences mocanionela@yahoo.com George MANIU Spiru Haret University Bucharest, Romania Faculty

More information

Quality Management of Software and Systems: Continuous Improvement Approaches

Quality Management of Software and Systems: Continuous Improvement Approaches Quality Management of Software and Systems: Continuous Improvement Approaches Contents Quality Improvement Paradigm (QIP) Experience Factory (EF) Goal Question Metric (GQM) GQM + Strategies TQM Definition

More information

GQM + Strategies in a Nutshell

GQM + Strategies in a Nutshell GQM + trategies in a Nutshell 2 Data is like garbage. You had better know what you are going to do with it before you collect it. Unknown author This chapter introduces the GQM + trategies approach for

More information

Reason-able View of Linked Data for Cultural Heritage

Reason-able View of Linked Data for Cultural Heritage Reason-able View of Linked Data for Cultural Heritage Mariana Damova 1, Dana Dannells 2 1 Ontotext, Tsarigradsko Chausse 135, Sofia 1784, Bulgaria 2 University of Gothenburg, Lennart Torstenssonsgatan

More information

THE NECESSARY SOFTWARE MEASUREMENT KNOWLEDGE IN SOFTWARE ENGINEERING EDUCATION FROM THE PRACTITIONERS POINT OF VIEW

THE NECESSARY SOFTWARE MEASUREMENT KNOWLEDGE IN SOFTWARE ENGINEERING EDUCATION FROM THE PRACTITIONERS POINT OF VIEW THE NECESSARY SOFTWARE MEASUREMENT KNOWLEDGE IN SOFTWARE ENGINEERING EDUCATION FROM THE PRACTITIONERS POINT OF VIEW Monica Villavicencio 1,2, Alain Abran 1 1 École de technologie supérieure, Montréal,

More information

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity

More information

Evaluating Data Warehousing Methodologies: Objectives and Criteria

Evaluating Data Warehousing Methodologies: Objectives and Criteria Evaluating Data Warehousing Methodologies: Objectives and Criteria by Dr. James Thomann and David L. Wells With each new technical discipline, Information Technology (IT) practitioners seek guidance for

More information

A DATA QUALITY MEASUREMENT INFORMATION MODEL

A DATA QUALITY MEASUREMENT INFORMATION MODEL A DATA QUALITY MEASUREMENT INFORMATION MODEL BASED ON ISO/IEC 15939 (Research-in-Progress) Ismael Caballero, Eugenio Verbo Department of Research & Development (Indra Software Factory, S.L.U.) Indra-UCLM

More information

A QUESTIONNAIRE-BASED DATA QUALITY METHODOLOGY

A QUESTIONNAIRE-BASED DATA QUALITY METHODOLOGY A QUESTIONNAIRE-BASED DATA QUALITY METHODOLOGY Reza Vaziri 1 and Mehran Mohsenzadeh 2 1 Department of Computer Science, Science Research Branch, Azad University of Iran, Tehran Iran rvaziri@iauctbacir

More information

A Metrics-Based Approach to Technical Documentation Quality

A Metrics-Based Approach to Technical Documentation Quality A Metrics-Based Approach to Technical Documentation Quality Anna Wingkvist, Morgan Ericsson, Rüdiger Lincke and Welf Löwe School of Computer Science, Physics, and Mathematics, Linnaeus University, Sweden

More information