A Preliminary Study of Comparative and Evaluative Questions for Business Intelligence

Size: px
Start display at page:

Download "A Preliminary Study of Comparative and Evaluative Questions for Business Intelligence"

Transcription

1 2009 Eighth International Symposium on Natural Language Processing A Preliminary Study of Comparative and Evaluative Questions for Business Intelligence Nathalie Rose T. Lim, Patrick Saint-Dizier, Brigitte Gay, and Rachel Edita Roxas answer is not directly lifted from the source text. Instead, natural language text is constructed from the results of the processing. New types of questions like comparative and evaluative questions are targeted for research as indicated in [1]. It is of interest to study comparative and evaluative expressions (in questions) because of the challenges and issues associated with processing them. These include the following aspects: 1) Multiple styles: Comparative expressions may be expressed in different ways. They can be bipredicational expressions, cross-class comparisons, degree comparisons, and these can appear in nouns, verbs, adjectives, adverbs, and even implicitly denoted. For this research, the focus will be on degree comparisons and explicit and implicit denotations from nouns (like maturity from reached maturity), verbs (like win), adjectives (like good strategy), and adverbs (like fast in evolve fast), since these types appear in actual questions raised in the domain being considered. Degree comparisons refer to the extent of applicability of a certain comparative expression or predicate. Samples of which are the predicates better and active. 2) Inferencing synonymous terms: Determining the synonyms and what these entail is an issue not specific to comparative and evaluative QA, but also to general QAs. However, in certain domains, terminologies have different or more specific semantic meaning due to the context. For example, a hub entails different things depending on the context (eg., transportation hub where hub is a location versus transaction hub where hub is a company). 3) Accessing semantic dimension: The semantic dimension being referred to here is the list of quantifiable measures, properties, and criteria that are associated with the comparative expression or predicate. For example, expensive is associated with the property of cost. 4) Determining ranges and limits for comparison: Values of identified properties or criteria to be used for comparison between objects can be taken from various source texts. However, evaluation (not comparison) of certain criteria are more complex if there is no set standard of measurement and is dependent on the object being evaluated. Using expensive as the example, the ranges of values for determining if a book is expensive is different with Abstract Comparative and evaluative question answering (QA) systems provide objective answers to questions that involve comparisons and evaluations based on a quantifiable set of criteria. As evaluations involve inferences and computations, answers are not lifted from source text. This entails the need for correct semantic interpretation of comparative expressions, converting them to quantifiable criteria before data can be obtained from source text, processing these information, and formulating natural language answers from the result of the processing. As business intelligence (BI) requires comparisons and interpretations of seemingly unrelated facts, a QA system for this domain would be beneficial. This paper presents a study of some comparative and evaluative questions that are raised in the domain of business intelligence. How these questions are processed is also discussed. I. INTRODUCTION C ONSIDER the following questions: Which European companies had the most alliances in year 2008? and Did Company X take more risk than Company Y in the past year?. The first question is an evaluative question. An evaluative question involves the computation or evaluation of at least one property or criteria. In this case, the criteria are explicitly stated in the question (i.e., most alliances which is equivalent to the number of alliances). However, in many cases, the properties involved are not explicitly stated, as in the case of the predicate take-risk in the second question. The predicate take-risk would have to be broken down to the properties such as number of transactions and the types of partners. Basis and constraints are defined by an expert in business intelligence. Evaluations and computations can be done for different objects for comparison purposes. And this comparative question is depicted in the second example question above. Thus, comparative and evaluative question-answering (QA) involves inferences in terminology, determining the properties involved for evaluation, and computation and comparison before an answer can be given. As such, the Manuscript received August 12, N. Rose T. Lim is affiliated with both De La Salle University in Manila, Philippines and Universite Paul Sabatier in Toulouse, France. She can be reached through phone: (632) ; fax: (632) ; nats.lim@delasalle.ph. P. Saint-Dizier is with IRIT, France. ( stdizier@irit.fr). B. Gay is with Groupe ESC Toulouse in France. ( b.gay@esctoulouse.fr). R. Edita Roxas is with De La Salle University Manila in the Philippines. ( rachel.roxas@delasalle.ph) /09/$ IEEE 35

2 determining if a house is expensive. The next section discusses the domain of business intelligence (BI) and types of questions that can be raised. II. BUSINESS INTELLIGENCE Business intelligence (BI) is an area in business and economy which aims at identifying trends in business and in any kind of strategic development (e.g., research themes, political orientations) from thousands of seemingly isolated facts. The globalization of markets for technology as well as fast innovation diffusion through complex networks of business relationships have created a major competitive challenge to corporate leaders. Companies and governments are experimenting with new approaches to the management of business relationships. Corporate strategies involving mergers, acquisitions, spin-offs, and a plethora of alliances are creating smaller, decentralized operational units within and across the boundaries of companies or countries. One role of BI is to help them (companies and governments) understand and master their position in global industries [2]. Software tools can be used to facilitate the analysis that they need to make. This entails that information is processed and structured to display extracted entities and semantic relationships among them. Therefore, there are at least two types of software tools that are necessary. One is a graphical tool that displays coarse-grained information, e.g., all commercial links between companies or countries. This is a kind of radiography of a situation over a certain period (generally, a year) with thousands of links between nodes representing companies. The graphs allow spatial analysis to identify business units or alliances which are larger than just companies. Evolution over a few years is often of much interest. A number of software are now able to handle this, among which is Tetralogie [3]. The other type of tool involves a more fine-grained analysis based on knowledge base constructed from news and other data. It includes determining requirements from the question, extracting the information from the knowledge base as per requirement, and processing these to derive the answer. These may be implemented through database queries (e.g., via SQL statements). However, this implies that the set of questions that can be raised are predetermined. Also, queries are far less natural and user-friendly than human language and these do not allow generation of cooperative responses. Thus, a QA system is better suited to the needs of the users in this domain. Though there are several QA systems, questions mainly focus on factoid, definitions, or lists. Comparative and evaluative questions are seldom tackled [4],[5]. For this study, the corpus is the set of news relevant to biotechnologies from years 2004 to The source information revolves around transactions between companies. Thus, answers to questions are mainly based on information extracted and processed from these news articles. Other needed information not present in the news is extracted or derived from other web sources. From a basic question to compare company transactions like: Which companies have the most number of transactions?, there could be variations and additional constraints added to it. The following subsections list the different classifications of comparisons that may be combined to form a single comparative or evaluative question. A. Spatial Scope Questions may include spatial qualification of the company or the transaction. Sample stem questions could be in (but not limited to) the form of: 1) Which companies in Asia 2) Which transactions in Europe 3) Which cities in 4) Which country 5) Which continent B. Categorial Scope Companies are categorized into public or private and products involved in the transactions fall under certain sectors. Sample stem questions could be in the form of: 1) Which <category> companies 2) Which in the <sector name> sector C. Temporal Scope In BI, the temporal aspect is crucial. It specifies the scope of the analysis to be done. Sample stem questions could be in the form of: 1) in <year>? 2) from <start> to <end>? <start> and <end> may be exact dates, but usually is indicated as the inclusive years. The <end> may also be specified as present. Finally, it is also possible that the temporal scope is implicit depending on the criteria. D. Directly Quantifiable Criteria Questions that involve computations before the answer can be discerned may involve a combination of different directly quantifiable criteria, like: 1) Number of transactions (may be all transactions in general or a specific transaction) 2) Number of partners 3) Amount involved in the transaction 4) Number of products E. Non-Directly Translatable Criteria Some adjectives may be used to encompass a series of criteria. These have different semantic meanings and interpretations depending on the domain (or even the expert). Some terms include: 1) active (as in active companies) 2) stable (as in stable partners) 3) risky (as in risky transactions) 4) innovative (as in innovative products) 5) fast (as in fast evolution)

3 In the next section, we show some related works and studies on the semantic meaning of comparatives, applications involving comparative expressions, and QA related to BI. In section 4, a discussion on how questions are processed is presented. The discussion includes details of how comparative and evaluative expressions are categorized and interpreted. Lastly, we conclude with issues that we have considered and the research directions we plan to take. III. RELATED WORKS Comparisons may be in relation to properties within the same object, degree of comparisons of the same property between different objects, or different properties of different objects [6]. The properties at stake in the comparison are embedded in the semantics of the words in the question, and possibly in the context that comes with the question. To date, there is obviously no widely available lexical resource containing an exhaustive list of comparative predicates, applied to precise terms, together with the properties involved. These can possibly be derived, to a limited extent, from existing resources like FrameNet [7] or from an ontology where relationships between concepts and terms can be mapped. However, this is tractable for very simple situations, and in most cases, identifying those properties is a major challenge. Friedman [8] presents a general approach to process comparative expressions by syntactically treating them to conform to a standard form containing the comparative operator and the clauses that are involved in the comparison. Another approach would be to automatically extract comparative relations in sentences via machine learning. In [9], the approach used is to determine whether the expression is non-equal gradable, equative, or superlative. By identifying the type of expression, the type of comparison may be determined from the semantics of the predicate and the properties of the objects through the pairability constraints. What is missing is the exploration on semantic and conceptual issues and their dependence to context, users, and domains. Olawsky [10] attempts to study the semantic context by generating a set of candidate interpretations of comparative expressions. Then, the user is prompted to choose among these to specify his intent. Some QA systems, like [11], can handle comparative expressions including cross-class comparisons on a range of different domains. However, these involve having a different backend knowledge representation system and the frontend QA system has to be customized before it can answer queries in the new domain. In addition, both of these systems only consider comparisons based on quantifiable predicates (i.e., those measurable by count, mass, or value). Also, predicates with non-directly translatable properties that are dependent on domain or context, to our knowledge, have not been explored. On the domain of BI, MUSING [12] aims to use the semantic web and combine with rule-based and statistical methods for knowledge acquisition and reasoning for providing financial analysis complying with Basel II requirements. The presentation in terms of input expected (whether these are natural language questions and whether these involve comparative expressions) and the output to be generated are undisclosed from the available documentation. IV. PROCESSING COMPARATIVE AND EVALUATIVE QUESTIONS General QA systems involve the processes of question analysis, information retrieval, answer determination, and response generation. For comparative and evaluative QA systems, the processes are redefined. The question analyzer must identify the comparative expressions in the question and decompose it into meaningful constituents, among which are those properties that will be evaluated. When predicates are decomposed into properties, then pertinent information can be extracted from sources (either already stored in database or additional information is mined from the web) and evaluation can be done in the answer determination phase. The properties and the evaluation criteria or rules are specified based on definitions given by an expert. Since the answer is not lifted from the source text, the response generator is in-charge of producing natural language text from the resulting computation and evaluation results. The succeeding subsections outline the processing of the source texts, the question, and the interpretation of a selected set of comparative and evaluative expressions. A. Processing Source Text We are considering the set of economic news in biotechnologies as our main source of information. Each news article is between 80 and 200 words long and is written in English. An excerpt of a news article (from is as follows: IDDI (INTERNATIONAL DRUG DEVELOPMENT INSTITUTE) AND CYTEL INC. TODAY REPORTED ENTERING INTO A STRATEGIC TECHNOLOGY COLLABORATION. THE COMPANIES ARE COOPERATING TO DEVELOP INTEGRATED SYSTEMS AND SERVICES FOR THE RANDOMIZATION OF TREATMENT ASSIGNMENTS FOR PATIENTS PARTICIPATING IN CLINICAL TRIALS As can be seen, sentences are long and verb forms may be quite complex and indirect. Each sentence is composed of a main predicate pred, which serves as a head, and arguments arg to the predicate are defined by their thematic roles t. The argument may be a string of words representing a noun phrase, a prepositional phrase, or a clause. ROL(s) = { t(arg, pred) t {agent, theme, patient, goal, temporal, location, abstract-pos, amount} } Moreover, there exist rhetorical relations rel between sentences s i and s j that comprise the news article. This is also used to identify which among the sentences contain relevant information.

4 REL = { rel(s i, s j ) rel { nucleus, elaboration(focus), justification, underspecified} Thus, for the given sample news article, the text is split into sentences. Let us call the first sentence S1 and the second sentence S2. Then these sentences are represented as: REL ={nucleus(s1, S2), elaboration[companies](s2, S1)} ROL(S1) = {agent( [IDDI (International Drug Development Institute) and Cytel Inc.], collaborate), temporal(today, collaborate), theme([strategic technology collaboration], collaborate) } ROL(S2) = {agent([iddi (International Drug Development Institute) and Cytel Inc.], develop), goal([integrated systems and services for patients], develop), abstract-pos([clinical trials], develop)} Notice that in S1 and S2 instead of the predicates reported (or entering ) and cooperating, the main predicate used are collaborate and develop, respectively. This is because we are only concerned with predicates that are relevant to the transactions being reported in the news. Thus, the semantic dependency is simplified to model only those needed for the conceptual representation of the news. From the semantic representation of each sentence in the news, information is extracted to fill in the typed-feature structure (which is the conceptual representation of the news). It contains the following information: News Source Date Link Transaction TransCategory TransType Date Company (1..10) ContractedItem Such that is a complex type containing the LocString, City, State, Country, and Continent. The date consists of the month, day, and year. TransCategory and TransType are transaction categories and its transaction subtype. There can be at most ten companies. Each of the company information and the contracted item are complex types defined as follows: Company ContractedItem Name Item Sector Role Indication NewEntity Stage SubsidiaryOf Worth Category Not all the information that is stored into the typed-feature structure is available from one news article. Some processing has to be done. A set of inferencing rules is developed to retrieve and store the required information. For example, the news date is indicated, but not the transaction date. In this case, the date of the news is inherited as the transaction date. On other cases, information from other web source is used. An example would be for the case of location. The unprocessed location string (LocString) actually refers to the location of the companies involved. Identifying which of the companies is located in the first location and which is located in the second can be taken from other news sources or other sources like company profile (possibly from B. Processing Questions Information from the question should be extracted for proper processing. We need to identify the type of question (question type), what we are looking for (question focus), and what the conditions are in our search (constraints). In our approach, we represent these into the following semantic representation: Q(<QUESTION TYPE>, <QUESTION FOCUS>,<BODY>) <Question Type> indicates the type of question (whether it is superlative or comparative) and its arguments. An example set of arguments for the superlative type of question would be the number of results (many or single) and search criteria. The <Question Focus> refers to what is expected as a result. The <Body> is the semantic dependency of the question defined by the main predicate and the thematic roles of its arguments. For the sample question Which companies take the most risks?, the semantic representation of the question will be the following: Q(SUPERLATIVE(MANY, HIGHEST), COMPANY, TAKE-RISK(AGENT: COMPANY)) This semantic representation is not enough to come up with the appropriate answer. We need other information to represent the basis for the evaluation. Thus, an operational representation of the question is constructed. An example format (in this case, for the superlative type of question) is: <SUPERLATIVE>(<VARIABLE>, <EVENT>, <RESPONSE>) Where <Superlative> could be highest or lowest depending on the search criteria in the semantic representation, the <Variable> is the basis of the search criteria, <Event> is the key concept determined from the semantic dependency in the question, and <Response> is the expected answer. In the above sample question, the operational representation will be: HIGHEST(RISK, TAKE-RISK(AGENT: COMPANY), COMPANY) Here, the <Event> is similar to the <Body> because takerisk is included in the identified key concepts that we can interpret. Other terms like Which companies like to make risky investments? are also mapped to the take-risk concept. To facilitate mapping of questions to the answers, we have a typed-feature representation for the question containing the following features:

5 Question Question Type Number of Results Search Criteria Question Focus Search Constraints Such that the <Question Type>, <Number of Results>, <Search Criteria>, and <Question Focus> are taken from the semantic representation, the <Search Constraints> is a complex type defined below. The <Duration> is the temporal scope of the search, while the <> and the <Transaction> are complex types, defined similar to that of the news. Search Contraints Duration DateStart DateEnd Transaction For criteria or properties that are already in the conceptual representation, these are used in the evaluation and/or comparison. For the sample question Which Asian companies have the most number of transactions in year 2008?. The company involved in the transaction should be located in Asia and the date (or year) should be Since these are search constraints indicated in the question, mapping the representation to the entries in the typed-feature representation of the news would provide a short-list of matching entries. Then with the most number of transactions, it is a matter of counting the occurrences of a certain company and comparing the values to determine the top companies. C. Complex Terms Other criteria that are non-directly quantifiable are referred to as complex terms. For these, the lexical knowledge is consulted to identify the term s interpretation into quantifiable properties. For the take-risk example, the lexical knowledge represents a company that takes risks as one which is active, has transactions every year, have alliances every year with new and unstable partners. Take-risk(c) := Active(c) TransEveryYear(c) CompAllyEveryYearAndAlwaysNewPart(c) HaveStablePartners(c) Again, the definition could consist of more key concepts or terms, which have to be evaluated first. Eventually, the key concept is broken down into values or quantifiable measure that can be extracted from the typed-feature structure of the news article. For example, the condition Active is also a key concept, defined to be a company that has above mean transactions in the duration of the search constraint. It is formally defined as: Active(c) := (c, n) CompanyTrans n NumTrans / NumTrans CompanyTrans = { (Company1.Name: c, n) n = Transact(c) } Transact(c) = { (Transaction.Company1.Name : c, Transaction.Company2.Name: c 2, Transaction.TransCategory:t, Transaction.Date.Year: y, Transaction.ContractedItem.Item:p) } NumTrans = { n (c, n) CompanyTrans} To process a comparative question like Does Company X take more risk than Company Y?, each of these entities will be tested based on the constraints. For a superlative question Which companies take the most risk?, all companies will be tested and computations will be done to generate the top entities. D. Interpretation of Comparative and Evaluative Expressions Aside from active and take risk, other comparative and evaluative expressions have been studied from questions that can be raised in BI. The expression is studied from the predicate, identifying its basic properties, then looking at the nouns that it can modify, re-evaluating the properties if there are additional constraints or different constraints. From the study, the predicates are categorized into uni-dimensional, multi-dimensional, polysemous, and underspecified. 1) Uni-dimensional predicates: Some predicates have only one sense or definition. For example, expensive. It is essentially involving a high cost. In this case, the cost is the quantifiable property that we can use to evaluate or compare entities. However, for some uni-dimensional predicates, like innovative, it is difficult to quantify. Innovative is defined as characterized by or introducing something new [13]. In this case, we can look at the effect instead, i.e., something innovative is in demand. Thus, determining if a product is innovative would depend on the number of entities having an interest in it. And an innovative company is one with an innovative product. In this particular domain, the expert formally defines this as: Innovative(c) := i, i [1, m-1] y:year (c 1, c, t, x, y, p i ) SellTransact(c,y) p i = p i+1 SellTransact(c,y) 0.7 x n CompanyTrans-Per-Year SellTransact(c, y) = { (Transaction.Company1.Name : c 1, Transaction.Company2.Name: c, Transaction.TransCategory:t, Transaction.TransType:x, Transaction.Date.Year : y, Transaction.ContractedItem.Item:p) t = buy t = alliance (x = [exclusive licensing] x = [nonexclusive licensing]) } CompanyTrans-Per-Year = { (Company1.Name: c, Transaction.Date.Year: y, n) n = Transact-Per- Year(c, y ) } 2) Multi-dimensional predicate: Taking the example of

6 take-risk, it entails different dimensions from being conservative. Different aspects would then have to be considered. In the case of BI, this could be in terms of the amount of investments, types of products invested in, the partners being taken, or the overall strategy that is being employed. 3) Polysemous predicate: Many predicates have different senses and meanings. Taking the example stable, it is defined in [13] to have three meanings, namely: firmly established, the second is steady in purpose, and the third is capability to resist motion. Being able to identify which among these senses may depend on the noun that it is associated with or with the domain in question. 4) Underspecified predicate and metonymy: Underspecification refers to a general criteria associated with the predicate, but will gain (more) context only when associated with the noun it modifies. Assuming that we consider only sense of stable as being steady in purpose, it is still underspecified because the properties associated to this meaning still depends on the context. Even within the domain of BI, the criteria for evaluating a stable company are different from a stable partner, even if the partner is also a company. This also leads to the issue of metonymy. The nouns associated to the predicate represent a class of objects that hold various properties. For example, a company can be quantified by the number of employees, the number of transactions, the types of transactions, the investments that it makes, and so on. By associating the predicate stable with company, determining which of these properties is to be used in the evaluation of steady in purpose is a challenge. In this case, the constraints are provided by an expert. A stable company is defined as one which is active, may not have alliances every year or have alliances every year but always with old partners. Stable(c) := Active(c) AllianceEveryYear(c) AllianceEveryYear(c) OnlyOncePartners(c) AllianceEveryYear(c) := y: YEAR (c, y, n) CompanyAlliance CompanyAlliance = { (Company1.Name: c, Transaction.Date.Year: y, n) n = Alliance-Per- Year(c, y ) } Alliance-Per-Year(c, y) = { (Transaction.Company1.Name : c, Transaction.Company1.Name : c2, Transaction.TransCategory:t, Transaction.Date.Year:y, Transaction.ContractedItem.Item: p ) t = alliance } OnlyOncePartners(c) := y1, y2 CompanyAlliance(c, c2, y1) CompanyAlliance(c, c2, y2 ) y1 = y2 On the other hand, a stable partner is one which has alliances every year. And, a company has stable partners when it has alliances every year and always with new partners and the partners have alliances every year. HaveStablePartners(c) := CompAllyEveryYearAndAlwaysNewPart(c) AllianceEveryYear(c2) CompAllyEveryYearAndAlwaysNewPart (c) := OnlyOncePartners(c) AllianceEveryYear(c,y) V. CONCLUSION AND DIRECTIONS Comparative and evaluative expressions in the domain of BI are complex because there are intricacies to the terminology used in BI where the criteria are predefined. As can be seen in the example predicates, expressions can be based on one criterion, can be based on multiple criteria, and/or can be underspecified. Currently, there are at least ten basic comparative and evaluative expressions in questions studied. Each of which have several variations considering aspects of polysemy, metonymy, underspecification, and multiple criteria. More comparative and evaluative expressions are yet to be explored in the context of BI. Expected form and style of answers from these questions will be taken into consideration. The research will also explore techniques to automatically determine the properties which are at stake in the evaluation and to automatically determine limits, ranges, and relative values of these properties from on-line sources, so that the technique can be portable to other domains. Evaluation will be carried out eventually. However, it is crucial first to identify evaluation metrics and processes and to which components the metrics and processes will be applied to, as the evaluation is not so straightforward. REFERENCES [1] J. Burger, et al., Issues, Tasks and Program Structures to Roadmap Research in Question & Answering, [Online]. Available: www-nlpir.nist.gov/projects/duc/papers/qa.roadmap-paper_v2.doc [2] B. Gay and B. Dousset, Innovation and Network Structural Dynamics: Study of the Alliance Network of a Major Sector of the Biotechnology Industry, Research Policy,34(10), , Management Journal, Special Issue, 213, , [3] Tetralogie. [Online]. Available: [4] M. Maybury, New Directions in Question Answering, The MIT Press, Menlo Park [5] D. Moldovan, et al., The Structure and Performance of an Open Domain Question Answering System, in Proceedings of the 38 th Meeting of the Association for Computational Linguistics (ACL), HongKong, [6] C. Kennedy, Comparatives, Semantics Of, in K. Allen (section editor) Lexical and Logical Semantics; Encyclopedia of Language and Linguistics, 2nd Edition, Elsevier, Oxford, [7] J. Ruppenhofer, et al. (2006). FrameNet II: Extended Theory and Practice. Available: [8] C. Friedman, A General Computational Treatment of the Comparative, in Proceedings of the 27 th Annual Meeting of the ACL,

7 1989. [Online]. Available: [9] N. Jindal and B. Liu, Mining Comparative Sentences and Relations, in Proceedings of the 21 st AAAI Conference on Artificial Intelligence, AAAI Press, USA, [10] D. Olawsky, The Lexical Semantics of Comparative Expressions in a Multi-level Semantic Processor, in Proceedings of the 27 th AnnualMeeting on ACL, USA, [11] B. Ballard, A General Computational Treatment of Comparatives for Natural Language Question Answering, in Proceedings of the 26 th Annual Meeting of the ACL, [Online]. Available: [12] MUSING Newsletter nr. 2 Spring [Online]. Available : spring-2009 [13] Merriam Webster Dictionary. [Online]. Available:

Numerical Data Integration for Cooperative Question-Answering

Numerical Data Integration for Cooperative Question-Answering Numerical Data Integration for Cooperative Question-Answering Véronique Moriceau Institut de Recherche en Informatique de Toulouse 118, route de Narbonne 31062 Toulouse cedex 09, France moriceau@irit.fr

More information

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD. Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518 International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,

More information

Overview of the TACITUS Project

Overview of the TACITUS Project Overview of the TACITUS Project Jerry R. Hobbs Artificial Intelligence Center SRI International 1 Aims of the Project The specific aim of the TACITUS project is to develop interpretation processes for

More information

Natural Language to Relational Query by Using Parsing Compiler

Natural Language to Relational Query by Using Parsing Compiler Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,

More information

Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System

Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System Athira P. M., Sreeja M. and P. C. Reghuraj Department of Computer Science and Engineering, Government Engineering

More information

Presented to The Federal Big Data Working Group Meetup On 07 June 2014 By Chuck Rehberg, CTO Semantic Insights a Division of Trigent Software

Presented to The Federal Big Data Working Group Meetup On 07 June 2014 By Chuck Rehberg, CTO Semantic Insights a Division of Trigent Software Semantic Research using Natural Language Processing at Scale; A continued look behind the scenes of Semantic Insights Research Assistant and Research Librarian Presented to The Federal Big Data Working

More information

Universal. Event. Product. Computer. 1 warehouse.

Universal. Event. Product. Computer. 1 warehouse. Dynamic multi-dimensional models for text warehouses Maria Zamr Bleyberg, Karthik Ganesh Computing and Information Sciences Department Kansas State University, Manhattan, KS, 66506 Abstract In this paper,

More information

Overview of MT techniques. Malek Boualem (FT)

Overview of MT techniques. Malek Boualem (FT) Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,

More information

Information Services for Smart Grids

Information Services for Smart Grids Smart Grid and Renewable Energy, 2009, 8 12 Published Online September 2009 (http://www.scirp.org/journal/sgre/). ABSTRACT Interconnected and integrated electrical power systems, by their very dynamic

More information

Natural Language Database Interface for the Community Based Monitoring System *

Natural Language Database Interface for the Community Based Monitoring System * Natural Language Database Interface for the Community Based Monitoring System * Krissanne Kaye Garcia, Ma. Angelica Lumain, Jose Antonio Wong, Jhovee Gerard Yap, Charibeth Cheng De La Salle University

More information

Domain Classification of Technical Terms Using the Web

Domain Classification of Technical Terms Using the Web Systems and Computers in Japan, Vol. 38, No. 14, 2007 Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J89-D, No. 11, November 2006, pp. 2470 2482 Domain Classification of Technical Terms Using

More information

How To Write A Summary Of A Review

How To Write A Summary Of A Review PRODUCT REVIEW RANKING SUMMARIZATION N.P.Vadivukkarasi, Research Scholar, Department of Computer Science, Kongu Arts and Science College, Erode. Dr. B. Jayanthi M.C.A., M.Phil., Ph.D., Associate Professor,

More information

Processing data streams by relational analysis

Processing data streams by relational analysis Processing data streams by relational analysis Ilhème Ghalamallah Institut de Recherche en Informatique de Toulouse, IRIT-SIG Plan Introduction Tetralogie Proposition X-Plor Conclusion 1 In the business

More information

Domain Knowledge Extracting in a Chinese Natural Language Interface to Databases: NChiql

Domain Knowledge Extracting in a Chinese Natural Language Interface to Databases: NChiql Domain Knowledge Extracting in a Chinese Natural Language Interface to Databases: NChiql Xiaofeng Meng 1,2, Yong Zhou 1, and Shan Wang 1 1 College of Information, Renmin University of China, Beijing 100872

More information

Transaction-Typed Points TTPoints

Transaction-Typed Points TTPoints Transaction-Typed Points TTPoints version: 1.0 Technical Report RA-8/2011 Mirosław Ochodek Institute of Computing Science Poznan University of Technology Project operated within the Foundation for Polish

More information

CINTIL-PropBank. CINTIL-PropBank Sub-corpus id Sentences Tokens Domain Sentences for regression atsts 779 5,654 Test

CINTIL-PropBank. CINTIL-PropBank Sub-corpus id Sentences Tokens Domain Sentences for regression atsts 779 5,654 Test CINTIL-PropBank I. Basic Information 1.1. Corpus information The CINTIL-PropBank (Branco et al., 2012) is a set of sentences annotated with their constituency structure and semantic role tags, composed

More information

Distributed Database for Environmental Data Integration

Distributed Database for Environmental Data Integration Distributed Database for Environmental Data Integration A. Amato', V. Di Lecce2, and V. Piuri 3 II Engineering Faculty of Politecnico di Bari - Italy 2 DIASS, Politecnico di Bari, Italy 3Dept Information

More information

FACULTY OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY AUTUMN 2016 BACHELOR COURSES

FACULTY OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY AUTUMN 2016 BACHELOR COURSES FACULTY OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY Please note! This is a preliminary list of courses for the study year 2016/2017. Changes may occur! AUTUMN 2016 BACHELOR COURSES DIP217 Applied Software

More information

ONTOLOGIES A short tutorial with references to YAGO Cosmina CROITORU

ONTOLOGIES A short tutorial with references to YAGO Cosmina CROITORU ONTOLOGIES p. 1/40 ONTOLOGIES A short tutorial with references to YAGO Cosmina CROITORU Unlocking the Secrets of the Past: Text Mining for Historical Documents Blockseminar, 21.2.-11.3.2011 ONTOLOGIES

More information

DATA QUALITY DATA BASE QUALITY INFORMATION SYSTEM QUALITY

DATA QUALITY DATA BASE QUALITY INFORMATION SYSTEM QUALITY DATA QUALITY DATA BASE QUALITY INFORMATION SYSTEM QUALITY The content of those documents are the exclusive property of REVER. The aim of those documents is to provide information and should, in no case,

More information

Interactive Dynamic Information Extraction

Interactive Dynamic Information Extraction Interactive Dynamic Information Extraction Kathrin Eichler, Holmer Hemsen, Markus Löckelt, Günter Neumann, and Norbert Reithinger Deutsches Forschungszentrum für Künstliche Intelligenz - DFKI, 66123 Saarbrücken

More information

Strategic Online Advertising: Modeling Internet User Behavior with

Strategic Online Advertising: Modeling Internet User Behavior with 2 Strategic Online Advertising: Modeling Internet User Behavior with Patrick Johnston, Nicholas Kristoff, Heather McGinness, Phuong Vu, Nathaniel Wong, Jason Wright with William T. Scherer and Matthew

More information

Semantic Search in Portals using Ontologies

Semantic Search in Portals using Ontologies Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br

More information

Business Definitions for Data Management Professionals

Business Definitions for Data Management Professionals Realising the value of your information TM Powered by Intraversed Business Definitions for Data Management Professionals Intralign User Guide Excerpt Copyright Intraversed Pty Ltd, 2010, 2014 W-DE-2015-0004

More information

I. INTRODUCTION NOESIS ONTOLOGIES SEMANTICS AND ANNOTATION

I. INTRODUCTION NOESIS ONTOLOGIES SEMANTICS AND ANNOTATION Noesis: A Semantic Search Engine and Resource Aggregator for Atmospheric Science Sunil Movva, Rahul Ramachandran, Xiang Li, Phani Cherukuri, Sara Graves Information Technology and Systems Center University

More information

ORGANIZATIONAL KNOWLEDGE MAPPING BASED ON LIBRARY INFORMATION SYSTEM

ORGANIZATIONAL KNOWLEDGE MAPPING BASED ON LIBRARY INFORMATION SYSTEM ORGANIZATIONAL KNOWLEDGE MAPPING BASED ON LIBRARY INFORMATION SYSTEM IRANDOC CASE STUDY Ammar Jalalimanesh a,*, Elaheh Homayounvala a a Information engineering department, Iranian Research Institute for

More information

Reverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms

Reverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms Reverse Engineering of Relational Databases to Ontologies: An Approach Based on an Analysis of HTML Forms Irina Astrova 1, Bela Stantic 2 1 Tallinn University of Technology, Ehitajate tee 5, 19086 Tallinn,

More information

A Framework of Personalized Intelligent Document and Information Management System

A Framework of Personalized Intelligent Document and Information Management System A Framework of Personalized Intelligent and Information Management System Xien Fan Department of Computer Science, College of Staten Island, City University of New York, Staten Island, NY 10314, USA Fang

More information

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Chapter 12 Business Intelligence and Decision Support Systems Information Technology For Management 7 th Edition Turban & Volonino Based on lecture slides by L. Beaubien, Providence College John Wiley

More information

Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata

Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata Alessandra Giordani and Alessandro Moschitti Department of Computer Science and Engineering University of Trento Via Sommarive

More information

Paraphrasing controlled English texts

Paraphrasing controlled English texts Paraphrasing controlled English texts Kaarel Kaljurand Institute of Computational Linguistics, University of Zurich kaljurand@gmail.com Abstract. We discuss paraphrasing controlled English texts, by defining

More information

Supporting Software Development Process Using Evolution Analysis : a Brief Survey

Supporting Software Development Process Using Evolution Analysis : a Brief Survey Supporting Software Development Process Using Evolution Analysis : a Brief Survey Samaneh Bayat Department of Computing Science, University of Alberta, Edmonton, Canada samaneh@ualberta.ca Abstract During

More information

Building a Question Classifier for a TREC-Style Question Answering System

Building a Question Classifier for a TREC-Style Question Answering System Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given

More information

3 Paraphrase Acquisition. 3.1 Overview. 2 Prior Work

3 Paraphrase Acquisition. 3.1 Overview. 2 Prior Work Unsupervised Paraphrase Acquisition via Relation Discovery Takaaki Hasegawa Cyberspace Laboratories Nippon Telegraph and Telephone Corporation 1-1 Hikarinooka, Yokosuka, Kanagawa 239-0847, Japan hasegawa.takaaki@lab.ntt.co.jp

More information

NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR

NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR Arati K. Deshpande 1 and Prakash. R. Devale 2 1 Student and 2 Professor & Head, Department of Information Technology, Bharati

More information

Report on the Dagstuhl Seminar Data Quality on the Web

Report on the Dagstuhl Seminar Data Quality on the Web Report on the Dagstuhl Seminar Data Quality on the Web Michael Gertz M. Tamer Özsu Gunter Saake Kai-Uwe Sattler U of California at Davis, U.S.A. U of Waterloo, Canada U of Magdeburg, Germany TU Ilmenau,

More information

» A Hardware & Software Overview. Eli M. Dow <emdow@us.ibm.com:>

» A Hardware & Software Overview. Eli M. Dow <emdow@us.ibm.com:> » A Hardware & Software Overview Eli M. Dow Overview:» Hardware» Software» Questions 2011 IBM Corporation Early implementations of Watson ran on a single processor where it took 2 hours

More information

Semantic analysis of text and speech

Semantic analysis of text and speech Semantic analysis of text and speech SGN-9206 Signal processing graduate seminar II, Fall 2007 Anssi Klapuri Institute of Signal Processing, Tampere University of Technology, Finland Outline What is semantic

More information

Multilingual and Localization Support for Ontologies

Multilingual and Localization Support for Ontologies Multilingual and Localization Support for Ontologies Mauricio Espinoza, Asunción Gómez-Pérez and Elena Montiel-Ponsoda UPM, Laboratorio de Inteligencia Artificial, 28660 Boadilla del Monte, Spain {jespinoza,

More information

POSBIOTM-NER: A Machine Learning Approach for. Bio-Named Entity Recognition

POSBIOTM-NER: A Machine Learning Approach for. Bio-Named Entity Recognition POSBIOTM-NER: A Machine Learning Approach for Bio-Named Entity Recognition Yu Song, Eunji Yi, Eunju Kim, Gary Geunbae Lee, Department of CSE, POSTECH, Pohang, Korea 790-784 Soo-Jun Park Bioinformatics

More information

Modern Systems Analysis and Design

Modern Systems Analysis and Design Modern Systems Analysis and Design Prof. David Gadish Structuring System Data Requirements Learning Objectives Concisely define each of the following key data modeling terms: entity type, attribute, multivalued

More information

Clustering Technique in Data Mining for Text Documents

Clustering Technique in Data Mining for Text Documents Clustering Technique in Data Mining for Text Documents Ms.J.Sathya Priya Assistant Professor Dept Of Information Technology. Velammal Engineering College. Chennai. Ms.S.Priyadharshini Assistant Professor

More information

A Workbench for Prototyping XML Data Exchange (extended abstract)

A Workbench for Prototyping XML Data Exchange (extended abstract) A Workbench for Prototyping XML Data Exchange (extended abstract) Renzo Orsini and Augusto Celentano Università Ca Foscari di Venezia, Dipartimento di Informatica via Torino 155, 30172 Mestre (VE), Italy

More information

How To Find Influence Between Two Concepts In A Network

How To Find Influence Between Two Concepts In A Network 2014 UKSim-AMSS 16th International Conference on Computer Modelling and Simulation Influence Discovery in Semantic Networks: An Initial Approach Marcello Trovati and Ovidiu Bagdasar School of Computing

More information

Data Quality in Information Integration and Business Intelligence

Data Quality in Information Integration and Business Intelligence Data Quality in Information Integration and Business Intelligence Leopoldo Bertossi Carleton University School of Computer Science Ottawa, Canada : Faculty Fellow of the IBM Center for Advanced Studies

More information

Ontologies for Enterprise Integration

Ontologies for Enterprise Integration Ontologies for Enterprise Integration Mark S. Fox and Michael Gruninger Department of Industrial Engineering,University of Toronto, 4 Taddle Creek Road, Toronto, Ontario M5S 1A4 tel:1-416-978-6823 fax:1-416-971-1373

More information

Text Analytics. A business guide

Text Analytics. A business guide Text Analytics A business guide February 2014 Contents 3 The Business Value of Text Analytics 4 What is Text Analytics? 6 Text Analytics Methods 8 Unstructured Meets Structured Data 9 Business Application

More information

Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery

Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery Transformation of Free-text Electronic Health Records for Efficient Information Retrieval and Support of Knowledge Discovery Jan Paralic, Peter Smatana Technical University of Kosice, Slovakia Center for

More information

Cross-Lingual Concern Analysis from Multilingual Weblog Articles

Cross-Lingual Concern Analysis from Multilingual Weblog Articles Cross-Lingual Concern Analysis from Multilingual Weblog Articles Tomohiro Fukuhara RACE (Research into Artifacts), The University of Tokyo 5-1-5 Kashiwanoha, Kashiwa, Chiba JAPAN http://www.race.u-tokyo.ac.jp/~fukuhara/

More information

POLAR IT SERVICES. Business Intelligence Project Methodology

POLAR IT SERVICES. Business Intelligence Project Methodology POLAR IT SERVICES Business Intelligence Project Methodology Table of Contents 1. Overview... 2 2. Visualize... 3 3. Planning and Architecture... 4 3.1 Define Requirements... 4 3.1.1 Define Attributes...

More information

Customer Intentions Analysis of Twitter Based on Semantic Patterns

Customer Intentions Analysis of Twitter Based on Semantic Patterns Customer Intentions Analysis of Twitter Based on Semantic Patterns Mohamed Hamroun mohamed.hamrounn@gmail.com Mohamed Salah Gouider ms.gouider@yahoo.fr Lamjed Ben Said lamjed.bensaid@isg.rnu.tn ABSTRACT

More information

2QWRORJ\LQWHJUDWLRQLQDPXOWLOLQJXDOHUHWDLOV\VWHP

2QWRORJ\LQWHJUDWLRQLQDPXOWLOLQJXDOHUHWDLOV\VWHP 2QWRORJ\LQWHJUDWLRQLQDPXOWLOLQJXDOHUHWDLOV\VWHP 0DULD7HUHVD3$=,(1=$L$UPDQGR67(//$72L0LFKHOH9,1',*1,L $OH[DQGURV9$/$5$.26LL9DQJHOLV.$5.$/(76,6LL (i) Department of Computer Science, Systems and Management,

More information

How To Develop Software

How To Develop Software Software Engineering Prof. N.L. Sarda Computer Science & Engineering Indian Institute of Technology, Bombay Lecture-4 Overview of Phases (Part - II) We studied the problem definition phase, with which

More information

THE INTELLIGENT BUSINESS INTELLIGENCE SOLUTIONS

THE INTELLIGENT BUSINESS INTELLIGENCE SOLUTIONS THE INTELLIGENT BUSINESS INTELLIGENCE SOLUTIONS ADRIAN COJOCARIU, CRISTINA OFELIA STANCIU TIBISCUS UNIVERSITY OF TIMIŞOARA, FACULTY OF ECONOMIC SCIENCE, DALIEI STR, 1/A, TIMIŞOARA, 300558, ROMANIA ofelia.stanciu@gmail.com,

More information

TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments

TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments TOOL OF THE INTELLIGENCE ECONOMIC: RECOGNITION FUNCTION OF REVIEWS CRITICS. Extraction and linguistic analysis of sentiments Grzegorz Dziczkowski, Katarzyna Wegrzyn-Wolska Ecole Superieur d Ingenieurs

More information

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 3 An Ontology Based Method to Solve Query Identifier Heterogeneity

More information

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge White Paper October 2002 I. Translation and Localization New Challenges Businesses are beginning to encounter

More information

Structure of the talk. The semantics of event nominalisation. Event nominalisations and verbal arguments 2

Structure of the talk. The semantics of event nominalisation. Event nominalisations and verbal arguments 2 Structure of the talk Sebastian Bücking 1 and Markus Egg 2 1 Universität Tübingen sebastian.buecking@uni-tuebingen.de 2 Rijksuniversiteit Groningen egg@let.rug.nl 12 December 2008 two challenges for a

More information

Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results

Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results , pp.33-40 http://dx.doi.org/10.14257/ijgdc.2014.7.4.04 Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results Muzammil Khan, Fida Hussain and Imran Khan Department

More information

2015 Workshops for Professors

2015 Workshops for Professors SAS Education Grow with us Offered by the SAS Global Academic Program Supporting teaching, learning and research in higher education 2015 Workshops for Professors 1 Workshops for Professors As the market

More information

Mining the Software Change Repository of a Legacy Telephony System

Mining the Software Change Repository of a Legacy Telephony System Mining the Software Change Repository of a Legacy Telephony System Jelber Sayyad Shirabad, Timothy C. Lethbridge, Stan Matwin School of Information Technology and Engineering University of Ottawa, Ottawa,

More information

A Review of Data Mining Techniques

A Review of Data Mining Techniques Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

Busting 7 Myths about Master Data Management

Busting 7 Myths about Master Data Management Knowledge Integrity Incorporated Busting 7 Myths about Master Data Management Prepared by: David Loshin Knowledge Integrity, Inc. August, 2011 Sponsored by: 2011 Knowledge Integrity, Inc. 1 (301) 754-6350

More information

Database Marketing, Business Intelligence and Knowledge Discovery

Database Marketing, Business Intelligence and Knowledge Discovery Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION 1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful

More information

Sentiment Analysis on Big Data

Sentiment Analysis on Big Data SPAN White Paper!? Sentiment Analysis on Big Data Machine Learning Approach Several sources on the web provide deep insight about people s opinions on the products and services of various companies. Social

More information

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

VisionWaves : Delivering next generation BI by combining BI and PM in an Intelligent Performance Management Framework

VisionWaves : Delivering next generation BI by combining BI and PM in an Intelligent Performance Management Framework VisionWaves : Delivering next generation BI by combining BI and PM in an Intelligent Performance Management Framework VisionWaves Bergweg 173 3707 AC Zeist T 030 6981010 F 030 6914967 2010 VisionWaves

More information

Ontology quality and fitness: A survey of so6ware support

Ontology quality and fitness: A survey of so6ware support Ontology quality and fitness: A survey of so6ware support Ontology Summit February 14, 2013 Michael Denny msdenny@mitre.org Survey consideraion: CasIng evaluaion factors as capabiliies At this juncture,

More information

Conceptual Schema Approach to Natural Language Database Access

Conceptual Schema Approach to Natural Language Database Access Conceptual Schema Approach to Natural Language Database Access In-Su Kang, Seung-Hoon Na, Jong-Hyeok Lee Div. of Electrical and Computer Engineering Pohang University of Science and Technology (POSTECH)

More information

Using NLP and Ontologies for Notary Document Management Systems

Using NLP and Ontologies for Notary Document Management Systems Outline Using NLP and Ontologies for Notary Document Management Systems Flora Amato, Antonino Mazzeo, Antonio Penta and Antonio Picariello Dipartimento di Informatica e Sistemistica Universitá di Napoli

More information

Selection of Optimal Discount of Retail Assortments with Data Mining Approach

Selection of Optimal Discount of Retail Assortments with Data Mining Approach Available online at www.interscience.in Selection of Optimal Discount of Retail Assortments with Data Mining Approach Padmalatha Eddla, Ravinder Reddy, Mamatha Computer Science Department,CBIT, Gandipet,Hyderabad,A.P,India.

More information

Mathematics Cognitive Domains Framework: TIMSS 2003 Developmental Project Fourth and Eighth Grades

Mathematics Cognitive Domains Framework: TIMSS 2003 Developmental Project Fourth and Eighth Grades Appendix A Mathematics Cognitive Domains Framework: TIMSS 2003 Developmental Project Fourth and Eighth Grades To respond correctly to TIMSS test items, students need to be familiar with the mathematics

More information

CSC 342 Semester I: 1425-1426H (2004-2005 G)

CSC 342 Semester I: 1425-1426H (2004-2005 G) CSC 342 Semester I: 1425-1426H (2004-2005 G) Software Engineering Systems Analysis: Requirements Structuring Context & DFDs. Instructor: Dr. Ghazy Assassa Software Engineering CSC 342/Dr. Ghazy Assassa

More information

The compositional semantics of same

The compositional semantics of same The compositional semantics of same Mike Solomon Amherst College Abstract Barker (2007) proposes the first strictly compositional semantic analysis of internal same. I show that Barker s analysis fails

More information

The Book of Grammar Lesson Six. Mr. McBride AP Language and Composition

The Book of Grammar Lesson Six. Mr. McBride AP Language and Composition The Book of Grammar Lesson Six Mr. McBride AP Language and Composition Table of Contents Lesson One: Prepositions and Prepositional Phrases Lesson Two: The Function of Nouns in a Sentence Lesson Three:

More information

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS 9 8 TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS Assist. Prof. Latinka Todoranova Econ Lit C 810 Information technology is a highly dynamic field of research. As part of it, business intelligence

More information

DATA QUALITY AND SCALE IN CONTEXT OF EUROPEAN SPATIAL DATA HARMONISATION

DATA QUALITY AND SCALE IN CONTEXT OF EUROPEAN SPATIAL DATA HARMONISATION DATA QUALITY AND SCALE IN CONTEXT OF EUROPEAN SPATIAL DATA HARMONISATION Katalin Tóth, Vanda Nunes de Lima European Commission Joint Research Centre, Ispra, Italy ABSTRACT The proposal for the INSPIRE

More information

Implementation of hybrid software architecture for Artificial Intelligence System

Implementation of hybrid software architecture for Artificial Intelligence System IJCSNS International Journal of Computer Science and Network Security, VOL.7 No.1, January 2007 35 Implementation of hybrid software architecture for Artificial Intelligence System B.Vinayagasundaram and

More information

ADVANCED GEOGRAPHIC INFORMATION SYSTEMS Vol. II - Using Ontologies for Geographic Information Intergration Frederico Torres Fonseca

ADVANCED GEOGRAPHIC INFORMATION SYSTEMS Vol. II - Using Ontologies for Geographic Information Intergration Frederico Torres Fonseca USING ONTOLOGIES FOR GEOGRAPHIC INFORMATION INTEGRATION Frederico Torres Fonseca The Pennsylvania State University, USA Keywords: ontologies, GIS, geographic information integration, interoperability Contents

More information

Chapter 6. Data-Flow Diagrams

Chapter 6. Data-Flow Diagrams Chapter 6. Data-Flow Diagrams Table of Contents Objectives... 1 Introduction to data-flow diagrams... 2 What are data-flow diagrams?... 2 An example data-flow diagram... 2 The benefits of data-flow diagrams...

More information

Clustering Connectionist and Statistical Language Processing

Clustering Connectionist and Statistical Language Processing Clustering Connectionist and Statistical Language Processing Frank Keller keller@coli.uni-sb.de Computerlinguistik Universität des Saarlandes Clustering p.1/21 Overview clustering vs. classification supervised

More information

Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001

Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001 A comparison of the OpenGIS TM Abstract Specification with the CIDOC CRM 3.2 Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001 1 Introduction This Mapping has the purpose to identify, if the OpenGIS

More information

A + dvancer College Readiness Online Alignment to Florida PERT

A + dvancer College Readiness Online Alignment to Florida PERT A + dvancer College Readiness Online Alignment to Florida PERT Area Objective ID Topic Subject Activity Mathematics Math MPRC1 Equations: Solve linear in one variable College Readiness-Arithmetic Solving

More information

SPATIAL DATA CLASSIFICATION AND DATA MINING

SPATIAL DATA CLASSIFICATION AND DATA MINING , pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal

More information

Performance Evaluation Techniques for an Automatic Question Answering System

Performance Evaluation Techniques for an Automatic Question Answering System Performance Evaluation Techniques for an Automatic Question Answering System Tilani Gunawardena, Nishara Pathirana, Medhavi Lokuhetti, Roshan Ragel, and Sampath Deegalla Abstract Automatic question answering

More information

Analyzing survey text: a brief overview

Analyzing survey text: a brief overview IBM SPSS Text Analytics for Surveys Analyzing survey text: a brief overview Learn how gives you greater insight Contents 1 Introduction 2 The role of text in survey research 2 Approaches to text mining

More information

Developing a Theory-Based Ontology for Best Practices Knowledge Bases

Developing a Theory-Based Ontology for Best Practices Knowledge Bases Developing a Theory-Based Ontology for Best Practices Knowledge Bases Daniel E. O Leary University of Southern California 3660 Trousdale Parkway Los Angeles, CA 90089-0441 oleary@usc.edu Abstract Knowledge

More information

Making Business Intelligence Easy. Whitepaper Measuring data quality for successful Master Data Management

Making Business Intelligence Easy. Whitepaper Measuring data quality for successful Master Data Management Making Business Intelligence Easy Whitepaper Measuring data quality for successful Master Data Management Contents Overview... 3 What is Master Data Management?... 3 Master Data Modeling Approaches...

More information

Modelling and Implementing a Knowledge Base for Checking Medical Invoices with DLV

Modelling and Implementing a Knowledge Base for Checking Medical Invoices with DLV Modelling and Implementing a Knowledge Base for Checking Medical Invoices with DLV Christoph Beierle 1, Oliver Dusso 1, Gabriele Kern-Isberner 2 1 Dept. of Computer Science, FernUniversität in Hagen, 58084

More information

Requirements Ontology and Multi representation Strategy for Database Schema Evolution 1

Requirements Ontology and Multi representation Strategy for Database Schema Evolution 1 Requirements Ontology and Multi representation Strategy for Database Schema Evolution 1 Hassina Bounif, Stefano Spaccapietra, Rachel Pottinger Database Laboratory, EPFL, School of Computer and Communication

More information

Université du Québec à Montréal. Financial Services Logical Data Model for Social Economy based on Universal Data Models. Project

Université du Québec à Montréal. Financial Services Logical Data Model for Social Economy based on Universal Data Models. Project Université du Québec à Montréal Financial Services Logical Data Model for Social Economy based on Universal Data Models Project In partial fulfillment of the requirements for the degree of Master in Software

More information

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 Yannis Stavrakas Vassilis Plachouras IMIS / RC ATHENA Athens, Greece {yannis, vplachouras}@imis.athena-innovation.gr Abstract.

More information

Measurement and Metrics Fundamentals. SE 350 Software Process & Product Quality

Measurement and Metrics Fundamentals. SE 350 Software Process & Product Quality Measurement and Metrics Fundamentals Lecture Objectives Provide some basic concepts of metrics Quality attribute metrics and measurements Reliability, validity, error Correlation and causation Discuss

More information

2. MOTIVATING SCENARIOS 1. INTRODUCTION

2. MOTIVATING SCENARIOS 1. INTRODUCTION Multiple Dimensions of Concern in Software Testing Stanley M. Sutton, Jr. EC Cubed, Inc. 15 River Road, Suite 310 Wilton, Connecticut 06897 ssutton@eccubed.com 1. INTRODUCTION Software testing is an area

More information

Semantic Transformation of Web Services

Semantic Transformation of Web Services Semantic Transformation of Web Services David Bell, Sergio de Cesare, and Mark Lycett Brunel University, Uxbridge, Middlesex UB8 3PH, United Kingdom {david.bell, sergio.decesare, mark.lycett}@brunel.ac.uk

More information

SQLMutation: A tool to generate mutants of SQL database queries

SQLMutation: A tool to generate mutants of SQL database queries SQLMutation: A tool to generate mutants of SQL database queries Javier Tuya, Mª José Suárez-Cabal, Claudio de la Riva University of Oviedo (SPAIN) {tuya cabal claudio} @ uniovi.es Abstract We present a

More information

From Databases to Natural Language: The Unusual Direction

From Databases to Natural Language: The Unusual Direction From Databases to Natural Language: The Unusual Direction Yannis Ioannidis Dept. of Informatics & Telecommunications, MaDgIK Lab University of Athens, Hellas (Greece) yannis@di.uoa.gr http://www.di.uoa.gr/

More information

INF5820 Natural Language Processing - NLP. H2009 Jan Tore Lønning jtl@ifi.uio.no

INF5820 Natural Language Processing - NLP. H2009 Jan Tore Lønning jtl@ifi.uio.no INF5820 Natural Language Processing - NLP H2009 Jan Tore Lønning jtl@ifi.uio.no Semantic Role Labeling INF5830 Lecture 13 Nov 4, 2009 Today Some words about semantics Thematic/semantic roles PropBank &

More information