Natural Language Processing and Information Systems



Similar documents
Visualization of Large and Unstructured Data Sets

Testing of Component-Based Systems and Software Quality

PRIMIUM Process Innovation for Enterprise Software

IT-Incident Management & IT-Forensics

Information Systems Technology and its Applications

How To Write A Paper On The Social Semantic Web

Information Systems Technology and its Applications, 4 th. International Conference

OMER Object-oriented Modeling of Embedded Real-Time Systems

German Conference on Bioinformatics 2004

18th IEEE Conference on Business Informatics Call for Papers

Tax investigations and dispute resolution your contacts

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN

Foreign Taxes Paid and Foreign Source Income INTECH Global Income Managed Volatility Fund

MANDATORY PROVIDENT FUND SCHEMES AUTHORITY

ARCS 2004 Organic and Pervasive Computing

Converging Web-Data and Database Data: Big - and Small Data via Linked Data

Prof. Dr. D. W. Cunningham, Berliner Strasse 35A, Cottbus, Germany

UNIVERSITY TOP 50 BY SUBJECTS a) Arts and Humanities Universities

Collaboration between Business Schools and Enterprises. Professor Chris Styles Associate Dean, Executive Education

Hong Kong s Health Spending 1989 to 2033

Grant Agreement N Updated Project Web Portal. Eric Schmieders (UniDue) Deliverable #PO-SoE Version: 1.0.

The worst is over for online brokerage

GS1 Industry Engagement Call to Action

Honorary Fellow of the Amsterdam School of Communication Research (ASCoR), University of Amsterdam, The Netherlands

ADVANCED GEOGRAPHIC INFORMATION SYSTEMS Vol. II - Using Ontologies for Geographic Information Intergration Frederico Torres Fonseca

International Organization for Standardization TC 215 Health Informatics. Audrey Dickerson, RN MS ISO/TC 215 Secretary

GLOBAL B2C E-COMMERCE DELIVERY 2015

Thermo Scientific Compound Discoverer Software. A New Generation. of integrated solutions for small molecule structure ID

World Consumer Income and Expenditure Patterns

HL7 AROUND THE WORLD

GLOBAL EDUCATION PROGRAM (GEP)

Wat verwacht de hybride consument van de verschillende distributiesystemen? Jan Verlinden Insurance Leader Belgium Capgemini

TOWARDS PUBLIC PROCUREMENT KEY PERFORMANCE INDICATORS. Paulo Magina Public Sector Integrity Division

An Ontology Based Method to Solve Query Identifier Heterogeneity in Post- Genomic Clinical Trials

Application of ontologies for the integration of network monitoring platforms

GLOBAL EDUCATION PROGRAM

SUPPLEMENTAL INFORMATION 4.10 STUDENTS PARTICIPATING OF INTERNATIONAL ACADEMIC EXCHANGE PROGRAMS

SunGard Best Practice Guide

PhD in Strategic Management, College of Management, Georgia Institute of Technology, 2008

Full professor and 6 assistant professors of IT A new school in IT Cameroon

E-Seminar. Financial Management Internet Business Solution Seminar

Updating the QIAcube operating software

Global Real Estate Outlook

ELPUB Digital Library v2.0. Application of semantic web technologies

Business Plan Calls Tariff The Choice for Business Telecoms

Enterprise Modelling. and Information Systems Architectures (EMISA 2013)

Invited Talk: THE MISREPRESENTATION OF DIGITAL TEENS AS TROLLS: CONSIDERING POLITICAL, NEWS AND FEMINIST AGENDAS"

10th International Trade Fair for Cine Equipment and Technology September 2014 MOC Munich, Germany

Politics & International Relations. ERASMUS and International Exchanges

MERCER S COMPENSATION ANALYSIS AND REVIEW SYSTEM AN ONLINE TOOL DESIGNED TO TAKE THE WORK OUT OF YOUR COMPENSATION REVIEW PROCESS

LuxeMbOurG Trading CenTre LisT Annex 1 to the s pecial terms and conditions for securities transactions Valid as from 1 January 2011

LuxeMbOurG Trading CenT re LisT Annex 1 to the special terms and conditions for securities transactions Valid as from 1 september 2011

An Overview of the Applications of Natural Language to Information Systems

The Lawson Customer Relationship Management Solution

USAGE OF METRICS AND ANALYTICS IN EMEA MOVING UP THE MATURITY CURVE

New Frontiers of Automated Content Analysis in the Social Sciences

Global AML Resource Map Over 2000 AML professionals

Guide. Axis Webinar User Guide

How To Use Networked Ontology In E Health

Morningstar is shareholders in

Delegation in human resource management

9th International Conference on Terminology and Artificial Intelligence

Retirement Readiness. OECD/IOPS GLOBAL FORUM ON PRIVATE PENSIONS - Sydney Nov 2-3

Proceedings of the International Workshop on Semantic Technologies meet Recommender Systems & Big Data SeRSy 2012

Guide. Axis Webinar. User guide

STATE OF GLOBAL E-COMMERCE REPORT (Preview) February 2013

Triple-play subscriptions to rocket to 400 mil.

Bournemouth University Global Partnerships

Global Pricing Study 2011: "Weak pricing cuts profits by 25%" Short summary

BT Connect Networks that think

European Master In Nuclear Fusion Science and Engineering Physics

ASAP implementation approach for SAP ERP implementation has five major phases as shown in below picture. Fit and Gap Analysis (FGA) is very critical

Schedule R Teleconferencing Service

E-Seminar. E-Commerce Internet Business Solution Seminar

WELCOME! Introduction. Celebrating. &PrimeRevenue. PrimeRevenue Hong Kong PrimeRevenue, Inc.

How does a venture capitalist appraise investment opportunities?

National Counties Building Society. A guide to our Buy to Let Mortgage lending criteria

Comparative tables. CPSS Red Book statistical update 427

LUXEMBOURG TRADING CENTRE LIST

How To Control A Record System

Westpac Travelling Scholarship University of Otago Business School Established in 1953 by the Trustees of the Dunedin Savings Bank (now Westpac).

Transcription:

Antje Düsterhöft, Bernhard Thalheim (Eds.) Natural Language Processing and Information Systems 8th International Conference on Applications of Natural Language to Information Systems June 2003 in Burg (Spreewald), Germany Gesellschaft für Informatik 2003

Lecture Notes in Informatics (LNI) - Proceedings Series of the Gesellschaft für Informatik (GI) Volume P-29 ISSN 1617-5468 ISBN 3-88579-358-X Volume Editors Prof. Dr. Antje Düsterhöft HS-Wismar FB Elektrotechnik und Informatik Postfach 1210 D-23952 Wismar E-Mail: duest@et.hs-wismar.de Prof. Dr. Bernhard Thalheim Brandenburgische Technische Universität Cottbus Institut für Informatik Postfach 101344 D-03013 Cottbus E-Mail: thalheim@informatik.tu-cottbus.de Series Editorial Board Heinrich C. Mayr, Universität Klagenfurt, Austria (Chairman, mayr@ifit.uni-klu.ac.at) Jörg Becker, Universität Münster, Germany Ulrich Furbach, Universität Koblenz, Germany Axel Lehmann, Universität der Bundeswehr München, Germany Peter Liggesmeyer, Universität Potsdam, Germany Ernst W. Mayr, Technische Universität München, Germany Heinrich Müller, Universität Dortmund, Germany Heinrich Reinermann, Hochschule für Verwaltungswissenschaften Speyer, Germany Karl-Heinz Rödiger, Universität Bremen, Germany Sigrid Schubert, Universität Dortmund, Germany Dissertations Dorothea Wagner, Universität Konstanz, Germany Seminars Reinhard Wilhelm, Universität des Saarlandes, Germany Gesellschaft für Informatik, Bonn 2003 printed by Köllen Druck+Verlag GmbH, Bonn

Preface Since 1995 the NLDB conference has aimed at bringing together researchers, industrial and potential users interested in various applications of NATURAL LANGUAGE in the DATABASE and INFORMATION SYSTEMS field. The integration of databases and natural language has been an utopia for a long time. Nowadays, this is an accessible convergent point on which a lot of researchers are focusing, mainly due to the large progress of research in natural language and to the development of new technologies which allow the storage of real semantic electronic dictionaries. Each aspect of an information system life cycle may be improved by natural language techniques: database design (specification, validation, conflicts resolution), database query languages and application programming that use new software engineering research (e.g. natural language program specifications). Furthermore, natural language based query languages and user interfaces facilitate the access to software systems for anyone and allow for new paradigms in the usage of computerized services. As information systems are now evolving into the communication area, the term databases should be considered in the broader sense of information and communication systems. The NLDB'2003 contributions are a balanced mix of full paper reports and extended abstracts from research and application giving a broad insight into the state of the art concerning problems and solutions within the context of natural language processing and information systems. The selected papers are assigned to the major topics: natural language for conceptual modelling, information retrieval and information extraction, linguistic resources for dialogue modelling, natural language for database querying, referencing and categorization, as well as building ontologies for web applications. We thank all authors for their interesting papers and we also take pleasure in thanking those who have contributed to the realization of the conference and of these proceedings, especially Karla Kersten, Aleksander Binemann-Zdanowicz and Thomas Kobienia. Wismar and Cottbus, in June 2003 Antje Düsterhöft Bernhard Thalheim

Programme Committee: Diego Mollá Aliod, Macquarie University, Australia Kenji Araki, Hokkaido University, Japan Alfs T. Berztiss, University of Pittsburgh, USA Mokrane Bouzeghoub, PRiSM, Université de Versailles, France Hans Burg, Ordina Alignment Consulting, The Netherlands Key-Sun Choi, NHK Science and Technology Research Lab., Japan Gary A Coen, Boeing, USA Isabelle Comyn-Wattiau, CEDRIC/CNAM, France Walter Daelemans, University of Antwerp, Belgium Antje Düsterhöft, University of Wismar, Germany Günther Fliedl, Universität Klagenfurt, Austria Alexander Gelbukh, Instituto Politecnico Nacional, Mexico Rafael Muñoz Guillena, Universidad de Alicante, Spain Jon Atle Gulla, Borvegian Institute of Technology, Norway Helmut Horacek, Universität Saarbrücken, Germany Paul Johannesson, Stockholm University, Sweden Zoubida Kedad, PRiSM, Université de Versailles, France Christian Kop, Universität Klagenfurt, Austria Winfried Lenders, Universität Bonn, Germany Jana Lewerenz, sd&m Düsseldorf, Germany Robert Luk, Hong Kong Polytechnic University, Hong Kong Heinrich C. Mayr, Universität Klagenfurt, Austria Paul McFetridge, Simon Fraser University, Canada Elisabeth Metais, CEDRIC/CNAM, France Farid Meziane, Salford University, UK Ana Maria Moreno, Universidad Politecnica de Madrid, Spain Kazunori Muraki, NEC Custum Tecnica Ltd., Japan Jian-Yun Nie, Université de Montréal, Canada Odile Piton, Université Paris I Panthéon-Sorbonne, France Reind van de Riet, Vrije Universiteit Amsterdam, The Netherlands Hae-Chang Rim, Korea University, Korea Hongchi Shi, University of Missouri-Columbia, USA Ishizaki Shun, Keio University, Japan Vijay Sugumaran, Oakland University Rochester, USA Veda Storey, Georgia State University, USA Lua Km Teng, National University of Singapore, Singapore Bernhard Thalheim, University of Cottbus, Germany Babis Theodoulidis, University of Surrey, UK Benkt Wangler, University of Skövde, Sweden Hans Weigand, Tilburg University, The Netherlands Werner Winiwarter, University of Vienna, Austria Christian Winkler, Universität Klagenfurt,Austria Mustafa Yaseen, Amman University of Applied Sciences, Jordan Additional Reviewers: Per Backlund, University of Skövde, Sweden Eva Söderström, University of Skövde, Sweden Organizing Committee: chair: Vojtech Vestenický, BTU Cottbus Aleksander Binemann-Zdanowicz, BTU Cottbus Antje Düsterhöft, University of Applied Sciences Wismar Carola Kadow, University of Applied Sciences Wismar Karla Kersten, BTU Cottbus Thomas Kobienia, BTU Cottbus

Contents Invited Paper K.-R. Fellbaum Speech Input and Output Technology State of the Art and Selected Applications 7 Conference Papers S. Armstrong, A. Clark, G. Coray, M. Georgescul, V. Pallotta, A. Popescu-Belis, D. Portabella, M. Rajman, M. Starlander Natural Language Queries on Natural Language Data: a Database of Meeting Dialogues 14 I. A. Bolshakov, A. Gelbukh On Detection of Malapropisms by Multistage Collocation Testing 28 V. Boonjing, C. Hsu Natural Language Interaction Using a Scalable Reference Dictionary 42 A. Burton-Jones, V. C. Storey, V. Sugumaran, P. Ahluwalia Assessing the Effectiveness of the DAML Ontologies for the Semantic Web 56 R. Camps, J. Daudé Improving the Efficacy of Approximate Searching by Personal-Name 70 P. Cimiano Ontology-Driven Discourse Analysis in GenIE 77 G. Fliedl, C. Kop, H. C. Mayr From Scenarios to KCPM Dynamic Schemas: Aspects of Automatic Mapping 91 G. Gardarin, H. Kou, K. Zetourni, X. Meng, H. Wang SEWISE : An Ontology-based Web Information Search Engine 106 A. Gelbukh, M. Alexandrov, A. Bourek, P. Makagonov Selection of Representative Documents for Clusters in a Document Collection 120 E. Kapetanios, D. Baer, P. Groenewoud Simplifying Syntactic and Semantic Parsing of NL Based Queries in Advanced Application Domains 127 H. Kou, G. Gardarin, K. Zeitouni Approaches to Feature Selection for Document Categorization 141

K. C. Lan, K. S. Ho Accessing Financial News Using Dialogues 155 M. Martinovic, G. Sampath, R. Wagner, S. Briening A Model of USENET Newsgroups Dynamics: Implementation and Results 168 F. Meziane, M. Khairudin Kasiran Extracting Unstructured Information from the WWW to Support Merchant Existence in ecommerce 175 B. Navarro, M. Palomar, P. Martýnez-Barco A General Proposal to Multilingual Information Access Based on Syntactic-Semantic Patterns 186 O. Piton, T. Grass, D. Maurel Linguistic Resource for NLP: Ask for Die Drei Musketiere and meet Les Trois Mousquetaires 200 G. Ramakrishnanan, P. Bhattacharyya Text Representation with WordNet Synsets using Soft Sense Disambiguation 214 I. Renz, A. Ficzay, H. Hitzler Keyword Extraction for Text Characterization 228 N. Stratica, L. Kosseim, B. C. Desai NLIDB Templates for Semantic Parsing 235 K. Thirunarayan, A. Berkovich, S. Grace, D. Sokol Information Extraction for Reorganizing Specifications 242