Processing: current projects and research at the IXA Group
|
|
|
- Carol Wheeler
- 10 years ago
- Views:
Transcription
1 Natural Language Processing: current projects and research at the IXA Group IXA Research Group on NLP University of the Basque Country Xabier Artola Zubillaga
2 Motivation A language that seeks to survive in the modern information society requires language technology products. Most of the working applications are only available for the "big" languages. "Minority" languages have to make a great effort to face this challenge. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 2
3 How to face the challenge? Open proposal for the development of language technology. Steps to take: from necessary infrastructure to useful LE applications. Based on the twelve year-long experience of the IXA Research Group in the field of natural language processing applied to Basque. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 3
4 IXA Research Group on NLP (UPV/EHU) (I) Main research fields: NLP, computational linguistics, language engineering. Goal: to collaborate on laying foundations for research; the development of language processing software. Application language: mainly Basque. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 4
5 IXA Research Group on NLP (UPV/EHU) (II) 1986/1987: 4-5 university lecturers (CS) 2000/2001: ~30 members 13 lecturers (11 doctorates, senior researchers) 13 PhD students (research grants) A few research assistants assigned to projects Interdisciplinary team: computer scientists & linguists Collaboration on linguistic aspects: UZEI, Elhuyar,... FOR MORE INFO... Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 5
6 IXA Research Group on NLP (UPV/EHU) (III) Relationships with other universities in Euskal Herria; Madrid; Toulouse; Barcelona; Maryland, Las Cruces (USA); Sydney (Australia); Massey (New Zealand); Rome (Italy); Helsinki (Finland);... And companies: Hizkia, Jalgi, Egunkaria, Microsoft, Xerox, LingSoft, LexiQuest,... Funding: local government, University of the Basque Country, Spanish Government,... Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 6
7 What is the language industry? A growing number of people use computer systems in their everyday life. Remote information retrieval Document writing and correction Many of these systems involve the use and processing of language. Consultation of dictionaries and encyclopedias Second language learning Translation of documents Electronic messaging Automatic phone services Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 7
8 Some terminology on Natural Language Processing (I) NLP deals with the automatic processing of both spoken and written text: communicate with/through computers by means of every day language. Computational linguistics or computer-oriented linguistics: formalisation of linguistic knowledge for computer processing. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 8
9 Some terminology on Natural Language Processing (II) Language engineering: production of computer systems which can recognise, understand, interpret and generate human language in all its forms. Typical products of LE are language software systems (lingware) such as lemmatisers, phrase recognisers, word sense disambiguation programs, translation aids, etc. All this is usually gathered under the heading of Human Language Technology. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 9
10 Underlying philosophy (I) Use, share, and reuse: theories, formalisms, and methodologies techniques and expertise technology Build our own linguistic resources in order to develop: general and specific tools applications and end-user products Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 10
11 Underlying philosophy (II) As an example: Several OCR programs claim to have Basque among the languages they are set up for. No one includes specific language information (dictionary, bi-grams or tri-grams info., etc.). In some of them, the use of (r acute) and similar obsolete features is the only reason for that claim!!!. Result These programs don't work with Basque texts as properly as they do with other languages. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 11
12 Strategic priorities: from basic research to application development Research & development End-user applications Language tools Basic & applied research Linguistic foundations Linguistic resources Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 12
13 Linguistic foundations & resources, tools and applications Linguistic foundations and resources: necessary infrastructure for the automatic processing of a language. Tools: mainly intended for application developers. Applications: commercial or non-commercial, for non-specialised end-users. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 13
14 Phase I: laying foundations MRD's Comp. description of morphology Basic Lexical Database Raw corpus (written texts & speech recordings) Phonetics Lexicon Morphology Syntax Semantics Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 14
15 Phase II: first basic tools and applications Xuxen: spelling checker/corrector Lemmatiser/Tagger Morphological analyser Statistical tools for the treatment of corpora MRD's Comp. description of morphology Enriched Lexical Database Morphologically annotated corpus Phonetics Lexicon Morphology Syntax Semantics Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 15
16 Phase III: more advanced tools and applications Basic CALL Electronic dictionaries Web crawler Grammar checker Environment for linguistic tools integration Xuxen: spelling checker/corrector Lemmatiser/Tagger Surface syntax Morphological analyser analyser Statistical tools for the treatment of corpora WSD MRD's Comp. description of morphology Comp. grammar Lexical Database Morphologically and syntactically annotated corpus Lexicalsemantic KB Phonetics Lexicon Morphology Syntax Semantics Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 16
17 Phase IV: multilinguality and general use applications NL generation, translation aids, dialog systems,... Information retrieval and extraction Advanced CALL Electronic dictionaries Web crawler Grammar checker Environment for linguistic tools integration Xuxen: spelling checker/corrector Lemmatiser/Tagger Syntax Morphological analyser analyser WSD Statistical tools for the treatment of corpora MRD's Comp. description Comp. Multilingual of morphology grammar lexicalsemantic KB Lexical Database Morphol., synt., and semantically annotated multilingual corpus Phonetics Lexicon Morphology Syntax Semantics Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 17
18 What not to do (I) Do not start developing applications before linguistic foundations are created. Follow, in general, the sequence stated above: foundations, tools, and applications. When a new system must be built, do not create ad hoc linguistic resources. Design these resources to be easily extended for full coverage and make them reusable by any other tool or application. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 18
19 What not to do (II): example Basque is a language with a very rich morphology. We decided not to begin with advanced applications (machine translation,...) but rather to develop a broad foundation based on lexicon and morphology. Now those foundations have become the base for present and future developments. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 19
20 Reusability is a must: example (I) Structured electronic dictionaries Lemmatiser Surface syntax parser Lexical Database Machine- Readable Dictionaries Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 20
21 Reusability is a must: example (II) Translation aids Structured electronic dictionaries Word-sense disambiguation Morphological analyser Surface syntax parser Lexical Database Corpus Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 21
22 What not to do (III) When you complete a new resource or tool do not keep it to yourself many researchers in the world are investigating on English, but only a few on each minority language we will not become rich (market criteria do not usually apply) Results should be public and shared for research purposes. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 22
23 Conclusions Long-term strategy for research and development of language engineering. Based on the experience of the IXA Group on the automatic processing of Basque. Every foundation, tool, and application developed in the previous phases is of great importance to face new problems and challenges. The development of a sound language industry should be the result of a coordinated effort, involving research groups, institutions and industry. Multilinguae, 22/11/00 IXA Research Group on NLP (UPV/EHU) 23
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg
Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that
Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU http://ixa.si.ehu.es
KYOTO () Intelligent Content and Semantics Knowledge Yielding Ontologies for Transition-Based Organization http://www.kyoto-project.eu/ Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU
MEDAR Mediterranean Arabic Language and Speech Technology An intermediate report on the MEDAR Survey of actors, projects, products
MEDAR Mediterranean Arabic Language and Speech Technology An intermediate report on the MEDAR Survey of actors, projects, products Khalid Choukri Evaluation and Language resources Distribution Agency;
Special Topics in Computer Science
Special Topics in Computer Science NLP in a Nutshell CS492B Spring Semester 2009 Jong C. Park Computer Science Department Korea Advanced Institute of Science and Technology INTRODUCTION Jong C. Park, CS
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990
IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper
IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and
CS 6740 / INFO 6300. Ad-hoc IR. Graduate-level introduction to technologies for the computational treatment of information in humanlanguage
CS 6740 / INFO 6300 Advanced d Language Technologies Graduate-level introduction to technologies for the computational treatment of information in humanlanguage form, covering natural-language processing
MULTIFUNCTIONAL DICTIONARIES
In: A. Zampolli, A. Capelli (eds., 1984): The possibilities and limits of the computer in producing and publishing dictionaries. Linguistica Computationale III, Pisa: Giardini, 279-288 MULTIFUNCTIONAL
Master of Arts in Linguistics Syllabus
Master of Arts in Linguistics Syllabus Applicants shall hold a Bachelor s degree with Honours of this University or another qualification of equivalent standard from this University or from another university
Comprendium Translator System Overview
Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4
Customizing an English-Korean Machine Translation System for Patent Translation *
Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,
Collecting Polish German Parallel Corpora in the Internet
Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska
Research Portfolio. Beáta B. Megyesi January 8, 2007
Research Portfolio Beáta B. Megyesi January 8, 2007 Research Activities Research activities focus on mainly four areas: Natural language processing During the last ten years, since I started my academic
AnHitz, development and integration of language, speech and visual technologies for Basque
AnHitz, development and integration of language, speech and visual technologies for Basque Kutz Arrieta VICOMTech [email protected] Igor Leturia Elhuyar Foundation [email protected] Urtza Iturraspe
ONLINE TRANSLATION SERVICES FOR THE LAO LANGUAGE
ONLINE TRANSLATION SERVICES FOR THE LAO LANGUAGE Vincent BERMENT [email protected] GETA-CLIPS (IMAG) BP 53 38041 Grenoble Cedex 9, France http://www-clips.imag.fr/geta/ INALCO 2, rue de Lille 75343
Word Completion and Prediction in Hebrew
Experiments with Language Models for בס"ד Word Completion and Prediction in Hebrew 1 Yaakov HaCohen-Kerner, Asaf Applebaum, Jacob Bitterman Department of Computer Science Jerusalem College of Technology
Survey Results: Requirements and Use Cases for Linguistic Linked Data
Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group
A Mixed Trigrams Approach for Context Sensitive Spell Checking
A Mixed Trigrams Approach for Context Sensitive Spell Checking Davide Fossati and Barbara Di Eugenio Department of Computer Science University of Illinois at Chicago Chicago, IL, USA [email protected], [email protected]
Overview of MT techniques. Malek Boualem (FT)
Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,
Opentrad: bringing to the market open source based Machine Translators
Opentrad: bringing to the market open source based Machine Translators Ibon Aizpurua Ugarte 1, Gema Ramírez Sánchez 2, Jose Ramom Pichel 3, Josu Waliño 4 1 Eleka Ingeniaritza Linguistikoa, S.L. 2 Prompsit
How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and
Semantic annotation of requirements for automatic UML class diagram generation
www.ijcsi.org 259 Semantic annotation of requirements for automatic UML class diagram generation Soumaya Amdouni 1, Wahiba Ben Abdessalem Karaa 2 and Sondes Bouabid 3 1 University of tunis High Institute
Modern foreign languages
Modern foreign languages Programme of study for key stage 3 and attainment targets (This is an extract from The National Curriculum 2007) Crown copyright 2007 Qualifications and Curriculum Authority 2007
Report on the embedding and evaluation of the second MT pilot
Report on the embedding and evaluation of the second MT pilot quality translation by deep language engineering approaches DELIVERABLE D3.10 VERSION 1.6 2015-11-02 P2 QTLeap Machine translation is a computational
Hybrid Strategies. for better products and shorter time-to-market
Hybrid Strategies for better products and shorter time-to-market Background Manufacturer of language technology software & services Spin-off of the research center of Germany/Heidelberg Founded in 1999,
Text-To-Speech Technologies for Mobile Telephony Services
Text-To-Speech Technologies for Mobile Telephony Services Paulseph-John Farrugia Department of Computer Science and AI, University of Malta Abstract. Text-To-Speech (TTS) systems aim to transform arbitrary
Automatic assignment of Wikipedia encyclopedic entries to WordNet synsets
Automatic assignment of Wikipedia encyclopedic entries to WordNet synsets Maria Ruiz-Casado, Enrique Alfonseca and Pablo Castells Computer Science Dep., Universidad Autonoma de Madrid, 28049 Madrid, Spain
Strategic plan for the development of research in Language Technology at the University of Gothenburg
Strategic plan for the development of research in Language Technology at the University of Gothenburg A long range plan with particular reference to the current research period 2009 12 Amended version,
Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata
Generating SQL Queries Using Natural Language Syntactic Dependencies and Metadata Alessandra Giordani and Alessandro Moschitti Department of Computer Science and Engineering University of Trento Via Sommarive
MASTER OF PHILOSOPHY IN ENGLISH AND APPLIED LINGUISTICS
University of Cambridge: Programme Specifications Every effort has been made to ensure the accuracy of the information in this programme specification. Programme specifications are produced and then reviewed
LGPLLR : an open source license for NLP (Natural Language Processing) Sébastien Paumier. Université Paris-Est Marne-la-Vallée
LGPLLR : an open source license for NLP (Natural Language Processing) Sébastien Paumier Université Paris-Est Marne-la-Vallée [email protected] Penguin from http://tux.crystalxp.net/ 1 Linguistic data
Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features
, pp.273-280 http://dx.doi.org/10.14257/ijdta.2015.8.4.27 Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features Lirong Qiu School of Information Engineering, MinzuUniversity of
Hybrid Machine Translation Guided by a Rule Based System
Hybrid Machine Translation Guided by a Rule Based System Cristina España-Bonet, Gorka Labaka, Arantza Díaz de Ilarraza, Lluís Màrquez Kepa Sarasola Universitat Politècnica de Catalunya University of the
Terminology Extraction from Log Files
Terminology Extraction from Log Files Hassan Saneifar 1,2, Stéphane Bonniol 2, Anne Laurent 1, Pascal Poncelet 1, and Mathieu Roche 1 1 LIRMM - Université Montpellier 2 - CNRS 161 rue Ada, 34392 Montpellier
Search and Data Mining: Techniques. Text Mining Anya Yarygina Boris Novikov
Search and Data Mining: Techniques Text Mining Anya Yarygina Boris Novikov Introduction Generally used to denote any system that analyzes large quantities of natural language text and detects lexical or
PoS-tagging Italian texts with CORISTagger
PoS-tagging Italian texts with CORISTagger Fabio Tamburini DSLO, University of Bologna, Italy [email protected] Abstract. This paper presents an evolution of CORISTagger [1], an high-performance
A stream computing approach towards scalable NLP
A stream computing approach towards scalable NLP Xabier Artola, Zuhaitz Beloki, Aitor Soroa IXA group. University of the Basque Country. LREC, Reykjavík 2014 Table of contents 1
LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM*
LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* Jonathan Yamron, James Baker, Paul Bamberg, Haakon Chevalier, Taiko Dietzel, John Elder, Frank Kampmann, Mark Mandel, Linda Manganaro, Todd Margolis,
209 THE STRUCTURE AND USE OF ENGLISH.
209 THE STRUCTURE AND USE OF ENGLISH. (3) A general survey of the history, structure, and use of the English language. Topics investigated include: the history of the English language; elements of the
LEXUS: a web based lexicon tool
LEXUS: a web based lexicon tool Jacquelijn Ringersma Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Content Max Planck Institute Archive of linguistic resources Tool support (archiving
Glossary of translation tool types
Glossary of translation tool types Tool type Description French equivalent Active terminology recognition tools Bilingual concordancers Active terminology recognition (ATR) tools automatically analyze
Automatic Identification of Arabic Language Varieties and Dialects in Social Media
Automatic Identification of Arabic Language Varieties and Dialects in Social Media Fatiha Sadat University of Quebec in Montreal, 201 President Kennedy, Montreal, QC, Canada [email protected] Farnazeh
National Masters School in Language Technology
National Masters School in Language Technology GSLT May 19, 2009 Introduction Sweden is a relatively small country and competence in language technology is spread over a number of academic institutions
THE BACHELOR S DEGREE IN SPANISH
Academic regulations for THE BACHELOR S DEGREE IN SPANISH THE FACULTY OF HUMANITIES THE UNIVERSITY OF AARHUS 2007 1 Framework conditions Heading Title Prepared by Effective date Prescribed points Text
Extracting translation relations for humanreadable dictionaries from bilingual text
Extracting translation relations for humanreadable dictionaries from bilingual text Overview 1. Company 2. Translate pro 12.1 and AutoLearn 3. Translation workflow 4. Extraction method 5. Extended
How To Complete The Danish Masters Program In Lct
European Masters Program in Language and Communication Technologies (LCT) Modules Handbook for Prospective Students European Masters Program in LCT - Modules Handbook Page ii Chapter 1 Study Program The
PROMT Technologies for Translation and Big Data
PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED
The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge
The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge White Paper October 2002 I. Translation and Localization New Challenges Businesses are beginning to encounter
An Artificial Intelligence approach to Arabic and Islamic content on the internet
An Artificial Intelligence approach to Arabic and Islamic content on the internet Eric Atwell, Claire Brierley, Kais Dukes, Majdi Sawalha, Abdul-Baquee Sharaf I-AIBS Institute for Artificial intelligence
Computer-Based Text- and Data Analysis Technologies and Applications. Mark Cieliebak 9.6.2015
Computer-Based Text- and Data Analysis Technologies and Applications Mark Cieliebak 9.6.2015 Data Scientist analyze Data Library use 2 About Me Mark Cieliebak + Software Engineer & Data Scientist + PhD
Why major in linguistics (and what does a linguist do)?
Why major in linguistics (and what does a linguist do)? Written by Monica Macaulay and Kristen Syrett What is linguistics? If you are considering a linguistics major, you probably already know at least
A Machine Translation System Between a Pair of Closely Related Languages
A Machine Translation System Between a Pair of Closely Related Languages Kemal Altintas 1,3 1 Dept. of Computer Engineering Bilkent University Ankara, Turkey email:[email protected] Abstract Machine translation
Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC)
Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger European Commission Joint Research Centre (JRC) https://ec.europa.eu/jrc/en/research-topic/internet-surveillance-systems
The Knowledge Sharing Infrastructure KSI. Steven Krauwer
The Knowledge Sharing Infrastructure KSI Steven Krauwer 1 Why a KSI? Building or using a complex installation requires specialized skills and expertise. CLARIN is no exception. CLARIN is populated with
Learning Translation Rules from Bilingual English Filipino Corpus
Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,
How RAI's Hyper Media News aggregation system keeps staff on top of the news
How RAI's Hyper Media News aggregation system keeps staff on top of the news 13 th Libre Software Meeting Media, Radio, Television and Professional Graphics Geneva - Switzerland, 10 th July 2012 Maurizio
Turkish Radiology Dictation System
Turkish Radiology Dictation System Ebru Arısoy, Levent M. Arslan Boaziçi University, Electrical and Electronic Engineering Department, 34342, Bebek, stanbul, Turkey [email protected], [email protected]
Study Plan. Bachelor s in. Faculty of Foreign Languages University of Jordan
Study Plan Bachelor s in Spanish and English Faculty of Foreign Languages University of Jordan 2009/2010 Department of European Languages Faculty of Foreign Languages University of Jordan Degree: B.A.
UNIVERSITY OF JORDAN ADMISSION AND REGISTRATION UNIT COURSE DESCRIPTION
Course Description B.A Degree Spanish and English Language and Literature 2203103 Spanish Language for Beginners (1) (3 credit hours) Prerequisite : none In combination with Spanish for Beginners (2),
Master of Arts Program in Linguistics for Communication Department of Linguistics Faculty of Liberal Arts Thammasat University
Master of Arts Program in Linguistics for Communication Department of Linguistics Faculty of Liberal Arts Thammasat University 1. Academic Program Master of Arts Program in Linguistics for Communication
Language and Computation
Language and Computation week 13, Thursday, April 24 Tamás Biró Yale University [email protected] http://www.birot.hu/courses/2014-lc/ Tamás Biró, Yale U., Language and Computation p. 1 Practical matters
What Is Linguistics? December 1992 Center for Applied Linguistics
What Is Linguistics? December 1992 Center for Applied Linguistics Linguistics is the study of language. Knowledge of linguistics, however, is different from knowledge of a language. Just as a person is
Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing
1 Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing Lourdes Araujo Dpto. Sistemas Informáticos y Programación, Univ. Complutense, Madrid 28040, SPAIN (email: [email protected])
Free Text Phrase Encoding and Information Extraction from Medical Notes. Jennifer Shu
Free Text Phrase Encoding and Information Extraction from Medical Notes by Jennifer Shu Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements
Exploiting Comparable Corpora and Bilingual Dictionaries. the Cross Language Text Categorization
Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization Alfio Gliozzo and Carlo Strapparava ITC-Irst via Sommarive, I-38050, Trento, ITALY {gliozzo,strappa}@itc.it
Doctoral Consortium 2013 Dept. Lenguajes y Sistemas Informáticos UNED
Doctoral Consortium 2013 Dept. Lenguajes y Sistemas Informáticos UNED 17 19 June 2013 Monday 17 June Salón de Actos, Facultad de Psicología, UNED 15.00-16.30: Invited talk Eneko Agirre (Euskal Herriko
Introduction to formal semantics -
Introduction to formal semantics - Introduction to formal semantics 1 / 25 structure Motivation - Philosophy paradox antinomy division in object und Meta language Semiotics syntax semantics Pragmatics
Teaching Formal Methods for Computational Linguistics at Uppsala University
Teaching Formal Methods for Computational Linguistics at Uppsala University Roussanka Loukanova Computational Linguistics Dept. of Linguistics and Philology, Uppsala University P.O. Box 635, 751 26 Uppsala,
Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia
Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Outline I What is CALL? (scott) II Popular language learning sites (stella) Livemocha.com (stacia) III IV Specific sites
Study Plan for Master of Arts in Applied Linguistics
Study Plan for Master of Arts in Applied Linguistics Master of Arts in Applied Linguistics is awarded by the Faculty of Graduate Studies at Jordan University of Science and Technology (JUST) upon the fulfillment
Terminology Extraction from Log Files
Terminology Extraction from Log Files Hassan Saneifar, Stéphane Bonniol, Anne Laurent, Pascal Poncelet, Mathieu Roche To cite this version: Hassan Saneifar, Stéphane Bonniol, Anne Laurent, Pascal Poncelet,
CS 533: Natural Language. Word Prediction
CS 533: Natural Language Processing Lecture 03 N-Gram Models and Algorithms CS 533: Natural Language Processing Lecture 01 1 Word Prediction Suppose you read the following sequence of words: Sue swallowed
Identifying Focus, Techniques and Domain of Scientific Papers
Identifying Focus, Techniques and Domain of Scientific Papers Sonal Gupta Department of Computer Science Stanford University Stanford, CA 94305 [email protected] Christopher D. Manning Department of
The PALAVRAS parser and its Linguateca applications - a mutually productive relationship
The PALAVRAS parser and its Linguateca applications - a mutually productive relationship Eckhard Bick University of Southern Denmark [email protected] Outline Flow chart Linguateca Palavras History
The history of machine translation in a nutshell
The history of machine translation in a nutshell John Hutchins [Web: http://ourworld.compuserve.com/homepages/wjhutchins] [Latest revision: November 2005] 1. Before the computer 2. The pioneers, 1947-1954
ON GETTING THE MOST OUT OF INTERNET RESOURCES TO RAISE TRANSLATION QUALITY OF PROFESSIONAL DOCUMENTATION
General and Professional Education 3/2013 pp. 21-27 ISSN 2084-1469 ON GETTING THE MOST OUT OF INTERNET RESOURCES TO RAISE TRANSLATION QUALITY OF PROFESSIONAL DOCUMENTATION Svetlana Sheremetyeva Department
DanNet From Dictionary to Wordnet
DanNet From Dictionary to Wordnet Jörg Asmussen Society for Danish Language and Literature, DSL, Copenhagen Bolette Sandford Pedersen Centre for Language Technology, CST, University of Copenhagen Lars
M LTO Multilingual On-Line Translation
O non multa, sed multum M LTO Multilingual On-Line Translation MOLTO Consortium FP7-247914 Project summary MOLTO s goal is to develop a set of tools for translating texts between multiple languages in
An Overview of Applied Linguistics
An Overview of Applied Linguistics Edited by: Norbert Schmitt Abeer Alharbi What is Linguistics? It is a scientific study of a language It s goal is To describe the varieties of languages and explain the
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
Semantic analysis of text and speech
Semantic analysis of text and speech SGN-9206 Signal processing graduate seminar II, Fall 2007 Anssi Klapuri Institute of Signal Processing, Tampere University of Technology, Finland Outline What is semantic
NATURAL LANGUAGE QUERY PROCESSING USING SEMANTIC GRAMMAR
NATURAL LANGUAGE QUERY PROCESSING USING SEMANTIC GRAMMAR 1 Gauri Rao, 2 Chanchal Agarwal, 3 Snehal Chaudhry, 4 Nikita Kulkarni,, 5 Dr. S.H. Patil 1 Lecturer department o f Computer Engineering BVUCOE,
TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION
TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION Dr. Siddhartha Ghosh 1, Sujata Thamke 2 and Kalyani U.R.S 3 1 Head of the Department of Computer Science & Engineering,
