Kevin Tang and Andrew Nevins

Size: px
Start display at page:

Download "Kevin Tang and Andrew Nevins"

Transcription

1 Kevin Tang and Andrew Nevins Abstract -ar(e) -ar(e) -er(e) -ir(e) Keywords: 1 Introduction dig dig dig dig +

2 ouç ouç ouç ouç Verb Vocabulary Size Productivity of ar-er-ir ar-er-ir are-ere-ire -ar -er/-ir -ar -er -ir 2 Data Sources 2.1 English CLMET3.0 Old Bailey

3 2.1.1 CLMET Portuguese Corpus do Português Colonia Tycho Brahe Corpus do Português. fixed Colonia.

4 2.3 Italian Google Italian Ngram DiaCoris Google-Ngram:Italian DiaCoris. fixed 2.4 Spanish Google Spanish Ngram IMPACT-es Google Ngram:Spanish IMPACT-es.

5 3 Methods: Verb Vocabulary Size 3.1 Simulations by Random Sampling 3.2 Epoching N

6 3.3 Lemma estimation burnt burnt ar(e) ir(e) er(e) 4 Analyses: Verb Vocabulary Size 4.1 Simulation results: English, CLMET Simulation results: Portuguese, Colonia 4.3 Simulation results: Italian, Google Ngram

7 -ar/-er/-ir -ar/-er/-ir 4.4 Simulation results: Spanish, Google Ngram

8 -ar/-er/-ir -ar/-er/-ir

9 4.5 Interim Summary 5 Methods: Productivity of -ar -er/-ir er-ir -ar 5.1 Simulations by Random Sampling -ar, -er -ir 5.2 Productivity Estimation ar/( er+ ir). -ar -er -ir -ar -ar Yang s Productivity Estimate. -ar M N/ln(N) M N M -er -ir -er/-ir -er/-ir

10 relative M -ar N M N -ar ( ) 1 ar/( er + ir) 6 Analyses: Productivity of 6.1 Simulation results: Portuguese, Corpus do Português -ar 6.2 Simulation results: Portuguese, Colonia -ar 6.3 Simulation results: Italian, Google Ngram are, -ere -ire 6.4 Simulation results: Italian, DiaCoris -ar 6.5 Simulation results: Spanish, Google Ngram -ar

11 -ar/-er/-ir -ar/-er/-ir 6.6 Simulation results: Spanish, IMPACT-es -ar

12 -ar/-er/-ir -ar/-er/-ir -ar/-er/-ir -ar/-er/-ir 7 Relationship between Verb vocabulary size and Productivity r p

13 r p r p 8 Statistical evaluation of the changepoint of verb vocabulary growth -ar changepoint

14

15 9 Artefact considerations

16 9.1 Corpus representativeness 9.2 Tagging accuracy and consistency

17 -ar -er/-ir -ar -er/-ir -ar -er/-ir -ar -er/-ir without

18 -ar -er -ir -ar -er -ir 10 Conclusion -ar -er -ir -ar/(-ir+-er) -ar

19 r p -ar -er -ir References The British industrial revolution in global perspective Literary and Linguistic Computing 7 Word frequency distributions Literary and Linguistic Computing 8 National Endowment for the humanities The European English Messenger 19 JLCL 26 Lancaster University Proceedings of the ACL 2012 system demonstrations Yearbook of morphology 2004 Special volume on non-standard data sources in corpus-based research De Economist 148 Syntactic development, its input and output Proceedings of LREC-2006, the fifth international conference on language resources and evaluation Biometrika 41 Advances in natural language processing arxiv preprint arxiv: Proceedings of the seventh international conference on language resources and evaluation (lrec 10)

20 arxiv preprint arxiv: Proceedings of the 5th ACL-HLT workshop on language technology for cultural heritage, social sciences, and humanities Proceedings of international conference on new methods in language processing Linguistic Variation Yearbook 5

Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection

Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection Dublin City University at CLEF 2004: Experiments with the ImageCLEF St Andrew s Collection Gareth J. F. Jones, Declan Groves, Anna Khasin, Adenike Lam-Adesina, Bart Mellebeek. Andy Way School of Computing,

More information

Word Completion and Prediction in Hebrew

Word Completion and Prediction in Hebrew Experiments with Language Models for בס"ד Word Completion and Prediction in Hebrew 1 Yaakov HaCohen-Kerner, Asaf Applebaum, Jacob Bitterman Department of Computer Science Jerusalem College of Technology

More information

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words , pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan

More information

Granite Oaks Middle School

Granite Oaks Middle School Granite Oaks Middle School Señorita Moss Foreign Language Department Room: B5 Office Hours: by appointment; before & after school (916) 315-9009 Ext. 3205 cmoss@rocklin.k12.ca.us Welcome to Granite Oaks

More information

Chapter 8. Final Results on Dutch Senseval-2 Test Data

Chapter 8. Final Results on Dutch Senseval-2 Test Data Chapter 8 Final Results on Dutch Senseval-2 Test Data The general idea of testing is to assess how well a given model works and that can only be done properly on data that has not been seen before. Supervised

More information

Unit 1, September TB Preliminary Lesson Unit 2, October TB Unit 5 Lesson 1 What do you and your family like to eat?

Unit 1, September TB Preliminary Lesson Unit 2, October TB Unit 5 Lesson 1 What do you and your family like to eat? Unit 1, September TB Preliminary Lesson Unit 2, October TB Unit 5 Lesson 1 What do you and your family like to eat? Do you live in an apartment or a house? What do you do for fun when you are not in school?

More information

The PALAVRAS parser and its Linguateca applications - a mutually productive relationship

The PALAVRAS parser and its Linguateca applications - a mutually productive relationship The PALAVRAS parser and its Linguateca applications - a mutually productive relationship Eckhard Bick University of Southern Denmark eckhard.bick@mail.dk Outline Flow chart Linguateca Palavras History

More information

Using the BNC to create and develop educational materials and a website for learners of English

Using the BNC to create and develop educational materials and a website for learners of English Using the BNC to create and develop educational materials and a website for learners of English Danny Minn a, Hiroshi Sano b, Marie Ino b and Takahiro Nakamura c a Kitakyushu University b Tokyo University

More information

Computer-aided Document Indexing System

Computer-aided Document Indexing System Journal of Computing and Information Technology - CIT 13, 2005, 4, 299-305 299 Computer-aided Document Indexing System Mladen Kolar, Igor Vukmirović, Bojana Dalbelo Bašić and Jan Šnajder,, An enormous

More information

Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU http://ixa.si.ehu.es

Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU http://ixa.si.ehu.es KYOTO () Intelligent Content and Semantics Knowledge Yielding Ontologies for Transition-Based Organization http://www.kyoto-project.eu/ Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU

More information

PROFICIENCY TARGET FOR END OF INSTRUCTION, SPANISH I

PROFICIENCY TARGET FOR END OF INSTRUCTION, SPANISH I PROFICIENCY TARGET FOR END OF INSTRUCTION, SPANISH I NOVICE-MID Writers at the Novice-Mid level are able to copy or transcribe familiar words or phrases, and reproduce from memory a modest number of isolated

More information

The Transition of Phrase based to Factored based Translation for Tamil language in SMT Systems

The Transition of Phrase based to Factored based Translation for Tamil language in SMT Systems The Transition of Phrase based to Factored based Translation for Tamil language in SMT Systems Dr. Ananthi Sheshasaayee 1, Angela Deepa. V.R 2 1 Research Supervisior, Department of Computer Science & Application,

More information

Computer Aided Document Indexing System

Computer Aided Document Indexing System Computer Aided Document Indexing System Mladen Kolar, Igor Vukmirović, Bojana Dalbelo Bašić, Jan Šnajder Faculty of Electrical Engineering and Computing, University of Zagreb Unska 3, 0000 Zagreb, Croatia

More information

The Fibonacci Strategy Revisited: Can You Really Make Money by Betting on Soccer Draws?

The Fibonacci Strategy Revisited: Can You Really Make Money by Betting on Soccer Draws? MPRA Munich Personal RePEc Archive The Fibonacci Strategy Revisited: Can You Really Make Money by Betting on Soccer Draws? Jiri Lahvicka 17. June 2013 Online at http://mpra.ub.uni-muenchen.de/47649/ MPRA

More information

Veronika VINCZE, PhD. PERSONAL DATA Date of birth: 1 July 1981 Nationality: Hungarian

Veronika VINCZE, PhD. PERSONAL DATA Date of birth: 1 July 1981 Nationality: Hungarian Veronika VINCZE, PhD CONTACT INFORMATION Hungarian Academy of Sciences Research Group on Artificial Intelligence Tisza Lajos krt. 103., 6720 Szeged, Hungary Phone: +36 62 54 41 40 Mobile: +36 70 22 99

More information

NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR

NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR Arati K. Deshpande 1 and Prakash. R. Devale 2 1 Student and 2 Professor & Head, Department of Information Technology, Bharati

More information

PROMT Technologies for Translation and Big Data

PROMT Technologies for Translation and Big Data PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED

More information

Identifying Focus, Techniques and Domain of Scientific Papers

Identifying Focus, Techniques and Domain of Scientific Papers Identifying Focus, Techniques and Domain of Scientific Papers Sonal Gupta Department of Computer Science Stanford University Stanford, CA 94305 sonal@cs.stanford.edu Christopher D. Manning Department of

More information

A Mixed Trigrams Approach for Context Sensitive Spell Checking

A Mixed Trigrams Approach for Context Sensitive Spell Checking A Mixed Trigrams Approach for Context Sensitive Spell Checking Davide Fossati and Barbara Di Eugenio Department of Computer Science University of Illinois at Chicago Chicago, IL, USA dfossa1@uic.edu, bdieugen@cs.uic.edu

More information

A Rule-Based Short Query Intent Identification System

A Rule-Based Short Query Intent Identification System A Rule-Based Short Query Intent Identification System Arijit De 1, Sunil Kumar Kopparapu 2 TCS Innovation Labs-Mumbai Tata Consultancy Services Pokhran Road No. 2, Thane West, Maharashtra 461, India 1

More information

Author Gender Identification of English Novels

Author Gender Identification of English Novels Author Gender Identification of English Novels Joseph Baena and Catherine Chen December 13, 2013 1 Introduction Machine learning algorithms have long been used in studies of authorship, particularly in

More information

Common Curriculum Map. Discipline: Foreign Language Course: Spanish 1-2

Common Curriculum Map. Discipline: Foreign Language Course: Spanish 1-2 Introductory Unit Basic Information: Common Curriculum Map Discipline: Foreign Language Course: Spanish 1-2 28.B.1b Imitate pronunciation, intonation and inflection in target language 28 B.1a Respond to

More information

Customizing an English-Korean Machine Translation System for Patent Translation *

Customizing an English-Korean Machine Translation System for Patent Translation * Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,

More information

Term extraction for user profiling: evaluation by the user

Term extraction for user profiling: evaluation by the user Term extraction for user profiling: evaluation by the user Suzan Verberne 1, Maya Sappelli 1,2, Wessel Kraaij 1,2 1 Institute for Computing and Information Sciences, Radboud University Nijmegen 2 TNO,

More information

An Overview of a Role of Natural Language Processing in An Intelligent Information Retrieval System

An Overview of a Role of Natural Language Processing in An Intelligent Information Retrieval System An Overview of a Role of Natural Language Processing in An Intelligent Information Retrieval System Asanee Kawtrakul ABSTRACT In information-age society, advanced retrieval technique and the automatic

More information

Foreign Language (FL)

Foreign Language (FL) Johnson County Community College 2016-2017 1 Foreign Language (FL) Courses FL 110 Elementary Ancient Greek (5 Hours) In this course, students will learn the basic vocabulary, grammar, and syntax of Classical

More information

UNIVERSITY OF JORDAN ADMISSION AND REGISTRATION UNIT COURSE DESCRIPTION

UNIVERSITY OF JORDAN ADMISSION AND REGISTRATION UNIT COURSE DESCRIPTION Course Description B.A Degree Spanish and English Language and Literature 2203103 Spanish Language for Beginners (1) (3 credit hours) Prerequisite : none In combination with Spanish for Beginners (2),

More information

Discovering suffixes: A Case Study for Marathi Language

Discovering suffixes: A Case Study for Marathi Language Discovering suffixes: A Case Study for Marathi Language Mudassar M. Majgaonker Comviva Technologies Limited Gurgaon, India Abstract Suffix stripping is a pre-processing step required in a number of natural

More information

ASTD: Arabic Sentiment Tweets Dataset

ASTD: Arabic Sentiment Tweets Dataset ASTD: Arabic Sentiment Tweets Dataset Mahmoud Nabil mah.nabil@cu.edu.eg Mohamed Aly mohamed@mohamedaly.info Amir F. Atiya amir@alumni.caltech.edu Abstract This paper introduces ASTD, an Arabic social sentiment

More information

INPOLDER under Word Level

INPOLDER under Word Level INPOLDER under Word Level Parsing Morphological Structure Gertjan Postma Meertens Instituut Amsterdam gertjan.postma@meertens.knaw.nl Workshop over (Historisch)-Morfologische Parsers INL, Leiden, 27 mei

More information

Department of Modern Languages

Department of Modern Languages 373 Department of Modern Languages Phone: (512) 245-2360 Office: Centennial Hall 214 Fax: (512) 245-8298 Web: http://www.modlang.txstate.edu/ Degree Programs Offered BA, major in French BA, major in French

More information

THE UNIVERSITY OF MANCHESTER PARTICULARS OF APPOINTMENT FACULTY OF MEDICAL AND HUMAN SCIENCES SCHOOL OF PSYCHOLOGICAL SCIENCES

THE UNIVERSITY OF MANCHESTER PARTICULARS OF APPOINTMENT FACULTY OF MEDICAL AND HUMAN SCIENCES SCHOOL OF PSYCHOLOGICAL SCIENCES Ref : MHS-05651 Internal ref: MR THE UNIVERSITY OF MANCHESTER PARTICULARS OF APPOINTMENT FACULTY OF MEDICAL AND HUMAN SCIENCES SCHOOL OF PSYCHOLOGICAL SCIENCES RESEARCH ASSISTANT (Polish Speaker) (Ref:

More information

(Big) Data Analytics: From Word Counts to Population Opinions

(Big) Data Analytics: From Word Counts to Population Opinions (Big) Data Analytics: From Word Counts to Population Opinions Mark Keane Insight@University College Dublin October 2014 ~ RSS ~ Edinburgh September 2014/EPIC 2 September 2014/EPIC 3 September 2014/EPIC

More information

Available fields of study for: University of Milan School of Language Mediation and Intercultural Communication Italy

Available fields of study for: University of Milan School of Language Mediation and Intercultural Communication Italy Available fields of study for: University of Milan School of Language Mediation and Intercultural Communication Italy Contact name: Maria Calvi maria.calvi@unimi.it 1 - Bachelor Linguistics and Cultural

More information

Statistical Machine Translation

Statistical Machine Translation Statistical Machine Translation Some of the content of this lecture is taken from previous lectures and presentations given by Philipp Koehn and Andy Way. Dr. Jennifer Foster National Centre for Language

More information

Expert System. Deep Semantic vs. Keyword and Shallow Linguistic: A New Approach for Supporting Exploitation

Expert System. Deep Semantic vs. Keyword and Shallow Linguistic: A New Approach for Supporting Exploitation Expert System Deep Semantic vs. Keyword and Shallow Linguistic: A New Approach for Supporting Exploitation Rita Joseph Federal Government Operations Expert System Who we are Expert System is the largest,

More information

Wikipedia and Web document based Query Translation and Expansion for Cross-language IR

Wikipedia and Web document based Query Translation and Expansion for Cross-language IR Wikipedia and Web document based Query Translation and Expansion for Cross-language IR Ling-Xiang Tang 1, Andrew Trotman 2, Shlomo Geva 1, Yue Xu 1 1Faculty of Science and Technology, Queensland University

More information

Database Design For Corpus Storage: The ET10-63 Data Model

Database Design For Corpus Storage: The ET10-63 Data Model January 1993 Database Design For Corpus Storage: The ET10-63 Data Model Tony McEnery & Béatrice Daille I. General Presentation Within the ET10-63 project, a French-English bilingual corpus of about 2 million

More information

Natural Language to Relational Query by Using Parsing Compiler

Natural Language to Relational Query by Using Parsing Compiler Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,

More information

PoS-tagging Italian texts with CORISTagger

PoS-tagging Italian texts with CORISTagger PoS-tagging Italian texts with CORISTagger Fabio Tamburini DSLO, University of Bologna, Italy fabio.tamburini@unibo.it Abstract. This paper presents an evolution of CORISTagger [1], an high-performance

More information

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z. Letter

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z. Letter ( 10 points ) Scrabble letters When Alfred Mosher Butts developed Scrabble beginning in 1933, he chose the distribution of letters after long and careful consideration. He ultimately decided there should

More information

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Satoshi Sekine Computer Science Department New York University sekine@cs.nyu.edu Kapil Dalwani Computer Science Department

More information

Trameur: A Framework for Annotated Text Corpora Exploration

Trameur: A Framework for Annotated Text Corpora Exploration Trameur: A Framework for Annotated Text Corpora Exploration Serge Fleury (Sorbonne Nouvelle Paris 3) serge.fleury@univ-paris3.fr Maria Zimina(Paris Diderot Sorbonne Paris Cité) maria.zimina@eila.univ-paris-diderot.fr

More information

N-gram Language Models and POS Distribution for the Identification of Spanish Varieties

N-gram Language Models and POS Distribution for the Identification of Spanish Varieties N-gram Language Models and POS Distribution for the Identification of Spanish Varieties Marcos Zampieri 1, Binyam Gebrekidan Gebre 2, Sascha Diwersy 1 1 University of Cologne, Germany 2 Max Planck Institute

More information

SPANISH Kindergarten

SPANISH Kindergarten SPANISH Kindergarten Use Junior SYMTALK workbook Recognize 80+ Vocabulary words Recognize basic greetings and courtesies. Identify colors and numbers 1-10 Develop reading skills using pictures to identify

More information

Finding Advertising Keywords on Web Pages. Contextual Ads 101

Finding Advertising Keywords on Web Pages. Contextual Ads 101 Finding Advertising Keywords on Web Pages Scott Wen-tau Yih Joshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University Contextual Ads 101 Publisher s website Digital Camera Review The

More information

UNIVERSITÀ DEGLI STUDI DELL AQUILA CENTRO LINGUISTICO DI ATENEO

UNIVERSITÀ DEGLI STUDI DELL AQUILA CENTRO LINGUISTICO DI ATENEO TESTING DI LINGUA INGLESE: PROGRAMMA DI TUTTI I LIVELLI - a.a. 2010/2011 Collaboratori e Esperti Linguistici di Lingua Inglese: Dott.ssa Fatima Bassi e-mail: fatimacarla.bassi@fastwebnet.it Dott.ssa Liliana

More information

Major Exit Questionnaire. Congratulations on completing a major in the Department of Spanish and Portuguese!

Major Exit Questionnaire. Congratulations on completing a major in the Department of Spanish and Portuguese! NORTHWESTERN UNIVERSITY DEPARTMENT OF SPANISH & PORTUGUESE Major Exit Questionnaire Congratulations on completing a major in the Department of Spanish and Portuguese! We appreciate your taking a few minutes

More information

Luis Bonilla, Ph.D. Curriculum Vitae. 124 Sunnyside Park Rd. Syracuse, NY 13214 E-mail: bonillal@wcsu.edu

Luis Bonilla, Ph.D. Curriculum Vitae. 124 Sunnyside Park Rd. Syracuse, NY 13214 E-mail: bonillal@wcsu.edu Luis Bonilla, Ph.D. Curriculum Vitae Education 124 Sunnyside Park Rd. Syracuse, NY 13214 E-mail: bonillal@wcsu.edu University of Rochester, Rochester, NY Ph.D. in Linguistics, 1998 M.A. in Linguistics,

More information

An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System

An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System An Approach to Handle Idioms and Phrasal Verbs in English-Tamil Machine Translation System Thiruumeni P G, Anand Kumar M Computational Engineering & Networking, Amrita Vishwa Vidyapeetham, Coimbatore,

More information

Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC)

Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC) Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger European Commission Joint Research Centre (JRC) https://ec.europa.eu/jrc/en/research-topic/internet-surveillance-systems

More information

Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic

Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic by Sigrún Helgadóttir Abstract This paper gives the results of an experiment concerned with training three different taggers on tagged

More information

University of California, San Diego : Linguistics Language Program : Spring 2016 1.

University of California, San Diego : Linguistics Language Program : Spring 2016 1. University of California, San Diego : Linguistics Language Program : Spring 2016 1. Welcome! UCSD LINGUISTICS LANGUAGE PROGRAM Welcome to the Linguistics Language Program, a unit of the Department of Linguistics

More information

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy Multi language e Discovery Three Critical Steps for Litigating in a Global Economy 2 3 5 6 7 Introduction e Discovery has become a pressure point in many boardrooms. Companies with international operations

More information

Developing a User-based Method of Web Register Classification

Developing a User-based Method of Web Register Classification Developing a User-based Method of Web Register Classification Jesse Egbert Douglas Biber Northern Arizona University Introduction The internet has tremendous potential for linguistic research and NLP applications

More information

Automatic Identification of Arabic Language Varieties and Dialects in Social Media

Automatic Identification of Arabic Language Varieties and Dialects in Social Media Automatic Identification of Arabic Language Varieties and Dialects in Social Media Fatiha Sadat University of Quebec in Montreal, 201 President Kennedy, Montreal, QC, Canada sadat.fatiha@uqam.ca Farnazeh

More information

Español Elemental. Repaso por el examen parcial Capítulos 3B, 4A, 4B, 5A. Fechas del Examen- Speaking- Essay and Short Answer- Listening and reading-

Español Elemental. Repaso por el examen parcial Capítulos 3B, 4A, 4B, 5A. Fechas del Examen- Speaking- Essay and Short Answer- Listening and reading- Point breakdown 13 Speaking 10 Listening 20 Reading 42- Essay (2 questions x 21 points each) 15- Short Answer (5 questions x 3 points each) Español Elemental Repaso por el examen parcial Capítulos 3B,

More information

COMPUTATIONAL DATA ANALYSIS FOR SYNTAX

COMPUTATIONAL DATA ANALYSIS FOR SYNTAX COLING 82, J. Horeck~ (ed.j North-Holland Publishing Compa~y Academia, 1982 COMPUTATIONAL DATA ANALYSIS FOR SYNTAX Ludmila UhliFova - Zva Nebeska - Jan Kralik Czech Language Institute Czechoslovak Academy

More information

Effective Self-Training for Parsing

Effective Self-Training for Parsing Effective Self-Training for Parsing David McClosky dmcc@cs.brown.edu Brown Laboratory for Linguistic Information Processing (BLLIP) Joint work with Eugene Charniak and Mark Johnson David McClosky - dmcc@cs.brown.edu

More information

AntConc: Design and Development of a Freeware Corpus Analysis Toolkit for the Technical Writing Classroom

AntConc: Design and Development of a Freeware Corpus Analysis Toolkit for the Technical Writing Classroom AntConc: Design and Development of a Freeware Corpus Analysis Toolkit for the Technical Writing Classroom Laurence Anthony Waseda University anthony@antlab.sci.waseda.ac.jp Abstract In this paper, I will

More information

Bachelor s Degree in English Studies

Bachelor s Degree in English Studies Bachelor s Degree in English Studies Degree Description: The length of the bachelor s degree in English Studies is 4 years. The minimum of credits required for the obtaining of the title is 240 ECTS credits,

More information

6 th Grade Spanish Curriculum

6 th Grade Spanish Curriculum 6 th Grade Spanish Curriculum Mendham Township Middle School Unit 1 Bienvenidos al Español! Summary: In Unit 1 the students will review greetings and farewells, days of the week, months of the year, weather,

More information

EAST PENNSBORO AREA COURSE: LFS 430 SCHOOL DISTRICT

EAST PENNSBORO AREA COURSE: LFS 430 SCHOOL DISTRICT Unit: Introduction Days: 5 days Key Learning(s): Classroom commands, colors, numbers, alphabet, cognates TPR Unit Essential Question(s): How do I begin to speak Classroom Commands Colors Numbers How do

More information

MA in English language teaching Pázmány Péter Catholic University *** List of courses and course descriptions ***

MA in English language teaching Pázmány Péter Catholic University *** List of courses and course descriptions *** MA in English language teaching Pázmány Péter Catholic University *** List of courses and course descriptions *** Code Course title Contact hours per term Number of credits BMNAT10100 Applied linguistics

More information

THE IMPORTANCE OF WORD PROCESSING IN THE USER ENVIRONMENT. Dr. Peter A. Walker DG V : Commission of the European Communities

THE IMPORTANCE OF WORD PROCESSING IN THE USER ENVIRONMENT. Dr. Peter A. Walker DG V : Commission of the European Communities [Terminologie et Traduction, no.1, 1986] THE IMPORTANCE OF WORD PROCESSING IN THE USER ENVIRONMENT Dr. Peter A. Walker DG V : Commission of the European Communities Introduction Some two and a half years

More information

Finnish Language Proficiency of Immigrant Physicians in Medical Licensure Examinations

Finnish Language Proficiency of Immigrant Physicians in Medical Licensure Examinations Finnish Language Proficiency of Immigrant Physicians in Medical Licensure Examinations Maija Tervola, maija.tervola@staff.uta.fi MA, Researcher, Doctoral Candidate School of Languages, Translation and

More information

Translation Solution for

Translation Solution for Translation Solution for Case Study Contents PROMT Translation Solution for PayPal Case Study 1 Contents 1 Summary 1 Background for Using MT at PayPal 1 PayPal s Initial Requirements for MT Vendor 2 Business

More information

Historical Linguistics. Diachronic Analysis. Two Approaches to the Study of Language. Kinds of Language Change. What is Historical Linguistics?

Historical Linguistics. Diachronic Analysis. Two Approaches to the Study of Language. Kinds of Language Change. What is Historical Linguistics? Historical Linguistics Diachronic Analysis What is Historical Linguistics? Historical linguistics is the study of how languages change over time and of their relationships with other languages. All languages

More information

Sentiment Analysis of Movie Reviews and Twitter Statuses. Introduction

Sentiment Analysis of Movie Reviews and Twitter Statuses. Introduction Sentiment Analysis of Movie Reviews and Twitter Statuses Introduction Sentiment analysis is the task of identifying whether the opinion expressed in a text is positive or negative in general, or about

More information

Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval

Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval Approaches of Using a Word-Image Ontology and an Annotated Image Corpus as Intermedia for Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information

More information

Spanish Curriculum Grades 4-8

Spanish Curriculum Grades 4-8 Spanish Curriculum Grades 4-8 Spanish Grade Four 1. Students will be introduced, recognize, and recite the Spanish Alphabet. 2. Students will recognize, recite and respond to simple Spanish greetings.

More information

EFL Learners Synonymous Errors: A Case Study of Glad and Happy

EFL Learners Synonymous Errors: A Case Study of Glad and Happy ISSN 1798-4769 Journal of Language Teaching and Research, Vol. 1, No. 1, pp. 1-7, January 2010 Manufactured in Finland. doi:10.4304/jltr.1.1.1-7 EFL Learners Synonymous Errors: A Case Study of Glad and

More information

Study Plan. Bachelor s in. Faculty of Foreign Languages University of Jordan

Study Plan. Bachelor s in. Faculty of Foreign Languages University of Jordan Study Plan Bachelor s in Spanish and English Faculty of Foreign Languages University of Jordan 2009/2010 Department of European Languages Faculty of Foreign Languages University of Jordan Degree: B.A.

More information

Can Twitter Predict Royal Baby's Name?

Can Twitter Predict Royal Baby's Name? Summary Can Twitter Predict Royal Baby's Name? Bohdan Pavlyshenko Ivan Franko Lviv National University,Ukraine, b.pavlyshenko@gmail.com In this paper, we analyze the existence of possible correlation between

More information

User studies, user behaviour and user involvement evidence and experience from The Danish Dictionary

User studies, user behaviour and user involvement evidence and experience from The Danish Dictionary User studies, user behaviour and user involvement evidence and experience from The Danish Dictionary Henrik Lorentzen, Lars Trap-Jensen Society for Danish Language and Literature, Copenhagen, Denmark E-mail:

More information

PHONETIC TOOL FOR THE TUNISIAN ARABIC

PHONETIC TOOL FOR THE TUNISIAN ARABIC PHONETIC TOOL FOR THE TUNISIAN ARABIC Abir Masmoudi 1,2, Yannick Estève 1, Mariem Ellouze Khmekhem 2, Fethi Bougares 1, Lamia Hadrich Belguith 2 (1) LIUM, University of Maine, France (2) ANLP Research

More information

High-Performance, Language-Independent Morphological Segmentation

High-Performance, Language-Independent Morphological Segmentation High-Performance, Language-Independent Morphological Segmentation Sajib Dasgupta and Vincent Ng Human Language Technology Research Institute University of Texas at Dallas 1 Morphology Segmentation of words

More information

Comma checking in Danish Daniel Hardt Copenhagen Business School & Villanova University

Comma checking in Danish Daniel Hardt Copenhagen Business School & Villanova University Comma checking in Danish Daniel Hardt Copenhagen Business School & Villanova University 1. Introduction This paper describes research in using the Brill tagger (Brill 94,95) to learn to identify incorrect

More information

Chapter 5. Phrase-based models. Statistical Machine Translation

Chapter 5. Phrase-based models. Statistical Machine Translation Chapter 5 Phrase-based models Statistical Machine Translation Motivation Word-Based Models translate words as atomic units Phrase-Based Models translate phrases as atomic units Advantages: many-to-many

More information

2004/2005 Avg salary - Department academic

2004/2005 Avg salary - Department academic 2004/2005 Centre for Applied Linguistics 38,339 French Studies 42,395 School of Theatre, Performance and Cultural Policy Studies 42,790 History of Art 43,276 Computer Science 43,281 English and Comparative

More information

Microblog Sentiment Analysis with Emoticon Space Model

Microblog Sentiment Analysis with Emoticon Space Model Microblog Sentiment Analysis with Emoticon Space Model Fei Jiang, Yiqun Liu, Huanbo Luan, Min Zhang, and Shaoping Ma State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory

More information

AMERICAN COUNCIL ON THE TEACHING OF FOREIGN LANGUAGES (ACTFL)

AMERICAN COUNCIL ON THE TEACHING OF FOREIGN LANGUAGES (ACTFL) AMERICAN COUNCIL ON THE TEACHING OF FOREIGN LANGUAGES (ACTFL) PROGRAM STANDARDS FOR THE PREPARATION OF FOREIGN LANGUAGE TEACHERS (INITIAL LEVEL Undergraduate & Graduate) (For K-12 and Secondary Certification

More information

Construction of Thai WordNet Lexical Database from Machine Readable Dictionaries

Construction of Thai WordNet Lexical Database from Machine Readable Dictionaries Construction of Thai WordNet Lexical Database from Machine Readable Dictionaries Patanakul Sathapornrungkij Department of Computer Science Faculty of Science, Mahidol University Rama6 Road, Ratchathewi

More information

Division of Arts, Humanities & Wellness Department of World Languages and Cultures. Course Syllabus SPANISH I LAN 113

Division of Arts, Humanities & Wellness Department of World Languages and Cultures. Course Syllabus SPANISH I LAN 113 Division of Arts, Humanities & Wellness Department of World Languages and Cultures Course Syllabus SPANISH I LAN 113 Semester and Year: Spring 2012 Course and Section number: 113-003/ 113-010 Meeting Times:

More information

Using Web Search for Machine Translation Nicolas Wehmeier BSc Computing and German 2003/2004

Using Web Search for Machine Translation Nicolas Wehmeier BSc Computing and German 2003/2004 Using Web Search for Machine Translation Nicolas Wehmeier BSc Computing and German 2003/2004 The candidate confirms that the work submitted is their own and the appropriate credit has been given where

More information

that differ from that of a basic online search:

that differ from that of a basic online search: Searching Online Databases: A Brief Tutorial Searching an online databaseutilizes methods that differ from that of a basic online search: Controlled vocabulary Indexed terms or Keywords Subject Headings

More information

Sense-Tagging Verbs in English and Chinese. Hoa Trang Dang

Sense-Tagging Verbs in English and Chinese. Hoa Trang Dang Sense-Tagging Verbs in English and Chinese Hoa Trang Dang Department of Computer and Information Sciences University of Pennsylvania htd@linc.cis.upenn.edu October 30, 2003 Outline English sense-tagging

More information

SEO Workshop Keyword and Competitor Research and On Page Optimisation

SEO Workshop Keyword and Competitor Research and On Page Optimisation SEO Workshop Keyword and Competitor Research and On Page Optimisation Marketing & Public Relations Department University of Newcastle April 2014 SEO Workshop Contents 2 What is SEO? STEP 1: Define Purpose

More information

ANALEC: a New Tool for the Dynamic Annotation of Textual Data

ANALEC: a New Tool for the Dynamic Annotation of Textual Data ANALEC: a New Tool for the Dynamic Annotation of Textual Data Frédéric Landragin, Thierry Poibeau and Bernard Victorri LATTICE-CNRS École Normale Supérieure & Université Paris 3-Sorbonne Nouvelle 1 rue

More information

Trend Micro Incorporated. Windows 7 (Unspecified. Tested on 64 bit) Windows Vista (Unspecified. Tested on 64 bit) Windows XP (32/64 bit)

Trend Micro Incorporated. Windows 7 (Unspecified. Tested on 64 bit) Windows Vista (Unspecified. Tested on 64 bit) Windows XP (32/64 bit) NAME Trend Micro Online Guardian for Families Company Trend Micro Incorporated Version 1.5.0.5041 Type of product Devices supported Operating systems Client Computer Windows 7 (Unspecified. Tested on 64

More information

a Chinese-to-Spanish rule-based machine translation

a Chinese-to-Spanish rule-based machine translation Chinese-to-Spanish rule-based machine translation system Jordi Centelles 1 and Marta R. Costa-jussà 2 1 Centre de Tecnologies i Aplicacions del llenguatge i la Parla (TALP), Universitat Politècnica de

More information

Phase 2 of the D4 Project. Helmut Schmid and Sabine Schulte im Walde

Phase 2 of the D4 Project. Helmut Schmid and Sabine Schulte im Walde Statistical Verb-Clustering Model soft clustering: Verbs may belong to several clusters trained on verb-argument tuples clusters together verbs with similar subcategorization and selectional restriction

More information

PELLISSIPPI STATE COMMUNITY COLLEGE MASTER SYLLABUS BEGINNING SPANISH I SPAN 1010. Laboratory Hours: 0.0 Date Revised: Summer 10

PELLISSIPPI STATE COMMUNITY COLLEGE MASTER SYLLABUS BEGINNING SPANISH I SPAN 1010. Laboratory Hours: 0.0 Date Revised: Summer 10 PELLISSIPPI STATE COMMUNITY COLLEGE MASTER SYLLABUS BEGINNING SPANISH I SPAN 1010 Class Hours: 3.0 Credit Hours: 3.0 Laboratory Hours: 0.0 Date Revised: Summer 10 Catalog Course Description: Introduction

More information

The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2

The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) The multilayer sentiment analysis model based on Random forest Wei Liu1, Jie Zhang2 1 School of

More information

PyCantonese: Cantonese linguistic research in the age of big data

PyCantonese: Cantonese linguistic research in the age of big data PyCantonese: Cantonese linguistic research in the age of big data Jackson L. Lee University of Chicago http://jacksonllee.com Childhood Bilingualism Research Center, CUHK September 15, 2015 Grammar versus

More information

Department for Energy Development and Independence Shaoceng Wei, Yang Luo, Aron Patrick DRAFT. Summer 2011. Electricity Price Prediction Equation

Department for Energy Development and Independence Shaoceng Wei, Yang Luo, Aron Patrick DRAFT. Summer 2011. Electricity Price Prediction Equation Department for Energy Development and Independence Shaoceng Wei, Yang Luo, Aron DRAFT Summer 2011 1 2 3 4 Purpose of this project Electricity The purpose of this project is to analyze the relationships

More information

F-SECURE INTERNET SECURITY 2012

F-SECURE INTERNET SECURITY 2012 NAME F-SECURE INTERNET SECURITY 2012 Company F-Secure Corporation Version 1.62 Type of product Devices supported Operating systems Client Computer Windows 7 (all editions) Windows Vista Windows XP Home,

More information

Portuguese Corpus-Based Learning Using ETL

Portuguese Corpus-Based Learning Using ETL Portuguese Corpus-Based Learning Using ETL Ruy Luiz Milidiú 1, Cícero Nogueira dos Santos 1 and 1,2 1 Departamento de Informática, Pontifícia Universidade Católica PUC-Rio Rua Marquês de São Vicente, 225,

More information

COURSE OBJECTIVES SPAN 100/101 ELEMENTARY SPANISH LISTENING. SPEAKING/FUNCTIONAl KNOWLEDGE

COURSE OBJECTIVES SPAN 100/101 ELEMENTARY SPANISH LISTENING. SPEAKING/FUNCTIONAl KNOWLEDGE SPAN 100/101 ELEMENTARY SPANISH COURSE OBJECTIVES This Spanish course pays equal attention to developing all four language skills (listening, speaking, reading, and writing), with a special emphasis on

More information