English To Urdu Translation System Tafseer Ahmed Sadaf Alvi University of Karachi, 2002
|
|
- Duane Stafford
- 7 years ago
- Views:
Transcription
1 English To Urdu Translation System Tafseer Ahmed Sadaf Alvi University of Karachi, 2002 Abstract This paper describes a system for English to Urdu Machine Translation. English and Urdu are different languages in terms of structure, word order in phrases and phrase order in clauses and sentences. Also words in Urdu change their forms with respect to gender, number and other attributes of sentences. This paper identify many such issues and suggest solutions of those. To design and implement Translation System, Grammar of English language is developed, and sentences are parsed using Bottom Up Chart Parsing Algorithm. Also Transformational Engine from Parse Tree of English Sentence to Urdu Sentence is developed. 1. Introduction Machine translation systems are need of today. The term 'machine translation' (MT) refers to computerized systems responsible for the production of translations with or without human assistance.[3] There is huge amount of textual data present in digital form and amount of this is increasing day by day. Most of these data is in English Language, but English is not the language of whole world. A large percentage of world population either cannot read English or has little expertise of this language. So Translators are needed from English to other languages. Urdu is the national language of Pakistan. It is also spoken in India and some other South Asian countries. Urdu belongs to Aryan family of languages[2]. It has accepted influence of Hindi, Arabic, Persian and Turkish languages. Machine Translation is considered as one of the most difficult problems of Computer Science. The paper identifies the major problems in translation from English to Urdu Translation, and suggests their solutions. It also gives the description of English to Urdu Translation System. 2. English and Urdu: Comparitive Study English and Urdu are languages which are different in many respects. The major difference between English and Urdu are: 2.1 SVO vs SOV Language English is classified as a Subject-Verb-Object language because Subject, Verb and Object phrases in a sentence are present in this order. An example of English sentence is I write an essay Subject Verb Object In contrast, Urdu is a Subject-Object-Verb(SOV) language. An example of Urdu sentence is main(i) aik mazmoon(an essay) likhta hon(write) Subject Object Verb 2.2 Word Order in Phrase Another differenece in structure of English and Urdu is that order of words in certain phrases is different in both languages. In Prepositional Phrase of English, Head word is not the last word of the phrase. But in Urdu, Head word is the last word of the phrase. For example, in English Book of Urdu Noun(Head) Preposition Noun But in Urdu, the phrase will be written as: Urdu ki(of) kitab(book) Noun Preposition Noun(Head)
2 2.3 Many Forms of Word In every language, a word is used in many inflected forms depending upon features of other words and phrases of the sentence. For example, verb in English is present in five forms (base, 1st, 2nd, 3rd and 4th forms). If a sentence is of Present Indefinite tense then 1st form of verb will be used. But in sentences of continous tense, 4th form of verb is used. But Urdu has more inflected forms than English. In English, adjective never changes their form with respect to the noun they are modifying. For example, In Green Book, Green Books, Green Pen, Green Pens the same word "Green" is used irrespective of the number and gender of the Noun. But Urdu has different inflectional forms of adjectives for different number and genders of the head Noun. In Urdu, these phrases will be written as Hari(Green) Kitab(Book), Hari(Green) Kitabain(Books), Hara(Green) Qalam(Pen), Haray(Green) Qalam(Pens) Similarly, prepositions and verbs have inflected forms in Urdu. 2.4 Noun and Verbs have Gender in Urdu English common nouns and verbs do not have genders associated with them. But in Urdu, every noun and verb has a gender associated with it. For example, Book is used as feminine and Pen is used as masculine. This is the cause of many forms of adjectives and prepositions used with different nouns (as in Example of 2.3). Similarly verbs have gender. In some sentences, inflectional form of the verb depends on its gender. aadmi(man) ne aurat(woman) se bat ki(talked). aurat(woman) ne aadmi(man) se bat ki(talked). 2.5 Types of Tense English and Urdu do not have one to one correspondence in Tense Types. Hence it is not necessary that Urdu has equivalent of every tense of English grammar. For example, Urdu has more than one versions of Past Indefinite Tense like Absolute Past, Near Past and Distant Past, [2] and one can translate English sentence to any one of these Urdu tenses. He bought a book can be translated as us ne kitab kharidi hay. us ne kitab kharidi thi. us ne kitab kharidi. 3 Translator System The System is divided into three major parts: The (1) Lexical Module converts input text into Token with features, the (2) Syntactical Module generates Parse Structure from the tokens obtained from Lexical Module and the (3)Transformational Module that transforms English Parse Structure (generated by Syntactical Module) to Urdu Sentence. 3.1 Lexical Module The function of lexical module is to generate tokens with features corresponding to each word. This module is divided into two sub-modules. The (1) Preprocessor Sub-module takes word as text and breaks it into words. It replaces short forms of the words to the original words. For example, don't is replaced by do not. It also attaches some flags with each showing that it is a Proper Noun, Possessive Noun etc (2) Token Generator sub module generates tokens with features corresponding to each word supplied by Preprocessor Module. Features are used to store attributes about a word. Usually word's Part of Speech (Category), Root (Base or Root Form), and properties like number of noun, degree of adjective and person of pronoun are parts of feature structure. For English to Urdu Translation, a special purpose feature set is designed which have attributes to
3 deal with all identified issues of translation. Table 1 shows list of features and their purpose. Table 1: Feature of English Word Features Description Category Part of Speech of the word. Its permissible values are Noun, Pronoun, Determiner, Adjectives, Verb, Adverb, Preposition, Conjunction and Punctuations. SubCategory Sub Category of the above feature. For example, category Pronoun has two subcategories: Subjective and Objective. Verb has two subcategories: Auxiliary and Main Verb. Sense Semantic Category of Noun. Permissible values are Human, Animate and Inanimate. Form When used with Pronoun, it represents Person (1,2,3). With Verb, it represents Verb Form (0,1,2,3,4) and with Adjective, it represents Degree (1,2,3). Number It shows that Noun or Pronoun is either Singular or Plural. Gender Gender of word (Noun, Pronoun, Verb) in Urdu. Subject This feature is associated with Verb. It has the urdu preposition that can be used Preposition after Subject of the sentence. (For Example ne, ko) Object This feature is associated with Verb. It has the urdu preposition that can be used Preposition after Object of the sentence. (For Example ko, se) Masculine This feature is associated with Noun, Adjectives and Prepositions. It has Singular Form inflected form which will be used with Masculine Singular Form. Feminine This feature is associated with Noun, Adjectives and Prepositions. It has Singular Form inflected form which will be used with Feminine Singular Form. Masculine This feature is associated with Noun, Adjectives and Prepositions. It has Plural Form inflected form which will be used with Masculine Plural Form. Feminine This feature is associated with Noun, Adjectives and Prepositions. It has Plural Form inflected form which will be used with Feminine Plural Form. Object Count This feature is associated with verb. It has number of allowable objects after the verb in a sentence. For Intransitive Verbs, the value is 0; for verbs allowing double objects, the value is 2. Urdu Meaning The meaning of word in Urdu (In Root Form) Irregular Urdu In Transformational Module, inflected forms of Urdu words will be derived by Form rules. But for words with irregular forms, a table will be consulted whose index is present at this feature. Some features of above table are worth explanation. As discussed in 2.4, Urdu nouns and verbs have gender; it is why Gender of word in Urdu is stored as a feature. Similarly as discussed in 2.3, Urdu Adjectives, Noun Modifiers and Prepositions may have inflected forms depending upon Gender and Number of the Noun Head. But there is no well defined rule(s) that a word e.g. Adjective will have inflection form or not. For Example, Adjective hara (Green) has inflected forms, but dana (wise) does not have inflected forms. Because of this reason, inflected forms of all adjectives, prepositions and noun are stored in Singular Masculine, Singular Feminine, Plural Masculine, and Plural Feminine Form Features. Subject Preposition and Object Prepositions features are associated with verbs. In some forms of sentences (for example, Past Indefinite Tense), a preposition (usually ne) is used after Subject and a preposition (usually ko) is used after Object. For Example, "I told you" will be written in Urdu as Main(I) ne tum(you) ko bataya(told) Subject Subject Preposition Object Object Preposition Verb
4 The Urdu Meaning of verbs is stored in un-inflected form. A rule based system in Transformation Module will inflect the verbs. The Token Generator SubModule searches the word (generated by Preprocessor SubModule) in database of words and generate one or more Token corresponding to each word (Because a word e.g. "fly" can be categorized as noun and verb both). It also uses flags to detect proper nouns and search proper nouns in a separate database of Proper Nouns. Lexical Modules also group words as sentences. When it gets a sentence separator in list of generated tokens, it ends the previous sentence and start new one. 3.2 Syntactical Module The Syntactical Module parses the sentence(s) (as collection of word tokens) coming from lexical Module. The token collection to be parsed is two-dimensional. First dimension is Word Position in the sentence and second dimension is for different tokens corresponding to the same word. Context Free Grammar for English language is developed. For Parsing, different techniques are studied. Among them, Bottom Up Chart Parsing [2] is selected for implementation. One reason of this selection is that it is more efficient than other methods. But the real advantage of this technique is that it can be used to get partial parses of the sentences. It is fact that no grammar (despite how carefully it is designed) can be used to parse all English sentences. A Bottom Up Chart Parser can give partial parses. These partial parses can be translated to give some sense of sentence, if complete parse of the sentence is not possible. These Partial Parse Structures consists of sequence of non-overlapping word tokens. The algorithm is that: If no complete parse is obtained then choose Parse Structure of maximum length (no. of tokens) from all partial parses. Then recursively find parse structure with maximum length in remaining parts of the sentence (the tokens that are not in part of the maximum length parse obtained in previous step). Finally give these partial parses as output in sequence of their order in sentence. Some Productions of Grammar developed for English are: <Sentence> <PreNP><NP><VP><PostVP> <NP> <PreNoun><Noun><PostNoun><NPList> many productions of this grammar are Empty Productions. For Example. <PreNP> null <PostVP> null As empty productions are causing problems in Chart Parsing Algorithm, all Empty Productions are eliminated and new production rules are formed. For Example <Sentence> <PreNP><NP><VP><PostVP> <Sentence> <NP><VP><PostVP> <Sentence> <PreNP><NP><VP> <Sentence> <NP><VP> 3.3 Transformational Module Transformational Module takes Parse Structure as input and generates Urdu Translation of the Sentence. As discussed in Section 2 that Urdu and English are not same in Sentence or Phrase Structure. Hence Phrases are to be re-arranged in Sentence and Words are to be re-arranged in a phrase. Form of words is also to be changed depending upon other words in the phrase or other phrases. To solve these problems, the Parse Structure (Parse Tree) obtained by Syntactical Module is traversed by depth first approach. Every node has a structure attached with it which has "Collection of Urdu Translated Word Tokens" of the node and other optional attributes. These structures are named as Syntactical Tokens. When a node fills all attributes of its Syntactical Structure, it pushes it on the stack. When all children of a node are traversed, the parent node pop syntactical tokens of its children and synthesize its (parent's) syntactical token using them. This
5 method is repeated until whole tree is traversed. The sequence of Urdu Translated Words obtained at root node of the tree is output as Translation of English sentence into Urdu. This translation has correct word order and forms of Urdu. An Example of Syntactical Tokens is: Attibute Description Gender Gender of the Head Noun of the phrase Number Number of Head Noun of the phrase Pronoun Person If head is a pronoun then store its person, for proper nouns use 3. Head Indices Token Indices of Head Noun(s) (Many head nouns will be present if Noun Phrase has "and" for conjunction of nouns. Adjective Token Indices of Adjectives modifying the Head Noun. Indices Sense Semantic Category of Head Noun(s). Tokens Collection of Word Tokens having Urdu Translated Words. Above is the Syntactical Token of Noun Phrase(NP). It has Gender, Number, Pronoun Person and Sense so that its parent node (e.g. Sentence) can select proper form of Verb using these attributes. Head Indices and Adjective Indices are used for any required change, in the word form of Head Noun(s) and Adjective(s), by parent node after attribute Tokens is filled and this syntactical token was pushed on stack. For example, if a Preposition follows the Noun Phrase in an Urdu sentence then the form of Masculine Singular Adjective should be changed to Masculine Plural. But the fact, that either a preposition will be present after this Noun Phrase or not, can not easily determined at the synthesis of Noun Phrase Syntactical Token. Hence Adjective(s) are assigned the forms depending upon the Gender and Number of the Head Noun(s), and the Syntactical Token is pushed on the stack. If the parent node finds preposition after this noun, it will change the form of adjective using adjective indices. For Example, Noun Phrase for "cold water" will be thanda(cold) pani(water) But when this Noun Phrase becomes part of Prepositional Phrase "vessel of cold water", the adjective thanda changes to thanday. thanday(cold) pani(water) ka(of) bartan(vessel) Above example also shows the re-arrangement of Words in Prepositional Phrase. Components of English grammar rule: <PP> preposition <NP> are re-arranged in Urdu as <PP> <NP> preposition As discussed in Section 2.4 and 2.5, Verb Translation (Tense Translation) is not an easy task. The correct Urdu form of verb to be used in a sentence depends upon the number and gender of the subject and object and tense of English sentence. A rule based system transforms Urdu Root word (of verb) into Correct Urdu Form. For example, reads changes its form depending upon the subject in following sentences. woh(he) partha hay(reads) woh(she) parhti hay(reads) Translation of past indefinite tense having transitive verb (of Urdu) has many issues. For example, this type of sentences requires Subject Preposition and Object Preposition (discussed in Section 3.1). The sentences "She writes a letter.", "She wrote a letter" and "She wrote a story" will be translated as
6 woh(she) khat(letter) likhti hay(writes) Verb From is changed depending upon subject us(she) ne khat(letter) likha(wrote). Verb From is changed depending upon the object. Subject Preposition ne is introduced and pronoun woh is changed to us. us(she) ne kahani(story) likhi(wrote). Verb From is changed depending upon the object. Subject Preposition ne is introduced and pronoun woh is changed to us. Similarly Object Preposition can be introduced depending on the semantic of the object. 3.4 Ambiguity Resolution As discussed in 3.2, the chart parser gives all possible parses of the sentence. If no complete parse is obtained then we use an algorithm to translate on basis of partial parses. But in many cases, more than one complete parses of the sentence are generated by the Syntactical Module. The System must choose only one of these parses. For choosing the best parse, system uses some heuristics. It gives negative weights to some rules. For example, there are rules <MainVerb> MainVerb <MainVerb> AuxiliaryVerb Second rule will have negative weight because Auxiliary Verbs are rarely used as Main Verbs. Similarly there is a rule that Adjective (Pre Noun Phrase) can be used as Noun Phrase. This rule also has negative weight and only can be used when no other rule for Noun Phrase is applied. In case of multiple parses, it is rejected because that parse using it has negative weight. Hence in Ambiguity Resolution, (any) negative weights to all parses are compared and the parse with highest value (lowest number of negative weight rules used) is selected to have best translation. 4 Conclusion In this paper, structure and working of English to Urdu Translation System are discussed. Many problems of the translation are solved. The software implementation of it is successful. Most of the translated sentences have correct word order and form. The problems are mainly due to absence of grammar rule (production) for that sentence in grammar rule database. The system can be enhanced by introducing a context sensitive system for preposition translation as translation of English preposition to Urdu is context sensitive. It is planned for next phase of research. Statistical Techniques will also be introduced in next phase to make system more efficient and accurate. The system will be starting point for access of information present in English for Urdu readers of Pakistan and all over the world. References 1. Allen, James Natural Language Understanding 2. Siddiqui, Abul Lais 1971,. Jamaul Qawaid(Comprehensive Grammar) 3. Hutchins, W.John 1995 Machine Translation: A Brief History
Ling 201 Syntax 1. Jirka Hana April 10, 2006
Overview of topics What is Syntax? Word Classes What to remember and understand: Ling 201 Syntax 1 Jirka Hana April 10, 2006 Syntax, difference between syntax and semantics, open/closed class words, all
More informationLivingston Public Schools Scope and Sequence K 6 Grammar and Mechanics
Grade and Unit Timeframe Grammar Mechanics K Unit 1 6 weeks Oral grammar naming words K Unit 2 6 weeks Oral grammar Capitalization of a Name action words K Unit 3 6 weeks Oral grammar sentences Sentence
More informationParaphrasing controlled English texts
Paraphrasing controlled English texts Kaarel Kaljurand Institute of Computational Linguistics, University of Zurich kaljurand@gmail.com Abstract. We discuss paraphrasing controlled English texts, by defining
More informationCALICO Journal, Volume 9 Number 1 9
PARSING, ERROR DIAGNOSTICS AND INSTRUCTION IN A FRENCH TUTOR GILLES LABRIE AND L.P.S. SINGH Abstract: This paper describes the strategy used in Miniprof, a program designed to provide "intelligent' instruction
More information10th Grade Language. Goal ISAT% Objective Description (with content limits) Vocabulary Words
Standard 3: Writing Process 3.1: Prewrite 58-69% 10.LA.3.1.2 Generate a main idea or thesis appropriate to a type of writing. (753.02.b) Items may include a specified purpose, audience, and writing outline.
More informationEAP 1161 1660 Grammar Competencies Levels 1 6
EAP 1161 1660 Grammar Competencies Levels 1 6 Grammar Committee Representatives: Marcia Captan, Maria Fallon, Ira Fernandez, Myra Redman, Geraldine Walker Developmental Editor: Cynthia M. Schuemann Approved:
More informationIndex. 344 Grammar and Language Workbook, Grade 8
Index Index 343 Index A A, an (usage), 8, 123 A, an, the (articles), 8, 123 diagraming, 205 Abbreviations, correct use of, 18 19, 273 Abstract nouns, defined, 4, 63 Accept, except, 12, 227 Action verbs,
More informationGMAT.cz www.gmat.cz info@gmat.cz. GMAT.cz KET (Key English Test) Preparating Course Syllabus
Lesson Overview of Lesson Plan Numbers 1&2 Introduction to Cambridge KET Handing Over of GMAT.cz KET General Preparation Package Introduce Methodology for Vocabulary Log Introduce Methodology for Grammar
More informationCorrelation: ELLIS. English language Learning and Instruction System. and the TOEFL. Test Of English as a Foreign Language
Correlation: English language Learning and Instruction System and the TOEFL Test Of English as a Foreign Language Structure (Grammar) A major aspect of the ability to succeed on the TOEFL examination is
More informationLevel 1 Teacher s Manual
TABLE OF CONTENTS Lesson Study Skills Unit Page 1 STUDY SKILLS. Introduce study skills. Use a Quigley story to discuss study skills. 1 2 STUDY SKILLS. Introduce getting organized. Use a Quigley story to
More informationAlbert Pye and Ravensmere Schools Grammar Curriculum
Albert Pye and Ravensmere Schools Grammar Curriculum Introduction The aim of our schools own grammar curriculum is to ensure that all relevant grammar content is introduced within the primary years in
More informationWriting Common Core KEY WORDS
Writing Common Core KEY WORDS An educator's guide to words frequently used in the Common Core State Standards, organized by grade level in order to show the progression of writing Common Core vocabulary
More informationOutline of today s lecture
Outline of today s lecture Generative grammar Simple context free grammars Probabilistic CFGs Formalism power requirements Parsing Modelling syntactic structure of phrases and sentences. Why is it useful?
More informationNatural Language Database Interface for the Community Based Monitoring System *
Natural Language Database Interface for the Community Based Monitoring System * Krissanne Kaye Garcia, Ma. Angelica Lumain, Jose Antonio Wong, Jhovee Gerard Yap, Charibeth Cheng De La Salle University
More informationLANGUAGE! 4 th Edition, Levels A C, correlated to the South Carolina College and Career Readiness Standards, Grades 3 5
Page 1 of 57 Grade 3 Reading Literary Text Principles of Reading (P) Standard 1: Demonstrate understanding of the organization and basic features of print. Standard 2: Demonstrate understanding of spoken
More informationLearning Translation Rules from Bilingual English Filipino Corpus
Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,
More informationYear 3 Grammar Guide. For Children and Parents MARCHWOOD JUNIOR SCHOOL
MARCHWOOD JUNIOR SCHOOL Year 3 Grammar Guide For Children and Parents A guide to the key grammar skills and understanding that your child will be learning this year with examples and practice questions
More informationSyntax: Phrases. 1. The phrase
Syntax: Phrases Sentences can be divided into phrases. A phrase is a group of words forming a unit and united around a head, the most important part of the phrase. The head can be a noun NP, a verb VP,
More informationTHERE ARE SEVERAL KINDS OF PRONOUNS:
PRONOUNS WHAT IS A PRONOUN? A Pronoun is a word used in place of a noun or of more than one noun. Example: The high school graduate accepted the diploma proudly. She had worked hard for it. The pronoun
More informationEnglish Appendix 2: Vocabulary, grammar and punctuation
English Appendix 2: Vocabulary, grammar and punctuation The grammar of our first language is learnt naturally and implicitly through interactions with other speakers and from reading. Explicit knowledge
More informationSEPTEMBER Unit 1 Page Learning Goals 1 Short a 2 b 3-5 blends 6-7 c as in cat 8-11 t 12-13 p
The McRuffy Kindergarten Reading/Phonics year- long program is divided into 4 units and 180 pages of reading/phonics instruction. Pages and learning goals covered are noted below: SEPTEMBER Unit 1 1 Short
More informationEnglish. Universidad Virtual. Curso de sensibilización a la PAEP (Prueba de Admisión a Estudios de Posgrado) Parts of Speech. Nouns.
English Parts of speech Parts of Speech There are eight parts of speech. Here are some of their highlights. Nouns Pronouns Adjectives Articles Verbs Adverbs Prepositions Conjunctions Click on any of the
More informationWritten Language Curriculum Planning Manual 3LIT3390
Written Language Curriculum Planning Manual 3LIT3390 TABLE OF CONTENTS Language Usage Curriculum... 1 Language Usage I... 2 Language Usage II... 4 Language Usage III... 6 Language Usage IV... 8 Language
More informationThird Grade Language Arts Learning Targets - Common Core
Third Grade Language Arts Learning Targets - Common Core Strand Standard Statement Learning Target Reading: 1 I can ask and answer questions, using the text for support, to show my understanding. RL 1-1
More informationThis image cannot currently be displayed. Course Catalog. Language Arts 600. 2016 Glynlyon, Inc.
This image cannot currently be displayed. Course Catalog Language Arts 600 2016 Glynlyon, Inc. Table of Contents COURSE OVERVIEW... 1 UNIT 1: ELEMENTS OF GRAMMAR... 3 UNIT 2: GRAMMAR USAGE... 3 UNIT 3:
More informationGrade 4 Writing Assessment. Eligible Texas Essential Knowledge and Skills
Grade 4 Writing Assessment Eligible Texas Essential Knowledge and Skills STAAR Grade 4 Writing Assessment Reporting Category 1: Composition The student will demonstrate an ability to compose a variety
More informationMESLEKİ İNGİLİZCE I / VOCATIONAL ENGLISH I
MESLEKİ İNGİLİZCE I / VOCATIONAL ENGLISH I VOCATIONAL ENGLISH I / 2 credits 3 rd * Reviewing Basic English Grammar (word order, nouns, adjectives, pronouns, verbs, prepositions etc.) * Learning common
More informationSPANISH Kindergarten
SPANISH Kindergarten Use Junior SYMTALK workbook Recognize 80+ Vocabulary words Recognize basic greetings and courtesies. Identify colors and numbers 1-10 Develop reading skills using pictures to identify
More informationParts of Speech. Skills Team, University of Hull
Parts of Speech Skills Team, University of Hull Language comes before grammar, which is only an attempt to describe a language. Knowing the grammar of a language does not mean you can speak or write it
More informationKnowledge. Subject Knowledge Audit - Spanish Meta-linguistic challenges full some none
Knowledge Subject Knowledge Audit - Spanish Meta-linguistic challenges full some none Draw up a list of useful Proper nouns in Spanish eg la Península Ibérica, el Reino Unido, la Comunidad Europea, Don
More informationSyntactic Theory on Swedish
Syntactic Theory on Swedish Mats Uddenfeldt Pernilla Näsfors June 13, 2003 Report for Introductory course in NLP Department of Linguistics Uppsala University Sweden Abstract Using the grammar presented
More information2016-2017 Curriculum Catalog
2016-2017 Curriculum Catalog 2016 Glynlyon, Inc. Table of Contents LANGUAGE ARTS 600 COURSE OVERVIEW... 1 UNIT 1: ELEMENTS OF GRAMMAR... 3 UNIT 2: GRAMMAR USAGE... 3 UNIT 3: READING SKILLS... 4 UNIT 4:
More informationONLINE ENGLISH LANGUAGE RESOURCES
ONLINE ENGLISH LANGUAGE RESOURCES Developed and updated by C. Samuel for students taking courses at the English and French Language Centre, Faculty of Arts (Links live as at November 2, 2009) Dictionaries
More informationHow the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.
Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.
More informationPupil SPAG Card 1. Terminology for pupils. I Can Date Word
Pupil SPAG Card 1 1 I know about regular plural noun endings s or es and what they mean (for example, dog, dogs; wish, wishes) 2 I know the regular endings that can be added to verbs (e.g. helping, helped,
More informationNational Quali cations SPECIMEN ONLY
N5 SQ40/N5/02 FOR OFFICIAL USE National Quali cations SPECIMEN ONLY Mark Urdu Writing Date Not applicable Duration 1 hour and 30 minutes *SQ40N502* Fill in these boxes and read what is printed below. Full
More informationGet Ready for IELTS Writing. About Get Ready for IELTS Writing. Part 1: Language development. Part 2: Skills development. Part 3: Exam practice
About Collins Get Ready for IELTS series has been designed to help learners at a pre-intermediate level (equivalent to band 3 or 4) to acquire the skills they need to achieve a higher score. It is easy
More informationEnglish Language Proficiency Standards: At A Glance February 19, 2014
English Language Proficiency Standards: At A Glance February 19, 2014 These English Language Proficiency (ELP) Standards were collaboratively developed with CCSSO, West Ed, Stanford University Understanding
More informationMODULE 15 Diagram the organizational structure of your company.
Student name: Date: MODULE 15 Diagram the organizational structure of your company. Objectives: A. Diagram the organizational chart for your place of business. B. Determine the importance of organization
More informationLesson Plan. Date(s)... M Tu W Th F
Grade...Class(es)... Lesson 12.1 Adjectives SE/TWE pp. 451 452 Objectives: To identify predicate adjectives and adjectives that precede nouns; to use adjectives correctly to describe nouns and pronouns
More informationHow To Translate English To Yoruba Language To Yoranuva
International Journal of Language and Linguistics 2015; 3(3): 154-159 Published online May 11, 2015 (http://www.sciencepublishinggroup.com/j/ijll) doi: 10.11648/j.ijll.20150303.17 ISSN: 2330-0205 (Print);
More informationArtificial Intelligence Exam DT2001 / DT2006 Ordinarie tentamen
Artificial Intelligence Exam DT2001 / DT2006 Ordinarie tentamen Date: 2010-01-11 Time: 08:15-11:15 Teacher: Mathias Broxvall Phone: 301438 Aids: Calculator and/or a Swedish-English dictionary Points: The
More informationPresented to The Federal Big Data Working Group Meetup On 07 June 2014 By Chuck Rehberg, CTO Semantic Insights a Division of Trigent Software
Semantic Research using Natural Language Processing at Scale; A continued look behind the scenes of Semantic Insights Research Assistant and Research Librarian Presented to The Federal Big Data Working
More informationParsing Technology and its role in Legacy Modernization. A Metaware White Paper
Parsing Technology and its role in Legacy Modernization A Metaware White Paper 1 INTRODUCTION In the two last decades there has been an explosion of interest in software tools that can automate key tasks
More informationSubordinating Ideas Using Phrases It All Started with Sputnik
NATIONAL MATH + SCIENCE INITIATIVE English Subordinating Ideas Using Phrases It All Started with Sputnik Grade 9-10 OBJECTIVES Students will demonstrate understanding of how different types of phrases
More informationTERMS. Parts of Speech
TERMS Parts of Speech Noun: a word that names a person, place, thing, quality, or idea (examples: Maggie, Alabama, clarinet, satisfaction, socialism). Pronoun: a word used in place of a noun (examples:
More information12 FIRST QUARTER. Class Assignments
August 7- Go over senior dates. Go over school rules. 12 FIRST QUARTER Class Assignments August 8- Overview of the course. Go over class syllabus. Handout textbooks. August 11- Part 2 Chapter 1 Parts of
More informationOnline Tutoring System For Essay Writing
Online Tutoring System For Essay Writing 2 Online Tutoring System for Essay Writing Unit 4 Infinitive Phrases Review Units 1 and 2 introduced some of the building blocks of sentences, including noun phrases
More informationFRENCH AS A SECOND LANGUAGE TRAINING
FRENCH AS A SECOND LANGUAGE TRAINING Beginner 1 This course is intended for people who have never studied French or people who have taken French in the past but have either forgotten most of it or have
More informationNo Evidence. 8.9 f X
Section I. Correlation with the 2010 English Standards of Learning and Curriculum Framework- Grade 8 Writing Summary Adequate Rating Limited No Evidence Section I. Correlation with the 2010 English Standards
More informationSYNTAX: THE ANALYSIS OF SENTENCE STRUCTURE
SYNTAX: THE ANALYSIS OF SENTENCE STRUCTURE OBJECTIVES the game is to say something new with old words RALPH WALDO EMERSON, Journals (1849) In this chapter, you will learn: how we categorize words how words
More informationLevel 4 Teacher s Manual
TABLE OF CONTENTS Lesson Unit 1 Page 1 PRETEST. Discuss student and parent letters. Motivate students to be the best they can be. (Skills tested: Patterns 1-4, 1 adjective and adverb definitions, capitalization/punctuation,
More informationParent Help Booklet. Level 3
Parent Help Booklet Level 3 If you would like additional information, please feel free to contact us. SHURLEY INSTRUCTIONAL MATERIALS, INC. 366 SIM Drive, Cabot, AR 72023 Toll Free: 800-566-2966 www.shurley.com
More informationApplication of Natural Language Interface to a Machine Translation Problem
Application of Natural Language Interface to a Machine Translation Problem Heidi M. Johnson Yukiko Sekine John S. White Martin Marietta Corporation Gil C. Kim Korean Advanced Institute of Science and Technology
More informationA Beginner s Guide To English Grammar
A Beginner s Guide To English Grammar Noncredit ESL Glendale Community College Concept by: Deborah Robiglio Created by: Edwin Fallahi, Rocio Fernandez, Glenda Gartman, Robert Mott, and Deborah Robiglio
More informationLearning the Question & Answer Flows
Learning the Question & Answer Flows These exercises are designed to help you learn how the Question and Answer Flows are constructed in the Parent Help Booklet. In the Question and Answer Flow, a series
More informationNATURAL LANGUAGE TO SQL CONVERSION SYSTEM
International Journal of Computer Science Engineering and Information Technology Research (IJCSEITR) ISSN 2249-6831 Vol. 3, Issue 2, Jun 2013, 161-166 TJPRC Pvt. Ltd. NATURAL LANGUAGE TO SQL CONVERSION
More informationAccording to the Argentine writer Jorge Luis Borges, in the Celestial Emporium of Benevolent Knowledge, animals are divided
Categories Categories According to the Argentine writer Jorge Luis Borges, in the Celestial Emporium of Benevolent Knowledge, animals are divided into 1 2 Categories those that belong to the Emperor embalmed
More informationPoints of Interference in Learning English as a Second Language
Points of Interference in Learning English as a Second Language Tone Spanish: In both English and Spanish there are four tone levels, but Spanish speaker use only the three lower pitch tones, except when
More informationOverview. Map of the Arabic Languagge. Some tips. Need to over-learn to reach a stage of unconscious competence. Recap Parts of Speech
Map of the Arabic Languagge Recap Parts of Speech Overview in Understanding the Qur an Start-up Initial growth Rapid growth Continuous growth Some tips Don t take furious notes Videos will be downloadable
More informationCOURSE OBJECTIVES SPAN 100/101 ELEMENTARY SPANISH LISTENING. SPEAKING/FUNCTIONAl KNOWLEDGE
SPAN 100/101 ELEMENTARY SPANISH COURSE OBJECTIVES This Spanish course pays equal attention to developing all four language skills (listening, speaking, reading, and writing), with a special emphasis on
More informationThis image cannot currently be displayed. Course Catalog. Language Arts 400. 2016 Glynlyon, Inc.
This image cannot currently be displayed. Course Catalog Language Arts 400 2016 Glynlyon, Inc. Table of Contents COURSE OVERVIEW... 1 UNIT 1: READING AND WRITING... 3 UNIT 2: READING FOR MEANING... 3 UNIT
More informationPREP-009 COURSE SYLLABUS FOR WRITTEN COMMUNICATIONS
Coffeyville Community College PREP-009 COURSE SYLLABUS FOR WRITTEN COMMUNICATIONS Ryan Butcher Instructor COURSE NUMBER: PREP-009 COURSE TITLE: Written Communications CREDIT HOURS: 3 INSTRUCTOR: OFFICE
More informationLESSON THIRTEEN STRUCTURAL AMBIGUITY. Structural ambiguity is also referred to as syntactic ambiguity or grammatical ambiguity.
LESSON THIRTEEN STRUCTURAL AMBIGUITY Structural ambiguity is also referred to as syntactic ambiguity or grammatical ambiguity. Structural or syntactic ambiguity, occurs when a phrase, clause or sentence
More informationStatistical Machine Translation
Statistical Machine Translation Some of the content of this lecture is taken from previous lectures and presentations given by Philipp Koehn and Andy Way. Dr. Jennifer Foster National Centre for Language
More informationExtraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology
Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology Makoto Nakamura, Yasuhiro Ogawa, Katsuhiko Toyama Japan Legal Information Institute, Graduate
More informationC o p yr i g ht 2015, S A S I nstitute Inc. A l l r i g hts r eser v ed. INTRODUCTION TO SAS TEXT MINER
INTRODUCTION TO SAS TEXT MINER TODAY S AGENDA INTRODUCTION TO SAS TEXT MINER Define data mining Overview of SAS Enterprise Miner Describe text analytics and define text data mining Text Mining Process
More informationTesting Data-Driven Learning Algorithms for PoS Tagging of Icelandic
Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic by Sigrún Helgadóttir Abstract This paper gives the results of an experiment concerned with training three different taggers on tagged
More informationHow To Write A Dissertation
FORMAT GUIDELINES FOR DOCTORAL DISSERTATIONS Northwestern University The Graduate School Last revised 1/23/2015 Formatting questions not addressed in this document should be directed to Student Services,
More informationFinal Exam Grammar Review. 5. Explain the difference between a proper noun and a common noun.
Final Exam Grammar Review Nouns 1. Definition of a noun: person, place, thing, or idea 2. Give four examples of nouns: 1. teacher 2. lesson 3. classroom 4. hope 3. Definition of compound noun: two nouns
More informationCompound Sentences and Coordination
Compound Sentences and Coordination Mary Westervelt Reference: Ann Hogue (2003) The Essentials of English: A Writer s Handbook. New York, Pearson Education, Inc. When two sentences are combined in a way
More informationEnglish Grammar Passive Voice and Other Items
English Grammar Passive Voice and Other Items In this unit we will finish our look at English grammar. Please be aware that you will have only covered the essential basic grammar that is commonly taught
More informationMorphology. Morphology is the study of word formation, of the structure of words. 1. some words can be divided into parts which still have meaning
Morphology Morphology is the study of word formation, of the structure of words. Some observations about words and their structure: 1. some words can be divided into parts which still have meaning 2. many
More informationTransformational Generative Grammar for Various Types of Bengali Sentences
UT tudies, Vol. 12, No. 1, 2010; P:99-105 Transformational Generative Grammar for Various Types of Bengali entences Mohammad Reza elim 1 and Muhammed Zafar Iqbal 1 1 Department of Computer cience and Engineering,
More informationUseful classroom language for Elementary students. (Fluorescent) light
Useful classroom language for Elementary students Classroom objects it is useful to know the words for Stationery Board pens (= board markers) Rubber (= eraser) Automatic pencil Lever arch file Sellotape
More informationKindergarten Common Core State Standards: English Language Arts
Kindergarten Common Core State Standards: English Language Arts Reading: Foundational Print Concepts RF.K.1. Demonstrate understanding of the organization and basic features of print. o Follow words from
More informationTowards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives
Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives Ramona Enache and Adam Slaski Department of Computer Science and Engineering Chalmers University of Technology and
More informationOpen Domain Information Extraction. Günter Neumann, DFKI, 2012
Open Domain Information Extraction Günter Neumann, DFKI, 2012 Improving TextRunner Wu and Weld (2010) Open Information Extraction using Wikipedia, ACL 2010 Fader et al. (2011) Identifying Relations for
More informationNouns may show possession or ownership. Use an apostrophe with a noun to show something belongs to someone or to something.
Nouns Section 1.4 Possessive Nouns Nouns may show possession or ownership. Use an apostrophe with a noun to show something belongs to someone or to something. Jane s dress is red. The table s legs were
More informationThe New Forest Small School
The New Forest Small School Spanish For Children Aged 11 to 16 OCR GCSE in Spanish J732 AIMS AND OBJECTIVES To provide: A meaningful and enjoyable educational experience Known and achievable but challenging
More informationFIFTH GRADE IMAGINE IT! BLUE BAND UNIT OVERVIEW UNIT 1: Heritage
UNIT 1: Heritage Unit 1: Lesson 1 Writing a list Compound words Common/ proper nouns Paragraph structure Listen to directions Verbs/verb phrases Direct/indirect objects Unit 1: Lesson 2 Prefixes tele-
More informationCohesive writing 1. Conjunction: linking words What is cohesive writing?
Cohesive writing 1. Conjunction: linking words What is cohesive writing? Cohesive writing is writing which holds together well. It is easy to follow because it uses language effectively to guide the reader.
More informationUnderstanding Clauses and How to Connect Them to Avoid Fragments, Comma Splices, and Fused Sentences A Grammar Help Handout by Abbie Potter Henry
Independent Clauses An independent clause (IC) contains at least one subject and one verb and can stand by itself as a simple sentence. Here are examples of independent clauses. Because these sentences
More informationHow To Understand A Sentence In A Syntactic Analysis
AN AUGMENTED STATE TRANSITION NETWORK ANALYSIS PROCEDURE Daniel G. Bobrow Bolt, Beranek and Newman, Inc. Cambridge, Massachusetts Bruce Eraser Language Research Foundation Cambridge, Massachusetts Summary
More informationELAGSEKRI7: With prompting and support, describe the relationship between illustrations and the text (how the illustrations support the text).
READING LITERARY (RL) Key Ideas and Details ELAGSEKRL1: With prompting and support, ask and answer questions about key details in a text. ELAGSEKRL2: With prompting and support, retell familiar stories,
More informationA Writer s Reference, Seventh Edition Diana Hacker Nancy Sommers
A Writer s Reference, Seventh Edition Diana Hacker Nancy Sommers What s new on the companion Web site? hackerhandbooks.com/writersref The companion Web site for A Writer s Reference, Seventh Edition, now
More informationIndiana Department of Education
GRADE 1 READING Guiding Principle: Students read a wide range of fiction, nonfiction, classic, and contemporary works, to build an understanding of texts, of themselves, and of the cultures of the United
More informationKINDGERGARTEN. Listen to a story for a particular reason
KINDGERGARTEN READING FOUNDATIONAL SKILLS Print Concepts Follow words from left to right in a text Follow words from top to bottom in a text Know when to turn the page in a book Show spaces between words
More informationGESE Initial steps. Guide for teachers, Grades 1 3. GESE Grade 1 Introduction
GESE Initial steps Guide for teachers, Grades 1 3 GESE Grade 1 Introduction cover photos: left and right Martin Dalton, middle Speak! Learning Centre Contents Contents What is Trinity College London?...3
More informationSymbiosis of Evolutionary Techniques and Statistical Natural Language Processing
1 Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing Lourdes Araujo Dpto. Sistemas Informáticos y Programación, Univ. Complutense, Madrid 28040, SPAIN (email: lurdes@sip.ucm.es)
More informationFrench Curriculum Grades 4-8
French Curriculum Grades 4-8 French Grade Four: 1. Students will be introduced, recognize, and recite the French alphabet. 2. Students will recognize, recite and respond to simple French greetings. 3.
More informationStudent Guide for Usage of Criterion
Student Guide for Usage of Criterion Criterion is an Online Writing Evaluation service offered by ETS. It is a computer-based scoring program designed to help you think about your writing process and communicate
More informationSyntactic and Semantic Differences between Nominal Relative Clauses and Dependent wh-interrogative Clauses
Theory and Practice in English Studies 3 (2005): Proceedings from the Eighth Conference of British, American and Canadian Studies. Brno: Masarykova univerzita Syntactic and Semantic Differences between
More informationPOS Tagsets and POS Tagging. Definition. Tokenization. Tagset Design. Automatic POS Tagging Bigram tagging. Maximum Likelihood Estimation 1 / 23
POS Def. Part of Speech POS POS L645 POS = Assigning word class information to words Dept. of Linguistics, Indiana University Fall 2009 ex: the man bought a book determiner noun verb determiner noun 1
More informationBuilding a Question Classifier for a TREC-Style Question Answering System
Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given
More informationEnglish Grammar Checker
International l Journal of Computer Sciences and Engineering Open Access Review Paper Volume-4, Issue-3 E-ISSN: 2347-2693 English Grammar Checker Pratik Ghosalkar 1*, Sarvesh Malagi 2, Vatsal Nagda 3,
More informationRhode Island College
Rhode Island College M.Ed. In TESL Program Language Group Specific Informational Reports Produced by Graduate Students in the M.Ed. In TESL Program In the Feinstein School of Education and Human Development
More informationQuality Assurance at NEMT, Inc.
Quality Assurance at NEMT, Inc. Quality Assurance Policy NEMT prides itself on the excellence of quality within every level of the company. We strongly believe in the benefits of continued education and
More informationTeacher training worksheets- Classroom language Pictionary miming definitions game Worksheet 1- General school vocab version
Teacher training worksheets- Classroom language Pictionary miming definitions game Worksheet 1- General school vocab version Whiteboard Work in pairs Desk Board pen Permanent marker Felt tip pen Colouring
More informationHoughton Mifflin Harcourt StoryTown Grade 1. correlated to the. Common Core State Standards Initiative English Language Arts (2010) Grade 1
Houghton Mifflin Harcourt StoryTown Grade 1 correlated to the Common Core State Standards Initiative English Language Arts (2010) Grade 1 Reading: Literature Key Ideas and details RL.1.1 Ask and answer
More information