Discovery ANALYSIS. Page46. Publication History Received: 23 August 2015 Accepted: 20 September 2015 Published: 12 October 2015

Size: px
Start display at page:

Download "Discovery ANALYSIS. Page46. Publication History Received: 23 August 2015 Accepted: 20 September 2015 Published: 12 October 2015"

Transcription

1 Discovery ANALYSIS The International Daily journal ISSN EISSN Discovery Publication. All Rights Reserved Summarization and Hybrid Machine Translation System for English to Marathi: A Research Effort in Information Retriveal System (H-Machine Translation) Publication History Received: 23 August 2015 Accepted: 20 September 2015 Published: 12 October 2015 Citation Pramod Salunkhe, Mrunal Bewoor, Suhas Patil, Shashank Joshi, Aniket Kadam. Summarization and Hybrid Machine Translation System for English to Marathi: A Research Effort in Information Retriveal System (H-Machine Translation). Discovery, 2015, 43(197), Page46

2 Summarization and Hybrid Machine Translation System for English to Marathi: A Research Effort in Information Retriveal System (H-Machine Translation) Pramod Salunkhe Mrunal Bewoor Suhas Patil Shashank Joshi Aniket Kadam BVDUCOEP BVDUCOEP BVDUCOEP BVDUCOEP PDEA COEM Pune,India Pune,India Pune,India Pune,India Pune,India [email protected] [email protected] [email protected] [email protected] [email protected] Abstract Research issue in information retrieval is retrieving accurate subjective vital information present in one language to user costumed language. Language has been a barrier in information retrieval. The main objective of research effort is to eliminate this gap and present vital foreseen or unseen information to client. A cross lingual machine translation is proposed with summarization as translation as core parts of software product. A person collecting information on a topic may come across text in different languages and would always need summarized and translated information. The proposed system is twin modular PART A summarizes a document extracting important text and Modular PART B Translates it to required language. As Marathi been my local language this system does summarization and then English Assertive and interrogative translation to Marathi. The core methodology in accurate translation is mapping of rules generated from OPEN-NLP package to handcrafted Marathi rules. A parser with lexical analysis methodology search in English Lexicon dataset, if instituted morphological in rank potted. Proposed system is research work in open NLP rule based implementation with translation of source to target language in with summarized text. Language syntactical structure variation and human s natural language. Literature examination reveals that every translation mechanism has some pros and cones and no single technique is better so a hybrid approach is been proposed in translation system. This research is an extended work and ongoing system generated results are been presented Keywords Hybrid Machine Translation,Cross-language Translation,Machine Translation,Open-NLP,Rule-based language spoken and used by people more Than 0.8 billion individuals has been derived from Sanskrit. Word in order is major problem in translation of spring language to objective language. Marathi is mostly spoken language in State of Maharashtra.The structure of language is twin documented from left side to right end, from top end to Bottom end of document. Marathi terms are derived from Sanskrit Nava derived from Navin,month in English Maas derived from Machine. Individuals from different culture and language base are not able to easy communicate where a translation system would facilitate to complete the gap. This research work is directed to first summarize than translate which is useful to Marathi scholar in study of some research work of English writer. In any research work there are numerous issues and problems to address this research work is formulating building hybrid Machine Translation system T= [context {assertive sentence, interrogative sentences}] with summarization to illuminate irrelevant paragraph from document. As research work first script is progressing this is first research article addressing literature Examination and small introduction to proposed work. II. RELATED WORK AND LITERATURE Part A of system is summarization module and part B is translation Many Machine translation(mt) scheme athwart the sphere cover previously are constructed for mainly regularly worn natural tongues that English, Hindi,Chinese, Japanese, Russian etc many more Indian mother tongue languages fig.1 portrays the presented appliance translation schemes and assorted advance used in constructing those schemes. I. INTRODUCTION Machine enabled transformation is core research in Natural Language (NL) for removing language as obstacle in communication and information access with help of bi-lingual machine translation. Research work in Machine translation has been done from English to Hindi, English to Urdu to another language like telgu many native languages and foreign languages like Arabic, Chinese and Spanish. The research problem to address is to community of Marathi language, Fig1. Translation scheme [3] Page47

3 1. Literature Survey on Direct Machine Translation Scheme A) Amongst Indian regional Languages: Anusaaraka scheme Rajeev Sangal at IIT Kanpur started Anusaaraka the aim of project is work in cross lingual translation among Indian languages with spring languages as Marathi Punjabi Bengali Telugu Kannada with translation aimed language Hindi [2]. Section of Indian government for Indian languages technology development (TDIL) department of I.T and in collaboration with satyam I.T firm the scheme is elaborated not for particular domain but translating small children stories.information conservation is main purpose of scheme. Scheme is dependent on spring language grammar structure with output in Hindi language but not perfect grammatically accurate. In for considering Kannada 80% terms in dictionary produced under scheme have 30,000 core terms match to single Hindi term to identify Kannada terms. Scheme manly focuses on inter language access not accuracy Panini and Grammar is been implemented in scheme. B)Punjabi Language to Hindi Translation Scheme G.S.Lehal & G.S. Josan [1] and have urbanized a scheme that is stand on straight term-to-term translation scheme. This scheme encompass of components such as pre-dispensation, term-to-term conversion by means of Punjabi and Hindi word list morphological breakdown, word intellect disambiguation, transliteration and station dispensation. Correctness of version bent by this scheme is 90.67%. Term fault tempo is 2.33% and SER is 24.28%. B) English to Hindi Translation Scheme Patil N and et al [8] urbanized a scheme bottom on relocate foot conversion loom, which employ dissimilar grammatical regulations of spring and aim tongues and a bilingual lexicon for conversion. The translation unit consists of predispensation, English hierarchy initiator, station dispensation of English graph, creation of Hindi graph, Post-dispensation of Hindi graph and produce yield the field of the scheme was climate telling. 3) Machine Translation Scheme: InterTounge A) ANGLABHARTI R M K Sinha, Jain R, Jain A [3][4] urbanized a mechanism support translation scheme intended for converting English to Indian tongues. It is urbanized by means of pretendintertongues loom. The inter-tongue loom complete it probable to employ similar scheme for transform English to additional than solitary Indian tongue and have get rid of require of rising divide translation scheme for English to every Indian Tongue. Investigation of English as spring language is completed only formerly and it fashion midway makeup PLIL (Pseudo Lingua in to Indian tongues). TPLIL is therefore rehabilitated to every Indian Tongues throughout a procedure of text-creation. The effort for PLIL cohort is 70% and transcript generation 30%. Merely through an added 30% effort, novel English to Indian language conversion scheme was built. The effort has be complete whereby do 90% translation chore and outstanding 10% is absent for person post-suppression The field of this machine translation Scheme been in public fitness. C)Hindi-Punjabi conversion scheme on mesh (www) Lehal G S with colleagues [6] urbanized the comprehensive description of Hindi-Punjabi machine conversion scheme on mesh.the scheme has numerous amenities like website conversion, mailbox conversion etc. 2) Transfer-Support Machine Conversion Schemes A) MANTRA (2000) &Mantra MT (1997) Mantra [5] is English to Hindi conversion scheme constructed by Bharati in data conservation scheme. The wording obtainable in solitary Indian tongue is prepared reachable in an additional Indian tongue amid assist of this Scheme. It utilizes XTAG stand wonderful tagger and beam craving examiner urbanized at Pennsylvania for working the examination of effort English copy. It allocates weight on guy and mechanism in fresh techniques. The schemes manufacture numerous yields matching to specified effort. Output support on mainly thorough Examination of English effort passage, incorporates a complete parser and bilingual vocabulary. Parsing scheme is stand on XTAG (Bandyopadhyas-excellent tagger with parser) with slight alteration for job at tender. A client might read yield shaped once full examination, but whilst he determine that scheme have noticeably vanished mistaken or unsuccessful to manufacture the production, he can forever lever to easy output. B) AnglaHindi (2003) AnglaHindi is a copied of AnglaBharti scheme urbanized by R M K Sinha et all for Indian Tongues, that is interlingual regulation-based English to Hindi mechanism- aid conversion Scheme It employ every component of AnglaBharti[4] and in addition employ abstracted instance-foot for translating regularly encounter act noun expression and verb expressions. The correctness of scheme translation is 90%. 4) Hybrid Machine Translation Scheme Anubharti is urbanized using a hybridized [9][8] instance-base mechanism translation loom i.e. a mixture of instance-foot, mass-foot loom and various straightforward grammatical examination. Instance-bottom loom track human-education method for hoard data from past practices and to be worn in future. Anubharti, the customary EBMT (Gupta et all 2003) loom has be customized to reduce the condition of a big instance-bottom the alteration in customary EBMT is accomplish by oversimplify the ingredient and substitute them with inattentive shape from underdone instance. The concept is achieve by recognize syntactic collections. Corresponding of the contribution verdict with inattentive instances is complete stand on syntactic group and semantic labels of spring tongue arrangement. The planning s of together AnglaBharti and AnuBharti, encompass undergone a Page48

4 substantial modify starting their preliminary conceptual design these schemes were forename as AnglaBharti-II and AnuBharti-II in that order. AnglaBharti-II uses a widespread instance-bottom for hybridization in addition a underdone instance-base and the AnuBharti-II creates employ of Hindi as spring tongue for translation to any additional language. The overview of instance-base is reliant ahead the goal language. III. MOTIVATION The motivation of research comes from practical issues faced by person carrying out literature study or collecting information on particular topic person. Language is factor in information access gap.in order to overcome this gap in retrieving appropriate information need s a complete product solution in cross lingual information retrieval.summarization and translation are core modules that facilitate information presentation in each user prefer language. Marathi being a local language with Divergence in pronunciation and very little work is accrued my research work takes motivation from it. is been developed. The OPENNLP package consists of Sentence detect ( ) Tokenization ( ) Parser ( ) Chunk ( ) as in built functions. Dataset of Marathi rules is been developed in mapping to English rules generated by OPENNLP Package. This proposed Six Phase architecture is core design that facilitates accurate information presentation from diverse language to Marathi language.the system is web deployed and web based which extract information from URL s and web pages and various online content.the output os summarization is input to translation module the translated output is cross lingual information retrieved. Architecture of Cross-lingual information retrieval system The architecture of system consists of six phases PHASE1 is summarization module which takes in pdf,doc,txt and web pages information from web. PHASAE2: consists of core summarization module which summarizes based on key word and percentage or centroid based method as user selects PHASE3: This module updates dataset with word and their meaning in Marathi with rules mapping from one language to other. PHASE 4: local and web dataset built information repository which stores data for faster access. PHASE5: this is core Translation module where a selector selects the translation scheme based on user input context example or rule based or in all a combination of all. PHASE 6: WORDNET implementation. Fig1: research Motivation IV. PROPOSED SYSTEM:CORE METHODOLOGY The core module in accurate information presentation is translation hence core methodology employ s mapping one to one rule in English to appropriate rule in Marathi language which are handcrafted in study details of structure and parsing study of OPEN NLP.Summarization methodology is simple and centroid based for topical or subject summarization [11][12] if in research implement context in translation Core methodologyy in part B of system is as shown in fig2. Core Technique is lexical analyzer to detect morphological structure which is matched to English lexicon and then applied to English grammar rule further mapping of English word is been done with Marathi a set of rules are been written to generate exact Marathi sentence translated.the research writer in Marathi studding the English document would be presented with impomation in Marathi and can further carry his work without any information gap. The inbuilt OPENNLP packages are been used to programme the system. A dictionary set is been generated to store in proper nouns pronouns and verbs adverbs.a healthy dataset consists of words terms and phrases Fig 2.Architecture of cross- lingual information retrieval system Page49

5 The core methodology of execution is presented as following in ascending order as: ALGORTHMIC PROCEDURE: SIX PHASE 1. INPUT FROM SUMMARIZATION ( KEYWORD BASED, % BASED AND CENTROID BASED) 2. PROCESS OF TRANSLATION implementation makes the system fully automated in translation.the system is in progress and this are preliminary results. {ADDING PRODUCTION RULES, TOKENIZATION, POS TAGGING, CHUNKING, PARSING ( OPEN NLP FUNCTION) 3. SEARCH THE TOKEN MAPPING ONE TO ONE OF RULE IN ENGLISH TO MARATHI MATCHING RULE 4. SEARCH RULE FROM DATABASE (Not found) 5. CREATE NEW RULE IN WITH WRITING RULES OF OPENNLP (UPDATE DATABASE) 6. IMPLEMENTATION OF WORDNET FOR ACCURACY A. MODULE A SUMMARIZATION 1) Creation of training dataset for Marathi words alphabets. 2) Centroid based methodology is selected for Marathi document summarization 3) Keyword based and percentage summarization applied 4) Generalized summarizations to context based summarize. B. MODULE B H-SCHEME 1) Rule based translation: rule R1 R2 R3. 2) Context based translation. 3) Example based tarnslation 3) Combination of all :Hybrid Translation. V. PRELIMINARY IMPLEMENTATION AND REULTS Base modular design has been implemented with PART A Summarizing doc, pdf docx, text files along with web pages as inputs. Part A summarizes keyword based and percentage based which is been further extended to Centroid based extractive summarization. Extractive summary of PART A is input to PART B current state of work this is rule based translation system which checks for mapping of rules from English to Marathi equivalent rules from dataset. If rule not found it is been constructed and added to dataset. The architecture of Information retrieval system is six phase which facilitates cross lingual information retrieval A selector selects rule based,context based,or example based translation mechanism or in all a hybrid mechanism.wordnet \ Page50

6 {F1} Relevant Sentences but not retrieved =08 Recall=8/ Comparative study for Motivation We consider Google translator to compare the system for given input paragraph consisting of 4 sentences and check the system performance in.the comparison is instance based and solely for motivation purpose we do not wish to compare system with Google as Google has large development system and its not correct to evaluate any other system with it. Writing rule for Marathi equivalent following tabular chart is been referred in writing mapped rules.the rule set on left side correspond to English language while rules on right side correspond to Marathi equivalent rules. This research work is in progress and hence the results presented are preliminary only. Performance Parameters for evaluation of system are been Precision and Recall at current work under implementation we only incorporate this Two comparison Parameters. A data set of 10 documents and 50 sentences have been tested for evaluating system. Precision=correctly retrieved information/total dataset VI. RESEARCH STUDY OUTCOMES AND EXPECTED RESEARCH OUTCOMES:IN PROCESS EXPERIMENTAL RESULTS Information extraction that bears quality has language barriers due to which user remain unknown of good quality information or literature of person which is barrier even for research in history domain or sociology and hence a system is required that supports such research person in summarizing and translating information. This research would build in software system that facilitates research scholars. A software system capable to summarize and translate documents web based application comparable in terms of certain parameters to Google translator. Assertive sentence affirmative sentence generalized translation system to hybrids system Recall= total retrieved inform from dataset/total dataset information. Precision= {TE} /TE+FP Relevant Sentences {TE} =08; Retrieved Sentences {TE+FP} =10 then Precision=8/ Recall= {TP} / {TP}+{F1} Acknowledgment It has been Immense Guidance and direction showing of Prof.M.S.Bewoor that has given me motivation in research work and Dr.D.M.Thakore sir guidance is valuable to us. Guidance of Dr.S.H patil has been valuable in writing this research article.the core Methodology Technique and architecture of Information Retriveal system was formulated Page51

7 by Aniket Kadam. I Express great Thanks to him. The research article writing ppt of Shashank Joshi was valuable in completing research arcticle. I epress thank to him also.at end a thanks to all authors whose articles are been cited in paper. References [1] Bonnie J. Dorr, Pamela W. Jordan, John W. Benoit,A Survey of Current Paradigms in Machine Translation, LAMP TR-027, Dec [2] [3] [4] TRANSLATION OF TELUGU-MARATHI AND VICE-VERSA USING RULE BASED MACHINE TRANSLATION, Dr. Siddhartha Ghosh1, Sujata Thamke2 and Kalyani U.R.S,International journal of information retrieval [5] Subjective and Objective Evaluation of English to Urdu Machine Translation, Vaishali Gupta, Nisheeth Joshi, Iti Mathur, IEEE-ACM [6] Identifying Devnagri characters Karnik, R.R. IEEE Paper [7] Min Zang, Hongfei Jiang, 2008, Grammar comparison study for Translation Equivalence Modeling and Statistical Machine Translation. In the Proceeding of the 22nd International Conference of Computational Linguistics pages [8] Sangal, Rajeev,Dipti Misra Sharma, Lakshmi Bai, Karunesh Arora, Developing Indian languages corpora: Standards and practice, November. [9] Abhijeet R. Joshi, M. Sasikumar, Constructive approach toteachinflectionnmarathilanguage, te/.../pdf.../catiml1.pdf [10] Mining text data C.C Agarwal CX Zhai books.google.com [digital vesion] [11] A survey of text summarization techniques A Nenkova k MCkeown springer 2012 [12] Multi-document multilingual summarization corpus preparation, Arabic, English, Greek, Chinese, Romanian by Lei Li Corina Forascu Proceedings of the MultiLing 2013 Workshop on Multilingual Multi-document Summarization [13] [online ]. [14] Page52

TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION

TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION Dr. Siddhartha Ghosh 1, Sujata Thamke 2 and Kalyani U.R.S 3 1 Head of the Department of Computer Science & Engineering,

More information

Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System

Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System Athira P. M., Sreeja M. and P. C. Reghuraj Department of Computer Science and Engineering, Government Engineering

More information

BILINGUAL TRANSLATION SYSTEM

BILINGUAL TRANSLATION SYSTEM BILINGUAL TRANSLATION SYSTEM (FOR ENGLISH AND TAMIL) Dr. S. Saraswathi Associate Professor M. Anusiya P. Kanivadhana S. Sathiya Abstract--- The project aims in developing Bilingual Translation System for

More information

Hybrid Machine Translation For English to Marathi: A Research Evaluation In Machine Translation

Hybrid Machine Translation For English to Marathi: A Research Evaluation In Machine Translation International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT) - 2016 Hybrid Machine Translation For English to Marathi: A Research Evaluation In Machine Translation Pramod Salunkhe

More information

Identifying Focus, Techniques and Domain of Scientific Papers

Identifying Focus, Techniques and Domain of Scientific Papers Identifying Focus, Techniques and Domain of Scientific Papers Sonal Gupta Department of Computer Science Stanford University Stanford, CA 94305 [email protected] Christopher D. Manning Department of

More information

Machine Translation Approaches and Survey for Indian Languages

Machine Translation Approaches and Survey for Indian Languages Computational Linguistics and Chinese Language Processing Vol. 18, No. 1, March 2013, pp. 47-78 47 The Association for Computational Linguistics and Chinese Language Processing Machine Translation Approaches

More information

Modern foreign languages

Modern foreign languages Modern foreign languages Programme of study for key stage 3 and attainment targets (This is an extract from The National Curriculum 2007) Crown copyright 2007 Qualifications and Curriculum Authority 2007

More information

Semantic annotation of requirements for automatic UML class diagram generation

Semantic annotation of requirements for automatic UML class diagram generation www.ijcsi.org 259 Semantic annotation of requirements for automatic UML class diagram generation Soumaya Amdouni 1, Wahiba Ben Abdessalem Karaa 2 and Sondes Bouabid 3 1 University of tunis High Institute

More information

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words , pp.290-295 http://dx.doi.org/10.14257/astl.2015.111.55 Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words Irfan

More information

Comprendium Translator System Overview

Comprendium Translator System Overview Comprendium System Overview May 2004 Table of Contents 1. INTRODUCTION...3 2. WHAT IS MACHINE TRANSLATION?...3 3. THE COMPRENDIUM MACHINE TRANSLATION TECHNOLOGY...4 3.1 THE BEST MT TECHNOLOGY IN THE MARKET...4

More information

Learning Translation Rules from Bilingual English Filipino Corpus

Learning Translation Rules from Bilingual English Filipino Corpus Proceedings of PACLIC 19, the 19 th Asia-Pacific Conference on Language, Information and Computation. Learning Translation s from Bilingual English Filipino Corpus Michelle Wendy Tan, Raymond Joseph Ang,

More information

Natural Language to Relational Query by Using Parsing Compiler

Natural Language to Relational Query by Using Parsing Compiler Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518 International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,

More information

Derby Translations. Translation for all languages. Active Knowledge. Team of Experts. Quality Is Our Priority. Competitive Prices.

Derby Translations. Translation for all languages. Active Knowledge. Team of Experts. Quality Is Our Priority. Competitive Prices. Active Knowledge Team of Experts Quality Is Our Priority Competitive Prices 100% Performance Fast Delivery Derby Translations is one of the UAE leading translation and language services providers, has

More information

CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation.

CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation. CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation. Miguel Ruiz, Anne Diekema, Páraic Sheridan MNIS-TextWise Labs Dey Centennial Plaza 401 South Salina Street Syracuse, NY 13202 Abstract:

More information

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features , pp.273-280 http://dx.doi.org/10.14257/ijdta.2015.8.4.27 Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features Lirong Qiu School of Information Engineering, MinzuUniversity of

More information

An Efficient Database Design for IndoWordNet Development Using Hybrid Approach

An Efficient Database Design for IndoWordNet Development Using Hybrid Approach An Efficient Database Design for IndoWordNet Development Using Hybrid Approach Venkatesh P rabhu 2 Shilpa Desai 1 Hanumant Redkar 1 N eha P rabhugaonkar 1 Apur va N agvenkar 1 Ramdas Karmali 1 (1) GOA

More information

Customizing an English-Korean Machine Translation System for Patent Translation *

Customizing an English-Korean Machine Translation System for Patent Translation * Customizing an English-Korean Machine Translation System for Patent Translation * Sung-Kwon Choi, Young-Gil Kim Natural Language Processing Team, Electronics and Telecommunications Research Institute,

More information

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge

The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge The SYSTRAN Linguistics Platform: A Software Solution to Manage Multilingual Corporate Knowledge White Paper October 2002 I. Translation and Localization New Challenges Businesses are beginning to encounter

More information

Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach.

Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach. Sentiment analysis on news articles using Natural Language Processing and Machine Learning Approach. Pranali Chilekar 1, Swati Ubale 2, Pragati Sonkambale 3, Reema Panarkar 4, Gopal Upadhye 5 1 2 3 4 5

More information

Open Domain Information Extraction. Günter Neumann, DFKI, 2012

Open Domain Information Extraction. Günter Neumann, DFKI, 2012 Open Domain Information Extraction Günter Neumann, DFKI, 2012 Improving TextRunner Wu and Weld (2010) Open Information Extraction using Wikipedia, ACL 2010 Fader et al. (2011) Identifying Relations for

More information

A GrAF-compliant Indonesian Speech Recognition Web Service on the Language Grid for Transcription Crowdsourcing

A GrAF-compliant Indonesian Speech Recognition Web Service on the Language Grid for Transcription Crowdsourcing A GrAF-compliant Indonesian Speech Recognition Web Service on the Language Grid for Transcription Crowdsourcing LAW VI JEJU 2012 Bayu Distiawan Trisedya & Ruli Manurung Faculty of Computer Science Universitas

More information

An Overview of Applied Linguistics

An Overview of Applied Linguistics An Overview of Applied Linguistics Edited by: Norbert Schmitt Abeer Alharbi What is Linguistics? It is a scientific study of a language It s goal is To describe the varieties of languages and explain the

More information

PROMT Technologies for Translation and Big Data

PROMT Technologies for Translation and Big Data PROMT Technologies for Translation and Big Data Overview and Use Cases Julia Epiphantseva PROMT About PROMT EXPIRIENCED Founded in 1991. One of the world leading machine translation provider DIVERSIFIED

More information

NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR

NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR Arati K. Deshpande 1 and Prakash. R. Devale 2 1 Student and 2 Professor & Head, Department of Information Technology, Bharati

More information

Improving statistical POS tagging using Linguistic feature for Hindi and Telugu

Improving statistical POS tagging using Linguistic feature for Hindi and Telugu Improving statistical POS tagging using Linguistic feature for Hindi and Telugu by Phani Gadde, Meher Vijay Yeleti in ICON-2008: International Conference on Natural Language Processing (ICON-2008) Report

More information

Latin Syllabus S2 - S7

Latin Syllabus S2 - S7 European Schools Office of the Secretary-General Pedagogical Development Unit Ref.: 2014-01-D-35-en-2 Orig.: FR Latin Syllabus S2 - S7 APPROVED BY THE JOINT TEACHING COMMITTEE ON 13 AND 14 FEBRUARY 2014

More information

Natural Language Database Interface for the Community Based Monitoring System *

Natural Language Database Interface for the Community Based Monitoring System * Natural Language Database Interface for the Community Based Monitoring System * Krissanne Kaye Garcia, Ma. Angelica Lumain, Jose Antonio Wong, Jhovee Gerard Yap, Charibeth Cheng De La Salle University

More information

Collecting Polish German Parallel Corpora in the Internet

Collecting Polish German Parallel Corpora in the Internet Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska

More information

Cross-Lingual Concern Analysis from Multilingual Weblog Articles

Cross-Lingual Concern Analysis from Multilingual Weblog Articles Cross-Lingual Concern Analysis from Multilingual Weblog Articles Tomohiro Fukuhara RACE (Research into Artifacts), The University of Tokyo 5-1-5 Kashiwanoha, Kashiwa, Chiba JAPAN http://www.race.u-tokyo.ac.jp/~fukuhara/

More information

Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology

Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology Makoto Nakamura, Yasuhiro Ogawa, Katsuhiko Toyama Japan Legal Information Institute, Graduate

More information

An Iterative approach to extract dictionaries from Wikipedia for under-resourced languages

An Iterative approach to extract dictionaries from Wikipedia for under-resourced languages An Iterative approach to extract dictionaries from Wikipedia for under-resourced languages Rohit Bharadwaj G SIEL, LTRC IIIT Hyd [email protected] Niket Tandon Databases and Information Systems

More information

ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS

ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS ONLINE RESUME PARSING SYSTEM USING TEXT ANALYTICS Divyanshu Chandola 1, Aditya Garg 2, Ankit Maurya 3, Amit Kushwaha 4 1 Student, Department of Information Technology, ABES Engineering College, Uttar Pradesh,

More information

Translution Price List GBP

Translution Price List GBP Translution Price List GBP TABLE OF CONTENTS Services AD HOC MACHINE TRANSLATION... LIGHT POST EDITED TRANSLATION... PROFESSIONAL TRANSLATION... 3 TRANSLATE, EDIT, REVIEW TRANSLATION (TWICE TRANSLATED)...3

More information

Parsing Technology and its role in Legacy Modernization. A Metaware White Paper

Parsing Technology and its role in Legacy Modernization. A Metaware White Paper Parsing Technology and its role in Legacy Modernization A Metaware White Paper 1 INTRODUCTION In the two last decades there has been an explosion of interest in software tools that can automate key tasks

More information

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia

Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Computer Assisted Language Learning (CALL): Room for CompLing? Scott, Stella, Stacia Outline I What is CALL? (scott) II Popular language learning sites (stella) Livemocha.com (stacia) III IV Specific sites

More information

31 Case Studies: Java Natural Language Tools Available on the Web

31 Case Studies: Java Natural Language Tools Available on the Web 31 Case Studies: Java Natural Language Tools Available on the Web Chapter Objectives Chapter Contents This chapter provides a number of sources for open source and free atural language understanding software

More information

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy

Multi language e Discovery Three Critical Steps for Litigating in a Global Economy Multi language e Discovery Three Critical Steps for Litigating in a Global Economy 2 3 5 6 7 Introduction e Discovery has become a pressure point in many boardrooms. Companies with international operations

More information

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990

More information

Building a Question Classifier for a TREC-Style Question Answering System

Building a Question Classifier for a TREC-Style Question Answering System Building a Question Classifier for a TREC-Style Question Answering System Richard May & Ari Steinberg Topic: Question Classification We define Question Classification (QC) here to be the task that, given

More information

A Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students

A Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students 69 A Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students Sarathorn Munpru, Srinakharinwirot University, Thailand Pornpol Wuttikrikunlaya, Srinakharinwirot University,

More information

Knowledge Discovery using Text Mining: A Programmable Implementation on Information Extraction and Categorization

Knowledge Discovery using Text Mining: A Programmable Implementation on Information Extraction and Categorization Knowledge Discovery using Text Mining: A Programmable Implementation on Information Extraction and Categorization Atika Mustafa, Ali Akbar, and Ahmer Sultan National University of Computer and Emerging

More information

Create A Language. 2015 Global Educator Award Winner: sharon mcadam. lo b a l e d u cato r awa r d w i n n e r. Global Connections 1: Global Society

Create A Language. 2015 Global Educator Award Winner: sharon mcadam. lo b a l e d u cato r awa r d w i n n e r. Global Connections 1: Global Society g lo b a l e d u cato r awa r d w i n n e r Global Connections 1: Global Society 2015 Unit 1: My Culture, My Family Create A Language 2015 Global Educator Award Winner: sharon mcadam PROFESSIONAL DEVELOPMENT

More information

Bisecting K-Means for Clustering Web Log data

Bisecting K-Means for Clustering Web Log data Bisecting K-Means for Clustering Web Log data Ruchika R. Patil Department of Computer Technology YCCE Nagpur, India Amreen Khan Department of Computer Technology YCCE Nagpur, India ABSTRACT Web usage mining

More information

How To Translate English To Yoruba Language To Yoranuva

How To Translate English To Yoruba Language To Yoranuva International Journal of Language and Linguistics 2015; 3(3): 154-159 Published online May 11, 2015 (http://www.sciencepublishinggroup.com/j/ijll) doi: 10.11648/j.ijll.20150303.17 ISSN: 2330-0205 (Print);

More information

Overview of MT techniques. Malek Boualem (FT)

Overview of MT techniques. Malek Boualem (FT) Overview of MT techniques Malek Boualem (FT) This section presents an standard overview of general aspects related to machine translation with a description of different techniques: bilingual, transfer,

More information

Skills for Effective Business Communication: Efficiency, Collaboration, and Success

Skills for Effective Business Communication: Efficiency, Collaboration, and Success Skills for Effective Business Communication: Efficiency, Collaboration, and Success Michael Shorenstein Center for Communication Kennedy School of Government Harvard University September 30, 2014 I: Introduction

More information

The Development of Multimedia-Multilingual Document Storage, Retrieval and Delivery System for E-Organization (STREDEO PROJECT)

The Development of Multimedia-Multilingual Document Storage, Retrieval and Delivery System for E-Organization (STREDEO PROJECT) The Development of Multimedia-Multilingual Storage, Retrieval and Delivery for E-Organization (STREDEO PROJECT) Asanee Kawtrakul, Kajornsak Julavittayanukool, Mukda Suktarachan, Patcharee Varasrai, Nathavit

More information

CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet

CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet Muhammad Atif Qureshi 1,2, Arjumand Younus 1,2, Colm O Riordan 1,

More information

Comparative Analysis on the Armenian and Korean Languages

Comparative Analysis on the Armenian and Korean Languages Comparative Analysis on the Armenian and Korean Languages Syuzanna Mejlumyan Yerevan State Linguistic University Abstract It has been five years since the Korean language has been taught at Yerevan State

More information

A Case Study of Question Answering in Automatic Tourism Service Packaging

A Case Study of Question Answering in Automatic Tourism Service Packaging BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 13, Special Issue Sofia 2013 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0045 A Case Study of Question

More information

NATURAL LANGUAGE QUERY PROCESSING USING SEMANTIC GRAMMAR

NATURAL LANGUAGE QUERY PROCESSING USING SEMANTIC GRAMMAR NATURAL LANGUAGE QUERY PROCESSING USING SEMANTIC GRAMMAR 1 Gauri Rao, 2 Chanchal Agarwal, 3 Snehal Chaudhry, 4 Nikita Kulkarni,, 5 Dr. S.H. Patil 1 Lecturer department o f Computer Engineering BVUCOE,

More information

AN APPROACH TO WORD SENSE DISAMBIGUATION COMBINING MODIFIED LESK AND BAG-OF-WORDS

AN APPROACH TO WORD SENSE DISAMBIGUATION COMBINING MODIFIED LESK AND BAG-OF-WORDS AN APPROACH TO WORD SENSE DISAMBIGUATION COMBINING MODIFIED LESK AND BAG-OF-WORDS Alok Ranjan Pal 1, 3, Anirban Kundu 2, 3, Abhay Singh 1, Raj Shekhar 1, Kunal Sinha 1 1 College of Engineering and Management,

More information

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines

Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines , 22-24 October, 2014, San Francisco, USA Automatic Mining of Internet Translation Reference Knowledge Based on Multiple Search Engines Baosheng Yin, Wei Wang, Ruixue Lu, Yang Yang Abstract With the increasing

More information

Semantic analysis of text and speech

Semantic analysis of text and speech Semantic analysis of text and speech SGN-9206 Signal processing graduate seminar II, Fall 2007 Anssi Klapuri Institute of Signal Processing, Tampere University of Technology, Finland Outline What is semantic

More information

USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE

USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE Ria A. Sagum, MCS Department of Computer Science, College of Computer and Information Sciences Polytechnic University of the Philippines, Manila, Philippines

More information

Arabic in first non-english web addresses

Arabic in first non-english web addresses www.breaking News English.com Ready-to-use ESL/EFL Lessons by Sean Banville 1,000 IDEAS & ACTIVITIES FOR LANGUAGE TEACHERS The Breaking News English.com Resource Book http://www.breakingnewsenglish.com/book.html

More information

LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM*

LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* LINGSTAT: AN INTERACTIVE, MACHINE-AIDED TRANSLATION SYSTEM* Jonathan Yamron, James Baker, Paul Bamberg, Haakon Chevalier, Taiko Dietzel, John Elder, Frank Kampmann, Mark Mandel, Linda Manganaro, Todd Margolis,

More information

How To Write A Summary Of A Review

How To Write A Summary Of A Review PRODUCT REVIEW RANKING SUMMARIZATION N.P.Vadivukkarasi, Research Scholar, Department of Computer Science, Kongu Arts and Science College, Erode. Dr. B. Jayanthi M.C.A., M.Phil., Ph.D., Associate Professor,

More information

Effective Data Retrieval Mechanism Using AML within the Web Based Join Framework

Effective Data Retrieval Mechanism Using AML within the Web Based Join Framework Effective Data Retrieval Mechanism Using AML within the Web Based Join Framework Usha Nandini D 1, Anish Gracias J 2 1 [email protected] 2 [email protected] Abstract A vast amount of assorted

More information

Visionet IT Modernization Empowering Change

Visionet IT Modernization Empowering Change Visionet IT Modernization A Visionet Systems White Paper September 2009 Visionet Systems Inc. 3 Cedar Brook Dr. Cranbury, NJ 08512 Tel: 609 360-0501 Table of Contents 1 Executive Summary... 4 2 Introduction...

More information

Language Interface for an XML. Constructing a Generic Natural. Database. Rohit Paravastu

Language Interface for an XML. Constructing a Generic Natural. Database. Rohit Paravastu Constructing a Generic Natural Language Interface for an XML Database Rohit Paravastu Motivation Ability to communicate with a database in natural language regarded as the ultimate goal for DB query interfaces

More information

Software Design Document (SDD) Template

Software Design Document (SDD) Template (SDD) Template Software design is a process by which the software requirements are translated into a representation of software components, interfaces, and data necessary for the implementation phase.

More information

ISA OR NOT ISA: THE INTERLINGUAL DILEMMA FOR MACHINE TRANSLATION

ISA OR NOT ISA: THE INTERLINGUAL DILEMMA FOR MACHINE TRANSLATION ISA OR NOT ISA: THE INTERLINGUAL DILEMMA FOR MACHINE TRANSLATION FLORENCE REEDER The MITRE Corporation 1 / George Mason University [email protected] ABSTRACT Developing a system that accurately produces

More information

Study Plan. Bachelor s in. Faculty of Foreign Languages University of Jordan

Study Plan. Bachelor s in. Faculty of Foreign Languages University of Jordan Study Plan Bachelor s in Spanish and English Faculty of Foreign Languages University of Jordan 2009/2010 Department of European Languages Faculty of Foreign Languages University of Jordan Degree: B.A.

More information

Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC)

Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger. European Commission Joint Research Centre (JRC) Automated Multilingual Text Analysis in the Europe Media Monitor (EMM) Ralf Steinberger European Commission Joint Research Centre (JRC) https://ec.europa.eu/jrc/en/research-topic/internet-surveillance-systems

More information

A Comparative Study on Sentiment Classification and Ranking on Product Reviews

A Comparative Study on Sentiment Classification and Ranking on Product Reviews A Comparative Study on Sentiment Classification and Ranking on Product Reviews C.EMELDA Research Scholar, PG and Research Department of Computer Science, Nehru Memorial College, Putthanampatti, Bharathidasan

More information

NATURAL LANGUAGE TO SQL CONVERSION SYSTEM

NATURAL LANGUAGE TO SQL CONVERSION SYSTEM International Journal of Computer Science Engineering and Information Technology Research (IJCSEITR) ISSN 2249-6831 Vol. 3, Issue 2, Jun 2013, 161-166 TJPRC Pvt. Ltd. NATURAL LANGUAGE TO SQL CONVERSION

More information

Robustness of a Spoken Dialogue Interface for a Personal Assistant

Robustness of a Spoken Dialogue Interface for a Personal Assistant Robustness of a Spoken Dialogue Interface for a Personal Assistant Anna Wong, Anh Nguyen and Wayne Wobcke School of Computer Science and Engineering University of New South Wales Sydney NSW 22, Australia

More information

Survey Results: Requirements and Use Cases for Linguistic Linked Data

Survey Results: Requirements and Use Cases for Linguistic Linked Data Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group

More information

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD. Svetlana Sokolova President and CEO of PROMT, PhD. How the Computer Translates Machine translation is a special field of computer application where almost everyone believes that he/she is a specialist.

More information

PoS-tagging Italian texts with CORISTagger

PoS-tagging Italian texts with CORISTagger PoS-tagging Italian texts with CORISTagger Fabio Tamburini DSLO, University of Bologna, Italy [email protected] Abstract. This paper presents an evolution of CORISTagger [1], an high-performance

More information

TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE

TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE Sangam P. Borkar M.E. (Electronics)Dissertation Guided by Prof. S. P. Patil Head of Electronics Department Rajarambapu Institute of Technology Sakharale,

More information

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS

More information

Word Completion and Prediction in Hebrew

Word Completion and Prediction in Hebrew Experiments with Language Models for בס"ד Word Completion and Prediction in Hebrew 1 Yaakov HaCohen-Kerner, Asaf Applebaum, Jacob Bitterman Department of Computer Science Jerusalem College of Technology

More information

PULLING OUT OPINION TARGETS AND OPINION WORDS FROM REVIEWS BASED ON THE WORD ALIGNMENT MODEL AND USING TOPICAL WORD TRIGGER MODEL

PULLING OUT OPINION TARGETS AND OPINION WORDS FROM REVIEWS BASED ON THE WORD ALIGNMENT MODEL AND USING TOPICAL WORD TRIGGER MODEL Journal homepage: www.mjret.in ISSN:2348-6953 PULLING OUT OPINION TARGETS AND OPINION WORDS FROM REVIEWS BASED ON THE WORD ALIGNMENT MODEL AND USING TOPICAL WORD TRIGGER MODEL Utkarsha Vibhute, Prof. Soumitra

More information

Automated Extraction of Security Policies from Natural-Language Software Documents

Automated Extraction of Security Policies from Natural-Language Software Documents Automated Extraction of Security Policies from Natural-Language Software Documents Xusheng Xiao 1 Amit Paradkar 2 Suresh Thummalapenta 3 Tao Xie 1 1 Dept. of Computer Science, North Carolina State University,

More information

Author : Dr. Y. P. Deshpande [ Amolakchand Mahavidyalaya, Yavatmal ]

Author : Dr. Y. P. Deshpande [ Amolakchand Mahavidyalaya, Yavatmal ] Article : Translation as a device for decoding a text Author : Dr. Y. P. Deshpande [ Amolakchand Mahavidyalaya, Yavatmal ] Abstract The paper defines Translation from linguistic and literary points of

More information

What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain?

What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain? What s in a Lexicon What kind of Information should a Lexicon contain? The Lexicon Miriam Butt November 2002 Semantic: information about lexical meaning and relations (thematic roles, selectional restrictions,

More information

A Survey on Product Aspect Ranking

A Survey on Product Aspect Ranking A Survey on Product Aspect Ranking Charushila Patil 1, Prof. P. M. Chawan 2, Priyamvada Chauhan 3, Sonali Wankhede 4 M. Tech Student, Department of Computer Engineering and IT, VJTI College, Mumbai, Maharashtra,

More information

Natural Language Web Interface for Database (NLWIDB)

Natural Language Web Interface for Database (NLWIDB) Rukshan Alexander (1), Prashanthi Rukshan (2) and Sinnathamby Mahesan (3) Natural Language Web Interface for Database (NLWIDB) (1) Faculty of Business Studies, Vavuniya Campus, University of Jaffna, Park

More information

Wikipedia and Web document based Query Translation and Expansion for Cross-language IR

Wikipedia and Web document based Query Translation and Expansion for Cross-language IR Wikipedia and Web document based Query Translation and Expansion for Cross-language IR Ling-Xiang Tang 1, Andrew Trotman 2, Shlomo Geva 1, Yue Xu 1 1Faculty of Science and Technology, Queensland University

More information

English to Devanagari Translation for UI Labels of Commercial Web based Interactive Applications

English to Devanagari Translation for UI Labels of Commercial Web based Interactive Applications English to Devanagari Translation for UI Labels of Commercial Web based Interactive Applications M. L. Dhore Vishwakarma Institute of Technology, Pune, India. S. K. Dixit Walchand Institute of Technology,

More information

National Language (Tamil) Support in Oracle An Oracle White paper / November 2004

National Language (Tamil) Support in Oracle An Oracle White paper / November 2004 National Language (Tamil) Support in Oracle An Oracle White paper / November 2004 Vasundhara V* & Nagarajan M & * [email protected]; & [email protected]) Oracle

More information

Search and Information Retrieval

Search and Information Retrieval Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search

More information

Domain Classification of Technical Terms Using the Web

Domain Classification of Technical Terms Using the Web Systems and Computers in Japan, Vol. 38, No. 14, 2007 Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J89-D, No. 11, November 2006, pp. 2470 2482 Domain Classification of Technical Terms Using

More information

SPANISH Kindergarten

SPANISH Kindergarten SPANISH Kindergarten Use Junior SYMTALK workbook Recognize 80+ Vocabulary words Recognize basic greetings and courtesies. Identify colors and numbers 1-10 Develop reading skills using pictures to identify

More information

Flattening Enterprise Knowledge

Flattening Enterprise Knowledge Flattening Enterprise Knowledge Do you Control Your Content or Does Your Content Control You? 1 Executive Summary: Enterprise Content Management (ECM) is a common buzz term and every IT manager knows it

More information

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information Satoshi Sekine Computer Science Department New York University [email protected] Kapil Dalwani Computer Science Department

More information

An Approach towards Automation of Requirements Analysis

An Approach towards Automation of Requirements Analysis An Approach towards Automation of Requirements Analysis Vinay S, Shridhar Aithal, Prashanth Desai Abstract-Application of Natural Language processing to requirements gathering to facilitate automation

More information

TREC 2003 Question Answering Track at CAS-ICT

TREC 2003 Question Answering Track at CAS-ICT TREC 2003 Question Answering Track at CAS-ICT Yi Chang, Hongbo Xu, Shuo Bai Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China [email protected] http://www.ict.ac.cn/

More information

Foreign Languages FOREIGN LANGUAGES. 2016-17 Sacramento City College Catalog. Degree: AA-T Spanish for Transfer

Foreign Languages FOREIGN LANGUAGES. 2016-17 Sacramento City College Catalog. Degree: AA-T Spanish for Transfer Foreign Languages Degree: AA-T Spanish for Transfer Division of Humanities and Fine Arts Chris Iwata, Dean Performing Arts Center 137 916-558-2551 Arabic ARABIC Chinese Cantonese-CANT French FREN Greek

More information

Points of Interference in Learning English as a Second Language

Points of Interference in Learning English as a Second Language Points of Interference in Learning English as a Second Language Tone Spanish: In both English and Spanish there are four tone levels, but Spanish speaker use only the three lower pitch tones, except when

More information