IndoWordNet Dictionary: An Online Multilingual Dictionary using IndoWordNet
|
|
|
- Rhoda Carter
- 9 years ago
- Views:
Transcription
1 IndoWordNet Dictionary: An Online Multilingual Dictionary using IndoWordNet Hanumant Redkar Sandhya Singh Nilesh Joshi Anupam Ghosh Pushpak Bhattacharyya Abstract India is a country with diverse culture, language and varied heritage. Due to this, it is very rich in languages and their dialects. Being a multilingual society, a multilingual dictionary becomes its need and one of the major resources to support a language. There are dictionaries for many Indian languages, but very few are available in multiple languages. WordNet is one of the most prominent lexical resources in the field of Natural Language Processing and Information Retrieval, while IndoWordNet is an integrated multilingual WordNet for Indian languages. These resources are used by researchers to experiment and resolve the issues in multilinguality through computation. However, there are few cases where WordNet is used by the nonresearchers or general public. This paper focuses on providing an online interface IndoWordNet Dictionary to non-researchers as well as researchers. It is developed to render multilingual WordNet information of 19 Indian languages in a dictionary format. The WordNet information is rendered in multiple views such as: sense based, thesaurus based, word usage based and language based. English WordNet information is also rendered using this interface. The IndoWordNet dictionary will help language researchers as well as general public to know meanings of a word in multiple Indian languages. 1 Introduction Language is a constituent element of civilization. In a country like India, diversity is its primary aspect. This leads to varied languages and their dialects. There are numerous languages in India which belong to different language families. These language families are Indo-Aryan, Dravidian, Sino- Tibetan, Tibeto-Burman and Austro-Asiatic. The major ones are the Indo-Aryan, spoken by most of Indians while Dravidian spoken by southern Indians. The Eighth Schedule of the Indian Constitution lists 22 languages, which have been referred to as scheduled languages and given recognition, status and official encouragement. A Dictionary can be called as a resource dealing with the individual words of a language along with
2 its orthography, pronunciation, usage, synonyms, derivation, history, etymology, etc. arranged in an order for convenience of referencing the words. Various criterions used for classifying this resource are - density of entries, number of languages involved, nature of entries, degree of concentration on strictly lexical data, axis of time, arrangement of entries, purpose, prospective user, etc. Some of the common types of dictionaries are 1 - Encyclopedia: Single or multi-volume publication that contains accumulated and authoritative knowledge on a subject arranged alphabetically. E.g. Britannica encyclopedia. Thesaurus: Thesaurus is a dictionary that lists words in groups of synonyms and related concepts. Etymological Dictionary: An etymological dictionary discusses the etymology/origin of the words listed. It is the product of research in historical linguistics. Dialect Dictionary: These dictionaries deal with the words of a particular geographical region or social group which are non standard. Specialized Dictionary: These dictionaries covers relatively restricted set of phenomena. Bilingual or Multilingual Dictionary: These are linguistic dictionaries in two or more languages. Reverse Dictionary: These dictionaries are based on the concept/idea/definition to words. Learner s Dictionary: These dictionaries are meant for foreign students/tourists to learn the usage of the word in language. Phonetic Dictionary: These dictionaries help in searching the words by the way they sound. Visual Dictionary: These dictionaries use pictures to illustrate the meaning of words. WordNet is a lexical resource composed of synsets and semantic relations. Synsets are sets of synonyms. They are linked by semantic relations like hypernymy, meronymy, etc. and lexical relations like antonymy, gradation, etc. (Miller et al., 1990; Fellbaum, 1998). IndoWordNet 2 is a linked structure of WordNets of 19 different Indian languages from Indo-Aryan, Dravidian and Sino- Tibetan families (Bhattacharyya, 2010). Other popular multilingual WordNets are: EuroWordNet, which is a linked WordNet for European languages (Vossen, 1999) and BalkaNet, which is a linked WordNet for Balkan Languages (Christodoulakis, 2002). The most innovative aspect of WordNets is that lexical information is organized in terms of meaning; that is, a synset contains words of the same part-of-speech which have approximately the same meaning. Thus, it is synonymy that functions as the essential principle in the construction of WordNets (Vincze et al., 2008). This feature of WordNet is most important for the dictionary construction. IndoWordNet is used in the field of Natural Language Processing tasks like Machine Translation, Information Retrieval, Information Extraction, etc. But, not much has been explored to use this resource beyond research labs. In this paper, we present an interface IndoWordNet Dictionary (IWN Dictionary) in the form of multilingual online dictionary which uses IndoWordNet as a resource. The primary focus of this interface is to provide synset information in a systematic and classified manner which is rendered in multiple views. The rest of the paper is organized as follows: Section 2 justifies the need of IndoWordNet Dictionary. Section 3 details the IndoWordNet Dictionary, its components, followed by its design and layout. Section 4 gives the features of the dictionary. Section 5 lists its limitations. Finally, the conclusion, scope and enhancements to the interface are presented. 2 Need for IndoWordNet Dictionary Our work on developing IWN Dictionary interface is motivated from various available online resources. To name some: langtolang.com 3 which includes cross-lingual references across 47 non- Indian languages, wordreference.com 4 which includes 17 non-indian languages, and others being logosdictionary.org 5 and xobodo.org 6 which has multiple languages including some Indian languages. But, all these resources render not more than two languages at a given instance. Further survey is done, which reveals that Mohanty et al. (2008) had developed a tool for
3 multilingual dictionary development process to create and link the synset based lexical resource for machine translation purpose. The aim was to simplify the process of synset creation and to link it with different Indian language WordNets. The tool was mainly used by lexicographers involved in the process of creating various Indian language WordNets. Also, Sinha et al. (2006) who have designed a browsable bilingual interface for viewing WordNet information in two languages Hindi and Marathi. The input to this browser is a search string in any of the two languages and the output is the search result for both the languages. The primary usage of this interface is to help users get the semantic information of the search string in both Hindi and Marathi. However, Sarma et al. (2012) built a multilingual dictionary considering three languages, viz., Assamese, Bodo and Hindi. The dictionary interface allows searching between Hindi-Assamese and Hindi-Bodo language pairs at a time. All these interfaces mentioned above could display the meanings in at most two languages with all the linguistic information available in the WordNet at a time. Hence, we have developed a web based interface to render multilingual WordNet information in a dictionary format. This interface is developed keeping in mind the general public, apart from researchers. Following points justify the need of online multilingual dictionary in today s time: To know or understand other languages. To find the meaning of a word in multiple languages on a single platform. To understand other languages with the help of a pivot language (referring one of the known languages). To understand synonymous words in input language as well as in another languages. To understand additional information like part-of-speech, gender, etc. in input as well as other different languages. To understand the semantic variations in different languages. To fulfil the social need of bridging the communication gap. To understand the script of various languages. To understand a local language when relocated to that area. To improve on one s own language vocabulary. To address social and educational needs. 3 IndoWordNet Multilingual Dictionary 3.1 What is IndoWordNet Dictionary? IndoWordNet Dictionary 7 or IWN Dictionary is an online interface to render multilingual IndoWord- Net information in the dictionary format. It allows user to view the results in multiple formats as per the need. Also, user can view the result in multiple languages simultaneously. The look and feel of the IWN Dictionary is kept same as a traditional dictionary keeping in mind the user adaptability. So far, it renders WordNet information of 19 Indian languages. These languages are: Assamese, Bodo, Bengali, Gujarati, Hindi, Kannada, Kashmiri, Konkani, Maithili, Malayalam, Manipuri, Marathi, Nepali, Odia, Punjabi, Sanskrit, Tamil, Telugu and Urdu. The WordNet information is also rendered in English. The English WordNet information is taken from Princeton University 8 website and imported into IndoWordNet database structure (Prabhu, et al., 2012). The data is imported to this database using DB Import tool developed under the Indradhanush WordNet Project 9. The work is still going on to include other non-indian languages and storing them in the World WordNet Database Structure proposed by Redkar et al. (2015). 3.2 Modules of IWN Dictionary The IWN Dictionary has two major modules: IWN Dictionary Search Module and IWN Dictionary Display Module. They are explained in details in the next section. Figure 1 shows the block diagram of the IWN Dictionary IWN Dictionary Search Module The main function of the IWN Dictionary Search module is to process the input word in the form which can be used to search the database and re
4 trieve relevant information. Its function can be split in three sub modules which involve analyzing a word, database lookup and multilingual data extraction. Each of these sub modules are described in details below - IWN Database Lookup In IWN Database Lookup module, the word received from Word Analyzer is searched in the IWN database and the corresponding synset ids of all the senses of word are sent to the Multilingual Extractor engine for further processing. If the word is not found in the database then the control is sent back to the Word Analyzer module to look for other similar or closest words. Multilingual Extractor The basic task of Multilingual Extractor module is to take the input synset ids and extract synset information of all the languages available in the IWN database. This extracted synset information, i.e. the multilingual senses of an input word are then sent to the IWN Dictionary Display module for rending information in the dictionary format IWN Dictionary Display Module Figure 1 Block diagram of IWN Dictionary. Word Analyzer Word Analyzer analyses and processes the input word. It checks whether it is in its root form. If it is in its root form then it is directly passed on to the IWN Database lookup module, else it will be processed to find the closest possible word in the database. The concept of human mediated lemmatizer is adopted from the work of Bhattacharyya et al., (2014) to find the closest possible words of an input word. Here, the trie data structure is created out of the vocabulary in the input language and trie structure is navigated to find the match between the input word and an entry in trie. Accordingly, the closest possible word(s) is populated and given for the next module for the database lookup. The extracted multilingual senses of a given word are given to the IWN Dictionary Display module for rendering purpose. This is an important module of an IWN Dictionary which renders and ranks information based on the user response and statistics. Here, IWN Click-Based-Rendering Algorithm is used to rank most frequently used senses using this interface. The basic task of this algorithm is to record and maintain the number of user clicks on a sense being viewed and accordingly rank the sense in the output interface. Similarly, this algorithm is applied to all the other views mentioned below. The multilingual IndoWordNet data can be rendered using different views: Sense Based view: All the meanings of an input word are displayed with respect to the senses available in the IWN database. Figure 2 shows the sense based view of an IWN Dictionary. Thesaurus Based view: In thesaurus based view, synonymous words in each language are rendered. Here, user can click on any of the words in the list to go to see other senses of that word. This is also displayed in the thesaurus based view. Figure 3 shows the thesaurus based view of an IWN Dictionary.
5 Word Usage Based view: In word usage based view, usage of an input word with respect to the languages is rendered. Here, the examples of a synset are rendered. Figure 4 shows the word usage based view of an IWN Dictionary. Language Based view: In language based view, meanings of a word are rendered with respect to the languages. Here, for each sense of a word, the meaning in all the languages is rendered in horizontal tabbed format. Figure 5 shows the language based view of an IWN Dictionary. Figure 2 IndoWordNet Dictionary Interface showing multiple languages with sense based view. Figure 3 IndoWordNet Dictionary Interface showing multiple languages with thesaurus based view. Apart from these views, transliteration information is displayed for each of these views. The transliteration information is seen once a user hovers on any of the content in the dictionary view. For now, we have used transliteration into a roman script using the Indic NLP Library 10 developed by Kunchukuttan et al. (2015). However, we are in the process to implement transliteration in all the Indian languages, so that anyone can read other language in their chosen transliteration language. 3.3 Design and Layout of IWN Dictionary The IWN Dictionary is designed keeping in mind, its simplicity and usability, more importantly, the user friendliness of the system. The frontend is designed and developed with PHP, CSS, JavaScript, etc. and at the backend MySQL database is used. The IWN dictionary data is retrieved from IndoWordNet database using IndoWordNet APIs (Prabhugaonkar et al., 2012). Figure 2, figure 3, figure 4 and figure 5 shows sample screen shots of the IWN Multilingual Dictionary Interface with different views. In all these views, initial information is always shown in the source language. This initial information is rendered on the horizontal tab in all the views. User can click on plus or minus button to expand or collapse to see the details in each tab respectively. Also, user can check/uncheck the checkbox next to each Language in sense based view, thesaurus based view and word usage based view to see information in the checked language. In figure 3, we can see that the source language is Hindi and the target languages are Bengali, Kannada, Malayalam, Nepali and Punjabi. Here, thesaurus based view is enabled; Hence, the IWN Dictionary renders synonymous words in form of thesaurus. On clicking hyperlink more, a user can see the meaning of these set of words. Also, user can click on any of the words in the list which takes him/her to multilingual senses of that particular word. In figure 4, we can see that the source language is Hindi and the target languages are English, Gujarati, Hindi, Kannada, Nepali and Odia. Here, word usage based view is enabled; hence, the IWN Dictionary renders usage of a word by showing the example sentences in a synset. 10
6 Figure 4 IndoWordNet Dictionary Interface showing multiple languages with word usage based view. - User types in a word. To input a word, he can enable phonetic transliteration or use onscreen keyboard. - User selects the Dictionary View; by default, sense based view is enabled and information is rendered sense wise. - User clicks on button GO. Once he/she clicks on button GO, the meanings of an input word are displayed in horizontal tabbed format. - User can toggle between different views by clicking on view sense based, thesaurus based, language based, word usage based radio buttons. - User can click on more to get additional information under each of these views. - The transliterated information is provided under each of these views. User has to click on checkbox view transliteration for this purpose; this is available under the hyper link more. However, user sees transliteration of each language on mouse hover in English by default. 4 Features of IWN Dictionary Figure 5 IndoWordNet Dictionary Interface showing multiple languages with language based view. In figure 5, we can see that the source language is Hindi and in the target languages initially the information is rendered in Hindi, However, a user can click on any of the horizontal bars to see all senses in each language. Here, language based view is enabled where all senses of an input word are rendered in each of the horizontal language tabs. The procedure to use this IWN multilingual dictionary is as follows: - User selects an input language. The salient features of IndoWordNet Dictionary are as follows: Renders information in multiple languages at a time. Different views to display the information: sense based, thesaurus based, word usage based, language based. Transliteration feature available in different languages which helps in reading effectively. Can assist language translation task where multilinguality is primary concern. Can be referred for educational and social needs. Automatic word suggestion is assisted. Similar / closest word suggestion is assisted. Usage analysis and statistics available. Provides statistics such as most frequently searched word with respect to the selected language, input language searched, etc. 5 Limitations of IWN Dictionary It is a concept based dictionary, i.e., for all the synonymous words it will show only one gloss
7 or a concept definition. Based entirely on WordNet as a backend resource which is an ongoing development resource so issues/errors in WordNet data will also be propagated here. Issues of WordNet can be its limitations like word spellings as found in corpus used for creating WordNet. Since the development of WordNet in all mentioned languages is not at same pace, at times we may not get the meaning in a language if it does not exist. The availability of words and their meaning depends upon the how much the WordNet data is available. Features such as pronunciation, sound, pictures, domain, origin of a word, etc. are not incorporated due to unavailability of the same in the IndoWordNet. 6 Conclusion In this paper, we have presented an online interface viz. IWN Dictionary. The IWN Dictionary interface renders IndoWordNet information in a systematic and classified manner. There are various views; such as sense based, thesaurus based, word usage based, language based to view the IndoWordNet information. Transliteration feature is also provided in this interface for user readability. Rendering is done based on the user response and usage. 7 Future Scope and Enhancements In future, we plan to expand this interface by including morph analyzer to process inflected input word. The ontological details along with semantic and lexical relations can be incorporated in the dictionary interface. Further, there can be a feature to link IndoWordNet dictionary to other available dictionaries along with the foreign language WordNets using World WordNet Database Structure. The features like picture, pronunciation, domain, source of a word, glossary, etc. can be implemented in the future versions. The transliteration feature to effectively read in one s native language can be implemented in this interface. Acknowledgments We graciously thank all the members of CFILT 11 lab, IIT Bombay for providing necessary help and guidance needed for the development of IndoWordNet Dictionary. Further, we sincerely thank the members IndoWordNet community and the users of IndoWordNet resource. References Anoop Kunchukuttan, Ratish Puduppully, Pushpak Bhattacharyya, Brahmi-Net: A transliteration and script conversion system for languages of the Indian subcontinent, Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2105) - Human Language Technologies: System Demonstrations Christiane Fellbaum, (ed.) 1998, WordNet: An Electronic Lexical Database. The MIT Press. Christodoulakis, Dimitris N BalkaNet: A Multilingual Semantic Network for Balkan Languages. EUROPRIX Summer School, Salzburg Austria, September, George Miller, R., Fellbaum, C., Gross, D., Miller, K. J Introduction to wordnet: An on-line lexical database. International journal of lexicography, OUP. (pp. 3.4: ). Hanumant Redkar, Sudha Bhingardive, Diptesh Kanojia, and Pushpak Bhattacharyya World WordNet Database Structure: An Efficient Schema for Storing Information of WordNets of the World. In Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, Texas. Manish Sinha, Mahesh Reddy and Pushpak Bhattacharyya An Approach towards Construction and Application of Multilingual Indo-WordNet. 3rd Global Wordnet Conference (GWC 06), Jeju Island, Korea, January, Neha R Prabhugaonkar, Apurva S Nagvenkar, Ramdas N Karmali IndoWordNet Application Programming Interfaces. COLING2012, Mumbai, India. Pushpak Bhattacharyya IndoWordNet. In the Proceedings of Lexical Resources Engineering Conference (LREC), Malta. 11
8 Pushpak Bhattacharyya, Ankit Bahuguna, Lavita Talukdar, and Bornali Phukan Facilitating Multi-Lingual Sense Annotation: Human Mediated Lemmatizer. In Global WordNet Conference, Piek Vossen, (ed.) EuroWordNet: A Multilingual Database with Lexical Semantic Networks for European languages. Kluwer Academic Publishers, Dordrecht. Rajat Mohanty, Pushpak Bhattacharyya, Prabhakar Pande, Shraddha Kalele, Mitesh Khapra and Aditya Sharma Synset based multilingual dictionary: Insights, applications and challenges. Global WordNet Conference, Hungary, January Shikhar Sarma, Kr Sarma Dibyajyoti, Biswajit Brahma, Mayashree Mahanta, Himadri Bharali, and Utpal Saikia Building Multilingual Lexical Resources Using Wordnets: Structure, Design and Implementation. 24th International Conference on Computational Linguistics Venkatesh Prabhu, Shilpa Desai, Hanumant Redkar, Neha Prabhugaonkar, Apurva Nagvenkar, Ramdas, Karmali An Efficient Database Design for IndoWordNet Development Using Hybrid Approach. COLING 2012, Mumbai, India. p 229. Veronika Vincze, György Szarvas, and János Csirik Why are wordnets important. Proc. of 2nd ECC, Malta, WSEAS Press (2008):
An Efficient Database Design for IndoWordNet Development Using Hybrid Approach
An Efficient Database Design for IndoWordNet Development Using Hybrid Approach Venkatesh P rabhu 2 Shilpa Desai 1 Hanumant Redkar 1 N eha P rabhugaonkar 1 Apur va N agvenkar 1 Ramdas Karmali 1 (1) GOA
WordNet Website Development And Deployment Using Content Management Approach
WordNet Website Development And Deployment Using Content Management Approach N eha R P rabhugaonkar 1 Apur va S N ag venkar 1 Venkatesh P P rabhu 2 Ramdas N Karmali 1 (1) GOA UNIVERSITY, Taleigao - Goa
Red Hat Enterprise Linux International Language Support Guide
Red Hat Enterprise Linux International Language Support Guide Red Hat Enterprise Linux International Language Support Guide Copyright This book is about international language support for Red Hat Enterprise
The Indian National Bibliography: Today and tomorrow
Submitted on: June 22, 2013 The Indian National Bibliography: Today and tomorrow Shahina P. Ahas Central Reference Library, Kolkata, India E-mail : [email protected] Swapna Banerjee Department of
National Language (Tamil) Support in Oracle An Oracle White paper / November 2004
National Language (Tamil) Support in Oracle An Oracle White paper / November 2004 Vasundhara V* & Nagarajan M & * [email protected]; & [email protected]) Oracle
PRICE LIST. ALPHA TRANSLATION AGENCY www.biuro-tlumaczen.tv [email protected]
We encourage you to get to know the prices of the services provided by Alpha Translation Agency in the range of standard and certified written translations of common and rare languages, as well as interpretation
TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION
TRANSLATION OF TELUGU-MARATHI AND VICE- VERSA USING RULE BASED MACHINE TRANSLATION Dr. Siddhartha Ghosh 1, Sujata Thamke 2 and Kalyani U.R.S 3 1 Head of the Department of Computer Science & Engineering,
Building Multilingual Search Index using open source framework
Building Multilingual Search Index using open source framework ABSTRACT Arjun Atreya V 1 Swapnil Chaudhari 1 Pushpak Bhattacharyya 1 Ganesh Ramakrishnan 1 (1) Deptartment of CSE, IIT Bombay {arjun, swapnil,
TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE
TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE Sangam P. Borkar M.E. (Electronics)Dissertation Guided by Prof. S. P. Patil Head of Electronics Department Rajarambapu Institute of Technology Sakharale,
USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE
USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE Ria A. Sagum, MCS Department of Computer Science, College of Computer and Information Sciences Polytechnic University of the Philippines, Manila, Philippines
Construction of Thai WordNet Lexical Database from Machine Readable Dictionaries
Construction of Thai WordNet Lexical Database from Machine Readable Dictionaries Patanakul Sathapornrungkij Department of Computer Science Faculty of Science, Mahidol University Rama6 Road, Ratchathewi
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURKISH CORPUS
ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURKISH CORPUS Gürkan Şahin 1, Banu Diri 1 and Tuğba Yıldız 2 1 Faculty of Electrical-Electronic, Department of Computer Engineering
Tibetan For Windows - Software Development and Future Speculations. Marvin Moser, Tibetan for Windows & Lucent Technologies, USA
Tibetan For Windows - Software Development and Future Speculations Marvin Moser, Tibetan for Windows & Lucent Technologies, USA Introduction This paper presents the basic functions of the Tibetan for Windows
Logi Ad Hoc Reporting System Administration Guide
Logi Ad Hoc Reporting System Administration Guide Version 11.2 Last Updated: March 2014 Page 2 Table of Contents INTRODUCTION... 4 Target Audience... 4 Application Architecture... 5 Document Overview...
BILINGUAL TRANSLATION SYSTEM
BILINGUAL TRANSLATION SYSTEM (FOR ENGLISH AND TAMIL) Dr. S. Saraswathi Associate Professor M. Anusiya P. Kanivadhana S. Sathiya Abstract--- The project aims in developing Bilingual Translation System for
Table of Contents User Guide... 1 Welcome... 4 Front End Life Blue... 5 General Navigation... 5 Menu Bar... 5 Continuous Scroll... 5 Hyperlinks...
User Guide Table of Contents User Guide... 1 Welcome... 4 Front End Life Blue... 5 General Navigation... 5 Menu Bar... 5 Continuous Scroll... 5 Hyperlinks... 6 Sections/Pages/Information... 6 Announcements...
The Chinese Language and Language Planning in China. By Na Liu, Center for Applied Linguistics
The Chinese Language and Language Planning in China By Na Liu, Center for Applied Linguistics This brief introduces the Chinese language and its varieties and describes Chinese language planning initiatives
Visualizing WordNet Structure
Visualizing WordNet Structure Jaap Kamps Abstract Representations in WordNet are not on the level of individual words or word forms, but on the level of word meanings (lexemes). A word meaning, in turn,
Software Requirements Specification. Human Resource Management System. Sponsored by Siemens Enterprise Communication. Prepared by InnovaSoft
Software Requirements Specification Human Resource Management System Sponsored by Siemens Enterprise Communication Prepared by InnovaSoft Cansu Hekim - 1630888 Bekir Doğru - 1560085 Zaman Safari - 1572254
DRH specification framework
DRH specification framework 2007-03-15 EDM - NIED Takeshi KAWAMOTO, Hiroaki NEGISHI, Mitsuaki SASAKI 1 DRH Basic Development before Sep. 2007 Server architectures Search architectures Multilanguage Architectures
SETTING UP A MULTILINGUAL INFORMATION REPOSITORY : A CASE STUDY WITH EPRINTS.ORG SOFTWARE
595 SETTING UP A MULTILINGUAL INFORMATION REPOSITORY : A CASE STUDY WITH EPRINTS.ORG SOFTWARE Nagaraj N Vaidya Francis Jayakanth Abstract Today 80 % of the content on the Web is in English, which is spoken
Implementing a Web-based Transportation Data Management System
Presentation for the ITE District 6 Annual Meeting, June 2006, Honolulu 1 Implementing a Web-based Transportation Data Management System Tim Welch 1, Kristin Tufte 2, Ransford S. McCourt 3, Robert L. Bertini
Hybrid Approach to English-Hindi Name Entity Transliteration
Hybrid Approach to English-Hindi Name Entity Transliteration Shruti Mathur Department of Computer Engineering Government Women Engineering College Ajmer, India [email protected] Varun Prakash Saxena
Oxford and the Dictionary
3 Oxford and the Dictionary Our range of over 500 dictionaries, thesauruses, and language reference works caters for the needs of very young children up to the accomplished academic, and for speakers of
Translution Price List GBP
Translution Price List GBP TABLE OF CONTENTS Services AD HOC MACHINE TRANSLATION... LIGHT POST EDITED TRANSLATION... PROFESSIONAL TRANSLATION... 3 TRANSLATE, EDIT, REVIEW TRANSLATION (TWICE TRANSLATED)...3
Keywords web based medical management, patient database on cloud, patient management and customized applications on tablets, android programming.
Functional Description of Online Medical Management System Using Modern Technology Priyanka Patil, Sruthi Kunhiraman, Rohini Temkar VES Institute of Technology, Chembur, Mumbai Abstract Today s web based
Voice based email system for blinds
Voice based email system for blinds T.Shabana 1, A.Anam 2, A.Rafiya 3, K.Aisha 4 Assistant Professor, Computer Engineering, M.H. Saboo Siddik College of Engineering, Mumbai, India 1 UG Student, Computer
An Overview of Applied Linguistics
An Overview of Applied Linguistics Edited by: Norbert Schmitt Abeer Alharbi What is Linguistics? It is a scientific study of a language It s goal is To describe the varieties of languages and explain the
Challenges of Multilingualism and Possible Approach for Standardization of e-governance Solutions in India
Challenges of Multilingualism and Possible Approach for Standardization of e-governance Solutions in India Swaran Lata 1 * and Somnath Chandra 1 ABSTRACT In this paper we have addressed the major challenges
CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation.
CINDOR Conceptual Interlingua Document Retrieval: TREC-8 Evaluation. Miguel Ruiz, Anne Diekema, Páraic Sheridan MNIS-TextWise Labs Dey Centennial Plaza 401 South Salina Street Syracuse, NY 13202 Abstract:
CBSE-SENIOR SCHOOL CERTIFICATE EXAMINATION DATE-SHEET-2016
Tuesday, 01 March, 2016 001 ENGLISH ELECTIVE 101 ENGLISH ELECTIVE-C 301 ENGLISH CORE Thursday, 03 March, 2016 054 BUSINESS STUDIES 199 BAHASA MELAYU 624 ELECT APPLIANCES 654 B THERAPY&HAIR DES 745 BEAUTY
Firewall Builder Architecture Overview
Firewall Builder Architecture Overview Vadim Zaliva Vadim Kurland Abstract This document gives brief, high level overview of existing Firewall Builder architecture.
Semantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
The Development of Multimedia-Multilingual Document Storage, Retrieval and Delivery System for E-Organization (STREDEO PROJECT)
The Development of Multimedia-Multilingual Storage, Retrieval and Delivery for E-Organization (STREDEO PROJECT) Asanee Kawtrakul, Kajornsak Julavittayanukool, Mukda Suktarachan, Patcharee Varasrai, Nathavit
Web Forms for Marketers 2.3 for Sitecore CMS 6.5 and
Web Forms for Marketers 2.3 for Sitecore CMS 6.5 and later User Guide Rev: 2013-02-01 Web Forms for Marketers 2.3 for Sitecore CMS 6.5 and later User Guide A practical guide to creating and managing web
Internationalization & Localization
Internationalization & Localization Of OpenOffice.org - The Indian Perspective Comprehensive Office Suite for Multilingual Indic Computing Bhupesh Koli, Shikha G Pillai
Challenges of Multilingual Web in India : Technology Development & Standardization Perspective
Challenges of Multilingual Web in India : Technology Development & Standardization Perspective Swaran Lata Country Manager, W3C India & Head, TDIL Programme, Dept of Information Technology, Govt of India
SYSTRAN v6 Quick Start Guide
SYSTRAN v6 Quick Start Guide 2 SYSTRAN v6 Quick Start Guide SYSTRAN Business Translator SYSTRAN Premium Translator Congratulations on your SYSTRAN v6 purchase which uses the latest generation of language
Software Requirements Specification for POS_Connect Page 1. Software Requirements Specification. for. POS_Connect. Version 1.0
Page 1 Software Requirements Specification for POS_Connect Version 1.0 1/9/2013 Page 2 Table of Contents Table of Contents Revision History 1. Introduction 1.1 Purpose 1.2 Document Conventions 1.3 Intended
Content Author's Reference and Cookbook
Sitecore CMS 6.5 Content Author's Reference and Cookbook Rev. 110621 Sitecore CMS 6.5 Content Author's Reference and Cookbook A Conceptual Overview and Practical Guide to Using Sitecore Table of Contents
Collaboration Technology Support Center Microsoft Collaboration Brief
Collaboration Technology Support Center Microsoft Collaboration Brief September 2005 HOW TO INTEGRATE MICROSOFT EXCHANGE SERVER INTO SAP ENTERPRISE PORTAL Authors Robert Draken, Solution Architect, Comma
Getting Off to a Good Start: Best Practices for Terminology
Getting Off to a Good Start: Best Practices for Terminology Technologies for term bases, term extraction and term checks Angelika Zerfass, [email protected] Tools in the Terminology Life Cycle Extraction
Bitrix Site Manager 4.1. User Guide
Bitrix Site Manager 4.1 User Guide 2 Contents REGISTRATION AND AUTHORISATION...3 SITE SECTIONS...5 Creating a section...6 Changing the section properties...8 SITE PAGES...9 Creating a page...10 Editing
A Demonstration of Hierarchical Clustering
Recitation Supplement: Hierarchical Clustering and Principal Component Analysis in SAS November 18, 2002 The Methods In addition to K-means clustering, SAS provides several other types of unsupervised
City of De Pere. Halogen How To Guide
City of De Pere Halogen How To Guide Page1 (revised 12/14/2015) Halogen Performance Management website address: https://global.hgncloud.com/cityofdepere/welcome.jsp The following steps take place to complete
Performance Testing Web 2.0
Performance Testing Web 2.0 David Chadwick Rational Testing Evangelist [email protected] Dawn Peters Systems Engineer, IBM Rational [email protected] 2009 IBM Corporation WEB 2.0 What is it? 2 Web
Modifier with Visual Basic with Applications (VBA) in Microsoft Dynamics GP 9.0
Modifier with Visual Basic with Applications (VBA) in Microsoft Dynamics GP 9.0 8520: Modifier with VBA in Microsoft Dynamics GP 9.0 (3 Days) About this Course The three-day Microsoft DynamicsT GP Modifier
Title: Chinese Characters and Top Ontology in EuroWordNet
Title: Chinese Characters and Top Ontology in EuroWordNet Paper by: Shun Sylvia Wong & Karel Pala Presentation By: Patrick Baker Introduction WordNet, Cyc, HowNet, and EuroWordNet each use a hierarchical
Student Learning Tools in Edgenuity
Student Learning Tools in Edgenuity Table of Contents Student Learning Tools... 2 The Lesson Support Pane... 2 enotes... 3 Glossary... 4 Transcript... 4 Video Captions... 5 Video Transcripts... 5 General
Course Scheduling Support System
Course Scheduling Support System Roy Levow, Jawad Khan, and Sam Hsu Department of Computer Science and Engineering, Florida Atlantic University Boca Raton, FL 33431 {levow, jkhan, samh}@fau.edu Abstract
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
6. Finding other resources
6. Finding other resources In this chapter Reference sources Other libraries Interlibrary loan and intrasystem loan World Wide Web In this chapter you will learn to locate additional facts and figures
Getting Started with ConceptDraw MINDMAP v8 for Windows
Getting Started with ConceptDraw MINDMAP v8 for Windows CS Odessa corp. Contents GETTING STARTED WITH ConceptDraw MINDMAP...3 STARTUP DIALOG...5 CREATING MIND MAPS...6 ADDING DETAILS...8 TOPIC TYPES...11
ABOUT THIS DOCUMENT ABOUT CHARTS/COMMON TERMINOLOGY
A. Introduction B. Common Terminology C. Introduction to Chart Types D. Creating a Chart in FileMaker E. About Quick Charts 1. Quick Chart Behavior When Based on Sort Order F. Chart Examples 1. Charting
Power Tools for Pivotal Tracker
Power Tools for Pivotal Tracker Pivotal Labs Dezmon Fernandez Victoria Kay Eric Dattore June 16th, 2015 Power Tools for Pivotal Tracker 1 Client Description Pivotal Labs is an agile software development
NLUI Server User s Guide
By Vadim Berman Monday, 19 March 2012 Overview NLUI (Natural Language User Interface) Server is designed to run scripted applications driven by natural language interaction. Just like a web server application
Veronika VINCZE, PhD. PERSONAL DATA Date of birth: 1 July 1981 Nationality: Hungarian
Veronika VINCZE, PhD CONTACT INFORMATION Hungarian Academy of Sciences Research Group on Artificial Intelligence Tisza Lajos krt. 103., 6720 Szeged, Hungary Phone: +36 62 54 41 40 Mobile: +36 70 22 99
What s New in Delphi for PHP 2.0. New and Enhanced IDE Features. Form Designer
What s New in Delphi for PHP 2.0 Delphi for PHP 2.0 provides a big step forward in functionality and performance. IDE enhancements in the new version enable you to do more and get your work done faster.
Bisecting K-Means for Clustering Web Log data
Bisecting K-Means for Clustering Web Log data Ruchika R. Patil Department of Computer Technology YCCE Nagpur, India Amreen Khan Department of Computer Technology YCCE Nagpur, India ABSTRACT Web usage mining
TRIDINDIA IT TRANSLATION SERVICES PRIVATE LIMITED
TRIDINDIA IT TRANSLATION SERVICES PRIVATE LIMITED As we understand your business is mostly about words, we not only translate words, we transform business in the world of words. Established in 2002 with
Copyright 2013 Splunk Inc. Introducing Splunk 6
Copyright 2013 Splunk Inc. Introducing Splunk 6 Safe Harbor Statement During the course of this presentation, we may make forward looking statements regarding future events or the expected performance
Adobe Conversion Settings in Word. Section 508: Why comply?
It s the right thing to do: Adobe Conversion Settings in Word Section 508: Why comply? 11,400,000 people have visual conditions not correctible by glasses. 6,400,000 new cases of eye disease occur each
Android Phone Controlled Robot Using Bluetooth
International Journal of Electronic and Electrical Engineering. ISSN 0974-2174, Volume 7, Number 5 (2014), pp. 443-448 International Research Publication House http://www.irphouse.com Android Phone Controlled
9/4/2012. Objectives Microsoft Word 2010 - Illustrated. Unit B: Editing Documents. Objectives (continued) Cutting and Pasting Text
Objectives Microsoft Word 2010 - Illustrated Unit B: Editing Documents Cut and paste text Copy and paste text Use the Office Clipboard Find and replace text 2 Objectives Check spelling and grammar Research
Google Analytics for Robust Website Analytics. Deepika Verma, Depanwita Seal, Atul Pandey
1 Google Analytics for Robust Website Analytics Deepika Verma, Depanwita Seal, Atul Pandey 2 Table of Contents I. INTRODUCTION...3 II. Method for obtaining data for web analysis...3 III. Types of metrics
IBM BPM V8.5 Standard Consistent Document Managment
IBM Software An IBM Proof of Technology IBM BPM V8.5 Standard Consistent Document Managment Lab Exercises Version 1.0 Author: Sebastian Carbajales An IBM Proof of Technology Catalog Number Copyright IBM
SMSFR: SMS-Based FAQ Retrieval System
SMSFR: SMS-Based FAQ Retrieval System Partha Pakray, 1 Santanu Pal, 1 Soujanya Poria, 1 Sivaji Bandyopadhyay, 1 Alexander Gelbukh 2 1 Computer Science and Engineering Department, Jadavpur University, Kolkata,
Word 2007 Unit B: Editing Documents
Word 2007 Unit B: Editing Documents TRUE/FALSE 1. You can select text and then drag it to a new location using the mouse. 2. The last item copied from a document is stored on the system Clipboard. 3. The
ONLINE ENGLISH LANGUAGE RESOURCES
ONLINE ENGLISH LANGUAGE RESOURCES Developed and updated by C. Samuel for students taking courses at the English and French Language Centre, Faculty of Arts (Links live as at November 2, 2009) Dictionaries
Natural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
How To Use Query Console
Query Console User Guide 1 MarkLogic 8 February, 2015 Last Revised: 8.0-1, February, 2015 Copyright 2015 MarkLogic Corporation. All rights reserved. Table of Contents Table of Contents Query Console User
CSCI110 Exercise 4: Database - MySQL
CSCI110 Exercise 4: Database - MySQL The exercise This exercise is to be completed in the laboratory and your completed work is to be shown to the laboratory tutor. The work should be done in week-8 but
Data Driven Automation Testing Framework
International Journal of Emerging Engineering Research and Technology Volume 2, Issue 7, October 2014, PP 51-56 ISSN 2349-4395 (Print) & ISSN 2349-4409 (Online) Data Driven Automation Testing Framework
Automatic Text Analysis Using Drupal
Automatic Text Analysis Using Drupal By Herman Chai Computer Engineering California Polytechnic State University, San Luis Obispo Advised by Dr. Foaad Khosmood June 14, 2013 Abstract Natural language processing
Chapter-1 : Introduction 1 CHAPTER - 1. Introduction
Chapter-1 : Introduction 1 CHAPTER - 1 Introduction This thesis presents design of a new Model of the Meta-Search Engine for getting optimized search results. The focus is on new dimension of internet
Microsoft Advertising adcenter Campaign Analytics Getting Started Guide
Microsoft Advertising adcenter Campaign Analytics Getting Started Guide Contents Introduction... 3 What is Microsoft Advertising adcenter Campaign Analytics?... 3 Useful terms... 3 Overview... 4 Get Started...
(Refer Slide Time: 2:03)
Control Engineering Prof. Madan Gopal Department of Electrical Engineering Indian Institute of Technology, Delhi Lecture - 11 Models of Industrial Control Devices and Systems (Contd.) Last time we were
A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches
J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria 105 A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches J. Nogueras-Iso, J.A. Bañares, J. Lacasta, J. Zarazaga-Soria
The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized
The Language Grid The Language Grid combines users language resources and machine translators to produce high-quality translation that is customized to each field. The Language Grid, a software that provides
Author : Dr. Y. P. Deshpande [ Amolakchand Mahavidyalaya, Yavatmal ]
Article : Translation as a device for decoding a text Author : Dr. Y. P. Deshpande [ Amolakchand Mahavidyalaya, Yavatmal ] Abstract The paper defines Translation from linguistic and literary points of
Toad for Data Analysts, Tips n Tricks
Toad for Data Analysts, Tips n Tricks or Things Everyone Should Know about TDA Just what is Toad for Data Analysts? Toad is a brand at Quest. We have several tools that have been built explicitly for developers
Natural Language Web Interface for Database (NLWIDB)
Rukshan Alexander (1), Prashanthi Rukshan (2) and Sinnathamby Mahesan (3) Natural Language Web Interface for Database (NLWIDB) (1) Faculty of Business Studies, Vavuniya Campus, University of Jaffna, Park
Overview of Microsoft Office Word 2007
Overview of Microsoft Office What Is Word Processing? Office is a word processing software application whose purpose is to help you create any type of written communication. A word processor can be used
About the HealthStream Learning Center
About the HealthStream Learning Center HealthStream Learning Center TM Administrator access to features and functions described in the HLC Help documentation is dependent upon the administrator s role
Lottery Looper. User Manual
Lottery Looper User Manual Lottery Looper 1.7 copyright Timersoft. All rights reserved. http://www.timersoft.com The information contained in this document is subject to change without notice. This document
LOCATIONS AROUND ME (ANDROID)
IMPACT: International Journal of Research in Engineering & Technology (IMPACT: IJRET) ISSN(E): 2321-8843; ISSN(P): 2347-4599 Vol. 2, Issue 4, Apr 2014, 193-198 Impact Journals LOCATIONS AROUND ME (ANDROID)
Single Property Website Quickstart Guide
Single Property Website Quickstart Guide Win More Listings. Attract More Buyers. Sell More Homes. TABLE OF CONTENTS Getting Started... 3 First Time Registration...3 Existing Account...6 Administration
Multilingual and Localization Support for Ontologies
Multilingual and Localization Support for Ontologies Mauricio Espinoza, Asunción Gómez-Pérez and Elena Montiel-Ponsoda UPM, Laboratorio de Inteligencia Artificial, 28660 Boadilla del Monte, Spain {jespinoza,
Kentico CMS, 2011 Kentico Software. Contents. Mobile Development using Kentico CMS 6 2 Exploring the Mobile Environment 1
Contents Mobile Development using Kentico CMS 6 2 Exploring the Mobile Environment 1 Time for action - Viewing the mobile sample site 2 What just happened 4 Time for Action - Mobile device redirection
Voluntary Product Accessibility Template Blackboard Learn Release 9.1 April 2014 (Published April 30, 2014)
Voluntary Product Accessibility Template Blackboard Learn Release 9.1 April 2014 (Published April 30, 2014) Contents: Introduction Key Improvements VPAT Section 1194.21: Software Applications and Operating
DynEd International, Inc.
General Description: Proficiency Level: Course Description: Computer-based Tools: Teacher Tools: Assessment: Teacher Materials: is a multimedia course for beginning through advanced-level students of spoken
Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide
Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide 1 www.microsoft.com/sharepoint The information contained in this document represents the current view of Microsoft Corporation on the issues
Modern foreign languages
Modern foreign languages Programme of study for key stage 3 and attainment targets (This is an extract from The National Curriculum 2007) Crown copyright 2007 Qualifications and Curriculum Authority 2007
