E-Blind Examination System

Similar documents
Text-To-Speech Technologies for Mobile Telephony Services

How to start with 3DHOP

Open-Source, Cross-Platform Java Tools Working Together on a Dialogue System

9RLFH$FWLYDWHG,QIRUPDWLRQ(QWU\7HFKQLFDO$VSHFWV

Establishing the Uniqueness of the Human Voice for Security Applications

An Arabic Text-To-Speech System Based on Artificial Neural Networks

A secure face tracking system

E-Helpdesk: Online Helpdesk for College Campus

Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction

PSG College of Technology, Coimbatore Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.

Web Based Maltese Language Text to Speech Synthesiser

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

DRAGON NATURALLYSPEAKING 12 FEATURE MATRIX COMPARISON BY PRODUCT EDITION

Develop Software that Speaks and Listens

3DHOP Local Setup. Lezione 14 Maggio 2015

Turkish Radiology Dictation System

Information Leakage in Encrypted Network Traffic

Network Monitoring Tool with LAMP Architecture

Information Technology Career Field Pathways and Course Structure

Thirukkural - A Text-to-Speech Synthesis System

A Research Using Private Cloud with IP Camera and Smartphone Video Retrieval

Hardware Implementation of Probabilistic State Machine for Word Recognition

Membering T M : A Conference Call Service with Speaker-Independent Name Dialing on AIN

AUTOMATIC PHONEME SEGMENTATION WITH RELAXED TEXTUAL CONSTRAINTS

Reading Competencies

TEXT TO SPEECH SYSTEM FOR KONKANI ( GOAN ) LANGUAGE

Robust Methods for Automatic Transcription and Alignment of Speech Signals

UQC103S1 UFCE Systems Development. uqc103s/ufce PHP-mySQL 1

Natural Language to Relational Query by Using Parsing Compiler

Voice Driven Animation System

Ericsson T18s Voice Dialing Simulator

End User Guide The guide for /ftp account owner

RFID Based 3D Indoor Navigation System Integrated with Smart Phones

CONATION: English Command Input/Output System for Computers

WebRTC: Why You Should Care and How Avaya Can Help You. Joel Ezell Lead Architect, Collaboration Environment R&D

Spot me if you can: Uncovering spoken phrases in encrypted VoIP conversations

Dragon speech recognition Nuance Dragon NaturallySpeaking 13 comparison by product. Feature matrix. Professional Premium Home.

Syllabus INFO-UB Design and Development of Web and Mobile Applications (Especially for Start Ups)

APPLYING MFCC-BASED AUTOMATIC SPEAKER RECOGNITION TO GSM AND FORENSIC DATA

Project Code: SPBX. Project Advisor : Aftab Alam. Project Team: Umair Ashraf (Team Lead) Imran Bashir Khadija Akram

Dr. Anuradha et al. / International Journal on Computer Science and Engineering (IJCSE)

DESIGN AND IMPLEMENTATION OF A FILE SHARING APPLICATION FOR ANDROID

Specialty Answering Service. All rights reserved.

Oct 15, Internet : the vast collection of interconnected networks that all use the TCP/IP protocols

VOL. 2, NO. 1, January 2012 ISSN ARPN Journal of Science and Technology ARPN Journals. All rights reserved

Fig. 1 BAN Architecture III. ATMEL BOARD

DESIGN OF CLUSTER OF SIP SERVER BY LOAD BALANCER

Automatic Text Analysis Using Drupal

Natural Language Web Interface for Database (NLWIDB)

Lecture 12: An Overview of Speech Recognition

USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE

eattendance System for Academic Institutions

Carla Simões, Speech Analysis and Transcription Software

Things to remember when transcribing speech

Keywords Android, Copyright Protection, Discrete Cosine Transform (DCT), Digital Watermarking, Discrete Wavelet Transform (DWT), YCbCr.

A Tokenization and Encryption based Multi-Layer Architecture to Detect and Prevent SQL Injection Attack

31 Case Studies: Java Natural Language Tools Available on the Web

A SURVEY ON WEB MINING TOOLS

Understanding Web personalization with Web Usage Mining and its Application: Recommender System

SAIP 2012 Performance Engineering

A framework for Itinerary Personalization in Cultural Tourism of Smart Cities

Specialized Programme on Web Application Development using Open Source Tools

A Comparative Study of Web Development Technologies Using Open Source and Proprietary Software

A Web Based Voice Recognition System for Visually Challenged People

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg

INTERNATIONAL JOURNAL OF ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY An International online open access peer reviewed journal

Web Design and Implementation for Online Registration at University of Diyala

Framework as a master tool in modern web development

Web Development. How the Web Works 3/3/2015. Clients / Server

Preprocessing Web Logs for Web Intrusion Detection

The Impact of Using Technology in Teaching English as a Second Language

Interwise Connect. Working with Reverse Proxy Version 7.x

Dell Enterprise Reporter 2.5. Configuration Manager User Guide

Online Recruitment - An Intelligent Approach

A Monitored Student Testing Application Using Cloud Computing

Interactive Multimedia Courses-1

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)

Log Analysis Software Architecture

METHODS OF INTEGRATING mvoip IN ADDITION TO A VoIP ENVIRONMENT

IT3504: Web Development Techniques (Optional)

ISSN: A Review: Image Retrieval Using Web Multimedia Mining

Manual. Netumo NETUMO HELP MANUAL Copyright Netumo 2014 All Rights Reserved

To ensure you successfully install Timico VoIP for Business you must follow the steps in sequence:

Syllabus INFO-GB Design and Development of Web and Mobile Applications (Especially for Start Ups)

Word Completion and Prediction in Hebrew

Developing Microsoft Azure Solutions 20532B; 5 Days, Instructor-led

JAVA-BASED FRAMEWORK FOR REMOTE ACCESS TO LABORATORY EXPERIMENTS. Department of Electrical Engineering University of Hagen D Hagen, Germany

Research on HTML5 in Web Development

A Survey on Product Aspect Ranking

STUDY AND SIMULATION OF A DISTRIBUTED REAL-TIME FAULT-TOLERANCE WEB MONITORING SYSTEM

ACKNOWLEDGMENT. I would like to thank Allah for giving me the patience to work hard and overcome all the

Design of Remote data acquisition system based on Internet of Things

Transcription:

IJIRST International Journal for Innovative Research in Science & Technology Volume 1 Issue 11 April 2015 ISSN (online): 2349-6010 E-Blind Examination System Akshay Naik Ajinkya Tandel Kavita Patil Vishal Patil Manjiri Pathak Professor & Guide Abstract The Online-Examination for Blind People is basi-cally an online examination system. It reflects the justification and objectivity of examination. It helps to release the workload of teachers and students. The current online examination systems do not allow blind people to appear for the examination. Hence, the proposed application is designed for the blind people so that they can appear for examination; it is also useful for the other handicapped people with upper limb disability. This research work provides the speech user interface for blind people to interact with the application. The proposed application will dictate the questions to the candidate with the help of Speech Synthesis. The application will accept the answers from candidates through voice commands using Speech Recognition. The proposed system also allows users (examiners/teachers) to add their questions to conduct the examination. The questions can be added easily by uploading a file. As the proposed system is online, result generation will be quick. Keywords: Attribute aware data aggregation, Dynamic routing protocol, Packet driven timing control algorithm, wireless sensor network I. INTRODUCTION A. Current System: In current system, the blind people who want to take the examination require a writer. The writer writes the answers which the blind dictate them. The traditional way for conduct-ing examination for the blind people requires scribe/writer. These candidates face difficulties in using the scribes for examinations. It is hard for them to dictate the answers to the scribes. Because, the scribes might be of lower qualification hence they cannot interpret their words and write them down as they are. B. Proposed System: Online Examination for Blind is a software solution, which allows a particular company or institute to arrange, conduct and manage examinations via online environment. This can be done through the Internet or Local Area Network Environment. Candidate can answer his/her examination paper on the computer and submit answers. The Examination Software evaluates the submitted answers and the results will be avail-able immediately after completion of the examination. The proposed application avoids main drawback of the current online examination system by providing the facility for the blind people to interact with the system. The basic architecture of the system is shown as follows II. SYSTEM ARCHITECTURE All rights reserved by www.ijirst.org 238

Fig. 1: Basic Architecture A. Web Application: The institute which wants to conduct examination will register through web application. Registering institute s ad-ministrator will fill basic information of the institute at the time of registration process. After Registration he will get the login credential of application which can be used for examination purpose. B. Databases: There are two databases; first at the admin server and other at the college server. Admin server will store all the details about the institutes who are using the proposed system. And the college server will store all the accounts of teachers, questions, results, etc. Because of this type of architecture the databases will be secure. The institute need not reveal its private data like teachers information and questions. C. Web Application: Blind user will login with his/her credential and start exam or may be start book reading. Application will dictate questions to handicapped candidates and accept answers through voice commands. Normal candidate can give examination through Graphical User Interface. D. Speech User Interface: The main feature provided in the proposed system is the Speech User Interface for the handicapped people. This is achieved by a series of steps like Speech Recognition (i.e. speech to text conversion) and Speech Synthesis (i.e. text to speech conversion). The description of these steps is explained in following sections. 1) Designing Web Application: For the proposed system HTML5 is preferred. HTML5 adds many new syntactic fea-tures. These include the new video, audio and canvas elements, as well as the integration of scalable vector graphics (SVG) content (that replaces the uses of generic object tags). These features are designed to make it easy to include and handle multimedia and graphical content on the web without having to resort to proprietary plugins and APIs. [1] 2) Designing Databases: The database contains three tables. One table contains all the student details who are appearing for the examination. Other table contains uploaded questions for the examination same as in excel file. The last table contains answers of the question given by the student and true answers of the questions for generating result of the exam. For the Proposed system PHP MySQL is preferred. For the development of robust and dynamic web application PHP MySQL is highly preferred. PHP begins with server side programming language and MySQL is open source relational database Management system software when combined, capable of delivering the unique solution. Another advantage of using this is easy to implement with simplicity. [2]By combining the feature of MySQL, back end database connectivity to the developed web application through PHP XAMPP server must be installed on your system. XAMPP is a free and open source cross-platform web server solution stack package, consisting mainly of the Apache HTTP Server, MySQL database and interpreters for scripts written in the PHP and Perl programming language. [3] 3) Accepting Questions: Examiner can upload questions by using Microsoft Excel (.xls) file. Import and export data between MySQL and Excel has become much easier in Excel 2007 than its previous versions. To explore the feature, you need to install MySQL for Excel. You can install the MySQL For Excel component while installing MySQL Community Server 6. Or you may install it as an add on upon your existing MySQL Server installation [4]. The questions will be read and stored with the options and the correct answer All rights reserved by www.ijirst.org 239

in the database. These questions and answers will be used for conducting the examination and for generating result. The advantage of this is that the examiner will be able to set question paper easily. He can add the questions in the database simply by uploading a file dynamically irrespective of its size. 4) Designing Speech User Interface: a) Speech Synthesis: The speech synthesis process has a series of steps. [5] They are shown in the following figure. Fig. 2: Speech Synthesis 1) Text Analysis: This contains string tokenization and normal-ization. It breaks a string into tokens which could be words, phrases or symbols. [5] Considering that punctuation marks are removed from words, there will be an issue with tokenization. In the normalization phase, [6] text is transformed into a more reliable way to be processed means it will become consistent so that it will be easier to deal with. An example of normalization would be dont and do not. Furthermore, an important step in normalizing text is expanding abbreviations. For example, if the string WWW is encountered, it will be normalized to World Wide Web. [5] 2) Linguistic Analysis: The normalized tokens are assigned with phonetic transcriptions. A phonetic transcription of a word is the visual representation of speech sounds. For ex-ample, the word elephant has the phonetic transcription el uh fuh nt. This process is called text-tophoneme or grapheme-to-phoneme conversion [5]. A grapheme is a unit in a written language such as a letter, a number or a punctuation mark. On the other hand, a phoneme is a unit of sound to represent the utterance of a grapheme such as the letter x which is uttered eks. When the process of text-to-phoneme or grapheme-to-phoneme is finished, the text is divided into prosodic units [7]. These units determine whether the text is a clause, phrase or sentence. To elaborate more, prosody is the rhythm, stress, and intonation of speech. Therefore, when the text is divided into these kinds of units the form of utterance of the text can be determined whether it is a statement, question, command, emphasis and so on [7]. 3) Waveform Generation: After completing the previous steps, the text will be in a Symbolic Linguistic Representation form which is phonetic transcriptions and prosody merged together. The main purpose now is converting this text into sound [5]. A phone is a speech sound and a diaphone is an adjacent pair of speech sounds. Based on this there are four synthesizer technologies that are mostly used. 1) Unit Selection Synthesis: 2) Diaphone Synthesis: 3) Domain-Specific Synthesis: 4) HMM-based Synthesis: The application uses an open source speech synthesizer, Google Translator. Google uses the open-source speech synthesizer espeak for speech synthesis. [8] espeak is a compact open source software speech synthesizer for English and other languages for Linux and Windows. espeak uses a formant synthesis method. This allows many languages to be provided in a small size. The speech is clear and can be used at high speeds but is not as natural or smooth as larger synthesizers which are based on human speech recordings. espeak is available as: 1. A command line program (Linux and Windows) to speak text from a file or from stdin. 2. A shared library version for use by other programs. (On Windows this is a DLL). 3. A SAPI5 version for Windows, so it can be used with screen-readers and other programs that support the Windows SAPI5 interface. 4. espeak has been ported to other platforms, including Android, Mac OSX and Solaris. The questions stored in the database by the examiner will be retrieved and shown to the user. The question and the options will be dictated using Google translator for handicapped people. b) Speech Recognition: The speech recognition process has a series of steps [5] as shown in the following figure. All rights reserved by www.ijirst.org 240

Fig. 3: Speech Recognition 1) Audio Input: User will interact with the application via speech commands using a microphone. 2) Feature Extraction of the Audio Signal: After obtaining the input audio signal it has to be processed and its features have to be extracted [9]. This step takes place when the signal is divided into small intervals (in terms of milliseconds) and a feature vector is generated for each interval. The input utterance is represented as a sequence of these feature vectors where each vector consists of coefficients called cepstral coefficients. 3) Acoustic Modeling: Hidden Markov Models (HMM) are used for acoustic modeling process. Such models are probabilistic models used for pattern recognition, a field in which Speech Recognition is. Every known phoneme has its own HMM represented by a graph with different states (nodes) and transition probabilities [1] from one state to another. These HMMs are obtained after extracting the features of the recorded audios that were used to prepare and train the acoustic model. Every word is made up of a number of phonemes and the HMM of a certain word is made up of the concatenation of the HMMs of the phonemes that constitute the word. These HMMs are present in the acoustic model [1] which the recognition procedure relies on. The obtained features vectors are then matched to the available HMM phonemes to be assigned to the one that most match it (with the highest probability). Then the combined assigned HMMs are used to find the HMM of the whole word to be recognized. Combining phonemes HMMs together is also a probabilistic process. Following figure shows an example of HMMs matching. Matching HMMs, on both phoneme and word levels, is done by the Viterbi algorithm. [9] [10] Fig. 4: Acoustic Modeling 4) Language Modeling: In a similar manner to the one above, the HMM of a meaningful sequence of words is a concatenation of the HMMs of the words included in this sequence. To handle this phrase or sentence HMM formation, a language model is needed that knows how to group different words together to provide the desired output. [9] 5) Textual Output: After the recognizer completes the acoustic modeling and the language modeling, the obtained textual output is returned. Again the proposed system uses Google translator for speech recognition [11]. It is client server-based. This means that the client just has to generate an audio file and then send it to Google s remote voice recognition servers. Google then responds with a formatted string containing the text recognized from the provided audio file. The whole speech recognition engine, including voice models and dynamic algorithms are placed in Google s servers. Thus, we can do speech recognition almost from any lightweight device capable to record audio. All rights reserved by www.ijirst.org 241

III. CONCLUSION The proposed system seems to be far better and efficient in terms of technology and integration point of view. The accuracy of the speech recognition system was among the top challenges. The proposed system will provide a better option for Blind people to appear for the examination. REFERENCES [1] http://en.wikipedia.org/wiki/html5. [2] http://www.slideshare.net/brainvireinfo/doc-php-my-sql-development services11102013. [3] http://en.wikipedia.org/wiki/xampp. [4] http://www.w3resource.com/mysql/exporting-and-importing-data-between-mysql-and-microsoft excel.php. [5] M. Hussein, J. Karim, and F. Marwan, Multi-purpose speech recogni-tion and speech synthesis system, IEEE MULTIDISCIPLINARY ENGI-NEERING EDUCATION MAGAZINE, vol. 6 no.4, December 2011. [6] S. Kumar, Study of various kinds of speech synthesizer technolo-gies and expression for expressive text to speech conversion system, (IJAEST) INTERNATIONAL JOURNAL OF ADVANCED ENGINEER-ING SCIENCES AND TECHNOLOGIES, vol. 8 no.2, pp. 301 305. [7] S. Thomas, Natural sounding text-to-speech synthesis. [8] http://espeak.sourceforge.net. [9] L. R. Rabiner, A tutorial on hidden markov models and selected applications in speech recognition, Proc. of the IEEE, vol. 77 no.2, pp. 257 286, 1989. [10] M. Nilsson and M. Ejnarsson, Speech recognition using hidden markov model performance evaluation in noisy environment. [11] http://panstamp.blogspot.in/2011/04/speech-recognition-from-google new.html. All rights reserved by www.ijirst.org 242