Automated Speech to Text Transcription Evaluation

Size: px
Start display at page:

Download "Automated Speech to Text Transcription Evaluation"

Transcription

1 Automated Speech to Text Transcription Evaluation Ryan H [email protected] Haikal Saliba [email protected] Patrick C [email protected] Bassem Tossoun [email protected] Chad Brantley [email protected] Gagandeep Kohli [email protected] Abstract The California State Legislature is a state governmental body that meets consistently to discuss state legislative action. During these meetings, no full transcriptions of the minutes are generally taken; instead, recordings of the long sessions are taken, should they ever need to be referenced. This presents a problem: videos are hard to extract data from. As part of a project aimed at collecting this data into a knowledge repository, we have worked to evaluate a number of different transcription softwares and services based on their ability to transcribe the data properly, and provide relevant data regarding their costs. Our results point to Microsofts MAVIS technology providing the highest quality transcript; however, we found that this is certainly not the cheapest option, considering the limited presence of open-source alternatives, like Julius and Sphinx. I. INTRODUCTION California State Legislature holds various committee meetings to discuss governmental issues. These meetings are recorded through video and audio and uploaded in bulk to the California Channel website. To obtain access, ordinary citizens and media must either search the California Channel and watch the videos or visit the California State Capitol. Through the use of modern technology, we hope to make California Legislature more easily accessible to the public. This project aims at evaluating the many transcription technologies currently available. Natural Language Processing tools such as OpenCalias will be used to obtain significant key words such as names, places, and events. The keywords obtained from OpenCalias will be used create an ontology map so documents that discuss similar domains or issues are linked together, thus, making the documents searchable. II. BACKGROUND/RELATED WORK Two of the many organizations that have taken the initiative to make US Legislature transparent are OpenCongress and OpenGovernment. OpenCongress is a non-profit, non-partisan public resource that was established when they noticed that US Congress offered few channels for the mass public to voice their opinion to policy makers. They state that there are only a few groups in the US that act on and distribute valuable information about political insiders and lobbyists. Even with technology, websites such as The Library of Congress doesnt offer a clear way for one to read and obtain documents. Therefore, OpenCongress is a webpage that offers governmental data obtained from news, blogs, and social networking to make the government more transparent. They aggregate all the data obtained from the sources mentioned above and classify bills, votes, issues, and people in congress. Finally, they use a userfriendly webpage to allow the open public to read and search for governmental data. In addition, they use social networking, such as Facebook, to allow one to share information with their friends. OpenGovernment is a public website that aims at making data about the United States three branches: executive, legislative, and judicial, free and open to the public and is made by the same founders as OpenCongress. They believe that by making data openly available, the public is more likely to engage in governmental matters, reduce corruption, promote better policy, and create a richer democratic institution. As of November 2010, OpenGovernment contains information about five legislatures: California, Louisiana, Maryland, Texas, and Wisconsin. They obtain governmental information from Open State Project, Google News, Blog Search, TransparencyData, and Project VoteSmart. Their web page is centralized by use of sort-by buttons for browsing bills and people to obtain information about particular domains. A track button allows one to obtain the latest actions of a domain. In addition, they provide users with the ability to comment and share bills or peoples documents, contact elected officials, and organize campaigns.

2 III. FEATURES/REQUIREMENTS EVAL Legislative Transparency is a long-term project with an ultimate goal of allowing the average user to easily search for information about legislative meetings and documents at a centralized place. Therefore, in order to achieve this big goal the project is broken down into iterations. The initial iteration hopes to produce meta-data tags, databases, query types, a white paper detailing the work, and a prototype. As knowledge engineers, Team 2 will focus on evaluating various audio-to-text or transcription software to find one that is lesserror prone and provide a report concluding the evaluation process that will become part of the white paper. The goal of this is to provide Dr. Blakeslee and the rest of the Legislative team with a building block for the future. The chosen software will be used to convert audio from legislative videos into text which will be processed through a Natural Language Process (NLP). NLP software will identify key speakers and information within the audio and its relationship to other meetings, and ultimately allow one to construct a database repository that one can query for desire questions. A. Feature List 1) Evaluation of speech-to-text software 2) Cost Effective (Money, time, computational resources) - Looks at free vs paid software and cost of time B. Requirement List 1) Major, Minor, and Proper Noun Errors produced by various speech-to-text software 2) Time it takes to transcribe an audio file 3) Usability/Accessibility of API, Web services, or etc - the need for human intervention such as breaking up the audio into various chunks or converting the format. C. Evaluation List 1) Chart that displays the breakdown of errors as Major, Minor, and Proper Noun. 2) Time it takes to transcribe an audio file 3) Usability will be measured by a scale of 1-5, in which 5 means the system requires major outside help to pre-process the audio and 1 means no outside work is involved beside uploading the audio and pressing transcribe. A. Technologies Explored Mavis AT&T Dragon Dictation Google Voice Voxforge/Julius IV. IMPLEMENTATION B. Mavis 1) Overview: Microsoft Audio Video Indexing Service (MAVIS) is a Windows Azure application which uses speech recognition technology developed at Microsoft Research to enable searching of digitized spoken content. MAVIS generates automatic closed captions and keywords which can increase accessibility of audio and video files with speech content. MAVIS uses a Deep Neural Net (DNN) based speech recognition] technology, which reduces errors in speech recognition by automatically expanding its vocabulary and storing word alternatives using a technique referred to as Probabilistic Word-Lattice Indexing. More explanation is available at the Microsoft website in the technical background. MAVIS, the technology at the foundation of the Washington Post s Truth Teller Project, was proven to transcribe sessions of Congress and fact check them. It is worth taking a look into the technology. Cost $20 per hour Major Errors Minor Errors Proper Noun Errors Noun Recognition ) Advantages/Strengths: Hosted solution in the cloud Transcribes multiple speakers No initial voice training required Good customer support Better at recognizing names than other technologies Words that are confidently understood are in bold script Wide variety of input files allowed Captions synced to video 4) Disadvantages/Weaknesses: Punctuation and capitalization can appear arbitrary at times Transcription of a 20 minute video can take up to 2 hours Words can tend to be left out altogether if not understand Strange characters can appear in the transcript C. AT&T 1) Overview: AT&Ts Speech API is a cloud-based service meant to transcribe audio to text using AT&Ts Watson speech engine. In order to do this, AT&T requires that you specify a relevant context for it to gather data from; all contexts are built into the service with no ability to specify your own context. In total, AT&T provides and maintains 7 contexts, including: Web Search Business Search Voic To Text SMS

3 Question and Answer TV Generic Being a cloud-based service, most of the hard work is done on AT&Ts platform. As such, the API is able to be called from many different environments and languages to achieve the same results. Requests are made to AT&T servers through an HTTP request, which perform speech-to-text analysis on the input files using Watson speech engine. Input file formats can be of two types: WAV, 16-bit PCM, single channel, 8 khz sampling AMR (narrowband), 12.2 kbit/s, 8 khz sampling (recommended) As an additional constraint, audio files can only be sent 4 minutes at a time. AT&T provides a number of APIs to use their service, supporting the following environments: HTML5 MS RESTful As a result, most languages can give a speech-to-text request to AT&T, include Java, Ruby, and C#. Language Cost RESTful Java $99/yr + $0.01/API call past 1 million/mth Correct Proper Major Errors Minor Errors Noun Recognition ) Advantages: Cheap: 1 yearly fee of $99 + $0.01 per API call past 1 million/month Easy to use and versatile: any language with HTTP support should be able to use it Works on multiple speakers Quick calculation: around 1 min audio / 1 min calculation 4) Disadvantages: 4 minutes at a time; must break up long text Transcription is not very strong; many errors AMR audio format (mostly) required : WAV format worked inconsistently Proper noun recognition is bad: doesnt capitalize except for start of sentence, and often errors in names Poor punctuation: seems arbitrary at times D. Dragon Dictation 1) Overview: Dragon Dictation is speech recognition software that lets you use your voice to create and edit text or interact with applications on your machine. It lets you use your voice to create and edit documents, manage , surf the Web, and more. It also provides digital voice software for mobile devices that let you capture your notes on-the-go and transcribe them with Dragon Dictate. The software is not 100 percent accurate out of the box and depends on the user correcting its dictation as it s used. The more it is used, and the more it s corrected, the better and more accurate its language model becomes. You can even use recordings that you ve made on your mobile device in order to build your personal language model. Although Dragon appeared to be a solid transcription technology for a single user, it proved that it was intended for exactly that: a single user. Output from Dragon also did not have any punctuation. For our purposes, it is not worth pursuing further evaluation of Dragon. Cost Platform $200 Windows, Mac OS X Proper Noun Errors Major Errors Minor Errors ) Advantages: Relatively malleable language model Transcribes audio relatively quickly Can easily load audio files with a range of different formats 4) Disadvantages: Requires voice training Intended to learn a single users speech patterns No punctuation Proper nouns may get lost in the noise E. Google Voice 1) Overview: The Google Voice API is a speech recognition API that supports audio to text automation. It allows you to use your voice to create and edit text or interact with applications on your machine. Google Voice has its own software and also provides the framework and essence of the Closed Captioning feature on YouTube. The software is often used to translate voice mail messages to text in order to provide a message to the user without the user having to listen to it. The Google Voice API can also be found in Android mobile phones, which it provides for Speech Recognition and navigation through applications on the phone. This version of the Google Voice API is actually not public and can support any size videos. A Speech2Text program was written using this version of Google Voice API function calls, which takes in a WAV file and outputs the text it transcribes from the audio file. The software still has a few rough edges and also a fatal flaw when trying to process audio files with sections of little or no sound (variability in frequency). The program does a decent job, and because its code is available and editable, hopefully, can be improved by us.

4 Cost $0 Major Errors Minor Errors Proper Noun Errors Noun Recognition ) Advantages/Strengths: Transcribes audio relatively quickly Free Can transcribe any length video 4) Disadvantages/Weaknesses: Only supports WAV files Has trouble with audio files that includes sections of little or no sound No punctuation Proper nouns may get lost in the noise F. VoxForge/Julius Voxforge is the most complete open-source English speech corpus; it compiles speech into acoustic models for other software systems such as: Julius, Sphinx, and HTK to work with. Using this data, these software systems can match certain sets of the resulting acoustic model to words, or perform other operations on them. Julius is an open-source speech recognition system; its development began in 1997 in Japan and since has been refit to work for many different languages. Julius requires two things to interpret speech: an acoustic model, which Voxforge provides, and a grammar of words to match the audio against. The grammar, however, must be tailored to the acoustic model, and few generic grammars seem to exist; as such, the Julius/Voxforge combo seems like a difficult option, or one that might require more time to get setup and evaluate. V. VALIDATION For the evaluation of various software, a 6-minute sample of a legislative meeting was extracted and manually transcribed. The sample was passed through various transcription software, which produced output transcripts. The location and number of errors made by each software was compared to the manual transcription. A. Error Definition An error is defined from where the first error occurred to the end of where that type of error occurred. Errors are defined this way because an error s beginning is usually the root cause for the rest of a phrase to be invalid. B. Error Types: 1) Major (Red marks):: Continuous stream of incorrect words Continuous stream of missing words 2) Minor (Yellow marks):: One word error Spelling error Grammar error (two/too) Capitalization error Period or thought break error Commas are not counted as minor errors 3) Proper Nouns (Green marks):: Inability to identify proper nouns correctly (USCB, California, Names, Senator). Proper noun errors are counted as either a part of minor or major error. They are major if their context includes a major error, minor otherwise. We consider uncapitalized nouns an error, because Natural Language Processing software relies on correct use of nouns to identify key people and places. Therefore, we would like to minimize the number of errors that will result from Natural Language Processing software by picking a robust transcription software. C. Usability Criteria Transcription software is evaluated on several qualitative measures as well. How readable is the transcript overall (1-5, 5 = most readable). If the reader can understand the content in spite of the errors, readability is high. How easy the software is to set up initially (1-5, 5=easy) How easy the software is to continually use after initial setup (1-5, 5=easy) General advantages / strengths General disadvantages / weaknesses VI. CONCLUSION According to our results Mavis is the best choice for this use case. Even though transcription of a single file may take hours, more than one file can be processed at a time in parallel on Microsofts cloud. The AT&T API, though comparable to Mavis in terms of number of errors, often results in low readability transcripts and requires more effort in manually correcting those errors. Google Voice, though free, results in highly unreadable transcripts with a large number of errors. Using Dragon Dictation results in highly unreadable transcripts as well, mainly due to the fact that Dragon Dictation is not tailored for such a use case, instead training on a single speaker. The main concern towards using Mavis would be the price as the software is not open source, and using it requires a paid subscription. However, even with Mavis, the resulting transcripts are still unreadable, with some major errors, as well as many proper noun errors. Though the reader would be able to follow the logic of the transcript, the document would still require manual correction to achieve correct transcription.

5 System Cost Platform Major Errors Minor Errors Proper Noun General Ease of Ease of Errors Readability Setup Continued (1-5, (1-5, Use (1-5, 5 = readable) 5 = easy) 5 = easy) MAVIS $20/h Microsoft Azure AT&T $99 + RESTful $0.01/API Java call past 1 million Dragon $200 Windows/Mac Dictation Application V. 11 Google $0 Windows/Mac Voice Application TABLE I OVERALL SYSTEM COMPARISON VII. FUTURE WORK The final intent of the legislature project is to allow ordinary citizens and media to search through California State legislature hearing. This white paper mainly focuses on various transcription technology and reaches the conclusion that there isnt an ideal transcription software. Therefore, the legislature team envisions to take the transcription one step further by taking each of the audio transcriptions from the various technologies and process it through OpenCalais. OpenCalais is a web service that analyzes textual documents to find named entities, facts, and events known as metadata. With the help of OpenCalais, union of documents metadata can be used to reduce the noise or transcription errors, and further OpenCalais provides relevance of each metadata. The relevance weight indicated how relevant and important the metadata is. As a result, metadata with relevance score of.4 or above can be used as keywords or tags for searching through the document. Though many of the APIs evaluated did not output human readable documents, we are curious as to whether analyzing the output through a tagging system results in accurate tags. As such, we plan to use OpenCalais to analyze our output files from each of the evaluated APIs and retrieve the tags associated with the output. We then plan to compare the resultant tags against the actual nature of the analyzed audio file to determine whether the tags are valid and represent major themes portrayed in the analyzed file. REFERENCES [1] H. Kopka and P. W. Daly, A Guide to LATEX, 3rd ed. Harlow, England: Addison-Wesley, 1999.

Speech Recognition Software Review

Speech Recognition Software Review Contents 1 Abstract... 2 2 About Recognition Software... 3 3 How to Choose Recognition Software... 4 3.1 Standard Features of Recognition Software... 4 3.2 Definitions... 4 3.3 Models... 5 3.3.1 VoxForge...

More information

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990

More information

COPYRIGHT 2011 COPYRIGHT 2012 AXON DIGITAL DESIGN B.V. ALL RIGHTS RESERVED

COPYRIGHT 2011 COPYRIGHT 2012 AXON DIGITAL DESIGN B.V. ALL RIGHTS RESERVED Subtitle insertion GEP100 - HEP100 Inserting 3Gb/s, HD, subtitles SD embedded and Teletext domain with the Dolby HSI20 E to module PCM decoder with audio shuffler A A application product note COPYRIGHT

More information

Automatic measurement of Social Media Use

Automatic measurement of Social Media Use Automatic measurement of Social Media Use Iwan Timmer University of Twente P.O. Box 217, 7500AE Enschede The Netherlands [email protected] ABSTRACT Today Social Media is not only used for personal

More information

JK WEBCOM TECHNOLOGIES

JK WEBCOM TECHNOLOGIES Who We Are? JK Webcom Technologies has been providing unending services to the audience at large since August 2004. Located in Rajouri Garden in New Delhi, we operate and serve individuals and businesses

More information

Closed captions are better for YouTube videos, so that s what we ll focus on here.

Closed captions are better for YouTube videos, so that s what we ll focus on here. Captioning YouTube Videos There are two types of captions for videos: closed captions and open captions. With open captions, the captions are part of the video itself, as if the words were burned into

More information

Language Translation Services RFP Issued: January 1, 2015

Language Translation Services RFP Issued: January 1, 2015 Language Translation Services RFP Issued: January 1, 2015 The following are answers to questions Brand USA has received to the RFP for Language Translation Services. Thanks to everyone who submitted questions

More information

Digital Asset Management. Content Control for Valuable Media Assets

Digital Asset Management. Content Control for Valuable Media Assets Digital Asset Management Content Control for Valuable Media Assets Overview Digital asset management is a core infrastructure requirement for media organizations and marketing departments that need to

More information

How to Upload and Caption Videos on YouTube

How to Upload and Caption Videos on YouTube How to Upload and Caption Videos on YouTube Criteria: Must have a gmail account to upload a video Video Sign In: Launch YouTube.com Click on Sign In and login with your gmail account and password Uploading

More information

C E D A T 8 5. Innovating services and technologies for speech content management

C E D A T 8 5. Innovating services and technologies for speech content management C E D A T 8 5 Innovating services and technologies for speech content management Company profile 25 years experience in the market of transcription/reporting services; Cedat 85 Group: Cedat 85 srl Subtitle

More information

Enhancing Document Review Efficiency with OmniX

Enhancing Document Review Efficiency with OmniX Xerox Litigation Services OmniX Platform Review Technical Brief Enhancing Document Review Efficiency with OmniX Xerox Litigation Services delivers a flexible suite of end-to-end technology-driven services,

More information

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy Much higher Volumes. Processed with more Velocity. With much more Variety. Is Big Data so big? Big Data Smart Data Project HAVEn: Adaptive Intelligence

More information

[Ramit Solutions] www.ramitsolutions.com SEO SMO- SEM - PPC. [Internet / Online Marketing Concepts] SEO Training Concepts SEO TEAM Ramit Solutions

[Ramit Solutions] www.ramitsolutions.com SEO SMO- SEM - PPC. [Internet / Online Marketing Concepts] SEO Training Concepts SEO TEAM Ramit Solutions [Ramit Solutions] www.ramitsolutions.com SEO SMO- SEM - PPC [Internet / Online Marketing Concepts] SEO Training Concepts SEO TEAM Ramit Solutions [2014-2016] By Lathish Difference between Offline Marketing

More information

Industry Guidelines on Captioning Television Programs 1 Introduction

Industry Guidelines on Captioning Television Programs 1 Introduction Industry Guidelines on Captioning Television Programs 1 Introduction These guidelines address the quality of closed captions on television programs by setting a benchmark for best practice. The guideline

More information

U.S. Department of Health and Human Services (HHS) The Office of the National Coordinator for Health Information Technology (ONC)

U.S. Department of Health and Human Services (HHS) The Office of the National Coordinator for Health Information Technology (ONC) U.S. Department of Health and Human Services (HHS) The Office of the National Coordinator for Health Information Technology (ONC) econsent Trial Project Architectural Analysis & Technical Standards Produced

More information

3PlayMedia. Closed Captioning, Transcription, and Subtitling

3PlayMedia. Closed Captioning, Transcription, and Subtitling Closed Captioning, Transcription, and Subtitling 1 Introduction This guide shows you the basics of how to quickly create high quality transcripts, closed captions, translations, and interactive transcripts

More information

Website Accessibility Under Title II of the ADA

Website Accessibility Under Title II of the ADA Chapter 5 Website Accessibility Under Title II of the ADA In this chapter, you will learn how the nondiscrimination requirements of Title II of 1 the ADA apply to state and local government websites. Chapter

More information

AFTER EFFECTS FOR FLASH FLASH FOR AFTER EFFECTS

AFTER EFFECTS FOR FLASH FLASH FOR AFTER EFFECTS and Adobe Press. For ordering information, CHAPTER please EXCERPT visit www.peachpit.com/aeflashcs4 AFTER EFFECTS FOR FLASH FLASH FOR AFTER EFFECTS DYNAMIC ANIMATION AND VIDEO WITH ADOBE AFTER EFFECTS

More information

GOALS FOR TODAY S WORKSHOP

GOALS FOR TODAY S WORKSHOP GOALS FOR TODAY S WORKSHOP UNDERSTANDING WHAT SOCIAL MEDIA IS RIGHT FOR YOUR BUSINESS ONLINE ADVERTISING (SOCIAL, WEB SEO & SEM) COMPUTER NETWORK BASICS AND HOW TO LEVERAGE CLOUD COMPUTING SETTING UP YOUR

More information

Dragon Solutions Transcription Workflow

Dragon Solutions Transcription Workflow Solutions Transcription Workflow summary Improving Transcription and Workflow Efficiency Law firms have traditionally relied on expensive paralegals, legal secretaries, or outside services to transcribe

More information

Closed Captioning and Educational Video Accessibility

Closed Captioning and Educational Video Accessibility the complete guide to Closed Captioning and Educational Video Accessibility MEDIACORE WHY ARE CLOSED CAPTIONS IMPORTANT? Video learning is exploding! Today, video is key to most online and blended courses,

More information

Extracting and Preparing Metadata to Make Video Files Searchable

Extracting and Preparing Metadata to Make Video Files Searchable Extracting and Preparing Metadata to Make Video Files Searchable Meeting the Unique File Format and Delivery Requirements of Content Aggregators and Distributors Table of Contents Executive Overview...

More information

SmallBiz Dynamic Theme User Guide

SmallBiz Dynamic Theme User Guide SmallBiz Dynamic Theme User Guide Table of Contents Introduction... 3 Create Your Website in Just 5 Minutes... 3 Before Your Installation Begins... 4 Installing the Small Biz Theme... 4 Customizing the

More information

DRAGON NATURALLYSPEAKING 12 FEATURE MATRIX COMPARISON BY PRODUCT EDITION

DRAGON NATURALLYSPEAKING 12 FEATURE MATRIX COMPARISON BY PRODUCT EDITION 1 Recognition Accuracy Turns your voice into text with up to 99% accuracy NEW - Up to a 20% improvement to out-of-the-box accuracy compared to Dragon version 11 Recognition Speed Words appear on the screen

More information

Hosted Fax Mail. Hosted Fax Mail. User Guide

Hosted Fax Mail. Hosted Fax Mail. User Guide Hosted Fax Mail Hosted Fax Mail User Guide Contents 1 About this Guide... 2 2 Hosted Fax Mail... 3 3 Getting Started... 4 3.1 Logging On to the Web Portal... 4 4 Web Portal Mailbox... 6 4.1 Checking Messages

More information

Transcription FAQ. Can Dragon be used to transcribe meetings or interviews?

Transcription FAQ. Can Dragon be used to transcribe meetings or interviews? Transcription FAQ Can Dragon be used to transcribe meetings or interviews? No. Given its amazing recognition accuracy, many assume that Dragon speech recognition would be an ideal solution for meeting

More information

interviewscribe User s Guide

interviewscribe User s Guide interviewscribe User s Guide YANASE Inc 2012 Contents 1.Overview! 3 2.Prepare for transcribe! 4 2.1.Assign the audio file! 4 2.2.Playback Operation! 5 2.3.Adjust volume and sound quality! 6 2.4.Adjust

More information

CREATING AND EDITING CONTENT AND BLOG POSTS WITH THE DRUPAL CKEDITOR

CREATING AND EDITING CONTENT AND BLOG POSTS WITH THE DRUPAL CKEDITOR Drupal Website CKeditor Tutorials - Adding Blog Posts, Images & Web Pages with the CKeditor module The Drupal CKEditor Interface CREATING AND EDITING CONTENT AND BLOG POSTS WITH THE DRUPAL CKEDITOR "FINDING

More information

Utilizing Automatic Speech Recognition to Improve Deaf Accessibility on the Web

Utilizing Automatic Speech Recognition to Improve Deaf Accessibility on the Web Utilizing Automatic Speech Recognition to Improve Deaf Accessibility on the Web Brent Shiver DePaul University [email protected] Abstract Internet technologies have expanded rapidly over the past two

More information

YouTube optimisation best practice guide

YouTube optimisation best practice guide YouTube optimisation best practice guide 23 rd April 2015 Alex Ovsianikov, Senior Natural Search Analyst Oliver Robertson, Senior Natural Search Analyst Dan Spry, Digital Promotions Analyst James Allen,

More information

Unit Title: Content Management System Website Creation

Unit Title: Content Management System Website Creation Unit Credit Value: 7 Unit Level: Three Unit Guided Learning Hours: 36 Ofqual Unit Reference Number: H/503/9327 Unit Review Date: 31/12/2016 Unit Sector: 15.3 Business Management Unit Summary This unit

More information

Phone Products. TeleForum. Mobilize Predictive Dialer

Phone Products. TeleForum. Mobilize Predictive Dialer Phone Products TeleForum Mobilize Predictive Dialer Automated (Outbound, Patch-Through, Ringless Voicemail Drops, Polls, Inbound, IVR and Cloud Routing) Why Democracy Partners? Democracy Partners are experienced

More information

MANAGEMENT AND AUTOMATION TOOLS

MANAGEMENT AND AUTOMATION TOOLS MANAGEMENT AND AUTOMATION TOOLS A guide to help with the automation and management of your social media presence 2 April 2012 Version 1.0 Contents Contents 2 Introduction 3 Skill Level 3 Terminology 3

More information

WHITEPAPER. Text Analytics Beginner s Guide

WHITEPAPER. Text Analytics Beginner s Guide WHITEPAPER Text Analytics Beginner s Guide What is Text Analytics? Text Analytics describes a set of linguistic, statistical, and machine learning techniques that model and structure the information content

More information

GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING

GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING MEDIA MONITORING AND ANALYSIS GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING Searchers Reporting Delivery (Player Selection) DATA PROCESSING AND CONTENT REPOSITORY ADMINISTRATION AND MANAGEMENT

More information

Your Individual Website Assessment Includes comparison to June 2008 manufacturing study data NAME of COMPANY for WEBSITENAME

Your Individual Website Assessment Includes comparison to June 2008 manufacturing study data NAME of COMPANY for WEBSITENAME WEBSITE ASSESSMENT Subject: For: Company: Your Individual Website Assessment Includes comparison to June 2008 manufacturing study data NAME of COMPANY for WEBSITENAME COMPANY NOTE: The additional elements

More information

WEB DESIGN & SEO PLANNING WORKSHEET

WEB DESIGN & SEO PLANNING WORKSHEET Company: Contact: Address: Email: State: City: Zip: Phone: Domain Name: Domain Registrar: Host Server: Host Directory: Username: Password: Before ABS Technologies can construct or build your website, we

More information

ASR Resource Websites

ASR Resource Websites ATIM Module ASR Page 1 of 5 ASR Resource Websites Adaptive Solutions, Inc. Dragon Products: voice recognition products and microphones. http://www.talksight.com/home.html Ars technical: Review of Speech

More information

8000hz Mono (single) Sound 16-bit

8000hz Mono (single) Sound 16-bit Recording and Uploading Voice Mail Greetings Using the Clearspan Web Portal, you can store multiple voice mail greetings. To create a voice mail greeting, you must use an audio recorder. We recommend using

More information

Kore Bots Platform Competitive Comparison Overview Kore Bots Platform Competitive Comparison Overview

Kore Bots Platform Competitive Comparison Overview Kore Bots Platform Competitive Comparison Overview Kore Bots Competitive Comparison Overview Kore Bots Competitive Comparison Overview 1 Kore Bots Competitive Comparison Overview Kore The intelligent Bots for the Enterprise Introduction Bots have officially

More information

Embedding Multimedia in Blackboard

Embedding Multimedia in Blackboard Embedding Multimedia in Blackboard Embedding videos Locate the video or podcast you would like the share. This example uses a cat- tastic YouTube video. (Curious? Click the image below.) 1. Find the button

More information

Texas Success Initiative (TSI) Assessment

Texas Success Initiative (TSI) Assessment Texas Success Initiative (TSI) Assessment Interpreting Your Score 1 Congratulations on taking the TSI Assessment! The TSI Assessment measures your strengths and weaknesses in mathematics and statistics,

More information

Voice Driven Animation System

Voice Driven Animation System Voice Driven Animation System Zhijin Wang Department of Computer Science University of British Columbia Abstract The goal of this term project is to develop a voice driven animation system that could take

More information

ITP 342 Mobile App Development. APIs

ITP 342 Mobile App Development. APIs ITP 342 Mobile App Development APIs API Application Programming Interface (API) A specification intended to be used as an interface by software components to communicate with each other An API is usually

More information

Clarified Communications

Clarified Communications Clarified Communications WebWorks Chapter 1 Who We Are WebWorks was founded due to the electronics industry s requirement for User Guides in Danish. The History WebWorks was founded in 2004 as a direct

More information

An elearning platform for distanced collaborative programming

An elearning platform for distanced collaborative programming An elearning platform for distanced collaborative programming Final report by Low Hau Sum Team Member: Chow Tsz Wun, Low Hau Sum, Mok Ka Hei Supervisor: Dr Chui C K FYP14006 2 Table of Contents 1 Introduction...

More information

Using a Digital Recorder with Dragon NaturallySpeaking

Using a Digital Recorder with Dragon NaturallySpeaking Using a Digital Recorder with Dragon NaturallySpeaking For those desiring to record dictation on the go and later have it transcribed by Dragon, the use of a portable digital dictating device is a perfect

More information

SPeach: Automatic Classroom Captioning System for Hearing Impaired

SPeach: Automatic Classroom Captioning System for Hearing Impaired SPeach: Automatic Classroom Captioning System for Hearing Impaired Andres Cedeño, Riya Fukui, Zihe Huang, Aaron Roe, Chase Stewart, Peter Washington Problem Definition Over one in seven Americans have

More information

Microsoft OneNote. Presented by Ben M. Schorr OM42 5/22/2014 2:15 PM - 3:15 PM. May 19-22, 2014, Toronto ON Canada

Microsoft OneNote. Presented by Ben M. Schorr OM42 5/22/2014 2:15 PM - 3:15 PM. May 19-22, 2014, Toronto ON Canada May 19-22, 2014, Toronto ON Canada Microsoft OneNote Presented by Ben M. Schorr OM42 5/22/2014 2:15 PM - 3:15 PM The handouts and presentations attached are copyright and trademark protected and provided

More information

WRITING FOR THE WEB. Lynn Villeneuve [email protected]

WRITING FOR THE WEB. Lynn Villeneuve lynn@astrolabewebsites.ca . WRITING FOR THE WEB Lynn Villeneuve [email protected] Adopting a specialized writing style for the web is important for reasons such as readability, search engine optimization and accessibility.

More information

Automated Lecture Transcription

Automated Lecture Transcription Automated Lecture Transcription Brandon Muramatsu [email protected] MIT, Office of Educational Innovation and Technology (Really, I m just moonlighting as an OCWC Staffer ) Citation: Muramatsu, B. (2009). Automated

More information

The preliminary design of a wearable computer for supporting Construction Progress Monitoring

The preliminary design of a wearable computer for supporting Construction Progress Monitoring The preliminary design of a wearable computer for supporting Construction Progress Monitoring 1 Introduction Jan Reinhardt, TU - Dresden Prof. James H. Garrett,Jr., Carnegie Mellon University Prof. Raimar

More information

media kit 2014 PUBLISH / DEVELOP Global Mobile Ad Network

media kit 2014 PUBLISH / DEVELOP Global Mobile Ad Network media kit 2014 PUBLISH / DEVELOP Global Mobile Ad Network WHY MOBILE PUBLISHING Proliferation of smartphone devices and tablets is shifting the way that customers use Internet, making advertising a key

More information

Unlocking Value from. Patanjali V, Lead Data Scientist, Tiger Analytics Anand B, Director Analytics Consulting,Tiger Analytics

Unlocking Value from. Patanjali V, Lead Data Scientist, Tiger Analytics Anand B, Director Analytics Consulting,Tiger Analytics Unlocking Value from Patanjali V, Lead Data Scientist, Anand B, Director Analytics Consulting, EXECUTIVE SUMMARY Today a lot of unstructured data is being generated in the form of text, images, videos

More information

Understanding Video Lectures in a Flipped Classroom Setting. A Major Qualifying Project Report. Submitted to the Faculty

Understanding Video Lectures in a Flipped Classroom Setting. A Major Qualifying Project Report. Submitted to the Faculty 1 Project Number: DM3 IQP AAGV Understanding Video Lectures in a Flipped Classroom Setting A Major Qualifying Project Report Submitted to the Faculty Of Worcester Polytechnic Institute In partial fulfillment

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518 International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,

More information

Sentiment Analysis on Big Data

Sentiment Analysis on Big Data SPAN White Paper!? Sentiment Analysis on Big Data Machine Learning Approach Several sources on the web provide deep insight about people s opinions on the products and services of various companies. Social

More information

A GrAF-compliant Indonesian Speech Recognition Web Service on the Language Grid for Transcription Crowdsourcing

A GrAF-compliant Indonesian Speech Recognition Web Service on the Language Grid for Transcription Crowdsourcing A GrAF-compliant Indonesian Speech Recognition Web Service on the Language Grid for Transcription Crowdsourcing LAW VI JEJU 2012 Bayu Distiawan Trisedya & Ruli Manurung Faculty of Computer Science Universitas

More information

Video Marketing for Financial Advisors How financial advisors can use online video to attract prospects and enhance their reputation

Video Marketing for Financial Advisors How financial advisors can use online video to attract prospects and enhance their reputation How financial advisors can use online video to attract prospects and enhance their reputation Hundreds of people visit your website long before they step foot in your office for this reason, it s important

More information

Controlling the computer with your voice

Controlling the computer with your voice AbilityNet Factsheet August 2015 Controlling the computer with your voice This factsheet provides an overview of how you can control computers (and tablets and smartphones) with your voice. Communication

More information

WASHINGTON STATE LEGISLATURE RSS TUTORIAL HOW TO USE RSS TO BE NOTIFIED WHEN BILLS CHANGE STATUS

WASHINGTON STATE LEGISLATURE RSS TUTORIAL HOW TO USE RSS TO BE NOTIFIED WHEN BILLS CHANGE STATUS WASHINGTON STATE LEGISLATURE RSS TUTORIAL HOW TO USE RSS TO BE NOTIFIED WHEN BILLS CHANGE STATUS January 3, 2007 What is RSS? RSS stands for Really Simple Syndication. RSS programs called newsreaders allow

More information

First, read the Editing Software Overview that follows so that you have a better understanding of the process.

First, read the Editing Software Overview that follows so that you have a better understanding of the process. Instructions In this course, you learn to transcribe and edit reports. When transcribing a report, you listen to dictation and create the entire document. When editing a report, the speech recognition

More information

Video Transcription in MediaMosa

Video Transcription in MediaMosa Video Transcription in MediaMosa Proof of Concept Version 1.1 December 28, 2011 SURFnet/Kennisnet Innovatieprogramma Het SURFnet/ Kennisnet Innovatieprogramma wordt financieel mogelijk gemaakt door het

More information

60% 60% 32 Good Signals. 26 Issues Found. Keyword. Landing Page Audit. UK News. www.bbc.co.uk. Put the important stuff above the fold.

60% 60% 32 Good Signals. 26 Issues Found. Keyword. Landing Page Audit. UK News. www.bbc.co.uk. Put the important stuff above the fold. 32 Good Signals 26 Issues Found Page Grade Put the important stuff above the fold. SPEED SECONDS 3.7 KILOBYTES 1109.09 REQUESTS 40 This page should load quicker This size of this page is ok The number

More information

How To Manage Your Digital Assets On A Computer Or Tablet Device

How To Manage Your Digital Assets On A Computer Or Tablet Device In This Presentation: What are DAMS? Terms Why use DAMS? DAMS vs. CMS How do DAMS work? Key functions of DAMS DAMS and records management DAMS and DIRKS Examples of DAMS Questions Resources What are DAMS?

More information

Legal Informatics Final Paper Submission Creating a Legal-Focused Search Engine I. BACKGROUND II. PROBLEM AND SOLUTION

Legal Informatics Final Paper Submission Creating a Legal-Focused Search Engine I. BACKGROUND II. PROBLEM AND SOLUTION Brian Lao - bjlao Karthik Jagadeesh - kjag Legal Informatics Final Paper Submission Creating a Legal-Focused Search Engine I. BACKGROUND There is a large need for improved access to legal help. For example,

More information

ilegislate The leading mobile application for paperless agendas www.granicus.com You can reach us at: (415) 357-3618 Overview

ilegislate The leading mobile application for paperless agendas www.granicus.com You can reach us at: (415) 357-3618 Overview ilegislate The leading mobile application for paperless agendas connecting government Convenient access to meeting agendas and supporting documents Reduce paper consumption and move to a paperless environment

More information

SEO REPORT. Prepared for searchoptions.com.au

SEO REPORT. Prepared for searchoptions.com.au REPORT Prepared for searchoptions.com.au March 24, 2016 searchoptions.com.au ISSUES FOUND ON YOUR SITE (MARCH 24, 2016) This report shows the issues that, when solved, will improve your site rankings and

More information

Voice-Recognition Software An Introduction

Voice-Recognition Software An Introduction Voice-Recognition Software An Introduction What is Voice Recognition? Voice recognition is an alternative to typing on a keyboard. Put simply, you talk to the computer and your words appear on the screen.

More information

Contents. Meltwater Quick-Start Guide

Contents. Meltwater Quick-Start Guide Meltwater Quick-Start Guide Contents Introduction... 2 Meltwater at a Glance... 2 Logging in... 3 Account Management... 3 Searches... 4 Keyword Search... 6 Advanced Search... 7 Source Selections... 9 Inbox...

More information

Customer Service Plan

Customer Service Plan Customer Service Plan 10/26/11 Executive Summary The United States has a long history of extending a helping hand to those people overseas struggling to make a better life, recover from a disaster or striving

More information

Folksonomies versus Automatic Keyword Extraction: An Empirical Study

Folksonomies versus Automatic Keyword Extraction: An Empirical Study Folksonomies versus Automatic Keyword Extraction: An Empirical Study Hend S. Al-Khalifa and Hugh C. Davis Learning Technology Research Group, ECS, University of Southampton, Southampton, SO17 1BJ, UK {hsak04r/hcd}@ecs.soton.ac.uk

More information

Dragon speech recognition Nuance Dragon NaturallySpeaking 13 comparison by product. Feature matrix. Professional Premium Home.

Dragon speech recognition Nuance Dragon NaturallySpeaking 13 comparison by product. Feature matrix. Professional Premium Home. matrix Recognition accuracy Recognition speed System configuration Turns your voice into text with up to 99% accuracy New - Up to a 15% improvement to out-of-the-box accuracy compared to Dragon version

More information

WHY DIGITAL ASSET MANAGEMENT? WHY ISLANDORA?

WHY DIGITAL ASSET MANAGEMENT? WHY ISLANDORA? WHY DIGITAL ASSET MANAGEMENT? WHY ISLANDORA? Digital asset management gives you full access to and control of to the true value hidden within your data: Stories. Digital asset management allows you to

More information

Genie Gateway Buyer s Guide. Introducing the Features, Functions & Tools

Genie Gateway Buyer s Guide. Introducing the Features, Functions & Tools Genie Gateway Buyer s Guide Introducing the Features, Functions & Tools Welcome to the Genie Gateway Genie Gateway is the faster safer way to pay and get paid online, via mobile devices, in store or by

More information

CONCEPTCLASSIFIER FOR SHAREPOINT

CONCEPTCLASSIFIER FOR SHAREPOINT CONCEPTCLASSIFIER FOR SHAREPOINT PRODUCT OVERVIEW The only SharePoint 2007 and 2010 solution that delivers automatic conceptual metadata generation, auto-classification and powerful taxonomy tools running

More information

Longman English Interactive

Longman English Interactive Longman English Interactive Level 2 Orientation (English version) Quick Start 2 Microphone for Speaking Activities 2 Translation Setting 3 Goals and Course Organization 4 What is Longman English Interactive?

More information

INBOUND MARKETING. should do online. Put up a website? Google Adwords? Facebook Ads? Both? Something else?

INBOUND MARKETING. should do online. Put up a website? Google Adwords? Facebook Ads? Both? Something else? 1 INBOUND MARKETING Digitally marketing a product or service can get complicated. Before digital came along things seemed easier. Consider a farmer s market: a farmer has a product and displays it on a

More information

Glossary of terms used in the survey

Glossary of terms used in the survey Glossary of terms used in the survey 5 October 2015 Term or abbreviation Audio / video capture Refers to the recording of audio and/or video. API Application programming interface, how a computer program

More information

SharePoint & Azure: Digital Asset Management

SharePoint & Azure: Digital Asset Management SharePoint & Azure: Digital Asset Management Project Leadership Microsoft Solutions Provider Proven Results www.attunix.com Introduction Attunix Corporation: A Bellevue, WA based business & technology

More information

The Definitive Guide to. Video SEO. i5 web works Email: [email protected] Phone: 855-367-4599 Web: www.i5ww.com

The Definitive Guide to. Video SEO. i5 web works Email: info@i5ww.com Phone: 855-367-4599 Web: www.i5ww.com The Definitive Guide to Video SEO i5 web works Email: [email protected] Phone: 855-367-4599 Web: www.i5ww.com Incorporating Video SEO into your strategies Video represents a unique place in the SEO world.

More information

KonyOne Server Prerequisites _ MS SQL Server

KonyOne Server Prerequisites _ MS SQL Server KonyOne Server Prerequisites _ MS SQL Server KonyOne Platform Release 5.0 Copyright 2012-2013 Kony Solutions, Inc. All Rights Reserved. Page 1 of 13 Copyright 2012-2013 by Kony Solutions, Inc. All rights

More information

How To Use The Alabama Data Portal

How To Use The Alabama Data Portal 113 The Alabama Metadata Portal: http://portal.gsa.state.al.us By Philip T. Patterson Geological Survey of Alabama 420 Hackberry Lane P.O. Box 869999 Tuscaloosa, AL 35468-6999 Telephone: (205) 247-3611

More information

Free Listing Distribution Website and Report Manager National Listing Distribution with Agent promotion

Free Listing Distribution Website and Report Manager National Listing Distribution with Agent promotion Online Marketing Sites http://www.postlets.com http://www.vflyer.com http://www.listhub.com http://www.zillow.com http://www.trulia.com http://www.sellpoint.com http://www.socialbios.com/create http://www.listing2leads.com

More information