INCITS 456 Speaker Recognition Format for Raw Data Interchange (SIVR-1)



Similar documents
Configuration and Management of Speaker Verification Systems

W3C Speaker Identification and Verification Workshop

VoiceXML Tutorial. Part 1: VoiceXML Basics and Simple Forms

Using Service Oriented Architecture (SOA) for Speaker-Biometrics Applications

VoiceXML and VoIP. Architectural Elements of Next-Generation Telephone Services. RJ Auburn

Interfaces de voz avanzadas con VoiceXML

VOICE INFORMATION RETRIEVAL FOR DOCUMENTS. Except where reference is made to the work of others, the work described in this thesis is.

Speaker Identification and Verification (SIV) Introduction and Best Practices Document

Open Source VoiceXML Interpreter over Asterisk for Use in IVR Applications

Standard Languages for Developing Multimodal Applications

Version 2.6. Virtual Receptionist Stepping Through the Basics

Phone Routing Stepping Through the Basics

Description of Biometric Data Interchange Format Standards

VocalPassword : voice biometrics authentication.

Voice Processing Standards. Mukesh Sundaram Vice President, Engineering Genesys (an Alcatel company)

Signatures. Advanced User s Guide. Version 2.0

Vocalité Version 2.4 Feature Overview

VoiceXML-Based Dialogue Systems

The Future of VoiceXML: VoiceXML 3 Overview. Dan Burnett, Ph.D. Dir. of Speech Technologies, Voxeo Developer Jam Session May 20, 2010

8000hz Mono (single) Sound 16-bit

An Introduction to VoiceXML

XML based Interactive Voice Response System

ARMORVOX IMPOSTORMAPS HOW TO BUILD AN EFFECTIVE VOICE BIOMETRIC SOLUTION IN THREE EASY STEPS

One receptionist can answer calls from predetermined PSTN/IP lines, each identified to a specific company

IVR (Interactive Voice Response) Operation Manual. Copyright 2012 Agile Networks, Inc. All Rights Reserved.

Direct Inward Dial Digit Translation Service

Voice Tools Project (VTP) Creation Review

Cisco IOS VoiceXML Browser

VoiceXML Data Logging Overview

Vxi* VoiceXML browser!

! <?xml version="1.0">! <vxml version="2.0">!! <form>!!! <block>!!! <prompt>hello World!</prompt>!!! </block>!! </form>! </vxml>

User Guide. IP Multimedia m6350 Client

Materials Software Systems Inc (MSSI). Enabling Speech on Touch Tone IVR White Paper

Hitachi ID Password Manager Telephony Integration

Nuance VocalPassword v9 :: Product Description

Workforce Management IVR. A multi-service voice platform

Published International Standards Developed by ISO/IEC JTC 1/SC 37 - Biometrics

Ron Shaham. Expert Witness in Islamic Courts : Medicine and Crafts in the Service of Law. : University of Chicago Press,. p 38

May not be reproduced in any form without permission from the publisher, except fair uses permitted under U.S. or applicable copyright law.

. Perspectives on the Economics of Aging. : University of Chicago Press,. p 3 Copyright University of

Hosted Fax Mail. Hosted Fax Mail. User Guide

Personal Voice Call Assistant: VoiceXML and SIP in a Distributed Environment

Mobile Application Languages XML, Java, J2ME and JavaCard Lesson 03 XML based Standards and Formats for Applications

Measuring Performance in a Biometrics Based Multi-Factor Authentication Dialog. A Nuance Education Paper

How to Build a Simple Virtual Office PBX System Using TekSIP and TekIVR

The Customer Portal will allow you to administrate your Arch system via the Internet. From the portal you can:


Nuance VocalPassword 8.2 :: Product Description

MXSAVE XMLRPC Web Service Guide. Last Revision: 6/14/2012

1. Login to with your User ID and password. Select Virtual Receptionist from the Basic Services tab.

Developing Usable VoiceXML Applications

Building Applications with Vision Media Servers

Dialog planning in VoiceXML

Thin Client Development and Wireless Markup Languages cont. VoiceXML and Voice Portals

Custom Software Development Portfolio. Innovation Collaboration Evolution Results

Algo 8180 Integration with Zultys

Deploying Cisco Unified Contact Center Express Volume 1

Traitement de la Parole

Brekeke PBX Version 2 User Guide Brekeke Software, Inc.

Shared VRU. A Key Link in Your Customer Service Chain Kyle Shadday, Director, Voice Response Strategy

My Account Quick Start

How To Use Voicexml On A Computer Or Phone (Windows)

Web page creation using VoiceXML as Slot filling task. Ravi M H

Single voice command recognition by finite element analysis

ZOIPER Classic CONFIGURATION FILE DOCUMENTATION. PRODUCT RELEASE Zoiper Classic 2.x UPDATED November 2010

CUSTOMER CONFIGURATION AUTO ATTENDANT ADMINISTRATOR S GUIDE

Avaya Media Processing Server 500

Innkeeper PBX. Desktop Digital Hybrid. User Guide. JK Audio

Hermes.Net IVR Designer Page 2 36

This method looks at the patterns found on a fingertip. Patterns are made by the lines on the tip of the finger.

Your Texas Benefits Card

Creating a low cost VoiceXML Gateway to replace IVR systems for rapid deployment of voice applications.

Call Answer/Message Manager with Aliant Voic Online

HQBN USMC HENDERSON HALL Telephone System Call Pilot Voice Messaging

Regions Secure Webmail. Instructions

Docsoft:AV Appliance. User Guide. Version: 3.0 Copyright 2005,2009 Docsoft, Inc.

Dialogic IP Media Server. Erik Pras

Find Me Call Forwarding

IVR CRM Integration. Migrating the Call Center from Cost Center to Profit. Definitions. Rod Arends Cheryl Yaeger BenchMark Consulting International

Form. Settings, page 2 Element Data, page 7 Exit States, page 8 Audio Groups, page 9 Folder and Class Information, page 9 Events, page 10

IVR Quick Start Guide. Getting off the ground with IVR

VOIP PHONE SYSTEM USER GUIDE

Universal Host. Desktop Digital Hybrid. User Guide. JK Audio

Office of Court Administration Automated Registry (AR) Interface Design Document for DSHS - Clinical Management for Behavioral Health Services (CMBHS)

Transcription:

INCITS 456 Speaker Recognition Format for Raw Data Interchange (SIVR-1) Judith A. Markowitz, PhD Chicago, IL www.jmarkowitz.com W3C Workshop on SIV March 6, 2009 1

What is INCITS 456? Data interchange format Storage & exchange of speech/voice data CBEFF Biometric Data Block (BDB) Draft standard First for speech/voice First in XML Developed jointly by ANSI/INCITS M1 and VoiceXML Forum s s Speaker Biometrics Committee 2

What is INCITS 456? Captures, stores, and exchanges RAW data Does not capture features or models Goal is to provide information that will enable recipient to analyze the data Audio format Input device and channel Speaker (sex, age) but not claim Language/dialect 3

Uses of INCITS 456 Data sharing Watch list creation Internal system audit Automatic reenrollment of users Multi-biometric fusion Product/algorithm testing SIV registry/service 4

Structure of INCITS 456 Session header Two levels information that should not change during the session (EX: sex of speaker, date of session, device & channel, audio format ) Instance header (Interaction Turn) information that changes from turn to turn of a dialogue (EX: utterance length, SNR, prompt, content ) 5

Example: Enrollment July 14, 2008 Chicago IVR: Welcome ABC Bank s s VoiceSure enrollment system Please say your account number. Caller: 357128999 ASR processes input SIV Session begins at 1:14 Central Daylight time IVR: Thank you Please say your password Caller : lollapalooza 2.5 seconds IVR: Please say your password again. Caller: lollapalooza 2.2 seconds SIV Session ends at 1:16 Central Daylight time 6

Example: Enrollment Session header (partial) Date & Time start: 2008-07-14T13:14-5:00 Date & Time end:2008-07-14t13:16-5:00 Channel: Digital NonVoIP, 300-3100 Hz Audio format: ByteOrder=0xFF00, streaming, format=ogg Vorbis, mono, sampling rate=8000, bits per sample=8 Speaker: female Input Device: telephone 7

Example: Enrollment Instance 1 header (partial) ASR used: No Prompt used: prompt1.wav Utterance: utt1.wav, 2.5 sec. (20000 samples), content unknown, Volume=68.5 db, SNR 42.1 Quality rating: unknown 8

Code for Example (Session Header) <Session FormatVersion="SIVR-1"> <DateAndTime> <start> 2008-07-14T13:11-5:00 </start> <end> 2008-07-14T13:14-5:00 </end> </DateAndTime> <Channel> <Type> DigitalNonVoIP </Type> <CutoffTop>3100</CutoffTop> <CutoffBottom>300</CutoffBottom> </Channel> <AudioFormatHeader> <ByteOrder>0xFF00</ByteOrder> <Streaming>0</Streaming> <HeaderSize>25</HeaderSize> <FileLengthInSamples>13600</FileLengthInSamples> <AudioFormat> OGG Vorbis </AudioFormat> <ChannelCount>1</ChannelCount> <SamplingRate>8000</SamplingRate> <BitsPerSample>8</BitsPerSample> <AudioFullSecondsOf>6</AudioFullSecondsOf> <AudioRemainderSamples>5600</AudioRemainderSamples> </AudioFormatHeader> Audio Format 9

Code for Example (Session Header cont.) <Speaker> <SpeakerMF> Female </SpeakerMF> </Speaker> <InputDevice> <Type> Telephone </Type> </InputDevice> 10

<Instance> <InstanceNumber>1</InstanceNumber> <ASRUsed> No </ASRUsed? <TypeOfPromptContent> String </TypeOfPromptContent> <StringPromptContent> URL EnrollPrompts/Prompt1.wav </StringPromptContent> <Utterance> <DataType> Pointer </DataType> <Data> 20080714-3124554/Utt1.wav </Data> <FileLengthInSamples>20000</FileLengthInSamples> <AudioFullSecondsOf>2</AudioFullSecondsOf> <AudioRemainderSamples>4000</AudioRemainderSamples> <Content> Unknown </Content> <Volume>68.5</Volume> <SNREstimate>42.1</SNREstimate> <Quality> <Score>254</Score> <AlgorithmVendorID>0</AlgorithmVendorID> <AlgorithmID>0</AlgorithmID> </Quality> </Utterance> </Instance> Code for Example (instance #1) 11

Implementation Application generates format directly The generated format can then become part of an EMMA tag <emma:interpretation id="intp1" emma:medium="acoustic" emma:mode="voice ="voice emma:function="verification"> <DEFF uri="http://example.com/deff ="http://example.com/deff-docs/mydoc12345/> </emma:interpretation emma:interpretation> 12

Implementation Or be used as a resource in an EMMA derivation <emma:derivation> <emma:interpretation id="better"> <emma:derived-from resource=http://www.incits456 http://www.incits456-1.txt composite="false"/>. : </emma:interpretation emma:interpretation> </emma:derivation emma:derivation> 13