ARIB STD-T63-26.451 V12.0.0. Codec for Enhanced Voice Services (EVS); Voice Activity Detection (VAD) (Release 12)



Similar documents
ARIB STD-T V Source code for 3GP file format (Release 7)

3GPP TS V8.1.0 ( )

ARIB TR-T V7.0.0

3GPP TS V6.1.0 ( )

ARIB STD-T V Contact Manager Application Programming Interface (API); Contact Manager API for Java Card (Release 10)

ARIB STD-T V Contact Manager Application Programming Interface; Contact Manager API for Java Card (Release 8)

TS-3GA (Rel7)v7.0.0 Telecommunication management; File Transfer (FT) Integration Reference Point (IRP): Requirements

3GPP TS V9.4.0 ( )

3GPP TS V7.0.0 ( )

TS-3GA (Rel6)v6.0.0 Call Forwarding (CF) supplementary services - Stage 1

3GPP TS V9.0.0 ( )

ETSI TS V ( ) Technical Specification

3GPP TS V8.1.0 ( )

3GPP TS V8.0.0 ( )

ARIB STD-T V Codec for Enhanced Voice Services (EVS); Performance characterization. (Release 12)

3GPP TS V5.0.0 ( )

TECHNICAL REPORT End to End Network Architectures (E2NA); Location of Transcoders for voice and video communications

3GPP TS V8.0.0 ( )

ETSI TS V8.2.0 ( ) Technical Specification

3GPP TS V4.0.1 ( )

ETSI TS V3.0.0 ( )

Technical Specification LTE; Evolved Universal Terrestrial Radio Access (E-UTRA); Layer 2 - Measurements (3GPP TS version 11.1.

3GPP TS V8.0.0 ( )

3GPP TS V ( )

ETSI TS V9.0.0 ( ) Technical Specification

3GPP TR V3.1.0 ( )

ARIB STD-T V Wide area network synchronisation standard

ETSI TS V9.0.0 ( ) Technical Specification

ETSI TS V1.1.1 ( )

Technical Specifications (GPGPU)

Quality of Service and Network Performance (UMTS version 3.1.0)

3GPP TS V6.3.0 ( )

VoIP Technologies Lecturer : Dr. Ala Khalifeh Lecture 4 : Voice codecs (Cont.)

ETSI TS V6.8.0 ( ) Technical Specification

3GPP TS V8.2.0 ( )

3GPP TS V8.1.0 ( )

The 3GPP Enhanced Voice Services (EVS) codec

ETSI TS V6.5.0 ( )

3GPP TS V8.1.0 ( )

ETSI TS V (2016

ETSI TS V8.4.0 ( )

ETSI TS V5.0.0 ( )

UMTS. UMTS V1.0.0 Quality of Service & Network Performance. ETSI SMG Plenary Tdoc SMG 657 / 97 Budapest, October 1997 Agenda Item: 9.

ETSI TR V6.1.0 ( )

How To Understand Gsm (Gsm) And Gsm.Org (Gms)

EUROPEAN pr ETS TELECOMMUNICATION January 1997 STANDARD

ETSI ETR 278 TECHNICAL March 1996 REPORT

ETSI TS V7.1.0 ( ) Technical Specification

Digital Telephone Network - A Practical Definition

CHANGE REQUEST. Work item code: MMS6-Codec Date: 15/03/2005

ETSI TR V8.0.0 ( )

ARIB STD-T64-C.S0042 v1.0 Circuit-Switched Video Conferencing Services

Universal Mobile Telecommunications System (UMTS); Service aspects; Virtual Home Environment (VHE) (UMTS version 3.0.0)

This amendment A1, modifies the European Telecommunication Standard ETS (1990)

3GPP TS V ( )

ARIB STD-T V G Security; Network Domain Security; IP network layer security (Release 6)

Just like speaking face to face.

3GPP TR V ( )

ETSI TR V1.2.1 ( ) Technical Report

ETSI TS V7.2.0 ( ) Technical Specification

ETSI TS V6.1.0 ( )

ETSI SR V1.1.2 ( )

EUROPEAN ETS TELECOMMUNICATION October 1992 STANDARD

GSM GSM TECHNICAL March 1996 SPECIFICATION Version 5.0.0

TECHNICAL PAPER. Fraunhofer Institute for Integrated Circuits IIS

Conferencing Using the IP Multimedia (IM) Core Network (CN) Subsystem

ETSI TR V3.1.0 ( )

3GPP TR V7.2.0 ( )

ETSI TS V3.1.1 ( ) Technical Specification

Universal Mobile Telecommunications System (UMTS); Service aspects; Advanced Addressing (UMTS version 3.0.0)

Basic principles of Voice over IP

ETSI TS V8.0.1 ( ) Technical Specification

3GPP TS v9.0.0 ( )

3GPP TS V9.0.0 ( )

ETSI TS V1.1.2 ( ) Technical Specification

How To Understand The Power Of A Cell Phone Network

TECHNICAL PAPER. Fraunhofer Institute for Integrated Circuits IIS

ETSI TR V3.1.1 ( ) Technical Report

ETSI TS V ( )

ETSI TR V1.1.2 ( )

VoIP Bandwidth Calculation

ETSI ETR 187 TECHNICAL April 1995 REPORT

3GPP TS V ( )

3GPP TS V8.4.0 ( )

Draft EN V1.1.1 ( )

ETSI TS V1.1.1 ( )

GSM GSM TECHNICAL November 1996 SPECIFICATION Version 5.0.0

Transcription:

ARIB STD-T63-26.451 V12.0.0 Codec for Enhanced Voice Services (EVS); Voice Activity Detection (VAD) (Release 12) Refer to Industrial Property Rights (IPR) in the preface of ARIB STD-T63 for Related Industrial Property Rights. Refer to Notice in the preface of ARIB STD-T63 for Copyrights.

TS 26.451 V12.0.0 (2014-09) Technical Specification 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Codec for Enhanced Voice Services (EVS); Voice Activity Detection (VAD) (Release 12) The present document has been developed within the 3 rd Generation Partnership Project ( TM ) and may be further elaborated for the purposes of.. The present document has not been subject to any approval process by the Organizational Partners and shall not be implemented. This Specification is provided for future development work within only. The Organizational Partners accept no liability for any use of this Specification. Specifications and Reports for implementation of the TM system should be obtained via the Organizational Partners' Publications Offices.

2 TS 26.451 V12.0.0 (2014-09) Keywords UMTS, LTE, EVS, telephony Postal address support office address 650 Route des Lucioles - Sophia Antipolis Valbonne - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Internet http://www.3gpp.org Copyright Notification No part may be reproduced except as authorized by written permission. The copyright and the foregoing restriction extend to reproduction in all media. 2014, Organizational Partners (ARIB, ATIS, CCSA, ETSI, TTA, TTC). All rights reserved. UMTS is a Trade Mark of ETSI registered for the benefit of its members is a Trade Mark of ETSI registered for the benefit of its Members and of the Organizational Partners LTE is a Trade Mark of ETSI registered for the benefit of its Members and of the Organizational Partners GSM and the GSM logo are registered and owned by the GSM Association

3 TS 26.451 V12.0.0 (2014-09) Contents Foreword... 4 1 Scope... 5 2 References... 5 3 Abbreviations... 5 4 General... 6 5 The SAD Algorithm... 6 Annex A (informative): Change history... 7

4 TS 26.451 V12.0.0 (2014-09) Foreword This Technical Specification has been produced by the 3 rd Generation Partnership Project (). The contents of the present document are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented when editorial only changes have been incorporated in the document.

5 TS 26.451 V12.0.0 (2014-09) 1 Scope The present document specifies the Voice Activity Detector (VAD) used in the Discontinuous Transmission (DTX) of the EVS Codec. Although the main application of the VAD algorithm is the detection of speech or voice signals, the algorithm is more accurately described as a Signal Activity Detection (SAD) algorithm. The present document is a high level overview of the functionality with reference to the Codec Detailed Algorithmic Description where the functionality is specified in detail. 2 References The following documents contain provisions which, through reference in this text, constitute provisions of the present document. - References are either specific (identified by date of publication, edition number, version number, etc.) or non-specific. - For a specific reference, subsequent revisions do not apply. - For a non-specific reference, the latest version applies. In the case of a reference to a document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document. [1] TR 21.905: "Vocabulary for Specifications". [2] TS 26.441: "Codec for Enhanced Voice Services (EVS); General Overview". [3] TS 26.445: "Codec for Enhanced Voice Services (EVS); Detailed Algorithmic Description ". [4] TS 26.442: "Codec for Enhanced Voice Services (EVS); ANSI C code (fixed-point)". [5] TS 26.443: "Codec for Enhanced Voice Services (EVS); ANSI C code (floating-point)". [6] TS 26.444: "Codec for Enhanced Voice Services (EVS); Test Sequences". [7] TS 26.446: "Codec for Enhanced Voice Services (EVS); AMR-WB Backward Compatible Functions". [8] TS 26.449: "Codec for Enhanced Voice Services (EVS); Comfort Noise Generation (CNG) Aspects". [9] TS 26.450: "Codec for Enhanced Voice Services (EVS); Discontinuous Transmission (DTX)". [10] TR 26.952: "Codec for Enhanced Voice Services (EVS); Performance Characterization". 3 Abbreviations For the purposes of the present document, the abbreviations given in TR 21.905 [1] and the following apply. An abbreviation defined in the present document takes precedence over the definition of the same abbreviation, if any, in TR 21.905 [1]. ACELP AMR-WB CNG DTX EVS FB Algebraic Code-Excited Linear Prediction Adaptive Multi Rate Wideband (codec) Comfort Noise Generator Discontinuous Transmission Enhanced Voice Services Fullband

6 TS 26.451 V12.0.0 (2014-09) FEC IP JBM MSB MTSI NB PS PSTN SAD SC-VBR SID SWB VAD WB WMOPS Frame Erasure Concealment Internet Protocol Jitter Buffer Management Most Significant Bit Multimedia Telephony Service for IMS Narrowband Packet Switched Public Switched Telephone Network Signal Activity Detection Source Controlled - Variable Bit Rate Silence Insertion Descriptor Super Wideband Voice Activity Detection Wideband Weighted Millions of Operations Per Second 4 General The function of the Enhanced Voice Services coder VAD algorithm, or more accurately the SAD algorithm, is to indicate whether each 20 ms frame contains signals that should be transmitted, e.g. speech, music or other audio. The output of the SAD algorithm is a Boolean flag ( f SAD ) that is set to one for the active signal, which is any useful signal bearing some meaningful information. Otherwise, the flag is set to zero indicating an inactive signal, which has no meaningful information. The inactive signal is mostly a pause or background noise. The procedure of the present document is mandatory for implementation in all network entities and User Equipment (UE)s supporting the EVS coder. The present document does not describe the ANSI-C code of this procedure. In the case of discrepancy between the procedure described in the present document and its ANSI-C code specifications contained in [4] the procedure defined by the [4] prevails. 5 The SAD Algorithm The Enhanced Voice Services codec signal activity detection (SAD) module described in the present document consists of three sub-sad modules; SAD1, SAD2 and SAD3. SAD1 and SAD2 are combined initially to provide an efficient preliminary activity decision. This preliminary decision is then modified by the third sub-sad module, SAD3, depending upon the codec mode of operation. The efficient preliminary activity output is used as the final SAD decision for the AMR-WB IO modes, while the activity output with SAD3 is used as the final SAD decision for all other bit-rates. Sub-clause 5.1.12 in [3] describes the operation of the SAD and the algorithms involved in the three sub-sad modules in detail.

7 TS 26.451 V12.0.0 (2014-09) Annex A (informative): Change history Change history Date TSG # TSG Doc. CR Rev Subject/Comment Old New 2014-09 65 SP-140466 Presented at TSG-SA #65 for approval 1.0.0 2014-09 65 Approved at TSG SA~65 1.0.0 12.0.0