230622 - DSAP - Digital Speech and Audio Processing



Similar documents
SCPD - System on Chip Physical Design

ITSM - Information Technology Service Management

BMAC - Basic Mathematics for Algebraic Coding Theory with Applications to Cryptography

NS - Network Security

AMC - Advanced Mobile Communications

EDM - Electronic Devices Modelling

DARFM - Design and Analysis of RF and Microwave Systems for Communications

Course overview Processamento de sinais 2009/10 LEA

240EO016 - Process Automation

IMIT - Innovation Management and International Trade

DSPS - Sustainable Design of Products and Services

Simple Voice over IP (VoIP) Implementation

A TOOL FOR TEACHING LINEAR PREDICTIVE CODING

Speech Signal Processing: An Overview

METNUMER - Numerical Methods

Advanced Speech-Audio Processing in Mobile Phones and Hearing Aids

GPS - Software Project Management

FBIO - Fundations of Bioinformatics

SICSB - Information Systems and Communications for Health Services

COMPC2 - Cost Accounting II

ATV - Lifetime Data Analysis

DAIC - Advanced Experimental Design in Clinical Research

Artificial Neural Network for Speech Recognition

240IOI21 - Operations Management

Carla Simões, Speech Analysis and Transcription Software

240EI032 - Human Resources Management

MUSICAL INSTRUMENT FAMILY CLASSIFICATION

CRIPT - Cryptography and Network Security

DIRRH - Human Resource Management

DARFM - Design and Analysis of RF and Microwave Systems for Communications

ACMSM - Computer Applications in Solids Mechanics

IES - Introduction to Software Engineering

PSEAEREE - Electronic System Design Applied to Renewable Energy and Energy Efficiency

MICRO - General and Ocular Microbiology

MUSC 1327 Audio Engineering I Syllabus Addendum McLennan Community College, Waco, TX

Environmental Science and Technology

DIGITAL SIGNAL PROCESSING - APPLICATIONS IN MEDICINE

Search keywords: Connect, Meeting, Collaboration, Voice over IP, VoIP, Acoustic Magic, audio, web conferencing, microphone, best practices

Curriculum for the Bachelor programme in sound engineering

ET - Electronics for Telecommunications

QOT - Quantum Optical Technologies

WSE - Writing Skills for Engineering

Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction

How To Pass A Psychology Course

Annotated bibliographies for presentations in MUMT 611, Winter 2006

Program curriculum for graduate studies in Speech and Music Communication

COURSE PROFILE. Business Intelligence MIS531 Fall

Signal and Information Processing

Establishing the Uniqueness of the Human Voice for Security Applications

The details of the programme are given below. For a short description of the courses, Click here.

CONTROL, COMMUNICATION & SIGNAL PROCESSING (CCSP)

BLIND SOURCE SEPARATION OF SPEECH AND BACKGROUND MUSIC FOR IMPROVED SPEECH RECOGNITION

FOPR-I1O23 - Fundamentals of Programming

Energy and Construction

PI - Internet Protocols

AND - Non-Destructive Testing

Advanced Signal Processing and Digital Noise Reduction

FOURIER TRANSFORM BASED SIMPLE CHORD ANALYSIS. UIUC Physics 193 POM

A. Master of Science Programme (120 credits) in Social Studies of Gender (Masterprogram i genusstudier)

THE BACHELOR S DEGREE IN SPANISH

TXC - Computer Network Technology

St. Thomas University. BUS 323 Human Resource Management. Spring Room 210 FFC

MBA 6931, Project Management Strategy and Tactics Course Syllabus. Course Description. Prerequisites. Course Textbook. Course Learning Objectives

Emotion Detection from Speech

HD VoIP Sounds Better. Brief Introduction. March 2009

AIGI - Smart Airport and Facility Management

Myanmar Continuous Speech Recognition System Based on DTW and HMM

What Audio Engineers Should Know About Human Sound Perception. Part 2. Binaural Effects and Spatial Hearing

English Language Teaching 5000 Level Modules 2010/11 August credits from ET ET5109, and 20 credits from ET5124 and ET5125

Management. PLANNING Internal and External Organization Environment SWOT analysis

EE 210 Introduction to Electrical Engineering Fall 2009 COURSE SYLLABUS. Massimiliano Laddomada, PhD Assistant Professor

A. Master of Science Programme (120 credits) in Global Studies (Masterprogram i globala studier)

240EO026 - Technical Entrepreneurship

IS Management Information Systems

Department of Management College of Business and Economics California State University Northridge. Course Syllabus, Fall 2010

SLHS 1301W The Physics and Biology of the Spoken Language. Spring Semester 2010

Final Year Project Progress Report. Frequency-Domain Adaptive Filtering. Myles Friel. Supervisor: Dr.Edward Jones

Transcription:

Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 230 - ETSETB - Barcelona School of Telecommunications Engineering 739 - TSC - Department of Signal Theory and Communications DEGREE IN TELECOMMUNICATIONS ENGINEERING (Syllabus 1992). (Teaching unit Optional) MASTER'S DEGREE IN INFORMATION AND COMMUNICATION TECHNOLOGIES (Syllabus 2009). (Teaching unit Optional) MASTER'S DEGREE IN TELECOMMUNICATIONS ENGINEERING (Syllabus 2013). (Teaching unit Optional) 5 Teaching languages: English Teaching staff Coordinator: Climent Nadeu Prior skills Signal Processing Requirements Signal processing Degree competences to which the subject contributes Specific: 1. Ability to apply information theory methods, adaptive modulation and channel coding, as well as advanced techniques of digital signal processing to communication and audiovisual systems. Transversal: 2. TEAMWORK: Being able to work in an interdisciplinary team, whether as a member or as a leader, with the aim of contributing to projects pragmatically and responsibly and making commitments in view of the resources that are available. 3. EFFECTIVE USE OF INFORMATION RESOURCES: Managing the acquisition, structuring, analysis and display of data and information in the chosen area of specialisation and critically assessing the results obtained. 4. FOREIGN LANGUAGE: Achieving a level of spoken and written proficiency in a foreign language, preferably English, that meets the needs of the profession and the labour market. Teaching methodology - Lectures (50%) - Application classes (with Matlab or similar) (50%) - Team work: project, presentation - Individual work: preparation and completion (out classroom) of application activities Learning objectives of the subject Learning objectives of the subject Understanding and being competent on a relevant set of concepts and techniques in the field of digital audio processing, and their application to problems arising from real applications. Especially, speech and music signals and applications will be considered. 1 / 5

Learning results: Ability to digitally process, in an application-oriented context, audio and speech signals, in order to analyze, model, extract information from, clean, modify, and generate/synthesize them. Study load Total learning time: 125h Hours large group: 39h 31.20% Hours medium group: Hours small group: Guided activities: Self study: 86h 68.80% 2 / 5

Content 1. Introduction Learning time: 1 Theory classes: 3h Self study : 7h Course presentation Audio diversity Characteristics of speech and music. Production model Hearing and auditory modeling The short-time Fourier transform 2. Short-term analysis-synthesis of (cuasi)periodic signals Learning time: 2 Theory classes: 6h Self study : 14h Filter-bank analysis/synthesis. The phase vocoder Filter-bank and spectrogram Time-scale and pitch modification QMF filters. MP3 coding. 3. Modeling and representation of speech signals Learning time: 1 Theory classes: 3h Self study : 7h Production-based all-pole modeling Pitch determination for speech and music LPC-based coding used in mobile telephony 3 / 5

4. Enhancement of speech and audio signals Learning time: 2 Theory classes: 6h Self study : 14h Cancellation: echo, interference Denoising: spectral subtraction, Wiener-based filtering, wavelets Blind source separation: ICA, CASA, NMF 5. Multi-microphone audio processing Learning time: 12h Theory classes: 4h Self study : 8h Room acoustics Array beamforming Acoustic source localization and tracking Specific objectives: 6. Recognition and detection of audio and speech Learning time: 15h Theory classes: 5h Self study : 1 6. Recognition and detection of audio and speech Pattern-matching approaches Audio activity detection Application to speech and speaker recognition Qualification system Attendance/participation in class (10%) Tests (30%) Project (50%) Presentation (10%) 4 / 5

Bibliography Basic: Quatieri, T.F. Discrete-time speech signal processing: principles and practice. Upper Saddle River, NJ: Prentice Hall, 2002. ISBN 013242942X. Gold, B.; Morgan, N.; Ellis, D. Speech and audio signal processing: processing and perception of speech and music. 2nd rev. ed. Wiley-Blackwell, 2011. ISBN 978-0-470-19536-9. Dutoit, T.; Marqués, F.; Rabiner, L.R. Applied signal processing: a MATLAB-based proof of concept. New York ; London: Springer, 2009. ISBN 978-0-38774534-3. Complementary: Rabiner, L.R.; Schafer, R.W. Theory and applications of digital speech processing. Prentice Hall, 2010. ISBN 9780136034285. Huang, Y.A.; Benesty, J. (eds.). Audio signal processing for next-generation multimedia communication systems [on line]. New York: Kluwer Academic Publishing, 2004 [Consultation: 23/07/2013]. Available on: <http://link.springer.com/book/10.1007/b117685/page/1>. ISBN 1402077688. Others resources: Lecture slides Practical work statements and programs Audiovisual material Slides Slides used in lectures Computer material Codi programes Software codes in Matlab or similar 5 / 5