Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig
|
|
- Job Warner
- 7 years ago
- Views:
Transcription
1 Multimedia Databases Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig
2 Previous Lecture Shape-based Features - Chain Codes - Area-based Retrieval - Moment Invariants - Query by Example Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 2
3 6 Audio Retrieval 6 Audio Retrieval 6.1 Basics of Audio Data 6.2 Audio Information in Databases 6.3 Audio Retrieval Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 3
4 6.1 Basics of Audio Data Information transfer through sound Audio (Latin, "I hear") Three different types of data: Music Spoken text Noise Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 4
5 6.1 Basics Auditory perception through pressure fluctuations in the air Eardrum vibrates synchronously Ear bones amplify and transmit the vibrations Auditory hair cells in the ear cochlea, are stimulated by the vibrations Neurons produce electrical impulses Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 5
6 6.1 Basics 3D model of the human ear Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 6
7 6.1 Basics Our brain only interprets two major properties of sound: Pitch Volume Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 7
8 6.1 Basics Quantitative performance of the sound wave Amplitude as volume Logarithmic perception (tenfold increase in amplitude doubles the perceived loudness) Frequency as pitch Number of periods per unit time is known as frequency (measured in hertz) Hearing range between 20 Hz and 20 khz Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 8
9 6.1 Basics Audio signals are time-dependent (overlapping) waveforms Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 9
10 6.1 Basics Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 10
11 6.1 Basics Constructive and destructive interference Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 11
12 6.1 Sound Creation Musical instruments are classified based on the vibration generator E.g., string-, blowing-, percussion Acoustics depends on the vibration generator E.g., strings-, air, membrane instruments Synthetic creation needs an oscillator The oscillator generates voltage oscillations Speaker transmit the voltage changes on a membrane Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 12
13 6.1 Sound Creation Influence of the oscillator Higher voltage Higher frequency (Moog, 1964) Amplifier influences the amplitude thus the volume ADSR (attack-decay-sustain-release) envelope influences the loudness of a sound in time Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 13
14 6.1 Sound Creation Moog 901B (1964) Modular Moog Synthesizer (1967) Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 14
15 6.1 Sound Creation Emerson, Lake & Palmer: The Great Gates of Kiev (1974) Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 15
16 6.1 ADSR envelope Synthesized sounds seem rather metallic. For producing a single synthesized sound, consider four typical phases: Attack: speed and strength of the signal rise Decay: lowering the level Sustain: actual pitch Release: end of the signal Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 16
17 6.1 Digitalization of Audio Data Transformation of the continuous sound wave into a discrete representation Sampling: save at regular intervals, the current amplitude value of vibration Clearly, we have to reconstruct audio signals from these values Amplitude Time Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 17
18 6.1 Sampling Basic characteristics Sampling rate: how many times per unit time is the value of the continuous signal tapped? Resolution: which accuracy are the values recorded with? Often, a resolution of 16 bits is used (2 16 different amplitude values) The sampling rate is application dependent: Audio CD: Hz Phone: 8000 Hz Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 18
19 6.1 Sampling Rate It is very important to uniquely reconstruct the initial oscillation The higher the sampling frequency, the more values must be saved Minimum sampling frequency? Sampling theorem (Nyquist, 1928) Sampling rate must be at least twice as large as the highest frequency occurring in the signal Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 19
20 6.1 Sampling Rate Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 20
21 6.1 Sampling Rate Phone: 8000 Hz DVD audio: 96,000 Hz or 192,000 Hz Audio CD: 44,100 measurements per second for two stereo channels with 16 bits per measurement results in kb/s ca 10 MB/min, i.e., 635 MB/h Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 21
22 6.1 Audio Formats For space reasons, is digital data usually stored in compressed form Known uncompressed formats: AIFF: *. aif (Apple Inter opportunity File Format) Wave: *. wav (Windows) IRCAM: *. snd (Institut de Recherche et Coordination Acoustique / Musique) AU: *. au (Sun audio) Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 22
23 6.1 Compression Data reduction: with (lossy) or without information loss (lossless) Lossless compression methods generally don t compress very much Free Lossless Audio Codec 50 60% of their original size Apple Lossless WavPack... Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 23
24 6.1 Compression Lossy compression algorithms typically are based on simple transformations Modified Discrete Cosine Transformation (MDCT) or wavelets Encoding: transformation of the waveform in frequency sequences (sampling) Decoding: Reconstruction of waveform from these values What data can we afford loosing? Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 24
25 6.1 Compression Change of the data without changing the subjective perception Omit very high/low frequencies Save superimposed frequencies with less precision Use of other effects according to psychoacoustic model, e.g., low tones before/after very loud sounds and frequency changes at a minimum distance are impossible to hear... Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 25
26 6.1 Compression MPEG-1 Audio Layers I, II and III (MP3) CD quality at bit rates of 128 kb/s Coarse approach to MP3 Channel coupling of the stereo signal by using the difference Cut off inaudible frequencies Eliminate redundancy by considering the psychoacoustic effects Compress data using Huffman coding Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 26
27 6.1 Compression AAC (Advanced Audio Coding) Industry-driven improvement of the MP3 format (supported by the MPEG) Used in TV-/radio broadcasts, Apple itunes Store,... Better quality at same file size Support for multi-channel audio Supports 48 main sound channels with up to 96 khz sampling rate, 16 low-frequency channels (limited to 120 Hz) and 15 data flows Ogg Vorbis, Real Audio, WMA 9,... Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 27
28 6.1 Compression Lossless compression, important factors are: De-/compression speed Compression rate Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 28
29 6.1 Compression Lossy compression, important factors are: De-/compression speed Compression rate Most important, the compressed audio quality! Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 29
30 6.1 Compression Lossy compression, results Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 30
31 6.1 Compression Lossy compression, results Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 31
32 6.1 The MIDI Format Communication protocol For transmission, recording and playing musical control information between digital instruments and the PC Statements are not sounds, but commands that can be used e.g., by sound cards Some commands: Note-on, note-off, key velocity, pitch, type of instrument Example: MIDI: Original: Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 32
33 6.1 The MIDI Format 10 minutes music are about 200 KB of MIDI data Significant savings compared to sampling, but no original sound Data are input to the PC via keyboard and output via synthesizer Sequencer for caching data and changes Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 33
34 6.2 Audio Information in Databases Audio data Music, CDs Sound effects, Earcons Audio data represent most of information transfer Storage of historical speeches Recordings of conversations, phone calls or negotiations Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 34
35 6.2 Special Applications Three main applications of audio signals in the context of databases Identification of audio signals (audio as query) Classification and search of similar signals (matching of audio) Phonetic synchronization Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 35
36 6.2 Identification of Audio Signals Find the title, etc. for this music piece Monitoring of audio streams Control of broadcasting of advertisements on radio Copyright Control (GEMA) (Remote) diagnosis based on noise Audio on Demand Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 36
37 6.2 Classification and Matching Find perceptionally similar audio signals E.g., similar pieces of music, the same quotation,... Recommendation E.g., bands with similar music Genre classification (rock, classical,...) E.g., in audio libraries Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 37
38 6.2 Synchronization Synchronization of audio streams Speech text, notes audio,... Retrieval of text from or to speech Find specific points in a speech Verbal query to text documents Following of audio scores in concerts, etc. Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 38
39 6.2 State of the Art Identification The simplest of the three problems, in recent years, successful research Classification and Matching Often still manual annotations Automatic classification only works roughly, on small collections Matching is still largely unresolved Synchronization Meanwhile, tolerable error rates (language text) Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 39
40 6.2 Persistent Storage (Compressed) audio files are stored in the database as (smart) BLOBs Additionally, are metadata and feature vectors stored for the realization of the search functionality Language: transcription as text Music: musical notation or MIDI Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 40
41 6.3 Audio Retrieval Search in audio data: metadata describe the audio file Semantic metadata: difficult to generate title, artist, speaker, keywords,... File information: can be automatically generated e.g., time/place of recording, filename, file size,... Widely used, e.g., music exchange markets, online shops,... Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 41
42 6.3 Metadata-based Search Manual indexing is extremely labor intensive and expensive Information is often incomplete, partial and subjective (e.g., genre classification) No possibility to Query by Example ( "Sounds like...") Search only with SQL, approximate string search, etc. Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 42
43 6.3 Content-based Search Using content of audio files Compare measure vs., measure Not very promising, inefficient Differences in sampling rate and resolution Sounds can be differentiated by certain characteristics Low Level Features Logical Features Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 43
44 6.3 Low Level Features Acoustic features Same basic idea as in image databases Description of signal information by means of characteristic features In contrast to image information we don t use a single feature vector, but a time-dependent vector function Time-point of the acoustic characteristics, rather than being contained in the audio file Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 44
45 6.3 Low Level Features Typical Low Level Features Mean amplitude, loudness Frequency distribution Pitch Brightness Bandwidth Measured in the... Time domain (amplitude versus time)... Frequency domain (intensity versus frequency) Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 45
46 6.3 Features in the Time Domain Amplitude Pressure fluctuations around the zero point Silence is equivalent to 0 amplitude Average energy Characterizes the volume of the signal with N as the total number of measurements and x n as n th measurement Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 46
47 6.3 Features in the Time Domain Zero-Crossing Rate Frequency of the sign change in the signal with sgn as a sign function (signum) Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 47
48 6.3 Features in the Time Domain Silence Ratio Proportion of values that belong to a period of complete silence We must first establish: The amplitude value below which a pitch is considered to be silence The minimum number of consecutive readings that need to be silent, to form a silence period Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 48
49 6.3 Features in the Frequency Domain Fourier transform of the signal Decomposition into frequency components with coefficients (Fourier coefficients) Representation of frequency spectrum of the signal Size of the coefficients of the frequency (represents the amount of energy per frequency) Usually measured in decibels (db) Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 49
50 6.3 Frequency Spectrum "Ahhh" sound and Fourier spectrum Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 50
51 6.3 Features in the Frequency Domain Bandwidth Interval of occurring frequencies Difference between the largest and smallest frequency in the spectrum (the minimum frequency is considered to be the first frequency above the silence threshold) Can also be used for classification, e.g., bandwidth in music is higher than for voice Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 51
52 6.3 Features in the Frequency Domain Power Distribution Can be read directly from the spectrum Distinction of frequencies with high/low energy Calculation of frequency bands with high/low energy Centroid as the center of the spectral energy distribution (brightness) Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 52
53 6.3 Features in the Frequency Domain Harmony The lowest of all the loud frequencies is called the fundamental frequency Harmony of a signal increases when the dominant components in the spectrum are multiples of the fundamental frequency E.g., standard pitch A, as the fundamental frequency (440 Hz) produced on a violin creates harmonic oscillations at 880 Hz, 1320 Hz, 1760 Hz, etc. Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 53
54 6.3 Harmony Harmonic oscillations Frequency spectrum of a sound played on an instrument Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 54
55 6.3 Features in the Frequency Domain Pitch Can be approximated by means of the Fourier spectrum The value is calculated from the frequencies and amplitudes of the peaks Related to the fundamental frequency, which is often used as an approximation Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 55
56 This Lecture Audio Retrieval - Basics of Audio Data - Audio Information in Databases - Audio Retrieval Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 56
57 Next lecture Classification and Retrieval of Audio Low level Audio Features Difference Limen Pitch Detection Multimedia Databases Wolf-Tilo Balke Institut für Informationssysteme TU Braunschweig 57
Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals. Introduction
Computer Networks and Internets, 5e Chapter 6 Information Sources and Signals Modified from the lecture slides of Lami Kaya (LKaya@ieee.org) for use CECS 474, Fall 2008. 2009 Pearson Education Inc., Upper
More informationMPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music
ISO/IEC MPEG USAC Unified Speech and Audio Coding MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music The standardization of MPEG USAC in ISO/IEC is now in its final
More informationPreservation Handbook
Preservation Handbook Digital Audio Author Gareth Knight & John McHugh Version 1 Date 25 July 2005 Change History Page 1 of 8 Definition Sound in its original state is a series of air vibrations (compressions
More informationCreating Content for ipod + itunes
apple Apple Education Creating Content for ipod + itunes This guide provides information about the file formats you can use when creating content compatible with itunes and ipod. This guide also covers
More informationTrigonometric functions and sound
Trigonometric functions and sound The sounds we hear are caused by vibrations that send pressure waves through the air. Our ears respond to these pressure waves and signal the brain about their amplitude
More informationIntroduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles
Introduction and Comparison of Common Videoconferencing Audio Protocols I. Digital Audio Principles Sound is an energy wave with frequency and amplitude. Frequency maps the axis of time, and amplitude
More informationThe Design and Implementation of Multimedia Software
Chapter 10 Auditory Content The Design and Implementation of Multimedia Software David Bernstein Jones and Bartlett Publishers www.jbpub.com David Bernstein (jbpub.com) Multimedia Software Jones and Bartlett
More informationDigital Audio Compression: Why, What, and How
Digital Audio Compression: Why, What, and How An Absurdly Short Course Jeff Bier Berkeley Design Technology, Inc. 2000 BDTI 1 Outline Why Compress? What is Audio Compression? How Does it Work? Conclusions
More informationAUDIO CODING: BASICS AND STATE OF THE ART
AUDIO CODING: BASICS AND STATE OF THE ART PACS REFERENCE: 43.75.CD Brandenburg, Karlheinz Fraunhofer Institut Integrierte Schaltungen, Arbeitsgruppe Elektronische Medientechnolgie Am Helmholtzring 1 98603
More informationencoding compression encryption
encoding compression encryption ASCII utf-8 utf-16 zip mpeg jpeg AES RSA diffie-hellman Expressing characters... ASCII and Unicode, conventions of how characters are expressed in bits. ASCII (7 bits) -
More informationAudacity 1.2.4 Sound Editing Software
Audacity 1.2.4 Sound Editing Software Developed by Paul Waite Davis School District This is not an official training handout of the Educational Technology Center, Davis School District Possibilities...
More informationAnalog Representations of Sound
Analog Representations of Sound Magnified phonograph grooves, viewed from above: The shape of the grooves encodes the continuously varying audio signal. Analog to Digital Recording Chain ADC Microphone
More informationUNIVERSITY OF CALICUT
UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION BMMC (2011 Admission) V SEMESTER CORE COURSE AUDIO RECORDING & EDITING QUESTION BANK 1. Sound measurement a) Decibel b) frequency c) Wave 2. Acoustics
More informationPRIMER ON PC AUDIO. Introduction to PC-Based Audio
PRIMER ON PC AUDIO This document provides an introduction to various issues associated with PC-based audio technology. Topics include the following: Introduction to PC-Based Audio Introduction to Audio
More informationAudio Editing. Using Audacity Matthew P. Fritz, DMA Associate Professor of Music Elizabethtown College
Audio Editing Using Audacity Matthew P. Fritz, DMA Associate Professor of Music Elizabethtown College What is sound? Sounds are pressure waves of air Pressure pushes air molecules outwards in all directions
More informationVoltage. Oscillator. Voltage. Oscillator
fpa 147 Week 6 Synthesis Basics In the early 1960s, inventors & entrepreneurs (Robert Moog, Don Buchla, Harold Bode, etc.) began assembling various modules into a single chassis, coupled with a user interface
More informationAudio Coding Algorithm for One-Segment Broadcasting
Audio Coding Algorithm for One-Segment Broadcasting V Masanao Suzuki V Yasuji Ota V Takashi Itoh (Manuscript received November 29, 2007) With the recent progress in coding technologies, a more efficient
More informationA Sound Analysis and Synthesis System for Generating an Instrumental Piri Song
, pp.347-354 http://dx.doi.org/10.14257/ijmue.2014.9.8.32 A Sound Analysis and Synthesis System for Generating an Instrumental Piri Song Myeongsu Kang and Jong-Myon Kim School of Electrical Engineering,
More informationDigitizing Sound Files
Digitizing Sound Files Introduction Sound is one of the major elements of multimedia. Adding appropriate sound can make multimedia or web page powerful. For example, linking text or image with sound in
More informationVoice---is analog in character and moves in the form of waves. 3-important wave-characteristics:
Voice Transmission --Basic Concepts-- Voice---is analog in character and moves in the form of waves. 3-important wave-characteristics: Amplitude Frequency Phase Voice Digitization in the POTS Traditional
More informationAcoustic Terms, Definitions and General Information
Acoustic Terms, Definitions and General Information Authored by: Daniel Ziobroski Acoustic Engineer Environmental and Acoustic Engineering GE Energy Charles Powers Program Manager Environmental and Acoustic
More informationTECHNICAL LISTENING TRAINING: IMPROVEMENT OF SOUND SENSITIVITY FOR ACOUSTIC ENGINEERS AND SOUND DESIGNERS
TECHNICAL LISTENING TRAINING: IMPROVEMENT OF SOUND SENSITIVITY FOR ACOUSTIC ENGINEERS AND SOUND DESIGNERS PACS: 43.10.Sv Shin-ichiro Iwamiya, Yoshitaka Nakajima, Kazuo Ueda, Kazuhiko Kawahara and Masayuki
More informationEUROPEAN COMPUTER DRIVING LICENCE. Multimedia Audio Editing. Syllabus
EUROPEAN COMPUTER DRIVING LICENCE Multimedia Audio Editing Syllabus Purpose This document details the syllabus for ECDL Multimedia Module 1 Audio Editing. The syllabus describes, through learning outcomes,
More informationDIGITAL MUSIC DAY 1 WHAT IS SOUND? ANALOG AND DIGITAL EARLY RECORDING WAX FOR YOUR EARS ROUND BUT FLAT WIRE AND TAPE PURE SOUND
DIGITAL MUSIC DAY 1 WHAT IS SOUND? 1. Making a sound playing a musical instrument, moves the surrounding the instrument. 2. Making a sound sends out air which hit your ears. 3. Waves of changing air pressure
More informationLecture 4: Jan 12, 2005
EE516 Computer Speech Processing Winter 2005 Lecture 4: Jan 12, 2005 Lecturer: Prof: J. Bilmes University of Washington Dept. of Electrical Engineering Scribe: Scott Philips
More informationGraham s Guide to Synthesizers (part 1) Analogue Synthesis
Graham s Guide to Synthesizers (part ) Analogue Synthesis Synthesizers were originally developed to imitate or synthesise the sounds of acoustic instruments electronically. Early synthesizers used analogue
More informationMusic technology. Draft GCE A level and AS subject content
Music technology Draft GCE A level and AS subject content July 2015 Contents The content for music technology AS and A level 3 Introduction 3 Aims and objectives 3 Subject content 4 Recording and production
More informationDT3: RF On/Off Remote Control Technology. Rodney Singleton Joe Larsen Luis Garcia Rafael Ocampo Mike Moulton Eric Hatch
DT3: RF On/Off Remote Control Technology Rodney Singleton Joe Larsen Luis Garcia Rafael Ocampo Mike Moulton Eric Hatch Agenda Radio Frequency Overview Frequency Selection Signals Methods Modulation Methods
More informationBroadband Networks. Prof. Dr. Abhay Karandikar. Electrical Engineering Department. Indian Institute of Technology, Bombay. Lecture - 29.
Broadband Networks Prof. Dr. Abhay Karandikar Electrical Engineering Department Indian Institute of Technology, Bombay Lecture - 29 Voice over IP So, today we will discuss about voice over IP and internet
More informationPCM Encoding and Decoding:
PCM Encoding and Decoding: Aim: Introduction to PCM encoding and decoding. Introduction: PCM Encoding: The input to the PCM ENCODER module is an analog message. This must be constrained to a defined bandwidth
More informationMP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN. zl2211@columbia.edu. ml3088@columbia.edu
MP3 Player CSEE 4840 SPRING 2010 PROJECT DESIGN Zheng Lai Zhao Liu Meng Li Quan Yuan zl2215@columbia.edu zl2211@columbia.edu ml3088@columbia.edu qy2123@columbia.edu I. Overview Architecture The purpose
More informationBasics of Digital Recording
Basics of Digital Recording CONVERTING SOUND INTO NUMBERS In a digital recording system, sound is stored and manipulated as a stream of discrete numbers, each number representing the air pressure at a
More informationAudio Coding, Psycho- Accoustic model and MP3
INF5081: Multimedia Coding and Applications Audio Coding, Psycho- Accoustic model and MP3, NR Torbjørn Ekman, Ifi Nils Christophersen, Ifi Sverre Holm, Ifi What is Sound? Sound waves: 20Hz - 20kHz Speed:
More informationMultimedia Databases. Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.
Multimedia Databases Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 0 Organizational Issues Lecture 21.10.2014 03.02.2015
More informationAnalog and Digital Signals, Time and Frequency Representation of Signals
1 Analog and Digital Signals, Time and Frequency Representation of Signals Required reading: Garcia 3.1, 3.2 CSE 3213, Fall 2010 Instructor: N. Vlajic 2 Data vs. Signal Analog vs. Digital Analog Signals
More informationYour Hearing ILLUMINATED
Your Hearing ILLUMINATED INFORMATION FROM YOUR HEARING CARE PROFESSIONAL REDISCOVER your hearing and reconnect 1 with the important things you might have been missing. Your sense of hearing is a vital
More informationT = 1 f. Phase. Measure of relative position in time within a single period of a signal For a periodic signal f(t), phase is fractional part t p
Data Transmission Concepts and terminology Transmission terminology Transmission from transmitter to receiver goes over some transmission medium using electromagnetic waves Guided media. Waves are guided
More informationLecture 2 Outline. EE 179, Lecture 2, Handout #3. Information representation. Communication system block diagrams. Analog versus digital systems
Lecture 2 Outline EE 179, Lecture 2, Handout #3 Information representation Communication system block diagrams Analog versus digital systems Performance metrics Data rate limits Next lecture: signals and
More informationThis document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.
This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore. Title Transcription of polyphonic signals using fast filter bank( Accepted version ) Author(s) Foo, Say Wei;
More informationTutorial about the VQR (Voice Quality Restoration) technology
Tutorial about the VQR (Voice Quality Restoration) technology Ing Oscar Bonello, Solidyne Fellow Audio Engineering Society, USA INTRODUCTION Telephone communications are the most widespread form of transport
More informationNetworked AV Systems Pretest
Networked AV Systems Pretest Instructions Choose the best answer for each question. Score your pretest using the key on the last page. If you miss three or more out of questions 1 11, consider taking Essentials
More informationTutorial. Part One -----Class1, 02/05/2015
2.1.1 Tutorial Part One -----Class1, 02/05/2015 Download Audacity and LAME Encoder Audacity is an open source cross-platform (It can be used in Windows, Macs, and Linux) audio editor. You can download
More informationSampling Theorem Notes. Recall: That a time sampled signal is like taking a snap shot or picture of signal periodically.
Sampling Theorem We will show that a band limited signal can be reconstructed exactly from its discrete time samples. Recall: That a time sampled signal is like taking a snap shot or picture of signal
More informationWelcome to the United States Patent and TradeMark Office
Welcome to the United States Patent and TradeMark Office an Agency of the United States Department of Commerce United States Patent 5,159,703 Lowery October 27, 1992 Silent subliminal presentation system
More informationSpeech Signal Processing: An Overview
Speech Signal Processing: An Overview S. R. M. Prasanna Department of Electronics and Electrical Engineering Indian Institute of Technology Guwahati December, 2012 Prasanna (EMST Lab, EEE, IITG) Speech
More informationLecture 1-6: Noise and Filters
Lecture 1-6: Noise and Filters Overview 1. Periodic and Aperiodic Signals Review: by periodic signals, we mean signals that have a waveform shape that repeats. The time taken for the waveform to repeat
More informationDepartment of Electrical and Computer Engineering Ben-Gurion University of the Negev. LAB 1 - Introduction to USRP
Department of Electrical and Computer Engineering Ben-Gurion University of the Negev LAB 1 - Introduction to USRP - 1-1 Introduction In this lab you will use software reconfigurable RF hardware from National
More informationIntroduzione alle Biblioteche Digitali Audio/Video
Introduzione alle Biblioteche Digitali Audio/Video Biblioteche Digitali 1 Gestione del video Perchè è importante poter gestire biblioteche digitali di audiovisivi Caratteristiche specifiche dell audio/video
More informationFor Articulation Purpose Only
E305 Digital Audio and Video (4 Modular Credits) This document addresses the content related abilities, with reference to the module. Abilities of thinking, learning, problem solving, team work, communication,
More informationA Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton
A Comparison of the ATRAC and MPEG-1 Layer 3 Audio Compression Algorithms Christopher Hoult, 18/11/2002 University of Southampton Abstract This paper intends to give the reader some insight into the workings
More informationEECC694 - Shaaban. Transmission Channel
The Physical Layer: Data Transmission Basics Encode data as energy at the data (information) source and transmit the encoded energy using transmitter hardware: Possible Energy Forms: Electrical, light,
More informationRF Measurements Using a Modular Digitizer
RF Measurements Using a Modular Digitizer Modern modular digitizers, like the Spectrum M4i series PCIe digitizers, offer greater bandwidth and higher resolution at any given bandwidth than ever before.
More informationGUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS
GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS Digitization Best Practices for Audio This document sets forth guidelines for digitizing audio materials for CARLI Digital Collections. The issues described
More informationData Transmission. Data Communications Model. CSE 3461 / 5461: Computer Networking & Internet Technologies. Presentation B
CSE 3461 / 5461: Computer Networking & Internet Technologies Data Transmission Presentation B Kannan Srinivasan 08/30/2012 Data Communications Model Figure 1.2 Studying Assignment: 3.1-3.4, 4.1 Presentation
More informationMathematical Harmonies Mark Petersen
1 Mathematical Harmonies Mark Petersen What is music? When you hear a flutist, a signal is sent from her fingers to your ears. As the flute is played, it vibrates. The vibrations travel through the air
More informationConvention Paper Presented at the 112th Convention 2002 May 10 13 Munich, Germany
Audio Engineering Society Convention Paper Presented at the 112th Convention 2002 May 10 13 Munich, Germany This convention paper has been reproduced from the author's advance manuscript, without editing,
More informationFigure 1: Relation between codec, data containers and compression algorithms.
Video Compression Djordje Mitrovic University of Edinburgh This document deals with the issues of video compression. The algorithm, which is used by the MPEG standards, will be elucidated upon in order
More informationFOURIER TRANSFORM BASED SIMPLE CHORD ANALYSIS. UIUC Physics 193 POM
FOURIER TRANSFORM BASED SIMPLE CHORD ANALYSIS Fanbo Xiang UIUC Physics 193 POM Professor Steven M. Errede Fall 2014 1 Introduction Chords, an essential part of music, have long been analyzed. Different
More informationMusic Compression Algorithms and Why You Should Care. Alexander Benjamin 12/09/10
Music Compression Algorithms and Why You Should Care Alexander Benjamin 12/09/10 TABLE OF CONTENTS 1.0 Introduction... 2 2.0 General Data Compression Techniques... 3 2.1 Lossless versus Lossy Compression...
More informationSTUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION
STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION Adiel Ben-Shalom, Michael Werman School of Computer Science Hebrew University Jerusalem, Israel. {chopin,werman}@cs.huji.ac.il
More informationANALYZER BASICS WHAT IS AN FFT SPECTRUM ANALYZER? 2-1
WHAT IS AN FFT SPECTRUM ANALYZER? ANALYZER BASICS The SR760 FFT Spectrum Analyzer takes a time varying input signal, like you would see on an oscilloscope trace, and computes its frequency spectrum. Fourier's
More information4 Digital Video Signal According to ITU-BT.R.601 (CCIR 601) 43
Table of Contents 1 Introduction 1 2 Analog Television 7 3 The MPEG Data Stream 11 3.1 The Packetized Elementary Stream (PES) 13 3.2 The MPEG-2 Transport Stream Packet.. 17 3.3 Information for the Receiver
More informationDAB + The additional audio codec in DAB
DAB + The additional audio codec in DAB 2007 Contents Why DAB + Features of DAB + Possible scenarios with DAB + Comparison of DAB + and DMB for radio services Performance of DAB + Status of standardisation
More informationDigital Transmission of Analog Data: PCM and Delta Modulation
Digital Transmission of Analog Data: PCM and Delta Modulation Required reading: Garcia 3.3.2 and 3.3.3 CSE 323, Fall 200 Instructor: N. Vlajic Digital Transmission of Analog Data 2 Digitization process
More informationNoise. CIH Review PDC March 2012
Noise CIH Review PDC March 2012 Learning Objectives Understand the concept of the decibel, decibel determination, decibel addition, and weighting Know the characteristics of frequency that are relevant
More informationSound Perception. Sensitivity to Sound. Sensitivity to Sound 1/9/11. Not physically sensitive to all possible sound frequencies Range
Sound Perception Similarities between sound and light perception Characteristics of sound waves Wavelength = Pitch Purity = Timbre Amplitude = loudness Sensitivity to Sound Not physically sensitive to
More informationARTICLE. Sound in surveillance Adding audio to your IP video solution
ARTICLE Sound in surveillance Adding audio to your IP video solution Table of contents 1. First things first 4 2. Sound advice 4 3. Get closer 5 4. Back and forth 6 5. Get to it 7 Introduction Using audio
More informationBasic Acoustics and Acoustic Filters
Basic CHAPTER Acoustics and Acoustic Filters 1 3 Basic Acoustics and Acoustic Filters 1.1 The sensation of sound Several types of events in the world produce the sensation of sound. Examples include doors
More information1. (Ungraded) A noiseless 2-kHz channel is sampled every 5 ms. What is the maximum data rate?
Homework 2 Solution Guidelines CSC 401, Fall, 2011 1. (Ungraded) A noiseless 2-kHz channel is sampled every 5 ms. What is the maximum data rate? 1. In this problem, the channel being sampled gives us the
More informationMatlab GUI for WFB spectral analysis
Matlab GUI for WFB spectral analysis Jan Nováček Department of Radio Engineering K13137, CTU FEE Prague Abstract In the case of the sound signals analysis we usually use logarithmic scale on the frequency
More informationStep by step guide to using Audacity
Step by step guide to using Audacity Contents 1 - Introduction... 1 2 - Getting Started... 2 2.1 - Starting Audacity... 2 3 Recording, Saving and Editing Your Audio... 3 3.1 Recording your audio... 3 3.2
More informationCBS RECORDS PROFESSIONAL SERIES CBS RECORDS CD-1 STANDARD TEST DISC
CBS RECORDS PROFESSIONAL SERIES CBS RECORDS CD-1 STANDARD TEST DISC 1. INTRODUCTION The CBS Records CD-1 Test Disc is a highly accurate signal source specifically designed for those interested in making
More informationAudio Engineering Society. Convention Paper. Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA
Audio Engineering Society Convention Paper Presented at the 129th Convention 2010 November 4 7 San Francisco, CA, USA The papers at this Convention have been selected on the basis of a submitted abstract
More informationMUSC 1327 Audio Engineering I Syllabus Addendum McLennan Community College, Waco, TX
MUSC 1327 Audio Engineering I Syllabus Addendum McLennan Community College, Waco, TX Instructor Brian Konzelman Office PAC 124 Phone 299-8231 WHAT IS THIS COURSE? AUDIO ENGINEERING I is the first semester
More informationVideo compression: Performance of available codec software
Video compression: Performance of available codec software Introduction. Digital Video A digital video is a collection of images presented sequentially to produce the effect of continuous motion. It takes
More informationWeb-Conferencing System SAViiMeeting
Web-Conferencing System SAViiMeeting Alexei Machovikov Department of Informatics and Computer Technologies National University of Mineral Resources Mining St-Petersburg, Russia amachovikov@gmail.com Abstract
More informationIntroduction to Digital Audio
Introduction to Digital Audio Before the development of high-speed, low-cost digital computers and analog-to-digital conversion circuits, all recording and manipulation of sound was done using analog techniques.
More informationAN1200.04. Application Note: FCC Regulations for ISM Band Devices: 902-928 MHz. FCC Regulations for ISM Band Devices: 902-928 MHz
AN1200.04 Application Note: FCC Regulations for ISM Band Devices: Copyright Semtech 2006 1 of 15 www.semtech.com 1 Table of Contents 1 Table of Contents...2 1.1 Index of Figures...2 1.2 Index of Tables...2
More informationDoppler. Doppler. Doppler shift. Doppler Frequency. Doppler shift. Doppler shift. Chapter 19
Doppler Doppler Chapter 19 A moving train with a trumpet player holding the same tone for a very long time travels from your left to your right. The tone changes relative the motion of you (receiver) and
More informationTiming Errors and Jitter
Timing Errors and Jitter Background Mike Story In a sampled (digital) system, samples have to be accurate in level and time. The digital system uses the two bits of information the signal was this big
More informationVideo Conferencing Glossary of Terms
Video Conferencing Glossary of Terms A Algorithm A step-by-step problem-solving procedure. Transmission of compressed video over a communications network requires sophisticated compression algorithms.
More informationThe Effects of Ultrasonic Sound Generated by Ultrasonic Cleaning Systems on Human Hearing and Physiology
The Effects of Ultrasonic Sound Generated by Ultrasonic Cleaning Systems on Human Hearing and Physiology Questions about the effects of ultrasonic energy on hearing and other human physiology arise from
More informationFREE TV AUSTRALIA OPERATIONAL PRACTICE OP 60 Multi-Channel Sound Track Down-Mix and Up-Mix Draft Issue 1 April 2012 Page 1 of 6
Page 1 of 6 1. Scope. This operational practice sets out the requirements for downmixing 5.1 and 5.0 channel surround sound audio mixes to 2 channel stereo. This operational practice recommends a number
More informationThe Sonometer The Resonant String and Timbre Change after plucking
The Sonometer The Resonant String and Timbre Change after plucking EQUIPMENT Pasco sonometers (pick up 5 from teaching lab) and 5 kits to go with them BK Precision function generators and Tenma oscilloscopes
More informationBuilding Design for Advanced Technology Instruments Sensitive to Acoustical Noise
Building Design for Advanced Technology Instruments Sensitive to Acoustic Noise Michael Gendreau Colin Gordon & Associates Presentation Outline! High technology research and manufacturing instruments respond
More informationImage Compression through DCT and Huffman Coding Technique
International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Rahul
More informationCurrent Status and Problems in Mastering of Sound Volume in TV News and Commercials
Current Status and Problems in Mastering of Sound Volume in TV News and Commercials Doo-Heon Kyon, Myung-Sook Kim and Myung-Jin Bae Electronics Engineering Department, Soongsil University, Korea kdhforce@gmail.com,
More informationSound and stringed instruments
Sound and stringed instruments Lecture 14: Sound and strings Reminders/Updates: HW 6 due Monday, 10pm. Exam 2, a week today! 1 Sound so far: Sound is a pressure or density fluctuation carried (usually)
More informationLittle LFO. Little LFO. User Manual. by Little IO Co.
1 Little LFO User Manual Little LFO by Little IO Co. 2 Contents Overview Oscillator Status Switch Status Light Oscillator Label Volume and Envelope Volume Envelope Attack (ATT) Decay (DEC) Sustain (SUS)
More informationDr. Abdel Aziz Hussein Lecturer of Physiology Mansoura Faculty of Medicine
Physiological Basis of Hearing Tests By Dr. Abdel Aziz Hussein Lecturer of Physiology Mansoura Faculty of Medicine Introduction Def: Hearing is the ability to perceive certain pressure vibrations in the
More informationFREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting
Page 1 of 9 1. SCOPE This Operational Practice is recommended by Free TV Australia and refers to the measurement of audio loudness as distinct from audio level. It sets out guidelines for measuring and
More informationitunes 7.0 Fall 07 fall 2007
itunes 7.0 Fall 07 fall 2007 Table of Contents Introduction 3 Layout of itunes 3 Playlists 4 Create a Playlist 4 Create a Smart Playlist 5 Burning to CD 5 Burning Preferences 5 Importing Files 6 Encoding
More informationNew Methods of Stereo Encoding for FM Radio Broadcasting Based on Digital Technology
12 P. STRAŇÁK, NEW METHODS OF STEREO ENCODING FOR FM RADIO BROADCASTING BASED ON DIGITAL TECHNOLOGY New Methods of Stereo Encoding for FM Radio Broadcasting Based on Digital Technology Pavel STRAŇÁK Phobos
More informationQuarterly Progress and Status Report. Measuring inharmonicity through pitch extraction
Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Measuring inharmonicity through pitch extraction Galembo, A. and Askenfelt, A. journal: STL-QPSR volume: 35 number: 1 year: 1994
More informationTeaching Fourier Analysis and Wave Physics with the Bass Guitar
Teaching Fourier Analysis and Wave Physics with the Bass Guitar Michael Courtney Department of Chemistry and Physics, Western Carolina University Norm Althausen Lorain County Community College This article
More informationThe Phase Modulator In NBFM Voice Communication Systems
The Phase Modulator In NBFM Voice Communication Systems Virgil Leenerts 8 March 5 The phase modulator has been a point of discussion as to why it is used and not a frequency modulator in what are called
More informationHearing and Deafness 1. Anatomy & physiology
Hearing and Deafness 1. Anatomy & physiology Chris Darwin Web site for lectures, lecture notes and filtering lab: http://www.lifesci.susx.ac.uk/home/chris_darwin/ safari 1 Outer, middle & inner ear Capture;
More informationSpectrum Level and Band Level
Spectrum Level and Band Level ntensity, ntensity Level, and ntensity Spectrum Level As a review, earlier we talked about the intensity of a sound wave. We related the intensity of a sound wave to the acoustic
More informationAnalog vs. Digital Transmission
Analog vs. Digital Transmission Compare at two levels: 1. Data continuous (audio) vs. discrete (text) 2. Signaling continuously varying electromagnetic wave vs. sequence of voltage pulses. Also Transmission
More informationBorn-digital media for long term preservation and access: Selection or deselection of media independent music productions
Submitted on: 29.07.2015 Born-digital media for long term preservation and access: Selection or deselection of media independent music productions Trond Valberg Head Curator, Music Section, National Library
More information