Habilitation. Bonn University. Information Retrieval. Dec. 2007. PhD students. General Goals. Music Synchronization: Audio-Audio



Similar documents
Beethoven, Bach und Billionen Bytes

Automatic Organization of Digital Music Documents Sheet Music and Audio

Recent advances in Digital Music Processing and Indexing

Music Information Research and Its Importance

Annotated bibliographies for presentations in MUMT 611, Winter 2006

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE.

EE3414 Multimedia Communication Systems Part I

ISSN: A Review: Image Retrieval Using Web Multimedia Mining

engin erzin the use of speech processing applications is expected to surge in multimedia-rich scenarios

Semantic Video Annotation by Mining Association Patterns from Visual and Speech Features

UNIVERSITY OF CENTRAL FLORIDA AT TRECVID Yun Zhai, Zeeshan Rasheed, Mubarak Shah

Automatic Transcription: An Enabling Technology for Music Analysis

Speaker: Prof. Mubarak Shah, University of Central Florida. Title: Representing Human Actions as Motion Patterns

Open issues and research trends in Content-based Image Retrieval

What is Multimedia? Derived from the word Multi and Media

Geometric Constraints

4.3: Multimedia Database Systems Multimedia Database Management System Data Structure Operations on Data Integration in a Database Model

NeMO - NeDiMAH Methods Ontology. Use Case manual: How to report your case using throughout the Excel template

A MACHINE LEARNING APPROACH TO FILTER UNWANTED MESSAGES FROM ONLINE SOCIAL NETWORKS

DYNAMIC CHORD ANALYSIS FOR SYMBOLIC MUSIC

Introduction to Computer Graphics

Limits and Possibilities of Markerless Human Motion Estimation

Multimedia Databases. Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig

Level: 3 Credit value: 5 GLH: 40 Assessment type:

MULTIMEDIA MINING RESEARCH AN OVERVIEW

Presentation Video Retrieval using Automatically Recovered Slide and Spoken Text

Internet Video Streaming and Cloud-based Multimedia Applications. Outline

AUTOMATIC VIDEO STRUCTURING BASED ON HMMS AND AUDIO VISUAL INTEGRATION

Modelling 3D Avatar for Virtual Try on

Automatic Annotation Wrapper Generation and Mining Web Database Search Result

Prof. Dr. D. W. Cunningham, Berliner Strasse 35A, Cottbus, Germany

Information Model for Multimedia Medical Record in Telemedicine

Outline. CIW Web Design Specialist. Course Content

Survey: Retrieval of Video Using Content (Speech &Text) Information

M3039 MPEG 97/ January 1998

HYPER MEDIA MESSAGING

Movie Classification Using k-means and Hierarchical Clustering

Web Mining Seminar CSE 450. Spring 2008 MWF 11:10 12:00pm Maginnes 113

CHAPTER 6: GRAPHICS, DIGITAL MEDIA, AND MULTIMEDIA

Database Systems. Multimedia Database Management System. Application. User. Application. Chapter 2: Basics

A Case study based Software Engineering Education using Open Source Tools

Introduction to Pattern Recognition

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts

Introduction. Selim Aksoy. Bilkent University

Unit 351: Website Software Level 3

Universidad Autónoma de Guadalajara Unidad Académica de Educación Secundaria y Media Superior Middle School Guide of classes for the student

Interactive Multimedia Courses-1

Music Technology Programs

Mining Signatures in Healthcare Data Based on Event Sequences and its Applications

A comprehensive survey on various ETC techniques for secure Data transmission

ANIMATION a system for animation scene and contents creation, retrieval and display

WESTERN KENTUCKY UNIVERSITY. Web Accessibility. Objective

WEST JEFFERSON HILLS SCHOOL DISTRICT THOMAS JEFFERSON HIGH SCHOOL DIGITAL DESIGN GRADES

MMGD0203 Multimedia Design MMGD0203 MULTIMEDIA DESIGN. Chapter 3 Graphics and Animations

Immersive Medien und 3D-Video

NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE. Venu Govindaraju

Where to Find the Highest Audio Engineer Salary. How Education and Training can affect the Audio Engineer Salary

Multimedia Technology Bachelor of Science

Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari

The School-assessed Task has three components. They relate to: Unit 3 Outcome 2 Unit 3 Outcome 3 Unit 4 Outcome 1.

FAST MIR IN A SPARSE TRANSFORM DOMAIN

Component visualization methods for large legacy software in C/C++

Modern Databases. Database Systems Lecture 18 Natasha Alechina

Spam Filtering in Online Social Networks Using Machine Learning Technique

Filtering Noisy Contents in Online Social Network by using Rule Based Filtering System

1 o Semestre 2007/2008

Development of Enterprise Architecture of PPDR Organisations W. Müller, F. Reinert

Combating Anti-forensics of Jpeg Compression

Course Overview. CSCI 480 Computer Graphics Lecture 1. Administrative Issues Modeling Animation Rendering OpenGL Programming [Angel Ch.

Web Design Specialist

Natural Language Querying for Content Based Image Retrieval System

Music Technology II. What are some additional applications understand midi sequencing. for music production software?

Semantic Concept Based Retrieval of Software Bug Report with Feedback

DINAMIC AND STATIC CENTRE OF PRESSURE MEASUREMENT ON THE FORCEPLATE. F. R. Soha, I. A. Szabó, M. Budai. Abstract

Transcription:

Perspektivenvorlesung Information Retrieval Music and Motion Bonn University Prof. Dr. Michael Clausen PD Dr. Frank Kurth Dipl.-Inform. Christian Fremerey Dipl.-Inform. David Damm Dipl.-Inform. Sebastian Ewert Dr. Tido Röder Habilitation Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Dec. 2007 PhD students Winter Term 2008/2009 Dipl.-Inform. Andreas Baak Dipl.-Math. Verena Konz Dipl.-Ing. Peter Grosche Dipl.-Inform. Thomas Helten (DFG) (MMCI) (MMCI) (DFG) Music Data Music Data Various interpretations Beethoven s Fifth Bernstein Karajan Scherbakov (piano) MIDI (piano) General Goals Music Synchronization: Audio-Audio Beethoven s Fifth Automated organization of complex and inhomogeneous music collections Karajan Generation of annotations and cross-links Tools and methods for multimodal search, navigation and interaction Scherbakov Music Information Retrieval (MIR)

Music Synchronization: Audio-Audio Beethoven s Fifth Music Synchronization: Audio-Audio Feature extraction: chroma features Karajan Karajan Scherbakov C C# 1 0.9 C C# 1 0.9 D 0.8 D 0.8 D# 0.7 D# 0.7 Scherbakov E F F# G G# A 0.6 0.5 0.4 0.3 0.2 E F F# G G# A 0.6 0.5 0.4 0.3 0.2 A# B 2 4 6 8 10 12 14 16 18 0.1 0 A# B 5 10 15 20 0.1 0 Synchronization: Karajan Scherbakov Music Synchronization: Audio-Audio Cost matrix Music Synchronization: Audio-Audio Cost-minimizing warping path 1 1 18 0.9 18 0.9 16 0.8 16 0.8 14 0.7 14 0.7 Karajan 12 10 8 0.6 0.5 0.4 Karajan 12 10 8 0.6 0.5 0.4 6 0.3 6 0.3 4 0.2 4 0.2 2 0.1 2 0.1 2 4 6 8 10 12 14 16 18 20 0 2 4 6 8 10 12 14 16 18 20 0 Scherbakov Scherbakov System: SyncPlayer/AudioSwitcher Music Synchronization: MIDI-Audio

Music Synchronization: MIDI-Audio Music Synchronization: Scan-Audio MIDI = meta data Automated annotation Audio recording Sonification of annotations Music Synchronization: Scan-Audio Music Synchronization: Scan-Audio Scanned Sheet Music Scanned Sheet Music Symbolic Note Events OMR Correspondence Correspondence Audio Recording Audio Recording Music Synchronization: Scan-Audio Music Synchronization: Scan-Audio Scanned Sheet Music Symbolic Note Events Scanned Sheet Music Symbolic Note Events OMR High Qualtity OMR Dirty but hidden Correspondence Correspondence High Qualtity Audio Recording Audio Recording

System: SyncPlayer/SheetMusic Music Synchronization: Lyrics-Audio Difficult task! Music Synchronization: Lyrics-Audio Lyrics-Audio Lyrics-MIDI + MIDI-Audio Music Synchronization Turetsky/Ellis (ISMIR 2003) Soulez/Rodet/Schwarz (ISMIR 2003) Arifi/Clausen/Kurth/Müller (ISMIR 2003) Hu/Dannenberg/Tzanetakis (WASPAA 2003) Müller/Kurth/Röder (ISMIR 2004) Raphael (ISMIR 2004) Dixon/Widmer (ISMIR 2005) Müller/Mattes/Kurth (ISMIR 2006) Dannenberg /Raphael (Special Issue ACM 2006) Fujihara/Goto (ICASSP 2008) Wang/Iskandar/New/Shenoy (IEEE T-ASLP 2008)

Similarity cluster

Global structure

Global structure Global structure System: SyncPlayer/AudioStructure Dannenberg/Hu (ISMIR 2002) Peeters/Burthe/Rodet (ISMIR 2002) Cooper/Foote (ISMIR 2002) Goto (ICASSP 2003) Chai/Vercoe (ACM Multimedia 2003) Lu/Wang/Zhang (ACM Multimedia 2004) Bartsch/Wakefield (IEEE Trans. Multimedia 2005) Goto (IEEE Trans. Audio 2006) Müller/Kurth (EURASIP 2007) Music Information Retrieval Multimodal Computing and Interaction Sheet Music (Image) MIDI CD / MP3 (Audio) Music Synchronization Audio Matching Music

Multimodal Computing and Interaction Motion Capture Data Sheet Music (Image) MIDI CD / MP3 (Audio) Digital 3D representations of motions MusicXML (Text) Music Singing / Voice (Audio) Computer animation Sports Music Literature (Text) Music Film (Video) Dance / Motion (Mocap) Gait analysis Motion Capture Data Motion Capture Data Motion Retrieval Motion Retrieval = MoCap database = query motion clip Goal: find all motion clips in similar to

Notion of Similarity Relational Features Numerical similarity vs. logical similarity Logically related motions may exhibit significant spatiotemporal variations Relational Features Relational Features Induced feature sequence: Relational Features Motion Retrieval

Relational Features Motion Templates Spatio-temporal invariance Indexing Efficient retrieval & preselection Problem: feature design & selection Motion Templates Motion Templates Motion Templates Motion Templates

MT-based Motion Retrieval MT-based Motion Retrieval: Jumping Jack MT-based Motion Retrieval: Jumping Jack MT-based Motion Retrieval: Elbow-To-Knee τ MT-based Motion Retrieval: Elbow-To-Knee MT-based Motion Retrieval τ

MT-based Motion Retrieval Conclusions Automated data organization Exploiting multimodality Synchronization Handling object deformations Indexing and efficiency Conclusions Lecture Music Processing Textbook Summer Term 2009 Thursdays 16-18 MPI, Room 024 3 Credit Points Website Meinard Müller Information Retrieval for Music and Motion 2007, XVI. 318 pages 136 illus. 39 in Color, Hardcover ISBN: 978-3-540-74047-6 www.springer.com/978-3-540-74047-6/ 69,50 EUR Selected Publications M. Müller (2007): Information Retrieval for Music and Motion. Monograph, Springer, 318 pages F. Kurth, M. Müller (2008): Efficient Index-Based Audio Matching. IEEE Trans. Audio, Speech & Language Processing, Vol. 16, No. 2, 382-395. M. Müller, D. Appelt (2008): Path-constrained Partial Music Synchronization. Proc. International Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2008) M. Müller, F. Kurth (2006): Enhancing Similarity Matrices for Music Audio Analysis. Proc. International Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006) M. Müller, T. Röder (2006): Motion Templates for Automatic Classification and Retrieval of Mocap Data. Proc. ACM SIGGRAPH / Eurographics Symposium on Computer Animation (SCA 2006) M. Müller, T. Röder, M. Clausen (2005): Efficient Content-Based Retrieval of Motion Capture Data. ACM Trans. Graph. 24 (SIGGRAPH 2005)