Habilitation. Bonn University. Information Retrieval. Dec. 2007. PhD students. General Goals. Music Synchronization: Audio-Audio



Similar documents
Beethoven, Bach und Billionen Bytes

Automatic Organization of Digital Music Documents Sheet Music and Audio

Recent advances in Digital Music Processing and Indexing

Music Information Research and Its Importance

The PROBADO Project - Approach and Lessons Learned in Building a Digital Library System for Heterogeneous Non-textual Documents

Annotated bibliographies for presentations in MUMT 611, Winter 2006

LOCAL SURFACE PATCH BASED TIME ATTENDANCE SYSTEM USING FACE.

EE3414 Multimedia Communication Systems Part I

ISSN: A Review: Image Retrieval Using Web Multimedia Mining

enterface 09 Project Proposal Video Navigation Tool: Application to browsing a database of dancers performances.

engin erzin the use of speech processing applications is expected to surge in multimedia-rich scenarios

Semantic Video Annotation by Mining Association Patterns from Visual and Speech Features

UNIVERSITY OF CENTRAL FLORIDA AT TRECVID Yun Zhai, Zeeshan Rasheed, Mubarak Shah

Automatic Transcription: An Enabling Technology for Music Analysis

Speaker: Prof. Mubarak Shah, University of Central Florida. Title: Representing Human Actions as Motion Patterns

Open issues and research trends in Content-based Image Retrieval

What is Multimedia? Derived from the word Multi and Media

Geometric Constraints

4.3: Multimedia Database Systems Multimedia Database Management System Data Structure Operations on Data Integration in a Database Model

Content Management in Web Based Education

Study Element Based Adaptation of Lecture Videos to Mobile Devices

NeMO - NeDiMAH Methods Ontology. Use Case manual: How to report your case using throughout the Excel template

A MACHINE LEARNING APPROACH TO FILTER UNWANTED MESSAGES FROM ONLINE SOCIAL NETWORKS

DYNAMIC CHORD ANALYSIS FOR SYMBOLIC MUSIC

Introduction to Computer Graphics

Multimedia Systems: Database Support

Limits and Possibilities of Markerless Human Motion Estimation

Multimedia Databases. Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig

Level: 3 Credit value: 5 GLH: 40 Assessment type:

MULTIMEDIA MINING RESEARCH AN OVERVIEW

Presentation Video Retrieval using Automatically Recovered Slide and Spoken Text

!"#$"%&' What is Multimedia?

Internet Video Streaming and Cloud-based Multimedia Applications. Outline

AUTOMATIC VIDEO STRUCTURING BASED ON HMMS AND AUDIO VISUAL INTEGRATION

Modelling 3D Avatar for Virtual Try on

Automatic Annotation Wrapper Generation and Mining Web Database Search Result

Event Detection in Basketball Video Using Multiple Modalities

Prof. Dr. D. W. Cunningham, Berliner Strasse 35A, Cottbus, Germany

Section for Cognitive Systems DTU Informatics, Technical University of Denmark

Rate control algorithms for video coding. Citation. Issued Date

Information Model for Multimedia Medical Record in Telemedicine

Abstract. 1. Introduction. 2. Previous work. Keywords: Multimedia information retrieval, conceptual indexing, video document, ontology

Outline. CIW Web Design Specialist. Course Content

Survey: Retrieval of Video Using Content (Speech &Text) Information

M3039 MPEG 97/ January 1998

HYPER MEDIA MESSAGING

Movie Classification Using k-means and Hierarchical Clustering

Web Mining Seminar CSE 450. Spring 2008 MWF 11:10 12:00pm Maginnes 113

CHAPTER 6: GRAPHICS, DIGITAL MEDIA, AND MULTIMEDIA

Database Systems. Multimedia Database Management System. Application. User. Application. Chapter 2: Basics

Efficient Multi-Feature Index Structures for Music Data Retrieval

A Case study based Software Engineering Education using Open Source Tools

Introduction to Pattern Recognition

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts

Beyond the Query-By-Example Paradigm: New Query Interfaces for Music Information Retrieval

How to Improve the Sound Quality of Your Microphone

SPEAKER IDENTITY INDEXING IN AUDIO-VISUAL DOCUMENTS

Introduction. Selim Aksoy. Bilkent University

Video Affective Content Recognition Based on Genetic Algorithm Combined HMM

Unit 351: Website Software Level 3

Universidad Autónoma de Guadalajara Unidad Académica de Educación Secundaria y Media Superior Middle School Guide of classes for the student

Interactive Multimedia Courses-1

Music Technology Programs

Mining Signatures in Healthcare Data Based on Event Sequences and its Applications

A comprehensive survey on various ETC techniques for secure Data transmission

ANIMATION a system for animation scene and contents creation, retrieval and display

WESTERN KENTUCKY UNIVERSITY. Web Accessibility. Objective

Multimediale Visualisierungssysteme WS 2000/2001

WEST JEFFERSON HILLS SCHOOL DISTRICT THOMAS JEFFERSON HIGH SCHOOL DIGITAL DESIGN GRADES

MMGD0203 Multimedia Design MMGD0203 MULTIMEDIA DESIGN. Chapter 3 Graphics and Animations

Immersive Medien und 3D-Video

NAVIGATING SCIENTIFIC LITERATURE A HOLISTIC PERSPECTIVE. Venu Govindaraju

Where to Find the Highest Audio Engineer Salary. How Education and Training can affect the Audio Engineer Salary

Multimedia Technology Bachelor of Science

MUSESCAPE: AN INTERACTIVE CONTENT-AWARE MUSIC BROWSER. George Tzanetakis. Computer Science Department Carnegie Mellon University, USA

Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari

Web-based Medical Data Archive System

The School-assessed Task has three components. They relate to: Unit 3 Outcome 2 Unit 3 Outcome 3 Unit 4 Outcome 1.

FAST MIR IN A SPARSE TRANSFORM DOMAIN

Component visualization methods for large legacy software in C/C++

INTELLIGENT VIDEO SYNTHESIS USING VIRTUAL VIDEO PRESCRIPTIONS

Modern Databases. Database Systems Lecture 18 Natasha Alechina

Spam Filtering in Online Social Networks Using Machine Learning Technique

RECENT TRENDS IN VIDEO ANALYSIS: A TAXONOMY OF VIDEO CLASSIFICATION PROBLEMS

Filtering Noisy Contents in Online Social Network by using Rule Based Filtering System

Interactive Flag Identification Using a Fuzzy-Neural Technique

1 o Semestre 2007/2008

Development of Enterprise Architecture of PPDR Organisations W. Müller, F. Reinert

Combating Anti-forensics of Jpeg Compression

ICSY. ICSY Integrated. 3. Integration. Multimediale Visualisierungssysteme WS 2000/2001. How to contact. Acknowledgements.

Course Overview. CSCI 480 Computer Graphics Lecture 1. Administrative Issues Modeling Animation Rendering OpenGL Programming [Angel Ch.

Multimedia Environment for Technology Enhanced Music Education and Composition and Performance

Consumer video dataset with marked head trajectories

Harvesting and Structuring Social Data in Music Information Retrieval

KNOWLEDGE BASED METHODS FOR VIDEO DATA RETRIEVAL

Web Design Specialist

Natural Language Querying for Content Based Image Retrieval System

Music Technology II. What are some additional applications understand midi sequencing. for music production software?

Semantic Concept Based Retrieval of Software Bug Report with Feedback

DINAMIC AND STATIC CENTRE OF PRESSURE MEASUREMENT ON THE FORCEPLATE. F. R. Soha, I. A. Szabó, M. Budai. Abstract

Transcription:

Perspektivenvorlesung Information Retrieval Music and Motion Bonn University Prof. Dr. Michael Clausen PD Dr. Frank Kurth Dipl.-Inform. Christian Fremerey Dipl.-Inform. David Damm Dipl.-Inform. Sebastian Ewert Dr. Tido Röder Habilitation Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Dec. 2007 PhD students Winter Term 2008/2009 Dipl.-Inform. Andreas Baak Dipl.-Math. Verena Konz Dipl.-Ing. Peter Grosche Dipl.-Inform. Thomas Helten (DFG) (MMCI) (MMCI) (DFG) Music Data Music Data Various interpretations Beethoven s Fifth Bernstein Karajan Scherbakov (piano) MIDI (piano) General Goals Music Synchronization: Audio-Audio Beethoven s Fifth Automated organization of complex and inhomogeneous music collections Karajan Generation of annotations and cross-links Tools and methods for multimodal search, navigation and interaction Scherbakov Music Information Retrieval (MIR)

Music Synchronization: Audio-Audio Beethoven s Fifth Music Synchronization: Audio-Audio Feature extraction: chroma features Karajan Karajan Scherbakov C C# 1 0.9 C C# 1 0.9 D 0.8 D 0.8 D# 0.7 D# 0.7 Scherbakov E F F# G G# A 0.6 0.5 0.4 0.3 0.2 E F F# G G# A 0.6 0.5 0.4 0.3 0.2 A# B 2 4 6 8 10 12 14 16 18 0.1 0 A# B 5 10 15 20 0.1 0 Synchronization: Karajan Scherbakov Music Synchronization: Audio-Audio Cost matrix Music Synchronization: Audio-Audio Cost-minimizing warping path 1 1 18 0.9 18 0.9 16 0.8 16 0.8 14 0.7 14 0.7 Karajan 12 10 8 0.6 0.5 0.4 Karajan 12 10 8 0.6 0.5 0.4 6 0.3 6 0.3 4 0.2 4 0.2 2 0.1 2 0.1 2 4 6 8 10 12 14 16 18 20 0 2 4 6 8 10 12 14 16 18 20 0 Scherbakov Scherbakov System: SyncPlayer/AudioSwitcher Music Synchronization: MIDI-Audio

Music Synchronization: MIDI-Audio Music Synchronization: Scan-Audio MIDI = meta data Automated annotation Audio recording Sonification of annotations Music Synchronization: Scan-Audio Music Synchronization: Scan-Audio Scanned Sheet Music Scanned Sheet Music Symbolic Note Events OMR Correspondence Correspondence Audio Recording Audio Recording Music Synchronization: Scan-Audio Music Synchronization: Scan-Audio Scanned Sheet Music Symbolic Note Events Scanned Sheet Music Symbolic Note Events OMR High Qualtity OMR Dirty but hidden Correspondence Correspondence High Qualtity Audio Recording Audio Recording

System: SyncPlayer/SheetMusic Music Synchronization: Lyrics-Audio Difficult task! Music Synchronization: Lyrics-Audio Lyrics-Audio Lyrics-MIDI + MIDI-Audio Music Synchronization Turetsky/Ellis (ISMIR 2003) Soulez/Rodet/Schwarz (ISMIR 2003) Arifi/Clausen/Kurth/Müller (ISMIR 2003) Hu/Dannenberg/Tzanetakis (WASPAA 2003) Müller/Kurth/Röder (ISMIR 2004) Raphael (ISMIR 2004) Dixon/Widmer (ISMIR 2005) Müller/Mattes/Kurth (ISMIR 2006) Dannenberg /Raphael (Special Issue ACM 2006) Fujihara/Goto (ICASSP 2008) Wang/Iskandar/New/Shenoy (IEEE T-ASLP 2008)

Similarity cluster

Global structure

Global structure Global structure System: SyncPlayer/AudioStructure Dannenberg/Hu (ISMIR 2002) Peeters/Burthe/Rodet (ISMIR 2002) Cooper/Foote (ISMIR 2002) Goto (ICASSP 2003) Chai/Vercoe (ACM Multimedia 2003) Lu/Wang/Zhang (ACM Multimedia 2004) Bartsch/Wakefield (IEEE Trans. Multimedia 2005) Goto (IEEE Trans. Audio 2006) Müller/Kurth (EURASIP 2007) Music Information Retrieval Multimodal Computing and Interaction Sheet Music (Image) MIDI CD / MP3 (Audio) Music Synchronization Audio Matching Music

Multimodal Computing and Interaction Motion Capture Data Sheet Music (Image) MIDI CD / MP3 (Audio) Digital 3D representations of motions MusicXML (Text) Music Singing / Voice (Audio) Computer animation Sports Music Literature (Text) Music Film (Video) Dance / Motion (Mocap) Gait analysis Motion Capture Data Motion Capture Data Motion Retrieval Motion Retrieval = MoCap database = query motion clip Goal: find all motion clips in similar to

Notion of Similarity Relational Features Numerical similarity vs. logical similarity Logically related motions may exhibit significant spatiotemporal variations Relational Features Relational Features Induced feature sequence: Relational Features Motion Retrieval

Relational Features Motion Templates Spatio-temporal invariance Indexing Efficient retrieval & preselection Problem: feature design & selection Motion Templates Motion Templates Motion Templates Motion Templates

MT-based Motion Retrieval MT-based Motion Retrieval: Jumping Jack MT-based Motion Retrieval: Jumping Jack MT-based Motion Retrieval: Elbow-To-Knee τ MT-based Motion Retrieval: Elbow-To-Knee MT-based Motion Retrieval τ

MT-based Motion Retrieval Conclusions Automated data organization Exploiting multimodality Synchronization Handling object deformations Indexing and efficiency Conclusions Lecture Music Processing Textbook Summer Term 2009 Thursdays 16-18 MPI, Room 024 3 Credit Points Website Meinard Müller Information Retrieval for Music and Motion 2007, XVI. 318 pages 136 illus. 39 in Color, Hardcover ISBN: 978-3-540-74047-6 www.springer.com/978-3-540-74047-6/ 69,50 EUR Selected Publications M. Müller (2007): Information Retrieval for Music and Motion. Monograph, Springer, 318 pages F. Kurth, M. Müller (2008): Efficient Index-Based Audio Matching. IEEE Trans. Audio, Speech & Language Processing, Vol. 16, No. 2, 382-395. M. Müller, D. Appelt (2008): Path-constrained Partial Music Synchronization. Proc. International Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2008) M. Müller, F. Kurth (2006): Enhancing Similarity Matrices for Music Audio Analysis. Proc. International Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006) M. Müller, T. Röder (2006): Motion Templates for Automatic Classification and Retrieval of Mocap Data. Proc. ACM SIGGRAPH / Eurographics Symposium on Computer Animation (SCA 2006) M. Müller, T. Röder, M. Clausen (2005): Efficient Content-Based Retrieval of Motion Capture Data. ACM Trans. Graph. 24 (SIGGRAPH 2005)