Presentation outline. An In-Depth Evaluation of Multimodal Video Genre Categorization. State-of-the-art many approaches (more than 10 years), e.g.
|
|
- Alvin Sherman
- 7 years ago
- Views:
Transcription
1 An In-Depth Evaluation of Multimodal Video Genre Categorization Presentation outline Problem statement, state-of-the-art and contribution Video content description Ionuț MIRONICĂ 1 imironica@imag.pub.ro Bogdan IONESCU 1,2 bionescu@imag.pub.ro Peter KNEES 3 peter.knees@jku.at Patrick LAMBERT 2 Data fusion Experimental results 11th International Workshop on Content-Based Multimedia Indexing,, Veszprém, Hungary, June 17-19, Conclusions 1 University POLITEHNICA of Bucharest Problem statement data indexing is based on extracting content-based descriptors, (numeric/compact representation of rich text-audio-visual information); content annotation (key process) goal: determine relevant content descriptors to facilitate automatic applications: classification into predefined categories,i.e., video genres: automatic categorization of videos (e.g., TV programs, video selling platforms); genre based visualization of web media (e.g., YouTube, blip.tv) State-of-the-art many approaches (more than 10 years), e.g.,: [X. Yuan, W. Lai, T. Mei, X.S. Hua, X.Q. Wu, S. Li 06] spatio-temporal - annotation: temporal (avg. shot length, cut %, camera motion) & spatial (face frames %, avg. brightness, color entropy); - classifier: Decision Trees & several s; - genres: - movie, commercial, news, music & sports - movies: action, comedy, horror & cartoon - sports: baseball, football, volleyball, tennis, basketball & soccer [Y. Song, Y.-D. Zhang, X. Zhang, J. Cao, J.-T. Li 09] only text - annotation: contextual and social information: metadata, user behavior, viewer s behavior and video relevance; - classifier: incremental ; - genres: large-scale web categorization. 3 4 State-of-the-art 2 [M. Montagnuolo, A. Messina 09] multi-modal - annotation: visual-perceptual (colour, texture, motion), temporal (shot length, distribution, rhythm, etc.), cognitive (face properties) & aural (text, sound caracteristics); - classifier: parallel Neural Network system; - genres: football, cartoons, music, weather forecast, newscast, talk shows & commercials. [S. Schmiedeke, C. Kofler, I. Ferrane 12] truly multi-modal Genre Tagging MediaEval Benchmarking Initiative for Multimedia Evaluation (2011, 2012) evaluation framework. - annotation: aural, visual, text from ASR, text from web metadata; - genres: 26 blip.tv genres, Internet videos; Contribution to the state-of-the-art in this context of the state-of-the-art, we attempt to respond to several research questions: to what extent aural and visual information (which can be extracted automatically) can lead to similar performance or even surpass the semantic textual descriptors? how efficient would be an adequate combination of various modalities in achieving highly accurate classification? how really important is the contribution of visual modalities in improving the accuracy of using textual data? 5
2 Approach > challenge: find a way to assign (genre) tags to unknown videos; > approach: machine learning paradigm; labeled data web food autos classifier train unlabeled data labeled data + Content description - audio Standard audio features (audio frame-based) f 1 f 2 f n var{f 2 } var{f n } time global feature = mean & variance Zero-Crossing Rate, Predictive Coefficients, Line Spectral Pairs, Mel-Frequency Cepstral Coefficients, spectral centroid, flux, rolloff, and kurtosis, + variance of each feature over a certain window. tagged video video database 7 [B. Mathieu et al., Yaafe toolbox, ISMIR 10, Netherlands] MPEG-7 & color/texture descriptors (visual frame-based) f 1 f 2 f n time global feature = mean & dispersion & skewness & kurtosis & median & root mean square Local Binary Pattern, Autocorrelogram, Color Coherence Vector, Color Layout Pattern, Edge Histogram, Classic color histogram, Scalable Color Descriptor, Color moments. descriptors (Bag-of-Words) dictionary of 4,096 words; rgbsift and spatial pyramids (2x2); Detection on interest points Codewords Dictionary Generate BoW histograms Train classifier [OpenCV toolbox, [CIVR 2009, J. Uijlings et all] Histogram of oriented Gradients - HoG divides the image into 3x3 cells and for each of them builds a pixel-wise histogram of edge orientations. Structural descriptors describes structural information in terms of contours and their relations ( scalespace representation); σ=1 σ=3 b : degree of curvature (proportional to the maximum amplitude of the bowness space); straight vs. bow ζ : degree of circularity; ½ circle vs. full circle e : edginess parameter zig-zag vs. sinusoid; y : symmetry parameter irregular vs. even edginess symmetry [CITS 2009, O. Ludwig,et all] [IJCV, C. Rasche 10] 12
3 Content description - textual TF-IDF descriptors (Term Frequency-Inverse Document Frequency) text sources: ASR and metadata from Internet, 1. remove XML markups, Data fusion multimodal integration Early fusion: 2. remove terms <5%-percentile of the frequency distribution, Global Descriptor Classifier Global Confidence score Decision 3. select term corpus: retaining for each genre class m terms (e.g., m = 150 for ASR and 20 for metadata) with the highest χ2 values that occur more frequently than in complement classes, 4. video descriptor: the TF-IDF values. extraction Normalization concatenation Classification Step Obtain the Global Confidence Score 14 Data fusion multimodal integration 2 Late fusion: Classifier 1 Classifier 2 Confidence value 1 Confidence value 2 cv 1 cv 2 design the aggregation function Global Confidence score Decision Experimental results Data set: MediaEval 2012 Genre Tagging Task 14,838 episodes from 2,249 shows (3,260 hours); 26 video genres (art, autos, business, comedy, gaming...); Classifier n Confidence value n cv n extraction Classification Step Confidence Scores Normalization Global Confidence Score we test: Experimental results 2 Classification scheme: - we have selected five of the most popular approaches: Support Vector Machines with linear, Radial Basis Function and Chi-square kernels; k-nearest Neighbour; Random and Extremely Random ; - we perform training on 5,288 videos and testing on 9,550; - classifier parameters and late fusion weights were optimized on training dataset. Evaluation metrics: Mean Average Precision summarizes rankings from multiple queries by averaging average precision; Experimental results 3 RBF - Chi 5-NN Random s Performance of visual descriptors: - best performance with MPEG-7 (ERF) and HOG (-RBF); - Bag-of-Words is not performing very well! 17 18
4 Experimental results 4 RBF - Chi 5-NN Random s Performance of audio descriptors: - best performance with Extremely Random s; - provide higher discriminative power than visual features. 19 Experimental results 5 RBF - Chi 5-NN Random s Performance of text descriptors: - best performance with metadata and Random ; - ASR provides lower performance than the use of audio descriptors; - metadata TF-IDF outperforms all the other approaches. 20 Experimental results 6 (2) Performance of multimodal Integration SUM Mean MNZ Rank Early Fusion all visual 35.82% 36.76% 38.21% 30.90% 30.11% all audio 43.86% 44.19% 44.50% 41.81% 42.33% all text 62.62% 62.81% 62.69% 50.60% 55.68% audio-visualtext 64.24% 65.61% 65.82% 53.84% 60.12% Performance of fusion techniques: - late fusion provides higher performance than early fusion; - the use of all modalities is better; - MNZ tends to provide the most accurate results, MAP is up to 65.82% which is quite significant; 21 Experimental results 7 (3) Comparison to state-of-the-art (from MediaEval 2012) Team Modality Method MAP proposed all Late Fusion MNZ with all descriptors 65.82% proposed text Late Fusion Mean with TF-IDF of ASR and metadata 62.81% TUB text Naive Bayes with Bag of Words on text (metadata) 52.25% proposed all Late Fusion MNZ with all descriptors except for metadata 51.9% proposed audio Late Fusion Mean with standard audio descriptors 44.50% proposed visual Late Fusion Mean with MPEG-7 related, structural, HoG and B-o-VW with rgbsift 38.21% ARF text linear on early fusion of TF-IDF of ASR and metadata 37.93% TUD visual & text Late Fusion of with B-o-W (visual word, ASR & metadata) 35.81% KIT visual with Visual descriptors (color, texture, B-o-VW with rgbsift) 35.81% TUD-MM text Dynamic Bayesian networks on text (ASR & metadata) 25.00% UNICAMP - UFMG visual Late fusion (KNN, Naive Bayes,, Random s) with BOW (text ASR) 21.12% ARF audio linear with block-based audio features 18.92% 22 Experimental results 8 (3) Comparison to state-of-the-art (from MediaEval 2012) Team Modality Method MAP proposed all Late Fusion MNZ with all descriptors 65.82% proposed text Late Fusion Mean with TF-IDF of ASR and metadata 62.81% TUB text Naive Bayes with Bag of Words on text (metadata) 52.25% proposed all Late Fusion MNZ with all descriptors except for metadata 51.9% metadata provides the highest discriminative power but cannot be generated automatically from video contents MAP 52.25%; proposed audio Late Fusion Mean with standard audio descriptors 44.50% proposed visual Late Fusion Mean with MPEG-7 related, structural, HoG and 38.21% B-o-VW with rgbsift the use of automatic content descriptors and late fusion allow for similar performance MAP 51.9% (surpassing even some metadata based approaches); ARF text linear on early fusion of TF-IDF of ASR and metadata 37.93% TUD visual & text Late Fusion of with B-o-W (visual word, ASR & metadata) 35.81% KIT visual with Visual descriptors (color, texture, B-o-VW with rgbsift) 35.81% TUD-MM text Dynamic Bayesian networks on text (ASR & metadata) 25.00% the inclusion of audio-visual information improves performance of text, visual 21.12% which is also the best performing approach MAP 65.82% UNICAMP - UFMG Late fusion (KNN, Naive Bayes,, Random s) with BOW (text ASR) ARF audio linear with block-based audio features 18.92% 23 Conclusions (1) we provided an in-depth evaluation of truly multimodal video description approaches; (2) we demonstrated the potential of appropriate late fusion to genre categorization; (3) we proved that notwithstanding the superiority of user-text based descriptors, late fusion can boost performance of automated content descriptors to achieve close performance; (4) we setup a new baseline for the 2012 Genre Tagging Task by outperforming the performance of the other participants; Acknowledgements: - we thank Prof. Nicu Sebe and Dr. Jasper Uijlings from University of Trento for their support. - we also acknowledge the 2012 Genre Tagging Task of the MediaEval Multimedia Benchmark for the dataset (
5 Thank you! Questions? 25
Big Data: Image & Video Analytics
Big Data: Image & Video Analytics How it could support Archiving & Indexing & Searching Dieter Haas, IBM Deutschland GmbH The Big Data Wave 60% of internet traffic is multimedia content (images and videos)
More informationEvent Detection in Basketball Video Using Multiple Modalities
Event Detection in Basketball Video Using Multiple Modalities Min Xu, Ling-Yu Duan, Changsheng Xu, *Mohan Kankanhalli, Qi Tian Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore, 119613
More informationCees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.
Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table
More informationSocial Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
More informationAnnotated bibliographies for presentations in MUMT 611, Winter 2006
Stephen Sinclair Music Technology Area, McGill University. Montreal, Canada Annotated bibliographies for presentations in MUMT 611, Winter 2006 Presentation 4: Musical Genre Similarity Aucouturier, J.-J.
More informationEvaluating Sources and Strategies for Learning Video Concepts from Social Media
Evaluating Sources and Strategies for Learning Video Concepts from Social Media Svetlana Kordumova Intelligent Systems Lab Amsterdam University of Amsterdam The Netherlands Email: s.kordumova@uva.nl Xirong
More informationMultimedia Data Mining: A Survey
Multimedia Data Mining: A Survey Sarla More 1, and Durgesh Kumar Mishra 2 1 Assistant Professor, Truba Institute of Engineering and information Technology, Bhopal 2 Professor and Head (CSE), Sri Aurobindo
More informationRecommender Systems: Content-based, Knowledge-based, Hybrid. Radek Pelánek
Recommender Systems: Content-based, Knowledge-based, Hybrid Radek Pelánek 2015 Today lecture, basic principles: content-based knowledge-based hybrid, choice of approach,... critiquing, explanations,...
More informationData Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
More informationRecognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28
Recognition Topics that we will try to cover: Indexing for fast retrieval (we still owe this one) History of recognition techniques Object classification Bag-of-words Spatial pyramids Neural Networks Object
More informationMultimedia data mining: state of the art and challenges
Multimed Tools Appl (2011) 51:35 76 DOI 10.1007/s11042-010-0645-5 Multimedia data mining: state of the art and challenges Chidansh Amitkumar Bhatt Mohan S. Kankanhalli Published online: 16 November 2010
More informationOnline Play Segmentation for Broadcasted American Football TV Programs
Online Play Segmentation for Broadcasted American Football TV Programs Liexian Gu 1, Xiaoqing Ding 1, and Xian-Sheng Hua 2 1 Department of Electronic Engineering, Tsinghua University, Beijing, China {lxgu,
More informationDESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
More informationFlorida International University - University of Miami TRECVID 2014
Florida International University - University of Miami TRECVID 2014 Miguel Gavidia 3, Tarek Sayed 1, Yilin Yan 1, Quisha Zhu 1, Mei-Ling Shyu 1, Shu-Ching Chen 2, Hsin-Yu Ha 2, Ming Ma 1, Winnie Chen 4,
More informationCLASSIFYING NETWORK TRAFFIC IN THE BIG DATA ERA
CLASSIFYING NETWORK TRAFFIC IN THE BIG DATA ERA Professor Yang Xiang Network Security and Computing Laboratory (NSCLab) School of Information Technology Deakin University, Melbourne, Australia http://anss.org.au/nsclab
More informationAutomated News Item Categorization
Automated News Item Categorization Hrvoje Bacan, Igor S. Pandzic* Department of Telecommunications, Faculty of Electrical Engineering and Computing, University of Zagreb, Croatia {Hrvoje.Bacan,Igor.Pandzic}@fer.hr
More informationIntroduction to Data Mining
Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:
More informationChapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
More informationThe Delicate Art of Flower Classification
The Delicate Art of Flower Classification Paul Vicol Simon Fraser University University Burnaby, BC pvicol@sfu.ca Note: The following is my contribution to a group project for a graduate machine learning
More informationM3039 MPEG 97/ January 1998
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND ASSOCIATED AUDIO INFORMATION ISO/IEC JTC1/SC29/WG11 M3039
More informationUNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003. Yun Zhai, Zeeshan Rasheed, Mubarak Shah
UNIVERSITY OF CENTRAL FLORIDA AT TRECVID 2003 Yun Zhai, Zeeshan Rasheed, Mubarak Shah Computer Vision Laboratory School of Computer Science University of Central Florida, Orlando, Florida ABSTRACT In this
More informationActive Learning SVM for Blogs recommendation
Active Learning SVM for Blogs recommendation Xin Guan Computer Science, George Mason University Ⅰ.Introduction In the DH Now website, they try to review a big amount of blogs and articles and find the
More informationData Mining Techniques Chapter 6: Decision Trees
Data Mining Techniques Chapter 6: Decision Trees What is a classification decision tree?.......................................... 2 Visualizing decision trees...................................................
More informationMachine Learning using MapReduce
Machine Learning using MapReduce What is Machine Learning Machine learning is a subfield of artificial intelligence concerned with techniques that allow computers to improve their outputs based on previous
More informationAssessment. Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall
Automatic Photo Quality Assessment Presenter: Yupu Zhang, Guoliang Jin, Tuo Wang Computer Vision 2008 Fall Estimating i the photorealism of images: Distinguishing i i paintings from photographs h Florin
More informationData, Measurements, Features
Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are
More informationMusic Genre Classification
Music Genre Classification Michael Haggblade Yang Hong Kenny Kao 1 Introduction Music classification is an interesting problem with many applications, from Drinkify (a program that generates cocktails
More informationMassive Labeled Solar Image Data Benchmarks for Automated Feature Recognition
Massive Labeled Solar Image Data Benchmarks for Automated Feature Recognition Michael A. Schuh1, Rafal A. Angryk2 1 Montana State University, Bozeman, MT 2 Georgia State University, Atlanta, GA Introduction
More informationAn Introduction to Data Mining
An Introduction to Intel Beijing wei.heng@intel.com January 17, 2014 Outline 1 DW Overview What is Notable Application of Conference, Software and Applications Major Process in 2 Major Tasks in Detail
More informationNEW MEXICO Grade 6 MATHEMATICS STANDARDS
PROCESS STANDARDS To help New Mexico students achieve the Content Standards enumerated below, teachers are encouraged to base instruction on the following Process Standards: Problem Solving Build new mathematical
More informationLocal features and matching. Image classification & object localization
Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to
More informationEmotion Detection from Speech
Emotion Detection from Speech 1. Introduction Although emotion detection from speech is a relatively new field of research, it has many potential applications. In human-computer or human-human interaction
More informationRecognizing Cats and Dogs with Shape and Appearance based Models. Group Member: Chu Wang, Landu Jiang
Recognizing Cats and Dogs with Shape and Appearance based Models Group Member: Chu Wang, Landu Jiang Abstract Recognizing cats and dogs from images is a challenging competition raised by Kaggle platform
More informationBig Data Text Mining and Visualization. Anton Heijs
Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark
More informationThe Artificial Prediction Market
The Artificial Prediction Market Adrian Barbu Department of Statistics Florida State University Joint work with Nathan Lay, Siemens Corporate Research 1 Overview Main Contributions A mathematical theory
More informationMultimodal fusion for multimedia analysis: a survey
Multimedia Systems (2010) 16:345 379 DOI 10.1007/s00530-010-0182-0 REGULAR PAPER Multimodal fusion for multimedia analysis: a survey Pradeep K. Atrey M. Anwar Hossain Abdulmotaleb El Saddik Mohan S. Kankanhalli
More informationVideoStory A New Mul1media Embedding for Few- Example Recogni1on and Transla1on of Events
VideoStory A New Mul1media Embedding for Few- Example Recogni1on and Transla1on of Events Amirhossein Habibian, Thomas Mensink, Cees Snoek ISLA, University of Amsterdam Problem statement Recognize and
More informationMIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS
More informationbusiness statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
More informationlarge-scale machine learning revisited Léon Bottou Microsoft Research (NYC)
large-scale machine learning revisited Léon Bottou Microsoft Research (NYC) 1 three frequent ideas in machine learning. independent and identically distributed data This experimental paradigm has driven
More informationHOG AND SUBBAND POWER DISTRIBUTION IMAGE FEATURES FOR ACOUSTIC SCENE CLASSIFICATION. Victor Bisot, Slim Essid, Gaël Richard
HOG AND SUBBAND POWER DISTRIBUTION IMAGE FEATURES FOR ACOUSTIC SCENE CLASSIFICATION Victor Bisot, Slim Essid, Gaël Richard Institut Mines-Télécom, Télécom ParisTech, CNRS LTCI, 37-39 rue Dareau, 75014
More informationData Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
More informationENHANCED WEB IMAGE RE-RANKING USING SEMANTIC SIGNATURES
International Journal of Computer Engineering & Technology (IJCET) Volume 7, Issue 2, March-April 2016, pp. 24 29, Article ID: IJCET_07_02_003 Available online at http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=7&itype=2
More informationSemantic Analysis of Song Lyrics
Semantic Analysis of Song Lyrics Beth Logan, Andrew Kositsky 1, Pedro Moreno Cambridge Research Laboratory HP Laboratories Cambridge HPL-2004-66 April 14, 2004* E-mail: Beth.Logan@hp.com, Pedro.Moreno@hp.com
More informationSeparation and Classification of Harmonic Sounds for Singing Voice Detection
Separation and Classification of Harmonic Sounds for Singing Voice Detection Martín Rocamora and Alvaro Pardo Institute of Electrical Engineering - School of Engineering Universidad de la República, Uruguay
More informationKnowledge Discovery and Data Mining. Bootstrap review. Bagging Important Concepts. Notes. Lecture 19 - Bagging. Tom Kelsey. Notes
Knowledge Discovery and Data Mining Lecture 19 - Bagging Tom Kelsey School of Computer Science University of St Andrews http://tom.host.cs.st-andrews.ac.uk twk@st-andrews.ac.uk Tom Kelsey ID5059-19-B &
More informationData Mining Yelp Data - Predicting rating stars from review text
Data Mining Yelp Data - Predicting rating stars from review text Rakesh Chada Stony Brook University rchada@cs.stonybrook.edu Chetan Naik Stony Brook University cnaik@cs.stonybrook.edu ABSTRACT The majority
More informationPredicting the Stock Market with News Articles
Predicting the Stock Market with News Articles Kari Lee and Ryan Timmons CS224N Final Project Introduction Stock market prediction is an area of extreme importance to an entire industry. Stock price is
More informationInvestigation of Support Vector Machines for Email Classification
Investigation of Support Vector Machines for Email Classification by Andrew Farrugia Thesis Submitted by Andrew Farrugia in partial fulfillment of the Requirements for the Degree of Bachelor of Software
More informationSearch and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
More informationComparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data
CMPE 59H Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data Term Project Report Fatma Güney, Kübra Kalkan 1/15/2013 Keywords: Non-linear
More informationThe Need for Training in Big Data: Experiences and Case Studies
The Need for Training in Big Data: Experiences and Case Studies Guy Lebanon Amazon Background and Disclaimer All opinions are mine; other perspectives are legitimate. Based on my experience as a professor
More informationGame ON! Predicting English Premier League Match Outcomes
Game ON! Predicting English Premier League Match Outcomes Aditya Srinivas Timmaraju adityast@stanford.edu Aditya Palnitkar aditpal@stanford.edu Vikesh Khanna vikesh@stanford.edu Abstract Among the different
More informationCloud Analytics for Capacity Planning and Instant VM Provisioning
Cloud Analytics for Capacity Planning and Instant VM Provisioning Yexi Jiang Florida International University Advisor: Dr. Tao Li Collaborator: Dr. Charles Perng, Dr. Rong Chang Presentation Outline Background
More informationView-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems
View-Invariant Dynamic Texture Recognition using a Bag of Dynamical Systems Avinash Ravichandran, Rizwan Chaudhry and René Vidal Center for Imaging Science, Johns Hopkins University, Baltimore, MD 21218,
More informationVEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS Norbert Buch 1, Mark Cracknell 2, James Orwell 1 and Sergio A. Velastin 1 1. Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE,
More informationIBM SPSS Direct Marketing 23
IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release
More informationModelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
More informationLearning from Diversity
Learning from Diversity Epitope Prediction with Sequence and Structure Features using an Ensemble of Support Vector Machines Rob Patro and Carl Kingsford Center for Bioinformatics and Computational Biology
More informationClassifying Manipulation Primitives from Visual Data
Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if
More informationIBM SPSS Direct Marketing 22
IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release
More informationNeural Network based Vehicle Classification for Intelligent Traffic Control
Neural Network based Vehicle Classification for Intelligent Traffic Control Saeid Fazli 1, Shahram Mohammadi 2, Morteza Rahmani 3 1,2,3 Electrical Engineering Department, Zanjan University, Zanjan, IRAN
More informationClassification of Bad Accounts in Credit Card Industry
Classification of Bad Accounts in Credit Card Industry Chengwei Yuan December 12, 2014 Introduction Risk management is critical for a credit card company to survive in such competing industry. In addition
More informationTowards Effective Recommendation of Social Data across Social Networking Sites
Towards Effective Recommendation of Social Data across Social Networking Sites Yuan Wang 1,JieZhang 2, and Julita Vassileva 1 1 Department of Computer Science, University of Saskatchewan, Canada {yuw193,jiv}@cs.usask.ca
More informationData Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka (dsarka@solidq.com) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
More informationPresentation Video Retrieval using Automatically Recovered Slide and Spoken Text
Presentation Video Retrieval using Automatically Recovered Slide and Spoken Text Matthew Cooper FX Palo Alto Laboratory Palo Alto, CA 94034 USA cooper@fxpal.com ABSTRACT Video is becoming a prevalent medium
More informationAUTOMATIC VIDEO STRUCTURING BASED ON HMMS AND AUDIO VISUAL INTEGRATION
AUTOMATIC VIDEO STRUCTURING BASED ON HMMS AND AUDIO VISUAL INTEGRATION P. Gros (1), E. Kijak (2) and G. Gravier (1) (1) IRISA CNRS (2) IRISA Université de Rennes 1 Campus Universitaire de Beaulieu 35042
More informationSupervised Learning (Big Data Analytics)
Supervised Learning (Big Data Analytics) Vibhav Gogate Department of Computer Science The University of Texas at Dallas Practical advice Goal of Big Data Analytics Uncover patterns in Data. Can be used
More informationCATEGORIZATION OF SIMILAR OBJECTS USING BAG OF VISUAL WORDS AND k NEAREST NEIGHBOUR CLASSIFIER
TECHNICAL SCIENCES Abbrev.: Techn. Sc., No 15(2), Y 2012 CATEGORIZATION OF SIMILAR OBJECTS USING BAG OF VISUAL WORDS AND k NEAREST NEIGHBOUR CLASSIFIER Piotr Artiemjew, Przemysław Górecki, Krzysztof Sopyła
More informationAdvanced Ensemble Strategies for Polynomial Models
Advanced Ensemble Strategies for Polynomial Models Pavel Kordík 1, Jan Černý 2 1 Dept. of Computer Science, Faculty of Information Technology, Czech Technical University in Prague, 2 Dept. of Computer
More informationBeating the NCAA Football Point Spread
Beating the NCAA Football Point Spread Brian Liu Mathematical & Computational Sciences Stanford University Patrick Lai Computer Science Department Stanford University December 10, 2010 1 Introduction Over
More informationExperiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
More informationSearch Taxonomy. Web Search. Search Engine Optimization. Information Retrieval
Information Retrieval INFO 4300 / CS 4300! Retrieval models Older models» Boolean retrieval» Vector Space model Probabilistic Models» BM25» Language models Web search» Learning to Rank Search Taxonomy!
More informationSimilarity Search in a Very Large Scale Using Hadoop and HBase
Similarity Search in a Very Large Scale Using Hadoop and HBase Stanislav Barton, Vlastislav Dohnal, Philippe Rigaux LAMSADE - Universite Paris Dauphine, France Internet Memory Foundation, Paris, France
More informationThe Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
More informationData Mining and Knowledge Discovery in Databases (KDD) State of the Art. Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland
Data Mining and Knowledge Discovery in Databases (KDD) State of the Art Prof. Dr. T. Nouri Computer Science Department FHNW Switzerland 1 Conference overview 1. Overview of KDD and data mining 2. Data
More informationProbabilistic Latent Semantic Analysis (plsa)
Probabilistic Latent Semantic Analysis (plsa) SS 2008 Bayesian Networks Multimedia Computing, Universität Augsburg Rainer.Lienhart@informatik.uni-augsburg.de www.multimedia-computing.{de,org} References
More informationUniversity of Central Florida at TRECVID 2004
University of Central Florida at TRECVID 2004 Yun Zhai, Xiaochun Cao, Yunjun Zhang, Omar Javed, Alper Yilmaz Fahd Rafi, Saad Ali, Orkun Alatas, Saad Khan, and Mubarak Shah Computer Vision Laboratory University
More informationTracking and Recognition in Sports Videos
Tracking and Recognition in Sports Videos Mustafa Teke a, Masoud Sattari b a Graduate School of Informatics, Middle East Technical University, Ankara, Turkey mustafa.teke@gmail.com b Department of Computer
More informationSTATISTICA. Financial Institutions. Case Study: Credit Scoring. and
Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT
More informationKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Evaluating the Accuracy of a Classifier Holdout, random subsampling, crossvalidation, and the bootstrap are common techniques for
More informationGlencoe. correlated to SOUTH CAROLINA MATH CURRICULUM STANDARDS GRADE 6 3-3, 5-8 8-4, 8-7 1-6, 4-9
Glencoe correlated to SOUTH CAROLINA MATH CURRICULUM STANDARDS GRADE 6 STANDARDS 6-8 Number and Operations (NO) Standard I. Understand numbers, ways of representing numbers, relationships among numbers,
More informationBases de données avancées Bases de données multimédia
Bases de données avancées Bases de données multimédia Université de Cergy-Pontoise Master Informatique M1 Cours BDA Multimedia DB Multimedia data include images, text, video, sound, spatial data Data of
More informationMusic Mood Classification
Music Mood Classification CS 229 Project Report Jose Padial Ashish Goel Introduction The aim of the project was to develop a music mood classifier. There are many categories of mood into which songs may
More informationVideo Classification and Audio BasedAProaches
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS, VOL. UNKNOWN, NO. UNKNOWN, UNKNOWN 2007 1 Automatic Video Classification: A Survey of the Literature Darin Brezeale and Diane J. Cook, Senior Member,
More informationOpen issues and research trends in Content-based Image Retrieval
Open issues and research trends in Content-based Image Retrieval Raimondo Schettini DISCo Universita di Milano Bicocca schettini@disco.unimib.it www.disco.unimib.it/schettini/ IEEE Signal Processing Society
More informationInternational Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
More informationExample: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
More informationInternational Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015
RESEARCH ARTICLE OPEN ACCESS Data Mining Technology for Efficient Network Security Management Ankit Naik [1], S.W. Ahmad [2] Student [1], Assistant Professor [2] Department of Computer Science and Engineering
More informationVideo Mining Using Combinations of Unsupervised and Supervised Learning Techniques
MERL A MITSUBISHI ELECTRIC RESEARCH LABORATORY http://www.merl.com Video Mining Using Combinations of Unsupervised and Supervised Learning Techniques Ajay Divakaran, Koji Miyahara, Kadir A. Peker, Regunathan
More informationIdentifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100
Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 Erkan Er Abstract In this paper, a model for predicting students performance levels is proposed which employs three
More informationCITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學
CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學 Audio Musical Genre Classification using Convolutional Neural Networks and Pitch and Tempo Transformations 使 用 捲 積 神 經 網 絡 及 聲 調 速 度 轉 換 的 音 頻 音 樂 流 派 分 類 研 究 Submitted
More informationFilling the Semantic Gap: A Genetic Programming Framework for Content-Based Image Retrieval
INSTITUTE OF COMPUTING University of Campinas Filling the Semantic Gap: A Genetic Programming Framework for Content-Based Image Retrieval Ricardo da Silva Torres rtorres@ic.unicamp.br www.ic.unicamp.br/~rtorres
More informationData Mining and Visualization
Data Mining and Visualization Jeremy Walton NAG Ltd, Oxford Overview Data mining components Functionality Example application Quality control Visualization Use of 3D Example application Market research
More informationEmail Spam Detection A Machine Learning Approach
Email Spam Detection A Machine Learning Approach Ge Song, Lauren Steimle ABSTRACT Machine learning is a branch of artificial intelligence concerned with the creation and study of systems that can learn
More informationPeople today have access to more
[3B2-9] mmu2009030001.3d 17/6/09 16:13 Page 2 Feature Article Learning Video Preferences Using Visual Features and Closed Captions An approach to identifying a viewer s video preferences uses hidden Markov
More informationPredict the Popularity of YouTube Videos Using Early View Data
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationLearning is a very general term denoting the way in which agents:
What is learning? Learning is a very general term denoting the way in which agents: Acquire and organize knowledge (by building, modifying and organizing internal representations of some external reality);
More informationAudio Content Analysis for Online Audiovisual Data Segmentation and Classification
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 9, NO. 4, MAY 2001 441 Audio Content Analysis for Online Audiovisual Data Segmentation and Classification Tong Zhang, Member, IEEE, and C.-C. Jay
More information