Rank aggregation via nuclear norm minimization

Similar documents
Manifold Learning Examples PCA, LLE and ISOMAP

How To Solve The Kd Cup 2010 Challenge

Lecture Topic: Low-Rank Approximations

Similarity and Diagonalization. Similar Matrices

Alok Gupta. Dmitry Zhdanov

Tensor Methods for Machine Learning, Computer Vision, and Computer Graphics

Games Manipulators Play

7. LU factorization. factor-solve method. LU factorization. solving Ax = b with A nonsingular. the inverse of a nonsingular matrix

Bilinear Prediction Using Low-Rank Models

ANALYTIC HIERARCHY PROCESS AS A RANKING TOOL FOR DECISION MAKING UNITS

Group Recommendations with Rank Aggregation and Collaborative Filtering

An Overview Of Software For Convex Optimization. Brian Borchers Department of Mathematics New Mexico Tech Socorro, NM

NMR Measurement of T1-T2 Spectra with Partial Measurements using Compressive Sensing

1 Review of Least Squares Solutions to Overdetermined Systems

CHOOSING A COLLEGE. Teacher s Guide Getting Started. Nathan N. Alexander Charlotte, NC

Solution to Homework 2

6. Cholesky factorization

26 Linear Programming

Use of Game Theory in Executive Support System

CHAPTER VII CONCLUSIONS

Statistical Machine Translation: IBM Models 1 and 2

The Image Deblurring Problem

Least-Squares Intersection of Lines

Permutation Betting Markets: Singleton Betting with Extra Information

Linear Algebraic Equations, SVD, and the Pseudo-Inverse

Math 1050 Khan Academy Extra Credit Algebra Assignment

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM

About the inverse football pool problem for 9 games 1

Big Data - Lecture 1 Optimization reminders

Part II Redundant Dictionaries and Pursuit Algorithms

COMPARING MATRIX-BASED AND GRAPH-BASED REPRESENTATIONS FOR PRODUCT DESIGN

Supervised Feature Selection & Unsupervised Dimensionality Reduction

Functional-Repair-by-Transfer Regenerating Codes

Application of Synchrophasor Data to Power System Operations

CI6227: Data Mining. Lesson 11b: Ensemble Learning. Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore.

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

Predicting Influentials in Online Social Networks

L25: Ensemble learning

Community Mining from Multi-relational Networks

Statistical machine learning, high dimension and big data

Nimble Algorithms for Cloud Computing. Ravi Kannan, Santosh Vempala and David Woodruff

Epipolar Geometry. Readings: See Sections 10.1 and 15.6 of Forsyth and Ponce. Right Image. Left Image. e(p ) Epipolar Lines. e(q ) q R.

A New Interpretation of Information Rate

Permutation Betting Markets: Singleton Betting with Extra Information

The Assignment Problem and the Hungarian Method

Reformulating the monitor placement problem: Optimal Network-wide wide Sampling

CS 5614: (Big) Data Management Systems. B. Aditya Prakash Lecture #18: Dimensionality Reduc7on

(67902) Topics in Theory and Complexity Nov 2, Lecture 7

! Solve problem to optimality. ! Solve problem in poly-time. ! Solve arbitrary instances of the problem. #-approximation algorithm.

Chapter 6. Orthogonality

Chapter Load Balancing. Approximation Algorithms. Load Balancing. Load Balancing on 2 Machines. Load Balancing: Greedy Scheduling

P versus NP, and More

8. Linear least-squares

The Need for Training in Big Data: Experiences and Case Studies

Knowledge Discovery and Data Mining. Structured vs. Non-Structured Data

How to assess the risk of a large portfolio? How to estimate a large covariance matrix?

DERIVATIVES AS MATRICES; CHAIN RULE

Lecture Notes 2: Matrices as Systems of Linear Equations

University of Lille I PC first year list of exercises n 7. Review

How To Bet On An Nfl Football Game With A Machine Learning Program

Introduction Epipolar Geometry Calibration Methods Further Readings. Stereo Camera Calibration

APPM4720/5720: Fast algorithms for big data. Gunnar Martinsson The University of Colorado at Boulder

Alpha Cut based Novel Selection for Genetic Algorithm

Background: State Estimation

Metrics on SO(3) and Inverse Kinematics

Performance of Dynamic Load Balancing Algorithms for Unstructured Mesh Calculations

Integer Factorization using the Quadratic Sieve

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10

Nonlinear Optimization: Algorithms 3: Interior-point methods

So, how do you pronounce. Jilles Vreeken. Okay, now we can talk. So, what kind of data? binary. * multi-relational

7 Gaussian Elimination and LU Factorization

Notes on Symmetric Matrices

ON THE COMPLEXITY OF THE GAME OF SET.

Notes on Factoring. MA 206 Kurt Bryan

! Solve problem to optimality. ! Solve problem in poly-time. ! Solve arbitrary instances of the problem. !-approximation algorithm.

Keywords: Image Generation and Manipulation, Video Processing, Video Factorization, Face Morphing

Linear Codes. Chapter Basics

Soft Clustering with Projections: PCA, ICA, and Laplacian

Applied Algorithm Design Lecture 5

Why? A central concept in Computer Science. Algorithms are ubiquitous.

THE SCHEDULING OF MAINTENANCE SERVICE

2.3 Convex Constrained Optimization Problems

Recommendations in Mobile Environments. Professor Hui Xiong Rutgers Business School Rutgers University. Rutgers, the State University of New Jersey

ANALYSIS, THEORY AND DESIGN OF LOGISTIC REGRESSION CLASSIFIERS USED FOR VERY LARGE SCALE DATA MINING

An Application of Visual Cryptography To Financial Documents

Sharing Classifiers among Ensembles from Related Problem Domains

Xiaoqiao Meng, Vasileios Pappas, Li Zhang IBM T.J. Watson Research Center Presented by: Payman Khani

JUST-IN-TIME SCHEDULING WITH PERIODIC TIME SLOTS. Received December May 12, 2003; revised February 5, 2004

OPTIMAL DESIGN OF A MULTITIER REWARD SCHEME. Amir Gandomi *, Saeed Zolfaghari **

Markov random fields and Gibbs measures

Large-Scale Similarity and Distance Metric Learning

Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 13. Random Variables: Distribution and Expectation

Mathematics for Computer Science/Software Engineering. Notes for the course MSM1F3 Dr. R. A. Wilson

One last point: we started off this book by introducing another famously hard search problem:

Financial Mathematics and Simulation MATH Spring 2011 Homework 2

0.1 Phase Estimation Technique

Lattice-Based Threshold-Changeability for Standard Shamir Secret-Sharing Schemes

Metric Multidimensional Scaling (MDS): Analyzing Distance Matrices

Multivariate Analysis of Ecological Data

Transcription:

Rank aggregation via nuclear norm minimization David F. Gleich Purdue University @dgleich Lek-Heng Lim University of Chicago KDD2011 San Diego, CA Lek funded by NSF CAREER award (DMS-1057064); David funded by DOE John von Neumann and NSERC David F. Gleich (Purdue) KDD 2011 1/20

Which is a better list of good DVDs? Lord of the Rings 3: The Return of Lord of the Rings 1: The Fellowship Lord of the Rings 2: The Two Towers Lost: Season 1 Battlestar Galactica: Season 1 Fullmetal Alchemist Trailer Park Boys: Season 4 Trailer Park Boys: Season 3 Tenchi Muyo! Shawshank Redemption Lord of the Rings 3: The Return of Lord of the Rings 1: The Fellowship Lord of the Rings 2: The Two Towers Star Wars V: Empire Strikes Back Raiders of the Lost Ark Star Wars IV: A New Hope Shawshank Redemption Star Wars VI: Return of the Jedi Lord of the Rings 3: Bonus DVD The Godfather Standard rank aggregation (the mean rating) Nuclear Norm based rank aggregation (not matrix completion on the netflix rating matrix) David F. Gleich (Purdue) KDD 2011 2/20

Rank Aggregation Given partial orders on subsets of items, rank aggregation is the problem of finding an overall ordering. Voting Find the winning candidate Program committees Find the best papers given reviews Dining Find the best restaurant in San Diego (subject to a budget?) David F. Gleich (Purdue) KDD 2011 3/20

Ranking is really hard Ken Arrow John Kemeny Dwork, Kumar, Naor, Sivikumar All rank aggregations involve some measure of compromise A good ranking is the average ranking under a permutation distance NP hard to compute Kemeny s ranking David F. Gleich (Purdue) KDD 2011 4/20

Embody chair John Cantrell (flickr) Given a hard problem, what do you do? Numerically relax! It ll probably be easier. David F. Gleich (Purdue) KDD 2011 5/20

Suppose we had scores Let be the score of the ith movie/song/paper/team to rank Suppose we can compare the ith to jth: Then is skew-symmetric, rank 2. Also works for with an extra log. Numerical ranking is intimately intertwined with skew-symmetric matrices Kemeny and Snell, Mathematical Models in Social Sciences (1978) David F. Gleich (Purdue) KDD 2011 6/20

Using ratings as comparisons Ratings induce various skew-symmetric matrices. Arithmetic Mean Log-odds David 1988 The Method of Paired Comparisons David F. Gleich (Purdue) KDD 2011 7/20

Movie Pairs Extracting the scores Given with all entries, then is the Borda count, the least-squares solution to 10 7 10 5 How many Most. Do we trust all? Not really. do we have? 10 1 10 1 10 5 Number of Comparisons Netflix data 17k movies, 500k users, 100M ratings 99.17% filled David F. Gleich (Purdue) KDD 2011 8/20

Only partial info? Complete it! Let be known for We trust these scores. Goal Find the simplest skew-symmetric matrix that matches the data noiseless noisy Both of these are NP-hard too. David F. Gleich (Purdue) KDD 2011 9/20

Solution Go Nuclear From a French nuclear test in 1970, image from http://picdit.wordpress.com/2008/07/21/8-insane-nuclear-explosions/ David F. Gleich (Purdue) KDD 2011 10

The nuclear norm The analog of the 1-norm or for matrices. For vectors For matrices Let be the SVD. is NP-hard while is convex and gives the same answer under appropriate circumstances best convex underestimator of rank on unit ball. David F. Gleich (Purdue) KDD 2011 11/20

Heuristic Only partial info? Complete it! Let be known for We trust these scores. Goal Find the simplest skew-symmetric matrix that matches the data NP hard Convex David F. Gleich (Purdue) KDD 2011 12/20

Solving the nuclear norm problem Use a LASSO formulation 1. 2. REPEAT 3. = rank-k SVD of 4. Jain et al. propose SVP for this problem without 5. 6. UNTIL Jain et al. NIPS 2010 David F. Gleich (Purdue) KDD 2011 13/20

Skew-symmetric SVDs Let be an skew-symmetric matrix with eigenvalues, where and. Then the SVD of is given by for and given in the proof. Proof Use the Murnaghan-Wintner form and the SVD of a 2x2 skew-symmetric block This means that SVP will give us the skewsymmetric constraint for free David F. Gleich (Purdue) KDD 2011 14/20

Exact recovery results David Gross showed how to recover Hermitian matrices. i.e. the conditions under which we get the exact Note that is Hermitian. Thus our new result! Gross arxiv 2010. David F. Gleich (Purdue) KDD 2011 15/20

Recovery Discussion and Experiments Confession If, then just look at differences from a connected set. Constants? Not very good. Intuition for the truth. David F. Gleich (Purdue) KDD 2011 16/20

The Ranking Algorithm 0. INPUT (ratings data) and c (for trust on comparisons) 1. Compute from 2. Discard entries with fewer than c comparisons 3. Set to be indices and values of what s left 4. = SVP( ) 5. OUTPUT David F. Gleich (Purdue) KDD 2011 17/20

Synthetic Results Construct an Item Response Theory model. Vary number of ratings per user and a noise/error level Our Algorithm! The Average Rating David F. Gleich (Purdue) KDD 2011 18/20

Conclusions and Future Work aggregate, then complete 1. Compare against others Rank aggregation with the nuclear norm is principled easy to compute The results are much better than simple approaches. 2. Noisy recovery! More realistic sampling. 3. Skew-symmetric Lanczos based SVD? David F. Gleich (Purdue) KDD 2011 19/20

Google nuclear ranking gleich