Regret Minimization for Reserve Prices in Second-Price Auctions



Similar documents
Chapter 7. Sealed-bid Auctions

Optimal Auctions Continued

CPC/CPA Hybrid Bidding in a Second Price Auction

Cournot s model of oligopoly

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2015

Next Tuesday: Amit Gandhi guest lecture on empirical work on auctions Next Wednesday: first problem set due

A Bidding Strategy of Intermediary in Display Advertisement Auctions

Introduction to Auction Design

A Theory of Auction and Competitive Bidding

Monotone multi-armed bandit allocations

8 Modeling network traffic using game theory

Computational Learning Theory Spring Semester, 2003/4. Lecture 1: March 2

Internet Advertising and the Generalized Second Price Auction:

An Introduction to Sponsored Search Advertising

Simple and efficient online algorithms for real world applications

Lecture 11: Sponsored search

A Complete Guide To. OpenRTB 2.3. Making Sense Of Programmatic Native Advertising

Handling Advertisements of Unknown Quality in Search Advertising

Managerial Economics

The Programmatic (R)evolution Causes, Effects & a Framework to thrive in it!

Bayesian Nash Equilibrium

Sharing Online Advertising Revenue with Consumers

Display Advertising. Xuelin Cao Yahoo! Beijing R&D Center

THE REAL WORLD OF REAL-TIME ADVERTISING

Advertising in Google Search Deriving a bidding strategy in the biggest auction on earth.

Adaptive Online Gradient Descent

An Auction Mechanism for a Cloud Spot Market

Chapter 9. Auctions. 9.1 Types of Auctions

For example: Standard Banners: Images displayed alongside, above or below content on a webpage.

WHY ADVERTISING THROUGH REAL-TIME BIDDING NEEDS A NEW BUYING MODEL

Auctions. Information Sheet

Single-Period Balancing of Pay Per-Click and Pay-Per-View Online Display Advertisements

OPTIMIZE YOUR BUSINESS WITH SUCCESSFUL ONLINE ADVERTISING & ADWORDS GENERATION

Algorithms for VWAP and Limit Order Trading

Part II: Bidding, Dynamics and Competition. Jon Feldman S. Muthukrishnan

Beat the Mean Bandit

Internet Advertising and the Generalized Second-Price Auction: Selling Billions of Dollars Worth of Keywords

Gambling and Data Compression

UCLA. Department of Economics Ph. D. Preliminary Exam Micro-Economic Theory

Economic Insight from Internet Auctions. Patrick Bajari Ali Hortacsu

THE CENTRAL LIMIT THEOREM TORONTO

The Arrival of Real-Time Bidding and What it Means for Media Buyers

Best Practices for Maximizing Revenue with MoPub. Publisher Playbook

Un point de vue bayésien pour des algorithmes de bandit plus performants

How To Solve A Sequential Mca Problem

Foundations of Machine Learning On-Line Learning. Mehryar Mohri Courant Institute and Google Research

VIEWABILITY ATA DSP ADEXCHANGE RTB. Cookie. Retargeting Look-alike IMPRESSION. Site Buys. SSP Behavioral targeting PROGRAMMATIC BUYING.

CS 522 Computational Tools and Methods in Finance Robert Jarrow Lecture 1: Equity Options

GAMING MANAGEMENT SEMINAR SERIES. Prices of Casino Games. Jason Zhicheng Gao 高 志 成

Online Network Revenue Management using Thompson Sampling

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)

CITY UNIVERSITY OF HONG KONG. Revenue Optimization in Internet Advertising Auctions

Stapled Finance. Paul Povel and Raj Singh. Bauer College of Business, Univ. of Houston Carlson School of Management, Univ.

Monte Carlo-based statistical methods (MASM11/FMS091)

Digital Media and Analytics

Financial Methods for Online Advertising

User Guide Thank you for purchasing the DX90

Overview: Auctions and Bidding. Examples of Auctions

Introduction to Computational Advertising

How To Price A Call Option

Online Algorithms: Learning & Optimization with No Regret.

Ad Networks vs. Ad Exchanges: How They Stack Up

6.254 : Game Theory with Engineering Applications Lecture 1: Introduction

Transcription:

Regret Minimization for Reserve Prices in Second-Price Auctions Nicolò Cesa-Bianchi Università degli Studi di Milano Joint work with: Claudio Gentile (Varese) and Yishay Mansour (Tel-Aviv) N. Cesa-Bianchi (UNIMI) Second-Price Auctions 1 / 21

Real-Time Bidding From: wifiadexchange.com N. Cesa-Bianchi (UNIMI) Second-Price Auctions 2 / 21

Real-Time Bidding SSP (supply-side platforms): assist the seller (publisher) e.g., by optimizing the reserve price DSP (demand-side platforms): assist the buyer (advertiser) e.g., by optimizing bids RTB process 1 A user visits the publisher website creating an impression 2 A call for bids is sent to ad exchanges through SSP 3 The ad exchanges query DSP for advertisers bids 4 The bids are submitted 5 Winner is selected and winner s ad displayed N. Cesa-Bianchi (UNIMI) Second-Price Auctions 3 / 21

Second-price auctions Mechanism [Myerson, 1981] Highest bid B (1) wins, but the price is reduced to the second highest bid B (2) Alice bids $0.80 and Bob bids $0.55. Alice wins and pays $0.55 Theorem: Bidding the true value is a dominant strategy Assume Alice bids B = v, her true value for the impression Bid less and lose: Alice bids B < v and loses the auction. Then B < B (2) < v and she lost a payoff of v B (2) Bid more and win: Alice bids B > v and wins the auction. Then v < B (1) < B and she ends up paying B (1) > v N. Cesa-Bianchi (UNIMI) Second-Price Auctions 4 / 21

Reserve price: how much the seller values the item 1 Mechanism If the highest bid B (1) 0.8 is below the reserve price, 0.6 then the item is not sold Otherwise, the winning 0.4 bid is reduced to the maximum between the 0.2 second highest bid B (2) and the reserve price 0 0 0.2 0.4 0.6 0.8 1 Theorem B (2) B (1) Bidding the true value is a dominant strategy even in the presence of a nonzero reserve price N. Cesa-Bianchi (UNIMI) Second-Price Auctions 5 / 21

Additional components Soft floor Allows to lower the reserve price by running first-price auctions when the highest bid is between reserve price and soft floor Estimated Clear Price (ECP) An estimate of a bid that is likely to win based on historical win rates Features available to SSP and/or DSP location and carrier (if user is mobile) publisher name website section placement (identifies a space where the ad can be shown) banner identifier user identifier ECP N. Cesa-Bianchi (UNIMI) Second-Price Auctions 6 / 21

Some actual data N. Cesa-Bianchi (UNIMI) Second-Price Auctions 7 / 21

Some actual data N. Cesa-Bianchi (UNIMI) Second-Price Auctions 8 / 21

The formal auction model For each impression: 1 Seller chooses reserve price p 2 Bids B 1,..., B m are drawn (hidden from seller) 3 Bidder with highest bid B (1) wins the auction 4 Revenue R(p, B 1,..., B m ) is revealed to seller Some uncomfortable assumptions In each auction, bids are drawn i.i.d. from some unknown distribution The bid distribution is the same across multiple auctions Seller knows number of bidders m (or its distribution) N. Cesa-Bianchi (UNIMI) Second-Price Auctions 9 / 21

The revenue function Second-price auction with reserve price R ( p, B (1), B (2)) = B (2) I { B (2) > p } + p I { B (2) p B (1)} 1 Notation for bids: B (1) B (2) 0.8 0.6 Note: 0.4 Revenue only depends on p and two highest 0.2 bids B (1), B (2) 0 0 0.2 0.4 0.6 0.8 1 B (2) B (1) N. Cesa-Bianchi (UNIMI) Second-Price Auctions 10 / 21

The expected revenue (for a fixed number of bidders) [ µ(p) = E R ( p, B (1), B (2))] 1 = P ( B (2) > x ) dx + p P ( B (2) p B (1)) p 1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 N. Cesa-Bianchi (UNIMI) Second-Price Auctions 11 / 21

Remark If the bid distribution satisfies the monotone hazard rate assumption, then the optimal reserve price does not depend on number of bidders: p = 1 F(p ) f(p ) where F(p) = P ( B p ) and f(p) = df(p) dp We do not assume the monotone hazard rate assumption N. Cesa-Bianchi (UNIMI) Second-Price Auctions 12 / 21

Publisher s regret when using p 1, p 2,... ] [ T ( E µ(p ) µ(p t )) where p = argmax µ(p ) p t=1 1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 p t p N. Cesa-Bianchi (UNIMI) Second-Price Auctions 13 / 21

Control regret in a sequence of T auctions Since bids are in [0, 1] regret after T auctions is always O(T) First approach Partition reserve prices in K = 1/ε bins and pick a reserve price p k for each bin k Run a multiarmed bandit algorithm over the set of prices p 1,..., p K T Regret(T) + = O ( T 2/3) for ε = T 1/3 ε ε T }{{} approx. cost } {{ } estim. cost This holds without any special assumption on the expected revenue function N. Cesa-Bianchi (UNIMI) Second-Price Auctions 14 / 21

Control regret in a sequence of T auctions Second approach Run a stochastic optimization algorithm that computes a sequence p 1, p 2,... of reserve prices to find the maximum of µ Using p t gives a stochastic revenue R t such that E[R t ] = µ(p t ) Regret grows like O ( T ) only under specific assumptions on µ (unimodality, smoothness, etc.) Main question Can we get T 1/2 regret without any assumption on the expected revenue function µ? N. Cesa-Bianchi (UNIMI) Second-Price Auctions 15 / 21

Main algorithmic idea Rewrite expected revenue function µ(p) = E [ B (2)] + } {{ } constant p 0 P ( B (2) x ) dx p P ( B p ) m } {{ } function of F(p) Express P ( B p ) m in terms of F(p) = P ( B (2) p ) Compute approximation of F( ) = P ( B (2) ) by sampling B (2) Obtain approximation of expected revenue µ N. Cesa-Bianchi (UNIMI) Second-Price Auctions 16 / 21

Sampling second prices Goal: Sample B (2) in order to approximate µ 1 If p = 0, then revenue is always B (2) However, E [ B (2)] may be much smaller than µ(p ) Hence, approximating µ well by setting p = 0 is potentially wasteful 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 p N. Cesa-Bianchi (UNIMI) Second-Price Auctions 17 / 21

Approximating the expected revenue 1 Find a rough approximation of µ by 0.8 using p = 0 a few times Use this approximation to find a region of prices that includes p with high probability Recurse on this region 0.6 0.4 0.2 using least price as 0 reserve price 0.1 0.3 0.5 0.7 1 p i p B (2) is sampled in green region using a nearly optimal reserve price N. Cesa-Bianchi (UNIMI) Second-Price Auctions 18 / 21

Controlling regret in each phase For each phase i = 1, 2,..., S Refine current region using T i auctions with reserve price p i (least price in current region) Regret in phase i when phase length is set to T i = T 1 2 i ( T i µ(p ) µ( p i ) ) C T i = CT T i 1 } {{ } confidence interval from previous phase Note: Choice of phase length implies that number of phases is S = 2 log log T N. Cesa-Bianchi (UNIMI) Second-Price Auctions 19 / 21

Finishing up T ( ) ( ) S ( ) µ(p ) µ(p t ) µ(p ) µ(0) T } {{ } }{{} 1 + µ(p ) µ( p i ) 1 = i=2 T S ( ) T + µ(p ) µ( p i ) t=1 T + i=2 S T i i=2 C T i 1 T + S CT = O ( (log log T) T ) T i T i N. Cesa-Bianchi (UNIMI) Second-Price Auctions 20 / 21

Open problems Extend to generalized second-price auction (multiple impressions sold in each auction) What if number m of bidders is unknown? What if bidders correlate (using, e.g., ECP)? N. Cesa-Bianchi (UNIMI) Second-Price Auctions 21 / 21