E0 370 Statistical Learning Theory Lecture 20 (Nov 17, 2011)



Similar documents
MTH6121 Introduction to Mathematical Finance Lesson 5

I. Basic Concepts (Ch. 1-4)

Task is a schedulable entity, i.e., a thread

Multiobjective Prediction with Expert Advice

The naive method discussed in Lecture 1 uses the most recent observations to forecast future values. That is, Y ˆ t + 1

ANALYSIS AND COMPARISONS OF SOME SOLUTION CONCEPTS FOR STOCHASTIC PROGRAMMING PROBLEMS

Parameter-Free Convex Learning through Coin Betting

An Online Learning-based Framework for Tracking

Single-machine Scheduling with Periodic Maintenance and both Preemptive and. Non-preemptive jobs in Remanufacturing System 1

The Application of Multi Shifts and Break Windows in Employees Scheduling

Table of contents Chapter 1 Interest rates and factors Chapter 2 Level annuities Chapter 3 Varying annuities

INTEREST RATE FUTURES AND THEIR OPTIONS: SOME PRICING APPROACHES

Chapter 7. Response of First-Order RL and RC Circuits

Chapter 8: Regression with Lagged Explanatory Variables

SPEC model selection algorithm for ARCH models: an options pricing evaluation framework

4 Convolution. Recommended Problems. x2[n] 1 2[n]

TSG-RAN Working Group 1 (Radio Layer 1) meeting #3 Nynashamn, Sweden 22 nd 26 th March 1999

Risk Modelling of Collateralised Lending

PROFIT TEST MODELLING IN LIFE ASSURANCE USING SPREADSHEETS PART ONE

Morningstar Investor Return

AP Calculus AB 2013 Scoring Guidelines

Predicting Stock Market Index Trading Signals Using Neural Networks

Working Paper Social security systems, human capital, and growth in a small open economy

Small and Large Trades Around Earnings Announcements: Does Trading Behavior Explain Post-Earnings-Announcement Drift?

Technical Appendix to Risk, Return, and Dividends

Mathematics in Pharmacokinetics What and Why (A second attempt to make it clearer)

Markit Excess Return Credit Indices Guide for price based indices

Stochastic Optimal Control Problem for Life Insurance

Multiprocessor Systems-on-Chips

On Stochastic and Worst-case Models for Investing

A Generalized Bivariate Ornstein-Uhlenbeck Model for Financial Assets

Improper Integrals. Dr. Philippe B. laval Kennesaw State University. September 19, f (x) dx over a finite interval [a, b].

Permutations and Combinations

AP Calculus AB 2007 Scoring Guidelines

Longevity 11 Lyon 7-9 September 2015

Present Value Methodology

Hotel Room Demand Forecasting via Observed Reservation Information

17 Laplace transform. Solving linear ODE with piecewise continuous right hand sides

Capacitors and inductors

Dynamic programming models and algorithms for the mutual fund cash balance problem

11/6/2013. Chapter 14: Dynamic AD-AS. Introduction. Introduction. Keeping track of time. The model s elements

Fair Valuation and Risk Assessment of Dynamic Hybrid Products in Life Insurance: A Portfolio Consideration

Time Consisency in Porfolio Managemen

Forecasting, Ordering and Stock- Holding for Erratic Demand

Newton s Laws of Motion

Usefulness of the Forward Curve in Forecasting Oil Prices

Bayesian Inference of Arrival Rate and Substitution Behavior from Sales Transaction Data with Stockouts

Imagine a Source (S) of sound waves that emits waves having frequency f and therefore

A Universal Pricing Framework for Guaranteed Minimum Benefits in Variable Annuities *

An Analysis of Tax Revenue Forecast Errors

Return Calculation of U.S. Treasury Constant Maturity Indices

Why Did the Demand for Cash Decrease Recently in Korea?

Modeling a distribution of mortgage credit losses Petr Gapko 1, Martin Šmíd 2

On the degrees of irreducible factors of higher order Bernoulli polynomials

MODEL AND ALGORITHMS FOR THE REAL TIME MANAGEMENT OF RESIDENTIAL ELECTRICITY DEMAND. A. Barbato, G. Carpentieri

The Grantor Retained Annuity Trust (GRAT)

Real-time Particle Filters

Strategic Optimization of a Transportation Distribution Network

Differential Equations. Solving for Impulse Response. Linear systems are often described using differential equations.

Fifth Quantitative Impact Study of Solvency II (QIS 5) National guidance on valuation of technical provisions for German SLT health insurance

WHAT ARE OPTION CONTRACTS?

CALCULATION OF OMX TALLINN

METHODS OF LEARNING IN STATISTICAL EDUCATION: A RANDOMIZED TRIAL OF PUBLIC HEALTH GRADUATE STUDENTS 3

Stock Trading with Recurrent Reinforcement Learning (RRL) CS229 Application Project Gabriel Molina, SUID

Modeling VIX Futures and Pricing VIX Options in the Jump Diusion Modeling

= r t dt + σ S,t db S t (19.1) with interest rates given by a mean reverting Ornstein-Uhlenbeck or Vasicek process,

Term Structure of Prices of Asian Options

Online Algorithms: Learning & Optimization with No Regret.

Description of the CBOE S&P 500 BuyWrite Index (BXM SM )

Dependent Interest and Transition Rates in Life Insurance

Large Scale Online Learning.

INTRODUCTION TO MARKETING PERSONALIZATION. How to increase your sales with personalized triggered s

Niche Market or Mass Market?

Follow the Leader If You Can, Hedge If You Must

A Re-examination of the Joint Mortality Functions

adaptive control; stochastic systems; certainty equivalence principle; long-term

The Impact of Surplus Distribution on the Risk Exposure of With Profit Life Insurance Policies Including Interest Rate Guarantees.

Forecasting Model for Crude Oil Price Using Artificial Neural Networks and Commodity Futures Prices

Chapter 4: Exponential and Logarithmic Functions

The Impact of Surplus Distribution on the Risk Exposure of With Profit Life Insurance Policies Including Interest Rate Guarantees

2.4 Network flows. Many direct and indirect applications telecommunication transportation (public, freight, railway, air, ) logistics

GoRA. For more information on genetics and on Rheumatoid Arthritis: Genetics of Rheumatoid Arthritis. Published work referred to in the results:

Chapter 1.6 Financial Management

THE DETERMINATION OF PORT FACILITIES MANAGEMENT FEE WITH GUARANTEED VOLUME USING OPTIONS PRICING MODEL

Supplementary Appendix for Depression Babies: Do Macroeconomic Experiences Affect Risk-Taking?

Transcription:

E0 370 Saisical Learning Theory Lecure 0 (ov 7, 0 Online Learning from Expers: Weighed Majoriy and Hedge Lecurer: Shivani Agarwal Scribe: Saradha R Inroducion In his lecure, we will look a he problem of learning from muliple expers in an online fashion. There are finie number of expers, who give heir predicions ξ,..., ξ. The learning algorihm has o use he predicor values and come up wih an oucome ŷ. The oal number of misakes made by he algorihm is compared wih he performance of he bes exper in consideraion. Online Predicion from Expers A general online predicion problem proceeds as follows. Online (binary predicion using muliple expers For =,..., T : Receive exper predicors ξ (x,..., ξ (x {±} Predic ŷ {±} Receive rue label y {±} Incur loss l(y, ŷ. Halving Algorihm Here we assume ha he se of expers ha we consider has an exper which would give he correc label for all insances. In he halving algorihm, for every ieraion, only he consisen expers are reained. If a predicor makes a misake i will no more be conribuing in he predicion process. Halving Algorihm Iniiae weighs wi = i ] For =,..., T : Receive exper predicors ξ (x,..., ξ (x {±} Predic ŷ = sign( n j= w j.ξ j (majoriy voe Receive rue label y {±} Incur loss l(y, ŷ Updae:- Updae:- i... : Ifξi y hen i 0 else i wi

Online Learning from Expers: Weighed Majoriy and Hedge Thus he maximum number of misakes, or he sum of loss over any given sequence is bounded by he logarihm of number of predicors. i.e. L 0- S Halving] log.. Weighed Majoriy (WM Algorihm In he halving algorihm, when a predicor makes even one misake, i will no be able o conribue o he predicion in he successive ieraions. When we don have an exper ha would predic correcly for all samples, his would no be a suiable approach. The weighed majoriy algorihm works well in such siuaions. Here every predicor is assigned equal weigh, say, iniially. Laer as hey make binary predicions on insances, he weighs of he predicors are decreased using muliplicaive updae, when hey commi misakes. The rae a which he weighs are updaed is governed by he parameer. Weighed majoriy Algorihm Iniiae weighs wi = i ] For =,..., T : Receive exper predicors ξ (x,..., ξ (x {±} Predic ŷ = sign( n j= w j.ξ j Receive rue label y {±} Incur loss l(y, ŷ Updae:- If ŷ y i... i w i exp(.i(y ξ i (majoriy voe Theorem.. Le ξ,..., ξ {±} T. Le S = (y,..., y T {±} and le > 0. Then he oal number of misakes ( L 0- S W eighedm ajoriy(] +exp(. min L 0- S ξ i ] + i. +exp(. Proof. Denoe L 0- S W eighedmajoriy] = L For each rial on which here is a misake, we have + = wi. exp (.I(y ξ i. ( = w i. exp + w i ( i:y ξ i i:y =ξ i = exp.w maj + W min (3 For all misake rials, we have + Therefore summing over =,..., T gives exp.w maj + W min + exp (W maj W min (4 = + exp.(w maj + W min +exp. For oher rials, + W W T + W T = + exp.( (5 ( + exp L. (6

Online Learning from Expers: Weighed Majoriy and Hedge 3 L ln W ln W T + +exp (. (7 Finding he lower bound on ln W T + + = j= w T + j w T + j exp.li w i ( i. (8 L ln W.L i ln wi +e ( = ln +.L i +e ( (9 (0 for all w j > 0 j Thus we obain he resul..3 Weighed Majoriy: Coninuous Version (WMC We now see he coninuous version of weighed majoriy algorihm. The final predicion is a weighed average of he exper predicor values. Here ỹ = ŷ = y = 0, ] Weighed majoriy Algorihm :Coninuous Version (WMC Iniiae weighs wi = i ] For =,..., T : Receive exper predicors ξ (x,..., ξ (x 0, ] Predic ŷ = w i.ξ i 0, ] (Weighed Average w i Receive rue label y 0, ] Incur loss l abs (y, ŷ = y ŷ Updae:- i... i wi. exp. ξ i y Theorem.. Le ξ,..., ξ 0, ] T. Le S = (y,..., y T 0, ] and le > 0. Then he oal number of misakes S W MC(] (. min exp i L abs L abs S ξ i ] +.. exp Proof. Denoe L abs S W MC(] = L For each rial we have + = wi. exp. y ξ i. ( wi. ( exp y ξi ]. (

4 Online Learning from Expers: Weighed Majoriy and Hedge + wi ( exp w i y ξi (3 w i. ( exp w i y ξi (4 w i =. ( exp ŷ y ] (5 ] +. exp ( exp. ŷ y. (6 + exp ( exp. T = ŷ y ] = exp ( exp.l exp ( exp. ŷ y ] (7 (8 Finding he lower bound on ln W T + L ln W ln W T + (exp. (9 + exp.li w i ( i. (0 Thus we obain he resul L ln W T + +.L i ln wi (exp ( ln +.L i (exp. ( 3 Online Allocaion The problem of online allocaion occurs in scenarios where we need o allocae differen fracion of resources ino differen opions. The loss associaed wih every opion is available a he end of every ieraion. We would like o reduce he oal loss suffered for he paricular allocaion. The allocaion for he nex ieraion is hen revised, based on he oal loss suffered in he curren ieraion using muliplicaive updae. Hedge Algorihm( Iniiae weighs wi = i ] For =,..., T : Make allocaion p p = w ; w i Receive vecor of loses l = (l,..., l 0, ] Incur loss p.l = p i.l i Updae:- i... i wi. exp (l i

Online Learning from Expers: Weighed Majoriy and Hedge 5 Theorem 3.. Le l,..., l T 0, ] The cumulaive loss of he algorihm is LA] = T p.l If he loss of a paricular opion over he T ieraions is given by Then L T l i. LHedge(] (. min L exp i +.. i exp Proof. Denoe LHedge(] = L For each rial we have + = wi. exp.l i. (3 wi. ( exp. w i.l i. (4 w i + wi ( exp p.l ]. (5 + exp ( exp.p.l ] (6 ] exp ( exp.l (7 L ln W ln W T + (exp. (8 Finding he lower bound on ln W T + + exp.li w i ( i. (9 Thus we obain he resul L ln W T + +.L i ln wi (exp (30 ln +.L i (exp. (3 4 ex Lecure In he nex lecure, we will inroduce he idea of minimax regre, in an adversarial learning seing. References