Surprise and Curiosity for Big Data Robotics

Size: px
Start display at page:

Download "Surprise and Curiosity for Big Data Robotics"

Transcription

1 Sequenial Decision-Making wih Big Daa: Papers from he AAAI-14 Workshop Surprise and Curiosiy for Big Daa Roboics Adam Whie, Joseph Modayil, Richard S. Suon Reinforcemen Learning and Arificial Inelligence Laboraory Universiy of Albera, Edmonon, Albera, Canada Absrac This paper inroduces a new perspecive on curiosiy and inrinsic moivaion, viewed as he problem of generaing behavior daa for parallel off-policy learning. We provide 1) he firs measure of surprise based on off-policy general value funcion learning progress, 2) he firs invesigaion of reacive behavior conrol wih parallel gradien emporal difference learning and funcion approximaion, and 3) he firs demonsraion of using curiosiy driven conrol o reac o a non-saionary learning ask all on a mobile robo. Our approach improves scalabiliy over previous off-policy, robo learning sysems, essenial for making progress on he ulimae big-daa decision making problem life-long robo learning. Off-policy, life-long, robo learning is an immense big-daa decision making problem. In life-long learning he agen s ask is o learn from an effecively infinie sream of ineracion. For example, a robo updaing a 100 imes a second, running 8 hours a day, wih a few dozen sensors can produce gigabyes of raw observaion daa every year of is life. Beyond he emporal scale of he problem, off-policy life-long learning enables addiional scaling in he number of hings ha can be learned in parallel, as demonsraed by recen predicive, learning sysems (see Modayil e al 2012, Whie e al 2013). A special challenge in off-policy, life-long learning is o selec acions in way ha provides effecive raining daa for poenially housands or millions of predicion learners wih diverse needs, which is he subjec of his sudy. Surprise and curiosiy play an imporan role in any learning sysem. These ideas have been explored in he conex of opion learning (Singh e al 2005, Simsek and Baro 2006, Schembri e al 2007), developmenal robo exploraion (Schmidhuber 1991, Oudeyer e al, 2007), and exploraion and exploiaion in reinforcemen learning (see Baldassarre and Mirolli 2013 for an overview). Informally, surprise is an unexpeced predicion error. For example, a robo migh be surprised abou is curren draw as i drives across sand for he firs ime. An agen migh be surprised if is reward funcion suddenly changed sign, producing large unexpeced negaive rewards. An agen should, however, be unsurprised Copyrigh c 2014, Associaion for he Advancemen of Arificial Inelligence ( All righs reserved. if is predicion of fuure sensory evens falls wihin he error induced by sensor noise. Equipped wih a measure of surprise, an agen can reac change how i is behaving o unexpeced siuaions o encourage relearning. This reacive adapaion we call curious behavior. In his paper we sudy how surprise and curiosiy can be used o adjus a robo s behavior in he face of changing world. In paricular, we focus on he siuaion where a robo has already learned wo off-policy predicions abou wo disinc policies. The robo hen experiences a physical change ha significanly impacs he predicive accuracy of a single predicion. The robo observes is own inabiliy o accuraely predic fuure baery curren draw when i execues a roaion command, exciing is inernal surprise measure. The robo s behavior responds by selecing acions o speed relearning of he incorrec predicion spinning in place unil he robo is no longer surprised hen reurning o normal operaion. This paper provides he firs empirical demonsraion of surprise and curiosiy based on off-policy learning progress on a mobile robo. Our specific insaniaion of surprise is based on he insananeous emporal difference error, raher han novely, salience, or prediced error (all explored in previous work). Our measure is unique because 1) i balances knowledge and compeence-based learning and 2) i uses error generaed by off-policy reinforcemen learning algorihms on real robo daa. Our experimen uses commodiy off-he-shelf irobo Creae and simple camera resuling in real-ime adapive conrol wih visual feaures. We focus on he paricular case of responding o a dramaic increase in surprise due o a change in he world raher han iniial learning. The approach described in his paper scales naurally o massive emporal sreams, high dimensional feaures, and many independen off-policy learners common in life-long robo learning. Background We model an agen s ineracion wih he world (including he robo s body) as a discree ime dynamical sysem. On each ime sep, he agen observes a feaure vecor x X R n, ha only parially characerizes he environmenal sae s S. We assume s is he curren sae of some unobservable Markov Decision Process (MDP), and hus x is compued from any informaion available o he 19

2 agen a ime. On each sep he agen akes an acion a A, resuling in an ransiion in he underlying MDP, and he agen observes a new feaure vecor x +1. In convenional reinforcemen learning, he agen s objecive is o predic he oal discouned fuure reward on every ime sep. The reward is a special scalar signal r +1 = r(x +1 ) R, ha is emied by he environmen on each sep. To predic reward he agen learns a value funcion v : S R. The ime scale of he predicion is conrolled by a discoun facor γ [0, 1). The precise quaniy o be prediced is he reurn g = k=0 γk r +k+1, and he value funcion is he expeced value of he reurn, [ ] v(s) = E π γ k S R +k+1 = s, k=0 where he expecaion is condiional on he acions (afer ) seleced according o a paricular policy π : X A [0, 1] as denoed by he subscrip on he expecaion operaor. As is common in reinforcemen learning, we esimae v wih a linear approximaion, v w (s ) = w x v(s ), where w R n. We use generalized noions of reward and erminaion o enable learning a broader class of predicions han is ypically considered in convenional reinforcemen learning. Firs noice we can define he reward o be any bounded funcion of x, such as he insananeous value of an IR sensors, which we call pseudo reward. Second, γ need no be consan, bu can also be defined as a funcion he feaures γ : X [0, 1]. These changes require he definiion of a general value funcion (GVF) [ ( v(s) = E Π k j=1 γ(x +j ) ) ] r(x +k+1 ), k=0 bu no special algorihmic modificaions are required o learn GVFs. See Modayil e al (2014) for a more deailed explanaion of GVF predicion. In order o learn many value funcions in parallel, each condiioned on a differen policy, we require off-policy learning. Off-policy reinforcemen learning allows he agen o learn predicions abou one policy while following anoher. In his seing, he policy ha condiions he value funcion he arge policy π is differen from he policy used o selec acions and conrol he robo, called he behavior policy µ : X A [0, 1]. Separaing he daa generaion policy from he policy o be learned allows learning abou many value funcions in parallel from a single sream of experience. Each predicion V (i) = v (i) w (s ) is abou fuure pseudo reward r (i) (x ) observed if he agen follows π (i) wih pseudo erminaion according o γ (i) (x ). These policy-coningen predicions can each be learned by independen insances of off-policy reinforcemen learning algorihms. One such off-policy algorihm is GTD(λ) which preforms sochasic gradien descen on he Mean Squared Projeced Bellman Error (MSPBE), wih linear ime and space complexiy. The updae equaions for GTD(λ), e π(x, a ) µ(x, a ) (x + γ(x )λe 1 ) w +1 w + α(δ e γ(x +1 )(1 λ)(e h )x +1 ) h +1 h + β(δ e (x h )x ), require an eligibiliy race vecor e R n, scalar learning rae parameers α and β, and he usual emporal difference error δ = r(x +1 ) + γ(x +1 )w x +1 w x. The MSPBE is a convex objecive ha can be direcly esimaed from daa, MSPBE(w) = E µ [δe] T E µ [xx T ] 1 E µ [δe], useful for racking he learning progress of off-policy predicions (Whie e al 2013). Experimen The objecive of his paper is o invesigae one poenial benefi of using surprise o influence decision making. Specifically we seek a concree demonsraion of adaping he behavior policy auomaically, in response o a perurbaion o he agen s sensorimoor sream. The change is unexpeced and no modelled by he agen. The change is designed o influence he accuracy of only one GVF predicion. Consider a measure of surprise based on he emporal difference error of each GVF Z (i) δ = (i) var[δ (i) ], (1) where denoes an exponenially weighed average. This measure of surprise increases when insananeous errors fall considerably ouside he mean error. A curious behavior is any policy ha uses a measure of surprise from several GVFs o influence acion selecion. Here we consider a rule-based curious behavior ha uses surprise o deermine wheher he behavior should coninue selecing acions according o arge policy π (i) or swich o anoher arge π (j). Assuming he behavior µ had seleced acions according o π (i) for k consecuive seps, he agen decides o coninue following π (i) for k more seps, or swich o a new arge policy: if Z (i) < τ hen j = argmax j i Z (j) if Z (j) < τ (2) pick j randomly µ = π (j) follow µ for k consecuive seps In our experimen repored here k was se o 120 seps or approximaely four seconds, τ was se o 0.2, and he decay rae of he exponenial average in Equaion 1 was Inuiively we expec a curious behavior o selec acions ha encourage or faciliae re-learning of an inaccurae GVF. In he case of a perurbaion ha affecs a single GVF v (i), 20

3 non-reacive behavior predicion horizon), and r (1) = 1.0. Each GVF had a unique feaure represenaion. The roaion GVF s binary feaure vecor was produced by a single iling of a decaying race of he observed baery curren, wih a ile widh of 1/16h. The forward GVF used a feaure vecor consruced from web-camera images sampled a 30 frames per second. A he sar of he experimen, 100 pixels were seleced a random, and from hese pixels eiher he luminance or color channel was seleced a random, and hese seleced values were used o consruc feaures from he mos recen image. Each value (beween 0 and 255) was independenly iled ino 16 non-overlapping iles of widh 16, producing a binary feaure vecor x, wih componens, of which 100 were acive on each ime sep (see Suon and Baro (1998) for a review of ile coding). The camera was mouned direcly on op of he robo facing forward. Boh GVFs were learned using a separae insances of GTD(λ = 0.9) wih α = 0.05 and β = All learning was done direcly on a raspberry pi direcly conneced o an irobo Creae wih an updae cycle of 30 ms. The perurbaion involved puing a heavy load in he back of he robo, which changes he curren draw and direcly affecs he roaion GVF. The drive speed and hus he forward GVF predicion will be unaffeced. Our experimen involved wo phases. During he firs phase (roughly en mins) he robo followed a hand-coded non-reacive behavior policy ha alernaed beween driving forward unil bumping, roaing couner clockwise in free-space (no agains he wall), and roaing away from he wall afer bump. Figure 1 shows a visualizaion of he noncurious behav load Figure 1: Non-reacive conrol of he Creae in is pen. we expec he curious behavior (2) o coninually selec acions according o he corresponding arge policy π (i) unil he surprise has subsided, and hus he new siuaion has been learned. To invesigae his hypohesis, we conduced an experimen in which wo GVFs were learned from off-policy experience. The firs GVF prediced fuure discouned baery curren draw while couner-clockwise spinning, encoded as π (0) (x, roaeccw) = 1.0, γ (0) (x) = 0.8 x X, and r (0) = baery curren. We used a simple discree acion se of {forward, reverse, sop, roae cw, and roae ccw}. The second GVF prediced he expeced number of ime seps unil bump if he robo drove forward, encoded as π (1) (x, forward) = 1.0 x X, γ (1) (x) = 0.0 on bump and 0.95 oherwise (a 6.6 second surprise surprise non-reacive behavior curious behav load ime seps Figure 2: A ime-series plo of he surprise measure for he Forward (op) and Roaion (boom) GVFs. Boh graphs show he hree phases of he experimen: 1) iniial learning under he non-reacive behavior, 2) learning under he curious behavior, and 3) learning afer load was added o he robo. Noice ha during phase 1, a spike in roaion surprise is presen. This was he robo s firs roaion jus afer bumping, which generaes a novel curren profile. A second spike in surprise occurs afer he load is added in phase 3. The laer spike is larger, bu reduces faser because he robo effecively modifies is behavior. reacive behavior during he firs phase of learning. Phase one was inerruped afer each GVF had learned o an accepable level of accuracy, afer which ime he behavior was swiched o a curious behavior policy (described in 2). Afer abou wo minues, a 5 pound load was placed in he cargo bay of he Creae. The load had a significan effec on he baery curren draw, bu was no heavy enough o affec he robo s abiliy o achieve he requesed wheel velociy for he drive forward acions. Figure 2 shows he effec of he load on each GVF s predicion via he surprise measures. The forward GVF s predicion is largely unaffeced by he load; he robo was unsurprised. The roaion GVF s pseudo reward was based on curren draw, and was herefore significanly affeced as seen in a subsanial increase in surprise. The exra load generaed a clear and obvious change in he robo s behavior. The firs ime he curious behavior seleced a couner-clockwise roaion (afer he load was added) a large increase in surprise was produced. The increased surprise caused an exended couner-clockwise roaion. When neiher surprise measure exceeds he hreshold, he curious behavior will drives forward, only occasionally execuing roaions in free space. The effec of he load produced a very disincive and exended change in behavior. Figure 3 provides wo quaniaive measures of how he acions were seleced before he load was added, during he surprise period, and afer relearning. Afer several seconds of roaion he surprise for roaing subsided below τ, consan roaion ceased, and he behavior reurned o normal operaion. The experimen was repeaed several imes varying he 21

4 roae arge acivaion forward arge acivaion number of roae acions roaion seleced... forward seleced... pause ime seps Figure 3: The op figure shows which GVF policy is acive while he robo was under curious behavior conrol. The boom hisogram shows he number of ime seps on which he roaion acion was seleced over he same ime period. The upper figure clearly shows ha he robo coninually seleced acions according o he roae arge policy afer he load was added (marked by he red bar), and hen reurned o normal operaion. The lower figure illusraes he same phenomenon, and also shows he variabiliy in he number of roaions before he load was added o he robo. lighing condiions, wall-clock duraion of each phase, and camera orienaion. The increase in surprise and resulan behavior modificaion was reliably demonsraed each ime. Discussion Our experimen was designed o highligh one way in which he off-policy learning progress of muliple GVFs can influence conrol. We crafed he GVFs and inroduced a change ha would be immediaely deeced, and would produce a disincive change in behavior (consan roaion). Alhough limied, our demonsraion is novel, and demonsraes a fundamenal idea in a scalable way. Surprisingly his work is he firs demonsraion of adapive conrol for he behavior policy of an off-policy learning sysem on a robo. All previous GVF learning sysems used non-adapive behaviors (see Suon e al 2010, Modayil e al 2012, Whie e al 2013). Typically inrinsically moived sysems are segregaed ino wo camps: 1) knowledge-driven sequenial ask learning (Schmidhuber 1991, Oudeyer e al 2007), and 2) compeence-driven opion or subask raining (Singh e al 2005, Simsek and Baro 2006, Schembri e al 2007). Our approach unifies hese wo approaches. A GVF can be used o represen a predicion condiioned on a fixed arge policy (as used here) and encode an sae-acionvalue funcion used o learn a policy (learned wih greedy- GQ(λ)). Therefore adaping behavior based-on GVF error drives boh predicive knowledge acquisiion and improving compeence. Finally, our work is he firs o highligh and demonsrae he role of surprise and curiosiy in a nonsaionary seing. The approach described here is small, bu surprisingly scalable. Consider an alernaive approach o reacing o change like following each arge policy in sequence. The robo migh be learning housands of differen GVFs condiioned on housands of arge polices, as in previous work (Whie e al 2013). I would be infeasible o rely on sequencing arge policies o efficienly deec and reac o changes when he number of GVFs is large. Assuming any change o he robo s world only affecs a subse of he GVFs, our approach o curiosiy will provide daa o GVF s ha are capable of more learning. A more general approach would be o learn he curious behavior wih reinforcemen learning and a surprise-based reward. If he behavior were generaed via average reward acor criic, hen he robo could balance he needs of many GVFs in is acion selecion wihou he resricion of following any arge policy. The approach explored here is he firs sep in exploring he naural synergy beween parallel off-policy learning and curious behavior. Alhough surprise is useful for adaping o non-saionariy, i can be usefully deployed for a wide range of seings. Imagine a seing where new GVFs are coninually creaed over ime. A curious behavior, in a similar way o adaping o a perurbaion, can adjus acion selecion o provide relevan daa for new GVFs. We focused here on using curiosiy afer iniial learning each GVF had been learned o high accuracy. Curiosiy can also be used during iniial learning o avoid he inheren limiaions of hand-coded behaviors. Finally, wha does an robo do when is no-longer surprised or bored? Could a curious behavior selec acions in such a way o ready iself o reac efficienly o new perurbaions or new GVFs? These quesions are lef o fuure work. Conclusion This paper provides 1) he firs measure of surprise based on off-policy GVF learning progress, 2) he firs invesigaion of reacive behavior conrol wih parallel gradien TD learning and funcion approximaion, and 3) he firs demonsraion of using curiosiy driven conrol o reac o non-saionariy all on a mobile robo. The abiliy o deermine which off-policy predicions are subsanially inaccurae, and modifying robo behavior online o improve learning efficiency is paricularly imporan in large-scale, parallel, off-policy learning sysems. Acknowledgemens This work was suppored by grans from Albera Innovaes Technology Fuures and he Naional Science and Engineering Research Council of Canada. References Baldassarre, G., Mirolli, M. (Eds.). (2013). Inrinsically moivaed learning in naural and arificial sysems. Berlin: Springer. Maei, H. R. (2011). Gradien Temporal-Difference Learning Algorihms. PhD hesis, Universiy of Albera. 22

5 Modayil, J., Whie, A., Suon, R. S. (2012). Muli-imescale nexing in a reinforcemen learning robo. In From Animals o Animas 12, Oudeyer, P. Y., Kaplan, F., Hafner, V. (2007). Inrinsic Moivaion Sysems for Auonomous Menal Developmen. In IEEE Transacions on Evoluionary Compuaion 11, Schembri, M., Mirolli, M., Baldassarre, G. (2007). Evolving inernal reinforcers for an inrinsically moivaed reinforcemenlearning robo. In Developmen and Learning, Schmidhuber J. (1991). A possibiliy for implemening curiosiy and boredom in model-building neural conrollers. In Proceedings of he 1s Inernaional Conference on Simulaion of Adapive Behavior, Simsek, O., Baro, A. G. (2006). An inrinsic reward mechanism for efficien exploraion. In Proceedings of he 23rd inernaional conference on Machine learning, Singh S., Baro, A. G., Chenanez, N. (2005). Inrinsically moivaed reinforcemen learning. In Advances in Neural Informaion Processing Sysems 17, Suon, R. S., Baro, A. G. (1998). Reinforcemen Learning: An Inroducion. MIT Press. Suon, R. S., Maei, H. R., Precup, D., Bhanagar, S., Silver, D., Szepesvári, Cs., Wiewiora, E. (2009). Fas gradien-descen mehods for emporal-difference learning wih linear funcion approximaion. In Proceedings of he 26h Inernaional Conference on Machine Learning. Suon, R. S., Modayil, J., Delp, M., Degris, T., Pilarski, P. M., Whie, A., Precup, D. (2011). Horde: A scalable real-ime archiecure for learning knowledge from unsupervised sensorimoor ineracion. In Proceedings of he10h Inernaional Conference on Auonomous Agens and Muliagen Sysems. Whie, A., Modayil, J., Suon, R. S. (2012). Scaling life-long offpolicy learning. In Developmen and Learning and Epigeneic Roboics,

TEMPORAL PATTERN IDENTIFICATION OF TIME SERIES DATA USING PATTERN WAVELETS AND GENETIC ALGORITHMS

TEMPORAL PATTERN IDENTIFICATION OF TIME SERIES DATA USING PATTERN WAVELETS AND GENETIC ALGORITHMS TEMPORAL PATTERN IDENTIFICATION OF TIME SERIES DATA USING PATTERN WAVELETS AND GENETIC ALGORITHMS RICHARD J. POVINELLI AND XIN FENG Deparmen of Elecrical and Compuer Engineering Marquee Universiy, P.O.

More information

PROFIT TEST MODELLING IN LIFE ASSURANCE USING SPREADSHEETS PART ONE

PROFIT TEST MODELLING IN LIFE ASSURANCE USING SPREADSHEETS PART ONE Profi Tes Modelling in Life Assurance Using Spreadshees PROFIT TEST MODELLING IN LIFE ASSURANCE USING SPREADSHEETS PART ONE Erik Alm Peer Millingon 2004 Profi Tes Modelling in Life Assurance Using Spreadshees

More information

Real-time Particle Filters

Real-time Particle Filters Real-ime Paricle Filers Cody Kwok Dieer Fox Marina Meilă Dep. of Compuer Science & Engineering, Dep. of Saisics Universiy of Washingon Seale, WA 9895 ckwok,fox @cs.washingon.edu, [email protected] Absrac

More information

Chapter 8: Regression with Lagged Explanatory Variables

Chapter 8: Regression with Lagged Explanatory Variables Chaper 8: Regression wih Lagged Explanaory Variables Time series daa: Y for =1,..,T End goal: Regression model relaing a dependen variable o explanaory variables. Wih ime series new issues arise: 1. One

More information

Multiprocessor Systems-on-Chips

Multiprocessor Systems-on-Chips Par of: Muliprocessor Sysems-on-Chips Edied by: Ahmed Amine Jerraya and Wayne Wolf Morgan Kaufmann Publishers, 2005 2 Modeling Shared Resources Conex swiching implies overhead. On a processing elemen,

More information

How To Predict A Person'S Behavior

How To Predict A Person'S Behavior Informaion Theoreic Approaches for Predicive Models: Resuls and Analysis Monica Dinculescu Supervised by Doina Precup Absrac Learning he inernal represenaion of parially observable environmens has proven

More information

The naive method discussed in Lecture 1 uses the most recent observations to forecast future values. That is, Y ˆ t + 1

The naive method discussed in Lecture 1 uses the most recent observations to forecast future values. That is, Y ˆ t + 1 Business Condiions & Forecasing Exponenial Smoohing LECTURE 2 MOVING AVERAGES AND EXPONENTIAL SMOOTHING OVERVIEW This lecure inroduces ime-series smoohing forecasing mehods. Various models are discussed,

More information

Duration and Convexity ( ) 20 = Bond B has a maturity of 5 years and also has a required rate of return of 10%. Its price is $613.

Duration and Convexity ( ) 20 = Bond B has a maturity of 5 years and also has a required rate of return of 10%. Its price is $613. Graduae School of Business Adminisraion Universiy of Virginia UVA-F-38 Duraion and Convexiy he price of a bond is a funcion of he promised paymens and he marke required rae of reurn. Since he promised

More information

TSG-RAN Working Group 1 (Radio Layer 1) meeting #3 Nynashamn, Sweden 22 nd 26 th March 1999

TSG-RAN Working Group 1 (Radio Layer 1) meeting #3 Nynashamn, Sweden 22 nd 26 th March 1999 TSG-RAN Working Group 1 (Radio Layer 1) meeing #3 Nynashamn, Sweden 22 nd 26 h March 1999 RAN TSGW1#3(99)196 Agenda Iem: 9.1 Source: Tile: Documen for: Moorola Macro-diversiy for he PRACH Discussion/Decision

More information

Market Liquidity and the Impacts of the Computerized Trading System: Evidence from the Stock Exchange of Thailand

Market Liquidity and the Impacts of the Computerized Trading System: Evidence from the Stock Exchange of Thailand 36 Invesmen Managemen and Financial Innovaions, 4/4 Marke Liquidiy and he Impacs of he Compuerized Trading Sysem: Evidence from he Sock Exchange of Thailand Sorasar Sukcharoensin 1, Pariyada Srisopisawa,

More information

Mathematics in Pharmacokinetics What and Why (A second attempt to make it clearer)

Mathematics in Pharmacokinetics What and Why (A second attempt to make it clearer) Mahemaics in Pharmacokineics Wha and Why (A second aemp o make i clearer) We have used equaions for concenraion () as a funcion of ime (). We will coninue o use hese equaions since he plasma concenraions

More information

SELF-EVALUATION FOR VIDEO TRACKING SYSTEMS

SELF-EVALUATION FOR VIDEO TRACKING SYSTEMS SELF-EVALUATION FOR VIDEO TRACKING SYSTEMS Hao Wu and Qinfen Zheng Cenre for Auomaion Research Dep. of Elecrical and Compuer Engineering Universiy of Maryland, College Park, MD-20742 {wh2003, qinfen}@cfar.umd.edu

More information

MACROECONOMIC FORECASTS AT THE MOF A LOOK INTO THE REAR VIEW MIRROR

MACROECONOMIC FORECASTS AT THE MOF A LOOK INTO THE REAR VIEW MIRROR MACROECONOMIC FORECASTS AT THE MOF A LOOK INTO THE REAR VIEW MIRROR The firs experimenal publicaion, which summarised pas and expeced fuure developmen of basic economic indicaors, was published by he Minisry

More information

USE OF EDUCATION TECHNOLOGY IN ENGLISH CLASSES

USE OF EDUCATION TECHNOLOGY IN ENGLISH CLASSES USE OF EDUCATION TECHNOLOGY IN ENGLISH CLASSES Mehme Nuri GÖMLEKSİZ Absrac Using educaion echnology in classes helps eachers realize a beer and more effecive learning. In his sudy 150 English eachers were

More information

Stock Trading with Recurrent Reinforcement Learning (RRL) CS229 Application Project Gabriel Molina, SUID 5055783

Stock Trading with Recurrent Reinforcement Learning (RRL) CS229 Application Project Gabriel Molina, SUID 5055783 Sock raing wih Recurren Reinforcemen Learning (RRL) CS9 Applicaion Projec Gabriel Molina, SUID 555783 I. INRODUCION One relaively new approach o financial raing is o use machine learning algorihms o preic

More information

CHARGE AND DISCHARGE OF A CAPACITOR

CHARGE AND DISCHARGE OF A CAPACITOR REFERENCES RC Circuis: Elecrical Insrumens: Mos Inroducory Physics exs (e.g. A. Halliday and Resnick, Physics ; M. Sernheim and J. Kane, General Physics.) This Laboraory Manual: Commonly Used Insrumens:

More information

Journal Of Business & Economics Research September 2005 Volume 3, Number 9

Journal Of Business & Economics Research September 2005 Volume 3, Number 9 Opion Pricing And Mone Carlo Simulaions George M. Jabbour, (Email: [email protected]), George Washingon Universiy Yi-Kang Liu, ([email protected]), George Washingon Universiy ABSTRACT The advanage of Mone Carlo

More information

The Grantor Retained Annuity Trust (GRAT)

The Grantor Retained Annuity Trust (GRAT) WEALTH ADVISORY Esae Planning Sraegies for closely-held, family businesses The Granor Reained Annuiy Trus (GRAT) An efficien wealh ransfer sraegy, paricularly in a low ineres rae environmen Family business

More information

Acceleration Lab Teacher s Guide

Acceleration Lab Teacher s Guide Acceleraion Lab Teacher s Guide Objecives:. Use graphs of disance vs. ime and velociy vs. ime o find acceleraion of a oy car.. Observe he relaionship beween he angle of an inclined plane and he acceleraion

More information

A Natural Feature-Based 3D Object Tracking Method for Wearable Augmented Reality

A Natural Feature-Based 3D Object Tracking Method for Wearable Augmented Reality A Naural Feaure-Based 3D Objec Tracking Mehod for Wearable Augmened Realiy Takashi Okuma Columbia Universiy / AIST Email: [email protected] Takeshi Kuraa Universiy of Washingon / AIST Email: [email protected]

More information

II.1. Debt reduction and fiscal multipliers. dbt da dpbal da dg. bal

II.1. Debt reduction and fiscal multipliers. dbt da dpbal da dg. bal Quarerly Repor on he Euro Area 3/202 II.. Deb reducion and fiscal mulipliers The deerioraion of public finances in he firs years of he crisis has led mos Member Saes o adop sizeable consolidaion packages.

More information

Optimal Investment and Consumption Decision of Family with Life Insurance

Optimal Investment and Consumption Decision of Family with Life Insurance Opimal Invesmen and Consumpion Decision of Family wih Life Insurance Minsuk Kwak 1 2 Yong Hyun Shin 3 U Jin Choi 4 6h World Congress of he Bachelier Finance Sociey Torono, Canada June 25, 2010 1 Speaker

More information

Distributing Human Resources among Software Development Projects 1

Distributing Human Resources among Software Development Projects 1 Disribuing Human Resources among Sofware Developmen Proecs Macario Polo, María Dolores Maeos, Mario Piaini and rancisco Ruiz Summary This paper presens a mehod for esimaing he disribuion of human resources

More information

MODEL AND ALGORITHMS FOR THE REAL TIME MANAGEMENT OF RESIDENTIAL ELECTRICITY DEMAND. A. Barbato, G. Carpentieri

MODEL AND ALGORITHMS FOR THE REAL TIME MANAGEMENT OF RESIDENTIAL ELECTRICITY DEMAND. A. Barbato, G. Carpentieri MODEL AND ALGORITHMS FOR THE REAL TIME MANAGEMENT OF RESIDENTIAL ELECTRICITY DEMAND A. Barbao, G. Carpenieri Poliecnico di Milano, Diparimeno di Eleronica e Informazione, Email: [email protected], [email protected]

More information

11/6/2013. Chapter 14: Dynamic AD-AS. Introduction. Introduction. Keeping track of time. The model s elements

11/6/2013. Chapter 14: Dynamic AD-AS. Introduction. Introduction. Keeping track of time. The model s elements Inroducion Chaper 14: Dynamic D-S dynamic model of aggregae and aggregae supply gives us more insigh ino how he economy works in he shor run. I is a simplified version of a DSGE model, used in cuing-edge

More information

Individual Health Insurance April 30, 2008 Pages 167-170

Individual Health Insurance April 30, 2008 Pages 167-170 Individual Healh Insurance April 30, 2008 Pages 167-170 We have received feedback ha his secion of he e is confusing because some of he defined noaion is inconsisen wih comparable life insurance reserve

More information

Nikkei Stock Average Volatility Index Real-time Version Index Guidebook

Nikkei Stock Average Volatility Index Real-time Version Index Guidebook Nikkei Sock Average Volailiy Index Real-ime Version Index Guidebook Nikkei Inc. Wih he modificaion of he mehodology of he Nikkei Sock Average Volailiy Index as Nikkei Inc. (Nikkei) sars calculaing and

More information

Chapter 2 Problems. 3600s = 25m / s d = s t = 25m / s 0.5s = 12.5m. Δx = x(4) x(0) =12m 0m =12m

Chapter 2 Problems. 3600s = 25m / s d = s t = 25m / s 0.5s = 12.5m. Δx = x(4) x(0) =12m 0m =12m Chaper 2 Problems 2.1 During a hard sneeze, your eyes migh shu for 0.5s. If you are driving a car a 90km/h during such a sneeze, how far does he car move during ha ime s = 90km 1000m h 1km 1h 3600s = 25m

More information

Chapter 7. Response of First-Order RL and RC Circuits

Chapter 7. Response of First-Order RL and RC Circuits Chaper 7. esponse of Firs-Order L and C Circuis 7.1. The Naural esponse of an L Circui 7.2. The Naural esponse of an C Circui 7.3. The ep esponse of L and C Circuis 7.4. A General oluion for ep and Naural

More information

Relationships between Stock Prices and Accounting Information: A Review of the Residual Income and Ohlson Models. Scott Pirie* and Malcolm Smith**

Relationships between Stock Prices and Accounting Information: A Review of the Residual Income and Ohlson Models. Scott Pirie* and Malcolm Smith** Relaionships beween Sock Prices and Accouning Informaion: A Review of he Residual Income and Ohlson Models Sco Pirie* and Malcolm Smih** * Inernaional Graduae School of Managemen, Universiy of Souh Ausralia

More information

Performance Center Overview. Performance Center Overview 1

Performance Center Overview. Performance Center Overview 1 Performance Cener Overview Performance Cener Overview 1 ODJFS Performance Cener ce Cener New Performance Cener Model Performance Cener Projec Meeings Performance Cener Execuive Meeings Performance Cener

More information

Task is a schedulable entity, i.e., a thread

Task is a schedulable entity, i.e., a thread Real-Time Scheduling Sysem Model Task is a schedulable eniy, i.e., a hread Time consrains of periodic ask T: - s: saring poin - e: processing ime of T - d: deadline of T - p: period of T Periodic ask T

More information

Statistical Analysis with Little s Law. Supplementary Material: More on the Call Center Data. by Song-Hee Kim and Ward Whitt

Statistical Analysis with Little s Law. Supplementary Material: More on the Call Center Data. by Song-Hee Kim and Ward Whitt Saisical Analysis wih Lile s Law Supplemenary Maerial: More on he Call Cener Daa by Song-Hee Kim and Ward Whi Deparmen of Indusrial Engineering and Operaions Research Columbia Universiy, New York, NY 17-99

More information

Term Structure of Prices of Asian Options

Term Structure of Prices of Asian Options Term Srucure of Prices of Asian Opions Jirô Akahori, Tsuomu Mikami, Kenji Yasuomi and Teruo Yokoa Dep. of Mahemaical Sciences, Risumeikan Universiy 1-1-1 Nojihigashi, Kusasu, Shiga 525-8577, Japan E-mail:

More information

Gene Regulatory Network Discovery from Time-Series Gene Expression Data A Computational Intelligence Approach

Gene Regulatory Network Discovery from Time-Series Gene Expression Data A Computational Intelligence Approach Gene Regulaory Nework Discovery from Time-Series Gene Expression Daa A Compuaional Inelligence Approach Nikola K. Kasabov 1, Zeke S. H. Chan 1, Vishal Jain 1, Igor Sidorov 2 and Dimier S. Dimirov 2 1 Knowledge

More information

Chapter 1.6 Financial Management

Chapter 1.6 Financial Management Chaper 1.6 Financial Managemen Par I: Objecive ype quesions and answers 1. Simple pay back period is equal o: a) Raio of Firs cos/ne yearly savings b) Raio of Annual gross cash flow/capial cos n c) = (1

More information

Direc Manipulaion Inerface and EGN algorithms

Direc Manipulaion Inerface and EGN algorithms A Direc Manipulaion Inerface for 3D Compuer Animaion Sco Sona Snibbe y Brown Universiy Deparmen of Compuer Science Providence, RI 02912, USA Absrac We presen a new se of inerface echniques for visualizing

More information

DOES TRADING VOLUME INFLUENCE GARCH EFFECTS? SOME EVIDENCE FROM THE GREEK MARKET WITH SPECIAL REFERENCE TO BANKING SECTOR

DOES TRADING VOLUME INFLUENCE GARCH EFFECTS? SOME EVIDENCE FROM THE GREEK MARKET WITH SPECIAL REFERENCE TO BANKING SECTOR Invesmen Managemen and Financial Innovaions, Volume 4, Issue 3, 7 33 DOES TRADING VOLUME INFLUENCE GARCH EFFECTS? SOME EVIDENCE FROM THE GREEK MARKET WITH SPECIAL REFERENCE TO BANKING SECTOR Ahanasios

More information

The Transport Equation

The Transport Equation The Transpor Equaion Consider a fluid, flowing wih velociy, V, in a hin sraigh ube whose cross secion will be denoed by A. Suppose he fluid conains a conaminan whose concenraion a posiion a ime will be

More information

Cointegration: The Engle and Granger approach

Cointegration: The Engle and Granger approach Coinegraion: The Engle and Granger approach Inroducion Generally one would find mos of he economic variables o be non-saionary I(1) variables. Hence, any equilibrium heories ha involve hese variables require

More information

Making a Faster Cryptanalytic Time-Memory Trade-Off

Making a Faster Cryptanalytic Time-Memory Trade-Off Making a Faser Crypanalyic Time-Memory Trade-Off Philippe Oechslin Laboraoire de Securié e de Crypographie (LASEC) Ecole Polyechnique Fédérale de Lausanne Faculé I&C, 1015 Lausanne, Swizerland [email protected]

More information

Bayesian Filtering with Online Gaussian Process Latent Variable Models

Bayesian Filtering with Online Gaussian Process Latent Variable Models Bayesian Filering wih Online Gaussian Process Laen Variable Models Yali Wang Laval Universiy [email protected] Marcus A. Brubaker TTI Chicago [email protected] Brahim Chaib-draa Laval Universiy

More information

OPERATION MANUAL. Indoor unit for air to water heat pump system and options EKHBRD011ABV1 EKHBRD014ABV1 EKHBRD016ABV1

OPERATION MANUAL. Indoor unit for air to water heat pump system and options EKHBRD011ABV1 EKHBRD014ABV1 EKHBRD016ABV1 OPERAION MANUAL Indoor uni for air o waer hea pump sysem and opions EKHBRD011ABV1 EKHBRD014ABV1 EKHBRD016ABV1 EKHBRD011ABY1 EKHBRD014ABY1 EKHBRD016ABY1 EKHBRD011ACV1 EKHBRD014ACV1 EKHBRD016ACV1 EKHBRD011ACY1

More information

Motion Along a Straight Line

Motion Along a Straight Line Moion Along a Sraigh Line On Sepember 6, 993, Dave Munday, a diesel mechanic by rade, wen over he Canadian edge of Niagara Falls for he second ime, freely falling 48 m o he waer (and rocks) below. On his

More information

Morningstar Investor Return

Morningstar Investor Return Morningsar Invesor Reurn Morningsar Mehodology Paper Augus 31, 2010 2010 Morningsar, Inc. All righs reserved. The informaion in his documen is he propery of Morningsar, Inc. Reproducion or ranscripion

More information

An Online Learning-based Framework for Tracking

An Online Learning-based Framework for Tracking An Online Learning-based Framework for Tracking Kamalika Chaudhuri Compuer Science and Engineering Universiy of California, San Diego La Jolla, CA 9293 Yoav Freund Compuer Science and Engineering Universiy

More information

Single-machine Scheduling with Periodic Maintenance and both Preemptive and. Non-preemptive jobs in Remanufacturing System 1

Single-machine Scheduling with Periodic Maintenance and both Preemptive and. Non-preemptive jobs in Remanufacturing System 1 Absrac number: 05-0407 Single-machine Scheduling wih Periodic Mainenance and boh Preempive and Non-preempive jobs in Remanufacuring Sysem Liu Biyu hen Weida (School of Economics and Managemen Souheas Universiy

More information

The Greek financial crisis: growing imbalances and sovereign spreads. Heather D. Gibson, Stephan G. Hall and George S. Tavlas

The Greek financial crisis: growing imbalances and sovereign spreads. Heather D. Gibson, Stephan G. Hall and George S. Tavlas The Greek financial crisis: growing imbalances and sovereign spreads Heaher D. Gibson, Sephan G. Hall and George S. Tavlas The enry The enry of Greece ino he Eurozone in 2001 produced a dividend in he

More information

Option Put-Call Parity Relations When the Underlying Security Pays Dividends

Option Put-Call Parity Relations When the Underlying Security Pays Dividends Inernaional Journal of Business and conomics, 26, Vol. 5, No. 3, 225-23 Opion Pu-all Pariy Relaions When he Underlying Securiy Pays Dividends Weiyu Guo Deparmen of Finance, Universiy of Nebraska Omaha,

More information

CALCULATION OF OMX TALLINN

CALCULATION OF OMX TALLINN CALCULATION OF OMX TALLINN CALCULATION OF OMX TALLINN 1. OMX Tallinn index...3 2. Terms in use...3 3. Comuaion rules of OMX Tallinn...3 3.1. Oening, real-ime and closing value of he Index...3 3.2. Index

More information

SPEC model selection algorithm for ARCH models: an options pricing evaluation framework

SPEC model selection algorithm for ARCH models: an options pricing evaluation framework Applied Financial Economics Leers, 2008, 4, 419 423 SEC model selecion algorihm for ARCH models: an opions pricing evaluaion framework Savros Degiannakis a, * and Evdokia Xekalaki a,b a Deparmen of Saisics,

More information

Can Individual Investors Use Technical Trading Rules to Beat the Asian Markets?

Can Individual Investors Use Technical Trading Rules to Beat the Asian Markets? Can Individual Invesors Use Technical Trading Rules o Bea he Asian Markes? INTRODUCTION In radiional ess of he weak-form of he Efficien Markes Hypohesis, price reurn differences are found o be insufficien

More information

A Re-examination of the Joint Mortality Functions

A Re-examination of the Joint Mortality Functions Norh merican cuarial Journal Volume 6, Number 1, p.166-170 (2002) Re-eaminaion of he Join Morali Funcions bsrac. Heekung Youn, rkad Shemakin, Edwin Herman Universi of S. Thomas, Sain Paul, MN, US Morali

More information

Dispatch-and-Search: Dynamic Multi-Ferry Control in Partitioned Mobile Networks

Dispatch-and-Search: Dynamic Multi-Ferry Control in Partitioned Mobile Networks Dispach-and-Search: Dynamic Muli-Ferry Conrol in Pariioned Mobile Newors ing He, Ananhram Swami, and Kang-Won Lee IBM Research, Hawhorne, NY 1532 USA {he,angwon}@us.ibm.com Army Research Laboraory, Adelphi,

More information

GUIDE GOVERNING SMI RISK CONTROL INDICES

GUIDE GOVERNING SMI RISK CONTROL INDICES GUIDE GOVERNING SMI RISK CONTROL IND ICES SIX Swiss Exchange Ld 04/2012 i C O N T E N T S 1. Index srucure... 1 1.1 Concep... 1 1.2 General principles... 1 1.3 Index Commission... 1 1.4 Review of index

More information

LEVENTE SZÁSZ An MRP-based integer programming model for capacity planning...3

LEVENTE SZÁSZ An MRP-based integer programming model for capacity planning...3 LEVENTE SZÁSZ An MRP-based ineger programming model for capaciy planning...3 MELINDA ANTAL Reurn o schooling in Hungary before and afer he ransiion years...23 LEHEL GYÖRFY ANNAMÁRIA BENYOVSZKI ÁGNES NAGY

More information

UNDERSTANDING THE DEATH BENEFIT SWITCH OPTION IN UNIVERSAL LIFE POLICIES. Nadine Gatzert

UNDERSTANDING THE DEATH BENEFIT SWITCH OPTION IN UNIVERSAL LIFE POLICIES. Nadine Gatzert UNDERSTANDING THE DEATH BENEFIT SWITCH OPTION IN UNIVERSAL LIFE POLICIES Nadine Gazer Conac (has changed since iniial submission): Chair for Insurance Managemen Universiy of Erlangen-Nuremberg Lange Gasse

More information

LEASING VERSUSBUYING

LEASING VERSUSBUYING LEASNG VERSUSBUYNG Conribued by James D. Blum and LeRoy D. Brooks Assisan Professors of Business Adminisraion Deparmen of Business Adminisraion Universiy of Delaware Newark, Delaware The auhors discuss

More information

Towards a Generic Trust Model Comparison of Various Trust Update Algorithms

Towards a Generic Trust Model Comparison of Various Trust Update Algorithms Towards a Generic Trus Model Comparison of Various Trus Updae Algorihms Michael Kinaeder, Erneso Baschny, and Kur Rohermel Insiue of Parallel and Disribued Sysems (IPVS) Universiä Sugar Universiässr. 38

More information

Random Walk in 1-D. 3 possible paths x vs n. -5 For our random walk, we assume the probabilities p,q do not depend on time (n) - stationary

Random Walk in 1-D. 3 possible paths x vs n. -5 For our random walk, we assume the probabilities p,q do not depend on time (n) - stationary Random Walk in -D Random walks appear in many cones: diffusion is a random walk process undersanding buffering, waiing imes, queuing more generally he heory of sochasic processes gambling choosing he bes

More information

We addressed the problem of developing a model to simulate at a high level of detail the movements of over

We addressed the problem of developing a model to simulate at a high level of detail the movements of over Published online ahead of prin Augus 15, 28 Aricles in Advance, pp. 1 2 issn 41-1655 eissn 1526-5447 informs doi 1.1287/rsc.18.238 28 INFORMS An Approximae Dynamic Programming Algorihm for Large-Scale

More information

Market Analysis and Models of Investment. Product Development and Whole Life Cycle Costing

Market Analysis and Models of Investment. Product Development and Whole Life Cycle Costing The Universiy of Liverpool School of Archiecure and Building Engineering WINDS PROJECT COURSE SYNTHESIS SECTION 3 UNIT 11 Marke Analysis and Models of Invesmen. Produc Developmen and Whole Life Cycle Cosing

More information

Present Value Methodology

Present Value Methodology Presen Value Mehodology Econ 422 Invesmen, Capial & Finance Universiy of Washingon Eric Zivo Las updaed: April 11, 2010 Presen Value Concep Wealh in Fisher Model: W = Y 0 + Y 1 /(1+r) The consumer/producer

More information

Small and Large Trades Around Earnings Announcements: Does Trading Behavior Explain Post-Earnings-Announcement Drift?

Small and Large Trades Around Earnings Announcements: Does Trading Behavior Explain Post-Earnings-Announcement Drift? Small and Large Trades Around Earnings Announcemens: Does Trading Behavior Explain Pos-Earnings-Announcemen Drif? Devin Shanhikumar * Firs Draf: Ocober, 2002 This Version: Augus 19, 2004 Absrac This paper

More information

4. International Parity Conditions

4. International Parity Conditions 4. Inernaional ariy ondiions 4.1 urchasing ower ariy he urchasing ower ariy ( heory is one of he early heories of exchange rae deerminaion. his heory is based on he concep ha he demand for a counry's currency

More information

Capacity Planning and Performance Benchmark Reference Guide v. 1.8

Capacity Planning and Performance Benchmark Reference Guide v. 1.8 Environmenal Sysems Research Insiue, Inc., 380 New York S., Redlands, CA 92373-8100 USA TEL 909-793-2853 FAX 909-307-3014 Capaciy Planning and Performance Benchmark Reference Guide v. 1.8 Prepared by:

More information

Improved Techniques for Grid Mapping with Rao-Blackwellized Particle Filters

Improved Techniques for Grid Mapping with Rao-Blackwellized Particle Filters 1 Improved Techniques for Grid Mapping wih Rao-Blackwellized Paricle Filers Giorgio Grisei Cyrill Sachniss Wolfram Burgard Universiy of Freiburg, Dep. of Compuer Science, Georges-Köhler-Allee 79, D-79110

More information

Chapter 6: Business Valuation (Income Approach)

Chapter 6: Business Valuation (Income Approach) Chaper 6: Business Valuaion (Income Approach) Cash flow deerminaion is one of he mos criical elemens o a business valuaion. Everyhing may be secondary. If cash flow is high, hen he value is high; if he

More information

Table of contents Chapter 1 Interest rates and factors Chapter 2 Level annuities Chapter 3 Varying annuities

Table of contents Chapter 1 Interest rates and factors Chapter 2 Level annuities Chapter 3 Varying annuities Table of conens Chaper 1 Ineres raes and facors 1 1.1 Ineres 2 1.2 Simple ineres 4 1.3 Compound ineres 6 1.4 Accumulaed value 10 1.5 Presen value 11 1.6 Rae of discoun 13 1.7 Consan force of ineres 17

More information

Analogue and Digital Signal Processing. First Term Third Year CS Engineering By Dr Mukhtiar Ali Unar

Analogue and Digital Signal Processing. First Term Third Year CS Engineering By Dr Mukhtiar Ali Unar Analogue and Digial Signal Processing Firs Term Third Year CS Engineering By Dr Mukhiar Ali Unar Recommended Books Haykin S. and Van Veen B.; Signals and Sysems, John Wiley& Sons Inc. ISBN: 0-7-380-7 Ifeachor

More information

DC-DC Boost Converter with Constant Output Voltage for Grid Connected Photovoltaic Application System

DC-DC Boost Converter with Constant Output Voltage for Grid Connected Photovoltaic Application System DC-DC Boos Converer wih Consan Oupu Volage for Grid Conneced Phoovolaic Applicaion Sysem Pui-Weng Chan, Syafrudin Masri Universii Sains Malaysia E-mail: [email protected], [email protected] Absrac

More information

9. Capacitor and Resistor Circuits

9. Capacitor and Resistor Circuits ElecronicsLab9.nb 1 9. Capacior and Resisor Circuis Inroducion hus far we have consider resisors in various combinaions wih a power supply or baery which provide a consan volage source or direc curren

More information

The Kinetics of the Stock Markets

The Kinetics of the Stock Markets Asia Pacific Managemen Review (00) 7(1), 1-4 The Kineics of he Sock Markes Hsinan Hsu * and Bin-Juin Lin ** (received July 001; revision received Ocober 001;acceped November 001) This paper applies he

More information

Optimal Stock Selling/Buying Strategy with reference to the Ultimate Average

Optimal Stock Selling/Buying Strategy with reference to the Ultimate Average Opimal Sock Selling/Buying Sraegy wih reference o he Ulimae Average Min Dai Dep of Mah, Naional Universiy of Singapore, Singapore Yifei Zhong Dep of Mah, Naional Universiy of Singapore, Singapore July

More information

Multi-camera scheduling for video production

Multi-camera scheduling for video production Muli-camera scheduling for video producion Fahad Daniyal and Andrea Cavallaro Queen Mary Universiy of London Mile End Road, E 4S London, Unied Kingdom Email: {fahad.daniyal, andrea.cavallaro}@eecs.qmul.ac.uk

More information

Trends in TCP/IP Retransmissions and Resets

Trends in TCP/IP Retransmissions and Resets Trends in TCP/IP Reransmissions and Reses Absrac Concordia Chen, Mrunal Mangrulkar, Naomi Ramos, and Mahaswea Sarkar {cychen, mkulkarn, msarkar,naramos}@cs.ucsd.edu As he Inerne grows larger, measuring

More information

Markov Chain Modeling of Policy Holder Behavior in Life Insurance and Pension

Markov Chain Modeling of Policy Holder Behavior in Life Insurance and Pension Markov Chain Modeling of Policy Holder Behavior in Life Insurance and Pension Lars Frederik Brand Henriksen 1, Jeppe Woemann Nielsen 2, Mogens Seffensen 1, and Chrisian Svensson 2 1 Deparmen of Mahemaical

More information

Hedging with Forwards and Futures

Hedging with Forwards and Futures Hedging wih orwards and uures Hedging in mos cases is sraighforward. You plan o buy 10,000 barrels of oil in six monhs and you wish o eliminae he price risk. If you ake he buy-side of a forward/fuures

More information

Why Did the Demand for Cash Decrease Recently in Korea?

Why Did the Demand for Cash Decrease Recently in Korea? Why Did he Demand for Cash Decrease Recenly in Korea? Byoung Hark Yoo Bank of Korea 26. 5 Absrac We explores why cash demand have decreased recenly in Korea. The raio of cash o consumpion fell o 4.7% in

More information

A DISCRETE-TIME MODEL OF TCP WITH ACTIVE QUEUE MANAGEMENT

A DISCRETE-TIME MODEL OF TCP WITH ACTIVE QUEUE MANAGEMENT A DISCRETE-TIME MODEL OF TCP WITH ACTIVE QUEUE MANAGEMENT by Hui Zhang B.Sc., Insiuion of Informaion and Communicaion, Dong Hua Universiy, P. R. China, 1 THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE

More information

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C.

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. Finance and Economics Discussion Series Divisions of Research & Saisics and Moneary Affairs Federal Reserve Board, Washingon, D.C. The Effecs of Unemploymen Benefis on Unemploymen and Labor Force Paricipaion:

More information

Measuring macroeconomic volatility Applications to export revenue data, 1970-2005

Measuring macroeconomic volatility Applications to export revenue data, 1970-2005 FONDATION POUR LES ETUDES ET RERS LE DEVELOPPEMENT INTERNATIONAL Measuring macroeconomic volailiy Applicaions o expor revenue daa, 1970-005 by Joël Cariolle Policy brief no. 47 March 01 The FERDI is a

More information

INTRODUCTION TO FORECASTING

INTRODUCTION TO FORECASTING INTRODUCTION TO FORECASTING INTRODUCTION: Wha is a forecas? Why do managers need o forecas? A forecas is an esimae of uncerain fuure evens (lierally, o "cas forward" by exrapolaing from pas and curren

More information

Economic Analysis of 4G Network Upgrade

Economic Analysis of 4G Network Upgrade Economic Analysis of ework Upgrade Lingjie Duan, Jianwei Huang, and Jean Walrand Absrac As he successor o he sandard, provides much higher daa raes o address cellular users ever-increasing demands for

More information

Automatic measurement and detection of GSM interferences

Automatic measurement and detection of GSM interferences Auomaic measuremen and deecion of GSM inerferences Poor speech qualiy and dropped calls in GSM neworks may be caused by inerferences as a resul of high raffic load. The radio nework analyzers from Rohde

More information

ABSTRACT KEYWORDS. Term structure, duration, uncertain cash flow, variable rates of return JEL codes: C33, E43 1. INTRODUCTION

ABSTRACT KEYWORDS. Term structure, duration, uncertain cash flow, variable rates of return JEL codes: C33, E43 1. INTRODUCTION THE VALUATION AND HEDGING OF VARIABLE RATE SAVINGS ACCOUNTS BY FRANK DE JONG 1 AND JACCO WIELHOUWER ABSTRACT Variable rae savings accouns have wo main feaures. The ineres rae paid on he accoun is variable

More information

Vector Autoregressions (VARs): Operational Perspectives

Vector Autoregressions (VARs): Operational Perspectives Vecor Auoregressions (VARs): Operaional Perspecives Primary Source: Sock, James H., and Mark W. Wason, Vecor Auoregressions, Journal of Economic Perspecives, Vol. 15 No. 4 (Fall 2001), 101-115. Macroeconomericians

More information

The Application of Multi Shifts and Break Windows in Employees Scheduling

The Application of Multi Shifts and Break Windows in Employees Scheduling The Applicaion of Muli Shifs and Brea Windows in Employees Scheduling Evy Herowai Indusrial Engineering Deparmen, Universiy of Surabaya, Indonesia Absrac. One mehod for increasing company s performance

More information