THE ABRACADABRA PROBLEM



Similar documents
MARTINGALES AND A BASIC APPLICATION

Infinite Sequences and Series

Incremental calculation of weighted mean and variance

Asymptotic Growth of Functions

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

A probabilistic proof of a binomial identity

Our aim is to show that under reasonable assumptions a given 2π-periodic function f can be represented as convergent series

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

Overview of some probability distributions.

SAMPLE QUESTIONS FOR FINAL EXAM. (1) (2) (3) (4) Find the following using the definition of the Riemann integral: (2x + 1)dx

Lecture 5: Span, linear independence, bases, and dimension

5 Boolean Decision Trees (February 11)

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 9 Spring 2006

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

Cooley-Tukey. Tukey FFT Algorithms. FFT Algorithms. Cooley

Factors of sums of powers of binomial coefficients

A Recursive Formula for Moments of a Binomial Distribution

Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling

Department of Computer Science, University of Otago

THIN SEQUENCES AND THE GRAM MATRIX PAMELA GORKIN, JOHN E. MCCARTHY, SANDRA POTT, AND BRETT D. WICK

Estimating Probability Distributions by Observing Betting Practices

Properties of MLE: consistency, asymptotic normality. Fisher information.

3 Basic Definitions of Probability Theory

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix

2. Degree Sequences. 2.1 Degree Sequences

THE HEIGHT OF q-binary SEARCH TREES

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

I. Chi-squared Distributions

Chapter 7 Methods of Finding Estimators

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

CS103X: Discrete Structures Homework 4 Solutions

A sharp Trudinger-Moser type inequality for unbounded domains in R n

Annuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL.


Optimal Strategies from Random Walks

Universal coding for classes of sources

Math 113 HW #11 Solutions

PROCEEDINGS OF THE YEREVAN STATE UNIVERSITY AN ALTERNATIVE MODEL FOR BONUS-MALUS SYSTEM

A Constant-Factor Approximation Algorithm for the Link Building Problem

A PROBABILISTIC VIEW ON THE ECONOMICS OF GAMBLING

Lecture 4: Cheeger s Inequality

Soving Recurrence Relations

Sequences and Series

TO: Users of the ACTEX Review Seminar on DVD for SOA Exam MLC

Normal Distribution.

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

1. MATHEMATICAL INDUCTION

Chapter 5: Inner Product Spaces

Hypothesis testing. Null and alternative hypotheses

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

The Stable Marriage Problem

Elementary Theory of Russian Roulette

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT

Analysis Notes (only a draft, and the first one!)

Class Meeting # 16: The Fourier Transform on R n

Sequences and Series Using the TI-89 Calculator

Theorems About Power Series

Chapter 7 - Sampling Distributions. 1 Introduction. What is statistics? It consist of three major areas:

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

Entropy of bi-capacities

Heavy Traffic Analysis of a Simple Closed Loop Supply Chain

Math C067 Sampling Distributions

Plug-in martingales for testing exchangeability on-line

1 Computing the Standard Deviation of Sample Means

NOTES ON PROBABILITY Greg Lawler Last Updated: March 21, 2016

Designing Incentives for Online Question and Answer Forums

MAXIMUM LIKELIHOODESTIMATION OF DISCRETELY SAMPLED DIFFUSIONS: A CLOSED-FORM APPROXIMATION APPROACH. By Yacine Aït-Sahalia 1

Exploratory Data Analysis

Notes on exponential generating functions and structures.

Section 11.3: The Integral Test

ON AN INTEGRAL OPERATOR WHICH PRESERVE THE UNIVALENCE

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

Solving Logarithms and Exponential Equations

Ekkehart Schlicht: Economic Surplus and Derived Demand

Convexity, Inequalities, and Norms

Solutions to Exercises Chapter 4: Recurrence relations and generating functions

Ramsey-type theorems with forbidden subgraphs

INFINITE SERIES KEITH CONRAD

arxiv: v1 [math.st] 21 Aug 2009

Swaps: Constant maturity swaps (CMS) and constant maturity. Treasury (CMT) swaps

A Mathematical Perspective on Gambling

Integer Factorization Algorithms

Maximum Likelihood Estimators.

ON THE DENSE TRAJECTORY OF LASOTA EQUATION

Subject CT5 Contingencies Core Technical Syllabus

1. C. The formula for the confidence interval for a population mean is: x t, which was

THE ARITHMETIC OF INTEGERS. - multiplication, exponentiation, division, addition, and subtraction

Repeating Decimals are decimal numbers that have number(s) after the decimal point that repeat in a pattern.

Transcription:

THE ABRACADABRA PROBLEM FRANCESCO CARAVENNA Abstract. We preset a detailed solutio of Exercise E0.6 i [Wil9]: i a radom sequece of letters, draw idepedetly ad uiformly from the Eglish alphabet, the expected time for the first appearace of the word ABRACADABRA is 26 + 26 4 + 26. We adopt the covetios N := {, 2, 3,...} ad N 0 := N {0}.. Formulatio of the problem Let U i ) i N deote radom letters draw idepedetly ad uiformly from the Eglish alphabet. More precisely, we assume that U i ) i N are idepedet ad idetically distributed radom variables, uiformly distributed i the set E := {A, B, C, D,..., X, Y, Z}, defied o some probability space Ω, A, P). For m, N with m, we use U [m,] as a shortcut for the vector U m, U m+,..., U ). Defie τ as the radom time i which the word ABRACADABRA first appears: τ := mi{ N, : U [ 0,] = ABRACADABRA},.) with the covetio mi := +. Our goal is to prove the followig result. Theorem. E[τ] = 26 + 26 4 + 26. 2. Strategy The proof is based o martigales. Let F ) N0 be the atural filtratio of U i ) i N, i.e., F 0 := {, Ω} ad F := σu,..., U ). We are goig to prove the followig results. Propositio 2. τ is a stoppig time with E[τ] <. Propositio 3. There exists a martigale M = M ) N0 ) M 0 = 0 ad M τ = 26 + 26 4 + 26 τ; such that: 2) M has bouded icremets: C 0, ) such that M + M C, for all N 0. Let us recall a special case of) Doob s optioal samplig theorem, cf. [Wil9, 0.0]. Theorem 4. If M = M ) N0 is a martigale with bouded icremets ad τ is a stoppig time with fiite mea, the E[M τ ] = E[M 0 ]. Combiig this with Propositios 2 ad 3, oe obtais immediately the proof of Theorem. Date: April 0, 205.

2 FRANCESCO CARAVENNA 3. Proof of Propositio 2 Recall that τ is a stoppig time if ad oly if {τ } F for every N 0. Note that {τ } = if 0, while for {τ } = {U [i 0,i] = ABRACADABRA}, i= which shows that the evet {τ } is i F it is expressed as a fuctio of U, U 2,..., U ). To prove that E[τ] <, we argue as i [Wil9, 0.; Exercise E0.5]. Lemma 5. For a positive radom variable τ, i order to have E[τ] < it is sufficiet that N N, ε > 0 : Pτ + N τ > ) ε N 0. 3.) This result is proved i Appedix A below. I order to apply it, let A be the evet that the word ABRACADABRA appears ot ecessarily for the first time!) at time + : A := {U [+,+] = ABRACADABRA}. By assumptio U i are idepedet ad uiformly chose letters, hece PA ) = p > 0, where p := E = 26. Sice A {τ + }, we have Pτ + τ > ) PA τ > ) for every N. However the evets A ad {τ > } are idepedet A is a fuctio of U [+,+], while {τ > } = {τ } c F is a fuctio of U [,] ), hece PA τ > ) = PA ). Thus Pτ + τ > ) PA ) = p, i.e. relatio 3.) holds with N = ad ε = p. It follows by Lemma 5 that E[τ] <. 4. Proof of Propositio 3 The required martigale M = M ) N0 will be costructed as the total et gai of a suitable family of gamblers, built as follows. At time 0 a first gambler eters the game, with a iitial capital of e. She bets o the evet that the first letter U is A the first letter of the word ABRACADABRA ). If she loses, her capital at time drops to 0e ad she stops playig i.e. her capital will stay 0e at all later times). O the other had, if she wis, her capital at time becomes 26e ad she goes o playig, bettig o the evet that the secod letter U 2 is B the secod letter of ABRACADABRA ). If she loses, her capital at time 2 is 0e ad she stops playig, while if she wis, her capital at time 2 equals 26) 2 e ad she goes o, bettig o the evet that the third letter U 3 is R the third letter of ABRACADABRA ), ad so o, util time. The gambler s capital at time is either 26) e, if the letters U [,] have formed exactly the word ABRACADABRA, or 0e otherwise. I ay case, the gambler stops playig after time, hece her capital will stay costat at all later times. Let us deote by x i be the i-th letter of the word ABRACADABRA, for i so that x = A, x 2 = B, x 3 = R,..., x = A). The capital i e) of this first gambler at time is the give by the process K ) N0 defied as follows: if = 0 K := K 26 {U=x} if. K if 2

THE ABRACADABRA PROBLEM 3 Note that if K = 0, the K = 0 irrespectively of U, as described above.) Now a secod gambler arrives, playig the same game, but with oe time uit of delay. Her iitial capital stays e at time 0 ad at time, the she bets o the evet that U 2 = x = A: if she loses, her capital at time 2 is 0e ad she stops playig, while if she wis, her capital at time 2 is 26e ad she goes o playig, bettig o the evet that U 3 = x 2 = B, etc. At time 2, the secod gambler s capital will be either 26) e or 0e, accordig to whether the letters U [2,2] have formed precisely the word ABRACADABRA or ot. At this poit she stops playig ad her capital stays costat at all later times. Geeralizig the picture, imagie that for each j N there is a j-th gambler with a iitial capital of e, who starts playig just before time j, bettig o the evet that U j = x, the if she wis) o U j+ = x 2,..., ad fially if she has wo all the previous bets) o U j+0 = x. After time j + 0 the gambler stops playig ad her capital stays costat. Deotig by K j) the capital i e) of the j-th gambler at time, for N 0, we have if < j K j) := 26 {U =x j)+ } if j j + 0. 4.) j+0 if > j + 0 We ca fially defie the process we are lookig for, that will be show to be a martigale: ) M 0 := 0, M := K j) 0 ) = K j). 4.2) Thus M is the sum of the et gais equality i 4.2), recall that 0 = for all j N. 0 of the first gamblers at time. For the Lemma 6. For τ defied as i.), oe has M τ = 26) + 26) 4 + 26 τ. Proof. We eed to evaluate M τ = τ Recall that K τ j) is the capital at time τ of the gambler who starts bettig just before time j. It suffices to show that K τ j) = 0 except for j {τ 0, τ 3, τ}, for which τ ) τ. K τ 0) τ = 26), K τ 3) τ = 26) 4, K τ) τ = 26. Sice the complete word ABRACADABRA appears at time τ, the gambler who started playig just before time τ 0 has a capital of 26), i.e. K τ τ 0) = 26). The gambler who started playig just before time τ 3 has a capital K τ τ 3) = 26) 4, because the last four letters of ABRACADABRA are ABRA ad coicide with the first four letters of that word. Aalogously, sice the last letter A is the same as the first letter, the gambler who started playig just before time τ has wo his first bet ad his capital is K τ τ) = 26. Fially, for all j {τ 0, τ 3, τ} all gamblers have lost at least oe bet ad their capital is K τ j) = 0, because τ is the first time the word ABRACADABRA appears. We could have equivaletly summed the et gais of all gamblers, defiig M := Kj) because = 0 for j >. 0 ),

4 FRANCESCO CARAVENNA To complete the proof of Propositio 3, it remais to show that M = M ) N0 is a martigale with bouded icremets. We start lookig at the capital processes. Lemma 7. For every fixed j N, the capital process ) N0 is a martigale. Proof. We argue for fixed j N. Plaily, 0 = is F 0 -measurable. By 4.), K j) is a measurable fuctio of ad U, assumig iductively that is F -measurable, it follows that Sice is F -measurable. This shows that K j) ) N0 is a adapted process. by 4.), it follows iductively that Kj) 26 for all N, 26 are bouded ad, i particular, itegrable). Fially, the relatio E[K j) F ] = is trivially satisfied if < j or if > j + 0, while for {j,..., j + 0}, agai by 4.), hece the radom variables E[K j) F ] = E[ 26 {U =x j)+ } F ] = 26 PU = x j)+ ) =, because U is idepedet of F ad PU = a) = 26 for every a E. Lemma 8. The capital processes ) N0 have uiformly bouded icremets: 25, j, N. 4.3) Proof. Oe has K j) = 0 if < j or > j+0, by 4.), while for {j,..., j+0} Sice j =, relatio 4.3) follows. 26 {U =x j)+ } 25 Kj). We ca fially show that M is a martigale. Note that M is F -measurable ad i L, for every N, because by 4.2) M is a fiite sum of K j), each of which is F -measurable ad i L by Lemma 7. Furthermore, agai by 4.2), for all N we ca write E[M F ] = E[K j) F ] =. However for j = we have = K) = by defiitio, cf. 4.), hece E[M F ] = + = ) = M. This shows that M is a martigale. Fially, for all N M M = ) ) = K j) ), agai because for j = we have = K) =. Now observe that, agai by 4.), for j + oe has K j) = = Kj) j+0, hece M M = j= 0 K j) ) j= 0 25, havig applied 4.3). This shows that M has bouded icremets, completig the proof.

THE ABRACADABRA PROBLEM 5 The assumptios imply that Appedix A. Proof of Lemma 5 as we show below. We are goig to use the formula Pτ > ln) ε) l l N 0, A.) E[τ] = 0 Pτ > t) dt, A.2) valid for every radom variable τ takig values i [0, ]. Breakig up the itegral i the sub-itervals [ln, l + )N], with l N 0, sice Pτ > t) Pτ > ln) for t ln, we get E[τ] = l+)n Pτ > t) dt Pτ > ln) l N 0 l N 0 = ln N ε) = N ε <, l+)n ln dt l N 0 ε) l N havig applied the geometric series N 0 q = q. This shows that E[τ] <, as required. It remais to prove A.), which we do by iductio. For l = 0 there is othig to prove. For every l N 0, sice {τ > l + )N} {τ > ln}, we ca write Pτ > l + )N) = Pτ > l + )N, τ > ln) = Pτ > ln) Pτ > l + )N τ > ln). The iductio step yields Pτ > ln) ε)l, while assumptio 3.) gives Pτ > + N τ > ) ε), N. A.3) Choosig = ln yields Pτ > l + )N τ > ln) ε), which plugged ito A.3) yields Pτ > l + )N) ε) l+, as required. Refereces [Wil9] D. Williams 99), Probability with martigales, Cambridge Uiversity Press Dipartimeto di Matematica e Applicazioi, Uiversità degli Studi di Milao-Bicocca, via Cozzi 55, 2025 Milao, Italy E-mail address: fracesco.caravea@uimib.it For every T [0, ] oe has T = T dt = 0 0 {T t} dt, hece τω) = 0 {τω)>t} dt for every radom variable τ takig values i [0, ]. Takig expectatios of both sides ad exchagig the expectatio with the itegral which is justified by Fubii-Toelli, thaks to positivity) oe obtais A.2).