# SMALE S 17TH PROBLEM: AVERAGE POLYNOMIAL TIME TO COMPUTE AFFINE AND PROJECTIVE SOLUTIONS.

Save this PDF as:

Size: px
Start display at page:

Download "SMALE S 17TH PROBLEM: AVERAGE POLYNOMIAL TIME TO COMPUTE AFFINE AND PROJECTIVE SOLUTIONS."

## Transcription

1 SMALE S 17TH PROBLEM: AVERAGE POLYNOMIAL TIME TO COMPUTE AFFINE AND PROJECTIVE SOLUTIONS. CARLOS BELTRÁN AND LUIS MIGUEL PARDO Abstract. Smale s 17th Problem asks: Can a zero of n complex polynomial equations in n unknowns be found approximately, on the average, in polynomial time with a uniform algorithm?. We give a positive answer to this question. Namely, we describe a uniform probabilistic algorithm that computes an approximate zero of systems of polynomial equations f : C n C n, performing a number of arithmetic operations which is polynomial in the size of the input, on the average. 1. Introduction In the series of papers [SS93a, SS93b, SS93c, SS94, SS96], Shub and Smale defined and studied in depth a homotopy method for solving systems of polynomial equations. Some articles preceding this new treatment were [Kan49, Sma86, Ren87, Kim89, Shu93]. Other authors have also treated this approach in [Mal94, Yak95, Ded97, BCSS98, Ded01, Ded06, MR0] and more recently in [Shu08, BS08]. In a previous paper [BP08], the authors furthered the program initiated in the series [SS93a] to [SS94], describing an algorithm that computes projective approximate zeros of systems of polynomial equations in polynomial running time, with bounded (small) probability of failure. In this paper, we give an updated version of some of the concepts introduced in [BP08], and we develop two important extensions of the results therein. On one hand, we describe a procedure that, instead of assuming a small probability of error, finds an approximate zero of systems of polynomial equations, on the average, in polynomial time (cf. Theorem 1.10). This improvement is made in order to answer explicitly the question posted by Smale in his 17th problem. On the other hand, we extend the result to the computation of affine solutions of systems of polynomial equations. This new result requires us to understand the probability distribution of the norm of the affine solutions of polynomial systems of equations (Theorem 1.9 below). These results may be summarized as follows. Theorem 1.1 (Main). There exists a uniform probabilistic algorithm that computes an approximate zero - both projective and affine - of systems of polynomial equations (with probability of success 1). The average number of arithmetic operations of this algorithm is polynomial in the size of the input. 000 Mathematics Subject Classification. Primary 65H0, 14Q0. Secondary 1Y05, 90C30. Key words and phrases. Approximate zero, Homotopy methods, Complexity of methods in Algebraic Geometry. Research was partially supported by Fecyt, Spanish Ministry of Science, MTM and a NSERC grant. 1

2 CARLOS BELTRÁN AND LUIS MIGUEL PARDO More specifically, the kind of algorithm that we obtain belongs to the class Average ZPP (for Zero error probability, Probabilistic, Average Polynomial Time), or equivalently Average Las Vegas. At the end of this Introduction we have added an Appendix where we clarify the terminology. In this paper, c denotes a positive constant Background. Let n N be a positive integer. Let (d) = (d 1,...,d n ) N n be a list of positive degrees, and let H (d) be the space of all systems f = [f 1,...,f n ] : C n C n of n polynomial equations and n unknowns, of respective degrees bounded by d 1,...,d n. Observe that H (d) is also equivalent to the space of all systems of homogeneous polynomial equations f : C n+1 C n of degrees d i. Indeed, for each system in the unknowns X 1,...,X n, we can consider its homogeneous counterpart, adding a new unknown X 0 to homogenize all the monomials of each equation to the same degree d i. The set of projective zeros of f = [f 1,...,f n ] H (d) is V IP (f) = {x IP n (C) : f i (x) = 0,1 i n} IP n (C), where the f i are seen as polynomial homogeneous mappings f i : C n+1 C. Observe that this set is naturally contained in the complex n-dimensional projective space IP n (C). Namely, a point x C n+1 is a zero of f i, 1 i n, if and only if λx C n+1 is a zero of f i, where 0 λ C is any complex number. On the other hand, we may consider the set of affine solutions of an element f H (d), V C (f) = {x C n : f i (x) = 0,1 i n} C n, where the f i are seen as polynomial mappings f i : C n C. Let ϕ 0 be the standard embedding of C n into IP n (C), (1.1) ϕ 0 : C n IP n (C) \ {x 0 = 0}. (x 1,...,x n ) (1 : x 1 :... : x n ) Then, we may write V C (f) = ϕ 1 0 (V IP(f)). Namely, x V C (f) = ϕ 0 (x) V IP (f), z = (z 0 : : z n ) V IP (f),z 0 0 = ϕ 1 0 (z) V C(f). We denote by d = max{d i : 1 i n} the maximum of the degrees, and by D = d 1 d n the Bézout number associated with the list (d). From now on, we assume that d. We denote by N +1 the complex dimension of H (d) as a vector space. We may consider N as the size of the input of our target systems. We summarize here the notation. n Number of equations (d) = (d 1,...,d n ) List of degrees d max{d 1,...,d n } H (d) Space of systems of equations associated with (d) N + 1 Complex dimension of H (d) D Bezóut number=d 1 d n As in [SS93a], we consider H (d) equipped with the Bombieri-Weyl Hermitian product, and the associated Hermitian structure (see Section for a precise

3 SMALE S 17TH PROBLEM 3 definition). We denote by S (d) the sphere in H (d) for this Hermitian product, with the inherited Riemannian structure. We will also consider the incidence variety W, W = {(f,ζ) H (d) IP n (C) : f 0, ζ V IP (f)}. As pointed out in [BCSS98], W is a differentiable manifold of complex dimension N + 1. The projective Newton operator was initially described in [Shu93], and studied in depth in the series of papers by Shub and Smale. Let f H (d) be seen as a system of homogeneous polynomial equations and let z C n+1. Let df(z) be the tangent mapping of f at z, considering f as a map from C n+1 to C n, and let z be the Hermitian complement of z in C n+1. Assume that df(z) z is a bijective mapping. Then, we define N f (z) = z ( df(z) z ) 1 f(z) IPn (C). (k) We denote by N f (z) = N f k N f (z) the projective point obtained after k (k) applications of Nf, starting at z. Assume that z IP n (C) is such that N f (z) is defined for every k 0, and that there exists a point ζ V IP (f) such that ( d N(k) T f (z),ζ ) 1 k 1 d T(z,ζ), k 0, where d T is the tangent of the Riemannian distance in IP n (C). Then, we say that z is a projective approximate zero of f, with associated (true) zero ζ. Note that using z as a starting point for projective Newton s operator, we may quickly obtain an approximation as close as wanted to ζ. Hence, a major objective is the efficient computation of projective approximate zeros. In [SS93a], Shub & Smale studied the behavior of the projective Newton operator in terms of a normalized condition number of polynomial systems. For f H (d) and z IP n (C), they defined a quantity µ norm (f,z) (see equation (3.1) for a precise definition) that satisfies the following property. Proposition 1. (Shub & Smale). Let (f,ζ) W, and let z IP n (C) be such that d T (z,ζ) 3 7 d 3/ µ norm (f,ζ). Then, z is a projective approximate zero of f with associated zero ζ. Now we briefly describe the Homotopic Deformation procedure (hd for short), a procedure that attempts to find projective approximate zeros of systems developed by Shub & Smale (see [SS93a, SS96] and mainly [SS94]). Let f S (d) be a target system, and let g S (d) be another system that has a known solution ζ 0 IP n (C). Consider the segment {f t = tf + (1 t)g,t [0, 1]} joining g and f. Under some regularity hypothesis (see Proposition 3.1), the Implicit Function Theorem defines a differentiable curve {(f t,ζ t ),t [0,1]} H (d) IP n (C), such that f t (ζ t ) = 0, t [0,1]. This curve will be denoted by Γ(f,g,ζ 0 ). The hd is a procedure that constructs a polygonal path that closely follows Γ(f,g,ζ 0 ). This path has initial vertex (g,ζ 0 ) and final vertex (f,z 1 ), for some z 1 IP n (C). The output of the algorithm is z 1 IP n (C). The polygonal path is constructed by homotopy steps (path following methods) each of which is

4 4 CARLOS BELTRÁN AND LUIS MIGUEL PARDO an application of the projective Newton operator, with an appropriate step size selection. In [SS94], Shub & Smale proved the existence of optimal initial pairs that guarantee a good performance of this algorithm, but their existential proof does not give any hint on how to compute these pairs (a full discussion of these aspects with historical comments may be read in detail in [BP06]). The lack of hints on how to find these initial pairs leads both to Shub & Smale s Conjecture (as in [SS94]) and to the problem mentioned in the title of this paper, Smale s 17th Problem: Can a zero of n complex polynomial equations in n unknowns be found approximately, on the average, in polynomial time with a uniform algorithm? Here, the term uniform emphasizes the fact that the algorithm demanded by Smale must be described explicitly (see the discussion in the Appendix to the Introduction), and average is used in the sense of the Bombieri-Weyl product. That is, in our notation, average on S (d). Smale also wrote: Certainly finding zeros of polynomials and polynomial systems is one of the oldest and most central problems of mathematics. Our problem asks if, under some conditions specified in the problem, it can be solved systematically by computers. 1.. Statement of the main outcomes. The formal statement of our main results requires the introduction of some precise terminology. We also recall some concepts and results from our previous paper [BP08] Projective solutions. An important quantity helps to control the number of homotopy steps used in hd algorithms, and, hence the complexity of the algorithm. For a pair f,g S (d) and a solution ζ V (g), we define µ norm (f,g,ζ) = sup [µ norm (h,z)]. (h,z) Γ(f,g,ζ) Definition 1.3. We say that (g,ζ) is an efficient initial pair if where β = 1 log D. E f S(d) [µ norm (f,g,ζ) β ] 10 5 n 5 N d 3/ log D. Note that the dependence on ε of this concept (appearing in our previous paper [BP08]) has disappeared. The following result follows from the arguments of Shub and Smale in [SS94] (see Section 3.3 for a proof). Theorem 1.4. Assume that (g,ζ 0 ) is an efficient initial pair. Then, the average number of projective Newton steps performed by Shub & Smale s Homotopy with initial pair (g,ζ 0 ) is less than or equal to cn 5 N d 3 log D. Hence, the average number of arithmetic operations is less than or equal to cn 6 N 3 d 3 log d log D.

5 SMALE S 17TH PROBLEM 5 Thus, the knowledge of an efficient initial pair yields an algorithm based on hd that finds a projective approximate zero of systems of polynomial equations in polynomial time, on the average. However, up to this moment such an initial pair is not explicitly known. In [BP08], the authors described a simple probabilistic method to find these pairs. To this end, the authors defined and analyzed the behavior of the quantity (1.) A ε (g,ζ) = Prob f S(d) [µ norm (f,g,ζ) > ε 1 ]. where (g,ζ) W, and ε > 0 is some positive real number. Definition 1.5. Let G W be a subset with a probability measure. We say that G is a strong questor set for initial pairs if for every ε > 0, where E means expectation. E G [A ε ] 10 4 n 5 N d 3/ ε, The importance of Definition 1.5 relies on the following theorem, that will be obtained as a consequence of a lemma from Probability Theory (see subsection 3.4.1). Theorem 1.6. Let G H (d) IP n (C) be a strong questor set for initial pairs. Then, where β = 1 log D. E ((g,ζ),f) G S(d) [µ norm (f,g,ζ) β ] 10 4 n 5 N d 3/ log D, Fubini s Theorem and Markov s Inequality immediately yield the following corollary. Corollary 1.7 (Existence of good initial pairs). Let σ (0,1). Then, there exists a measurable set C σ G such that (1.3) Prob (g,ζ) G [(g,ζ) C σ ] 1 σ, and such that for every (g,ζ) C σ, E f S(d) [µ norm (f,g,ζ) β ] 104 n 5 N d 3/ log D. σ In particular, with probability of at least 4/5, a randomly chosen pair (g,ζ) G (d) is an efficient initial pair. This result means the following: if we are able to find a computationally tractable strong questor set, then we can probabilistically find an efficient initial pair (g,ζ). Hence, we obtain an algorithm that finds a projective zero of systems, on the average, in polynomial running time. A similar idea (i.e. the usage of a set that contains good initial points for iterative algorithms) has been recently developed in [HSS01], for the case of univariate polynomial solving. In [BP08], the authors explicitly described a family of systems G (d) associated with the list of degrees (d), such that one can easily choose at random a point (g,ζ) G (d). This rather technical family will be described with detail in Subsection 3.5. The main result in [BP08] may be written as follows (see [BP08, Proposition 36]):

6 6 CARLOS BELTRÁN AND LUIS MIGUEL PARDO Theorem 1.8. The family G (d) is a strong questor set for initial pairs. Namely, for every ε > 0: E G(d) [A ε ] 10 4 n 5 N d 3/ ε. Hence, we have an explicit and efficient description of a strong questor set, for each list of degrees (d). This theorem simply means that we can easily compute an efficient initial pair, using a probabilistic method. Hence, from Corollary 1.7, we can compute a projective approximate zero of a system of equations, on the average, in polynomial running time. This is the projective version of our Main Theorem (Theorem 1.1) Affine solutions. Usually, we are interested in the search of affine (not projective) approximate zeros of systems. We recall some properties of the affine Newton operator. Let f H (d), f : C n C n, and let z C n be such that the differential matrix df(z) M n (C) is of maximal rank. The affine Newton operator of f at z is: N f (z) = z (df(z)) 1 f(z) C n. We denote by N (k) f (z) = N f k N f (z) the affine point obtained after k applications of N f to z, if it exists. Assume there exists a true zero ζ C n of f such that for every positive integer k N, the vector N (k) f (z) is defined and N (k) f (z) ζ 1 k 1 z ζ. Then, we say that z is an affine approximate zero of f, with associated zero ζ. As we have seen, the affine solutions of f are related to some of its projective solutions via the standard embedding ϕ 0 of equation (1.1). This suggests that, in order to find an affine approximate zero of f, we may first find a projective approximate zero z IP n (C) of f, with associated zero ζ IP n (C), and then we may consider the affine point ϕ 1 0 (z) Cn, if it exists. An initial drawback is that ϕ 1 0 (z) may not be an affine approximate zero associated with ϕ 1 0 (ζ). In Proposition 4.5 below we will prove that that this process can be done, if we are able to state an upper bound for the quantity 0 (ζ). Hence, the probability distribution of the norm of the affine solutions of systems in H (d) turns out to be an essential ingredient for the analysis of the complexity of finding affine approximate zeros. We prove the following result (Corollary 4.9 of Section 4). Theorem 1.9. Let δ > 0. Then, the probability that a randomly chosen system f S (d) has an affine solution ζ C n with ζ > δ is at most D πn. δ Now we describe an algorithm that finds affine approximate zeros of systems. For every (g,ζ) H (d) IP n (C), we define the following procedure (which will be called Affine Homotopic Deformation, ahd for short). Algorithm: ahd Input: f S (d).

7 SMALE S 17TH PROBLEM 7 PART ONE Apply Shub & Smale s hd to the segment [g,f], starting at the initial pair (g,ζ). Let z IP n (C) be the output of this procedure. PART TWO For i from 1 to do Let z i (i) = N f (z) IP n(c), where N f is the projective Newton operator associated with f. Check (using Smale s α-theorem [Ded06]) if the affine point ϕ 1 0 (zi ) C n is an affine approximate zero of f. In this case, halt. Output: an affine approximate zero of f. The proof of the following result relies on Theorem 1.9. Theorem Let (g,ζ) W be an efficient initial pair. Then, the average number of arithmetic operations performed by ahd is less than or equal to cn 6 N 3 d 3 log d(log D). From Theorem 1.10, the knowledge of an efficient initial pair (g,ζ) yields an algorithm that finds an affine approximate zero of systems of polynomial equations, on the average, in polynomial running time. Finally, Corollary 1.7 and Theorem 1.8 above allow us to find (g,ζ), using a simple probabilistic procedure. This is the affine version of our Main Theorem (Theorem 1.1). This paper is structured as follows. In Section we describe in some detail the notations, concepts and previous results related to our main results. In Section 3 we recall the main results which lead to the resolution of the projective case and we describe in detail the construction of the strong questor set G (d). We will also prove Theorem 1.6. Section 4 is devoted to proving theorems 1.9 and Appendix to the Introduction. At this point we must clarify what the terms uniform algorithm and Average ZPP mean. For a precise discussion we direct the reader to [BCSS98] and references therein. Non uniform algorithms have been known in theoretical computer science literature for a long time. A classical definition of a non uniform complexity class in the boolean setting is Nick Pippenger s NC classes. In the continuous setting, we have the notions of NC R classes in [CMP95, MP98] or [BCSS98, Chpt. 18] and the references therein. Roughly speaking, a non-uniform algorithm is a sequence of circuits C N such that for every N, the circuit solves instances of length N. Non uniform upper complexity bounds are sometimes useful (and give hints) to achieve lower complexity bounds and uniform efficient algorithms. However, they stay at a theoretical level, since non uniform algorithms are not implementable. Non uniform lower complexity bounds are also essential to understand the troubles with complexity in many limit problems. On the other hand, there are several definitions of what a uniform algorithm is. In the boolean setting, the Turing machine model of complexity seems to be the natural one. As S. Cook observed in [Coo85], NC classes also accept different forms of uniformity. In the continuous setting the work by Blum, Shub and Smale [BSS89] established a context that defines what a uniform algorithm should

8 8 CARLOS BELTRÁN AND LUIS MIGUEL PARDO be. Roughly speaking, a uniform algorithm is an algorithm that has a finite machine that performs the required computations. In terms of circuit (non uniform) complexity classes, the machine contains a procedure that implicitly generates the sequence of circuits (cf. for example [CMP9]). Another simplified description is that a uniform algorithm is something that you may implement as a single program in any standard programming language, whereas non uniform algorithms require an implementation for each input length (and hence an infinite number of programs ). Uniform algorithms can be deterministic or non-deterministic. This distinction leads to the classical P NP question of Cook s Conjecture in the boolean setting and the two problems concerning Hilbert Nullstellensatz (complex) and 4 Feasible (real) in the continuous setting (cf. [Koi96] or [BCSS98] and the references therein). A particularly useful class of non-deterministic, uniform algorithms are probabilistic algorithms. A probabilistic algorithm is a uniform algorithm that starts its computations by randomly guessing an instance from an appropriate class of data associated with the given input length. When the probability that the guessed instance leads to an error is bounded (usually by a constant smaller than 1/), they are called bounded error probability algorithms. If that probability is equal to 0 they are called zero error probability algorithms. Note that the running time (number of arithmetic operations) of a probabilistic algorithm may depend on the initially-guessed instance. Thus, we must fix a way of measuring the time of the algorithm on a given input. There are two widely accepted ways of doing so. The time of the algorithm for a given input is defined as the worst-case time among all choices of initial guesses, or the average of the time among all choices of initial guesses. In this paper, we use the second of these two options. Namely, for every polynomial system f S (d), we have t(f) = E (g,ζ) G(d) [ Arithmetic ops. of hd with input f and guess (g,ζ)]. With this terminology, when the running time of a zero error probability algorithm is bounded by a polynomial in the input length, the problem is said to be in the class ZPP (Zero error probability, Probabilistic, Polynomial time). For bounded error probability algorithms (measuring the running time as the worst case instead of average among guesses), the term is BPP. Note that, in the Turing setting, ZPP BPP (cf. for example [DK00, Chapter 8]). As an example, the primality tests in [SS77, SS78] or [Mil76, Rab80] are BPP, but not ZPP. Usually the complexity classes as ZPP or BPP are used for decisional problems (problems with YES/NO answer). It is common to use the term Las Vegas for the non-decisional version of ZPP. The algorithm described in the introduction of this paper can be written as Input: a polynomial system f (normalized such that its norm equals to 1). Let (d) be the list of degrees of the polynomials in f. Guess at random (g,ζ) G (d).

9 SMALE S 17TH PROBLEM 9 Compute the polygonal path of the Homotopy Deformation in [g,f] (described in [SS94]). Output: a projective approximate zero of f (if desired, apply ahd above to obtain an affine approximate zero of f). Note that for some input systems, the algorithm may run forever, and no output will be obtained. Then, Theorem 1.6 (using Theorem 3. below) reads: the average running time of this algorithm is bounded by a polynomial in the size of the input. More specifically, there is a constant c such that for every multiindex (d), E f S(d) [t(f)] N c. Moreover, note that Theorem 1.6 also implies that for every input system f S (d) (outside of some zero measure set), the probability that a random choice of (g,ζ) G (d) leads to an approximate zero of f is exactly equal to 1. Thus, our algorithm differs from a usual ZPP (or Las Vegas) algorithm in that the polynomial running time is not the worst-case but average case, and zeromeasure sets can be omitted. In the language of Theoretical Computer Science, the running time t(n) is not defined as the maximum of the running times for inputs of length N, but as the average of these running times. Hence, we say that our algorithm is Average ZPP, or Average Las Vegas. It is not deterministic (as in probabilistic primality tests cited above) although it is certainly uniform. Thus, it gives an affirmative answer to Smale s 17th Problem. The result as stated asks the algorithm to be able to choose random real numbers with the normal distribution (which leads to random choice of points in spheres).. Notions and notations Recall that we have fixed a positive integer number n N that will be the number of equations in our target systems. For every positive integer number l N, we fix some ordering in the set E l = {α = (α 0,...,α n ) N n+1 : α α n = l}. Then, we define H l = α E l C. Observe that the vector space H l may be understood in two different ways: as the space of homogeneous polynomial mappings C n+1 C of degree l. In fact, for (a α ) α El H l, we may consider the mapping C n+1 C (x 0,...,x n ) α=(α 0,...,α n) E l a α x α0 0 xαn n. as the space of polynomial mappings C n C of degree at most l. In fact, for (a α ) α El H l, we may consider the mapping C n C (x 1,...,x n ) α=(α 0,...,α n) E l a α x α1 1 xαn n.

10 10 CARLOS BELTRÁN AND LUIS MIGUEL PARDO Now, let (d) = (d 1,...,d n ) N n be a list of positive integers. We consider the vector space of systems n H (d) = H di. i=1 As, H (d) may be understood as the set of homogeneous polynomial mappings C n+1 C n of respective degrees given by the list (d 1,...,d n ), or as the set of (not necessarily homogeneous) polynomial mappings C n C n of degrees bounded by the same list. We may denote by f,g,... the elements in H (d). We denote by N +1 the complex dimension of H (d). Namely, ( n + d1 N + 1 = dim(h (d) ) = dim(h d1 ) + + dim(h dn ) = n ) + + ( n + dn An element f = [f 1,...,f n ] H (d) is usually given as the list of coefficients of f 1,...,f n. Hence, N + 1 may be seen as the size of the input. As said in the introduction, we denote by d = max{d i : 1 i n} the maximum of the degrees, and by D = d 1 d n d n the Bézout number associated with the list (d). From now on, we assume that d. We consider H (d) equipped with the Bombieri-Weyl Hermitian product,. We denote by the diagonal matrix given by ( (di ) ) 1/ = Diag α, 1 i m α E di where ( d i α) is the multinomial coefficient. Namely, ( ) di d i! = α α 0! α n! N. Then, f,g = f, g, f,g H m (d), where, is the usual Hermitian product in H (d). The main property of the Bombieri-Weyl Hermitian product is its unitary invariance (see for example [BCSS98, Teor. 1, pag 18]): for every unitary matrix U U n+1, and for every pair of elements f,g H (d), we have (.1) f,g = f U,g U. For any element f H (d), we may consider the derivative of f as a linear map. This concept changes if we see f as a map with domain in C n+1 or C n. Hence, if we consider f : C n+1 C n as a homogeneous polynomial mapping, then we denote by df(x) the derivative of f at x, where x C n+1 is a point. On the other hand, if we consider f : C n C n as a (possibly not homogeneous) polynomial mapping, then we denote by df(x) the derivative of f at x, where x C n is a point. For every ζ IP n (C), let V ζ be the set of systems that have a projective zero at ζ. Namely, V ζ = {f H (d) : ζ V IP (f)} H (d). n ).

11 SMALE S 17TH PROBLEM 11 Let e 0 = (1 : 0 : : 0) IP n (C) be a point that we can fix as a north pole. We will use the following subspaces of H (d) : L e0 = {g = [g 1,...,g n ] H (d) : g i = X di 1 0 n a ij X j,1 i n} V e0, j=1 L e 0 = {g V e0 : g,f = 0, f L e0 } V e0. Namely, L e0 is the set of homogeneous systems that have a zero at e 0 and are linear in the variables X 1,...,X n. The set L e 0 may be seen as the set of homogeneous systems of order at e 0. For an element f H (d) and a point x C n+1, let T x f be the restriction of the tangent mapping df(x) to the Hermitian complement of x. Namely, We will also use the mapping where (d) 1/ is given by the formula T x f = ( df(x)) x. ψ e0 : L e0 M n (C), g (d) 1/ T e0 g (d) 1/ = Diag(d 1/ 1,...,d 1/ n ) M n (R). Observe that ψ e0 is a linear isometry (see for example [BCSS98, Lemma 17, page 35]). We consider IP n (C) equipped with the canonical Riemannian metric. The Riemannian distance between two points x,y IP n (C) will be denoted by d R (x,y). As in [BCSS98], we will frequently use the so-called tangent distance d T (x,y) defined as: x d T (x,y) = tan d R (x,y) = y x,y 1, where some affine representatives of x and y have been chosen. The tangent distance d T is not quite a distance, as it does not satisfy the triangle inequality. But if x and y are near (in terms of d R ), then d T (x,y) is a good approximation of d R (x,y)..1. Some Geometric Integration Theory. The Coarea Formula is a classic integral formula which generalizes Fubini s Theorem. The most general version we know is Federer s Coarea Formula (cf. [Fed69]), but for our purposes a smooth version as used in [BCSS98, SS93b] or [How] suffices. Definition.1. Let X and Y be Riemannian manifolds, and let F : X Y be a C 1 surjective map. Let k = dim(y ) be the real dimension of Y. For every point x X such that the differential DF(x) is surjective, let v1,...,v x k x be an orthonormal basis of Ker(DF(x)). Then, we define the Normal Jacobian of F at x, NJ x F, as the volume in the tangent space T F(x) Y of the parallelepiped spanned by DF(x)(v1),...,DF(x)(v x k x ). In the case that DF(x) is not surjective, we define NJ x F = 0. Theorem. (Coarea Formula). Let X, Y be two Riemannian manifolds of respective dimensions k 1 k. Let F : X Y be a C 1 surjective map, such that

12 1 CARLOS BELTRÁN AND LUIS MIGUEL PARDO df(x) is surjective for almost all x X. Let ψ : X R be an integrable mapping. Then, 1 (.) ψ dx = ψ(x) NJ x F d(f 1 (y)) dy, X y Y x F 1 (y) where NJ x F is the normal jacobian of F at x. Observe that the integral on the right-hand side of equation (.) may be interpreted as follows: from Sard s Theorem, for every y Y except for a zero measure set, y is a regular value of F. Then, F 1 (y) is a differentiable manifold of dimension k 1 k, and it inherits from X a structure of Riemannian manifold. Thus, it makes sense to integrate functions on F 1 (y). The following Proposition is easy to prove (see for example [BCSS98]). Proposition.3. Let X,Y be two Riemannian manifolds, and let F : X Y be a C 1 map. Let x 1,x X be two points. Assume that there exist isometries ϕ X : X X and ϕ Y : Y Y such that ϕ X (x 1 ) = x, and Then, F ϕ X = ϕ Y F. NJ x1 F = NJ x F. Moreover, if F is bijective and G = F 1 : Y X, then NJ x F = 1 NJ F(x) G, x X. 3. Smale s 17th Problem: projective solutions The projective Newton operator. The normalized condition number of [SS93a] is defined as follows. For f H (d) and z IP n (C), (3.1) µ norm (f,z) = f (T z f) 1 Diag( z di 1 d 1 i ), and µ norm (f,z) = + if T z f is not onto. Also, µ norm (f,z) = µ norm (λf,ηz) for every λ,η C \ {0} (i.e. µ norm depends only on the projective class of f and z). The following set plays a crucial role in the study of projective Newton operator: Σ = {(f,ζ) W : det(t z f) = 0}, where W is the incidence variety defined in the Introduction. Observe that Σ (usually called the Discriminant variety) consists of the set of pairs (f,ζ) W such that ζ is a singular solution of f. Let π 1 : W H (d) \ {0} be the projection onto the first coordinate. Then, the set of critical points of π 1 is exactly Σ (cf. [BCSS98], [SS93b]). 3.. NHD in the space of systems. Let f,g S (d) be two polynomial systems, f ±g, and let ζ IP n (C) be a solution of g. Let [g,f] be the segment joining g and f. Namely, [g,f] = {tf + (1 t)g,t [0,1]} H (d). We also use the notation (g, h) to represent the corresponding open interval. Namely, [g,f] = {tf + (1 t)g,t (0,1)} H (d). The following result is a consequence of the Implicit Function Theorem (cf. for example [BCSS98]).

13 SMALE S 17TH PROBLEM 13 Proposition 3.1. Let (f,g,ζ 0 ) H (d) W be such that f = g = 1, f ±g. Let Γ(f,g,ζ 0 ) be the connected component of π1 1 ([g,f]) W that contains the point (g,ζ 0 ). If Γ(f,g,ζ 0 ) Σ =, then it is a smooth curve. Moreover, for each h [g,f], there exists a unique solution ζ 0 V IP (h) of h such that (h,ζ 0) Γ(f,g,ζ 0 ). Proof. Let p = π 1 Γ(f,g,ζ0): Γ(f,g,ζ 0 ) [g,f] be the restriction of π 1 to Γ(f,g,ζ 0 ). Note that both Γ(f,g,ζ 0 ) and [g,f] are Haursdoff, compact and path-wise connected. Thus, the image of p is a closed interval inside [g,f], and the Implicit Function Theorem applied to π 1 implies that p is surjective. Now, p is obviously proper and we conclude that it is a covering map (see for example [Ho75]) and a global homeomorphism. The proposition immediately follows. In the hypothesis of Proposition 3.1, let ζ = (π 1 Γ(f,g,ζ0)) 1 (f) be the (unique) solution of f that belongs to Γ(f,g,ζ 0 ). The following result is one of the main outcomes of the series of papers due to Shub and Smale (cf. [SS94, Prop. 7.]). Theorem 3. (Shub & Smale). Let 0 ν 1 and (g,ζ 0 ) W. Let f S (d) \{±g}. Then, k c(µ norm (f,g,ζ 0 )) ν D ν d 3/ steps of projective Newton operator, starting from (g,ζ 0 ), are sufficient to produce an approximate zero z of f. Moreover, the (true) projective zero associated with z is ζ and (3.) d T (z,ζ)) 3 7 d 3/ µ norm (f,ζ) Proof of Theorem 1.4. Let (g,ζ 0 ) be an efficient initial pair, and let β =. From Definition 1.3, 1 log D E f S(d) [µ norm (f,g,ζ) β ] 10 5 n 5 N d 3/ log D. From Theorem 3., taking ν = β so that D ν is a universal constant, we conclude that the average number of steps of the hd with initial pair (g,ζ 0 ) is at most as wanted. cd 3/ E f S(d) [µ norm (f,g,ζ 0 )) β ] cn 5 N d 3 log D, 3.4. Further properties of strong questor sets. In this subsection we will prove Theorem 1.6. We will use a well-known fact from probability theory (for a proof see for example [BP07b, Lemma 43]). Lemma 3.3. Let ξ be a positive real valued random variable such that for every positive real number t > 0 Prob[ξ > t] < ct α, where P rob[ ] holds for Probability, and c > 0, α > 1 are some positive constants. Then, E[ξ] c 1 α α α 1. Lemma 3.4. Let X,Y be two probability spaces and let X Y be endowed with the product probability measure. Let ξ : X Y [0, ) be a positive real random variable and let T : X (0, ) [0,1] (x,ε) Prob y Y [ξ(x,y) > ε 1 ]

14 14 CARLOS BELTRÁN AND LUIS MIGUEL PARDO Assume that for every ε > 0 we have E x X [T(x,ε)] Kε, for some K 1 independent of ε. Then, for every 0 < β <, E X Y [ξ β ] K β Proof. Let t > 0. Fubini s Theorem yields Prob (x,y) X Y [ξ(x,y) β > t] = x X From Lemma 3.3, we conclude that x X T(x,t 1 β )dx Kt β. E (x,y) X Y [ξ(x,y) β ] K β Prob y Y [ξ(x,y) β > t]dx = β K β Proof of Theorem 1.6. Apply Lemma 3.4 with X = G, Y = S (d) and ξ((g,ζ),f) = µ norm (f,g,ζ) The family of good initial pairs. Now we recall the description of the strong questor set for initial pairs found in [BP08]. Let Y = [0,1] B 1 (L e 0 ) B 1 (M n (n+1) (C)) R C N+1, where B 1 (L e 0 ) is the closed ball of radius one in L e 0 for the canonical Hermitian metric and B 1 (M n (n+1) (C)) is the closed ball of radius one in the space of n (n + 1) complex matrices for the standard Frobenius norm. We assume that Y is endowed with the product of the respective Riemannian structures and the corresponding measures and probabilities. Let τ R be the real number given by n + n τ = N, and let us fix any (a.e. continuous) mapping φ : M n (n+1) (C) U n+1 such that for every matrix M M n (n+1) (C) of maximal rank, φ associates a unitary matrix φ(m) U n+1 satisfying Mφ(M)e 0 = 0. In other words, φ(m) transforms e 0 into a vector in the kernel Ker(M) of M. Our statements below are independent of the chosen mapping φ that satisfies this property. There are several different ways to define this mapping φ, using simple procedures from Linear Algebra. Observe that the first column of the matrix Mφ(M) is zero. Namely, Mφ(M) = (0 C), where C M n (C) is a square matrix. As no confusion is possible, we also denote by Mφ(M) the matrix C. Then, we consider the associated system where ψ e0 is as defined in Section. ψ 1 e 0 (Mφ(M)) L e0 H (d),

15 SMALE S 17TH PROBLEM 15 Then, we define a mapping G (d) : Y V e0 as follows. For every (t,h,m) Y, let ( ) G (d) (t,h,m) = 1 τ 1 1/ t n +n 1 ( ) h 1 + τt n +n ψ 1 Mφ(M) e h 0 V e0. M F Finally, we define the set G (d) W as G (d) = {(g,e 0 ) W : y Y,G (d) (y) = g}. Namely, G (d) is the set of pairs (g,ζ) H (d) IP n (C) such that ζ = e 0 and g is in the image of Y under G (d). The probability distribution in G (d) is the one inherited from Y by G (d). Namely, in order to choose a random point in G (d) we choose a random point y Y and we consider the pair (G (d) (y),e 0 ) G (d). As said in the introduction, the main result in [BP08] is Theorem 1.8 and in that paper the authors use it to describe an algorithm that computes a projective approximate zero of systems f H (d) in polynomial time, assuming a small probability of failure. 4. Smale s 17th Problem: affine solutions. Now we turn to the case of affine solutions of systems of polynomial equations. Our space of inputs is again H (d) (or the sphere S (d) ), and we look for affine approximate zeros (points in C n ) of our systems From projective to affine approximate zeros. In this section, we will show how to obtain an affine approximate zero of f from the information contained in a projective approximate zero of f. We start with the following result, which is a consequence of the γ and µ theories of Shub & Smale (cf. for example [Sma86, SS93a, Ded06]). Proposition 4.1. With the notations above, let ζ = (ζ 0 : : ζ n ) IP n (C) be a projective zero of f, such that ζ 0 0, and let z = (z 0 : : z n ) IP n (C) be a projective point such that z 0 0. Assume that 0 (z) ϕ 1 0 (ζ) 3 7 d 3/ µ norm (f,ζ). Then, ϕ 1 0 (z) is an affine approximate zero of f, with associated zero ϕ 1 0 (ζ). Proof. From [Sma86, Th C], it suffices to prove that 0 (z) ϕ 1 0 (ζ) 3 7 γ(f,ϕ 1 0 (ζ)), where γ(f,ϕ 1 0 (ζ)) is the quantity defined in [Sma86]. Now, from [SS93a, Prop. 3 of Section I.3 and Th. of Section I.4] we know that (4.1) γ(f,ϕ 1 d3/ 0 (ζ)) µ norm(f,ζ), and the proposition follows. Note that the notation in [SS93b] is slightly different, as µ norm is denoted µ proj. See also [BP07a, Prop. 3.4] for a proof of equation (4.1). Lemma 4.. Let v,w C n be two complex vectors. Then, v w (1 + v )(1 + w ) 1 + v,w.

16 16 CARLOS BELTRÁN AND LUIS MIGUEL PARDO Proof. Let R and I be the respective real and imaginary parts of v,w. On one hand, we have On the other hand, v w = v w,v w = v + w R. (1 + v )(1 + w ) 1 + v,w = (1 + v )(1 + w ) 1 + R + 1I = (1+ v )(1+ w ) (1+R) I = 1+ v + w + v w 1 R (R +I ) = Hence, v + w R + v w v,w (1 + v )(1 + w ) 1 + v,w = v w + v w v,w v w, as wanted. Lemma 4.3. Let ζ = (ζ 0 : : ζ n ),z = (z 0 : : z n ) IP n (C) be two projective points, such that ζ 0,z 0 0. Then, 0 (z) ϕ 1 0 (ζ) (1 + ϕ 1 0 (ζ) ϕ 1 0 (z) ) d T(z,ζ). Proof. We denote v = ϕ 1 0 (ζ), w = ϕ 1 0 (z). Then, the point (1,v) Cn+1 is a representative of ζ and the point (1,w) C n+1 is a representative of z. Hence, v w d T (z,ζ) = v w (1,v) (1,w) (1,v),(1,w) 1 = v w (1,v) (1,w) (1,v),(1,w) (1,v),(1,w) = v w (1 + v )(1 + w ) 1 + v,w 1 + v,w. From Lemma 4., we conclude that Thus, as wanted. v w d T (z,ζ) 1 + v,w. v w 1 + v,w 1 + v,w 1 + v w, d T (z,ζ) Lemma 4.4. Let ζ = (ζ 0 : : ζ n ) and z = (z 0 : : z n ) be two projective points such that ζ 0 0, and let ε 0 be such that d T (z,ζ) ε. Moreover, assume that (4.) d R (ζ,e 0 ) + ε < π. Then, z 0 0 and 0 (z) ϕ ϕ 1 0 (ζ) (ζ) 1 ε. (ζ) ε 0

17 SMALE S 17TH PROBLEM 17 Proof. Let β z = d R (z,e 0 ) and β ζ = d R (ζ,e 0 ) be the respective Riemannian distances from z,ζ to e 0. Observe that Moreover, β z β ζ + d R (z,ζ) β z + d T (z,ζ) β z + ε. 0 (z) = tanβ z, 0 (ζ) = tan β ζ. Now, tan is an increasing function in [0, π ). From inequality (4.), we conclude, 0 (z) tan(β ζ + d R (z,ζ)) = tan β ζ + d T (z,ζ) 1 tan β ζ d T (z,ζ) ϕ 1 0 (ζ) + ε 1 0 (ζ) ε. From Lemma 4.3, we conclude that ( 0 (z) ϕ 1 0 (ζ) 1 + ϕ (ζ) (ζ) + ε ) 1 0 (ζ) ε and the lemma follows. ε, The following result yields a method to obtain an affine approximate zero of f from a projective approximate zero of f, if some properties are satisfied. Proposition 4.5. Let f H (d). Let ζ V IP (f) be a projective zero of f, ζ 0 0, and let z = (z 0,...,z n ) IP n (C) be a projective approximate zero of f with associated zero ζ, such that Let z k = Then, d T (z,ζ) (k) N f (z), where k N is such that 3 7 d 3/ µ norm (f,ζ). k log log (4(1 + 0 (ζ) )). 0 (zk ) ϕ 1 0 (ζ) 3 7 d 3/ µ norm (f,ζ). In particular (from Proposition 4.1), ϕ 1 0 (zk ) is an affine approximate zero of f with associated zero ϕ 1 0 (ζ). Proof. From the definition of projective approximate zero, d T (z k,ζ) 1 d 1 k 1 T(z,ζ) (1 + 0 (ζ) ) 3 7 d 3/ µ norm (f,ζ). The reader may check that we are under the hypotheses of Lemma 4.4. We conclude that where ε = 1 (1+ 0 (zk ) ϕ ϕ (ζ) (ζ) 1 ε, (ζ) ε (ζ) ) d 3/ µ norm(f,ζ) 0. The proposition easily follows.

18 18 CARLOS BELTRÁN AND LUIS MIGUEL PARDO 4.. The average size of affine solutions. Proposition 4.5 yields a way of obtaining an affine approximate zero ϕ 1 0 (y) of f from a projective approximate zero of f with associated zero ζ. However, we need to control 0 (ζ) in order to estimate the number of steps of the projective Newton operator necessary to guarantee that ϕ 1 0 (y) is really an affine approximate zero of f. Now we dedicate some paragraphs to study the probability distribution of 0 (ζ) under our hypotheses. Lemma 4.6. Let 1 n α <. Then, 1 0 ν[ip n (C)] (x) α dip n (C) = Γ( ) ( ) 1 α Γ n + α, Γ(n) x IP n(c) where ν[ip n (C)] = π n /Γ(n + 1) is the volume of the complex projective space. Proof. In [BP07b, Lemma 1], the authors proved that 1 NJ x ϕ 0 = (1 + x ) n+1. Then, from Theorem. we have x Cn x α (1 + x ) n+1 dcn = 0 (z) α dip n (C). z IP n(c) On the other hand, using polar coordinates, x Cn x α (1 + x ) n+1 dcn = πn Γ(n) The lemma follows, as 0 ν[ip n (C)] = t n 1+α πn (1 + t dt = ) n+1 Γ(n) π n Γ(n + 1). Γ ( ) ( ) 1 α Γ n + α. Γ(n + 1) Theorem 4.7. Let 1 n α <. Let B(d) α be the expected value of the norm of the affine solutions of a random system of polynomial equations, powered to α. Namely, B(d) α = ν [S (d) ] f S (d) D (ζ) α ds (d). Then, Proof. Let Observe that I α (d) = ζ V IP (f) B(d) α = Γ( ) ( ) 1 α Γ n + α. Γ(n) f S (d) ζ V IP (f) I α (d) = ν [S (d) ]DB α (d), (4.3) I 0 (d) = ν [S (d) ]D. 0 (ζ) α ds (d). In fact, equality (4.3) is due to the fact that a generic system f H (d) has exactly D projective solutions (cf. for example [BCSS98]). Let Ŵ = {(f,ζ) S (d) IP n (C) : ζ V IP (f)} be the incidence variety intersected with S (d) IP n (C). Ŵ is a (real) manifold of real dimension N + 1 (see for example [BP08]). Let p 1 : Ŵ S (d)

19 SMALE S 17TH PROBLEM 19 and p : Ŵ IP n(c) be the canonical projections. Then, from Theorem. applied to p 1, we have that I(d) α = 0 (x) α NJ (f,x) p 1 dŵ. (f,x) Ŵ Moreover, from Theorem. applied to p, 0 (x) α NJ (f,x) p 1 dŵ = (f,x) Ŵ NJ (f,x)p 1 x IP n(c) f p 1 (x) 0 (x) α dp 1 NJ (f,x) p (x) dip n(c). where for x IP n (C), p 1 (x) S (d) {x} may be identified with the set of systems that have x as a projective zero. We conclude, (4.4) I(d) α = NJ (f,x)p 1 0 (x) α dp x IP n(c) f p 1 1 (x) NJ (f,x) p (x) dip n(c). Let x IP n (C) be a projective point and let U U n+1 be a unitary matrix such that Ue 0 = x. From equality (.1), the mapping from p 1 (x) onto p 1 (e 0) sending each (f,x) to the pair (f U,e 0 ) is an isometry. Moreover, from Proposition.3, for every f p 1 (e 0), we have that NJ (f,e0)p i = NJ (f U 1,x)p i, i = 1,. Hence, from Theorem. we have that NJ (f,x)p 1 dp 1 f p 1 (x) NJ (f,x) p (x) = NJ (f U 1,x)p 1 dp f p 1 1 (e0) NJ (f U 1,x)p (e 0) = NJ (f,e0)p 1 dp 1 f p 1 (e0) NJ (e 0), (f,e0)p and we conclude that the inner integral in equation (4.4) does not depend on the choice of x IP n (C). Hence, ( ) I(d) α = NJ (f,e0)p 1 dp 1 f p 1 (e0) NJ (e 0) 0 (f,e0)p (x) α dip n (C). x IP n(c) First, assume that α = 0. Then, from equation (4.3), ν [S (d) ]D = I(d) 0 = ν[ip NJ (f,e0)p 1 n(c)] dp 1 f p 1 (e0) NJ (e 0), (f,e0)p and we obtain that We conclude that Hence, NJ (f,e0)p 1 dp 1 f p 1 (e0) NJ (e 0) = ν [S (d) ]D (f,e0)p ν[ip n (C)]. I α (d) = ν [S (d) ]D ν[ip n (C)] B α (d) = 1 ν[ip n (C)] The theorem follows from Lemma 4.6. x IP n(c) x IP n(c) 0 (x) α dip n (C). 0 (x) α dip n (C). An immediate consequence of Theorem 4.7 is the following result.

20 0 CARLOS BELTRÁN AND LUIS MIGUEL PARDO Corollary 4.8. Let δ > 0. Let P δ be the probability that a randomly chosen system f S (d) has an affine solution ζ V C (f) with ζ > δ. Then, P δ D Γ( ) ( ) 1 α Γ n + α Γ(n)δ α. Here, α [0,) is any positive real number. Proof. Observe that [ ] 1 E f S(d) max ζ V C (f) ζ α ν [S (d) ] f S (d) ζ V IP (f) From Theorem 4.7, this last term equals D Γ( ) ( ) 1 α Γ n + α. Γ(n) 0 (ζ) α ds (d). From Markov s Inequality, for every positive real number δ > 0, we conclude, Prob f S(d) [ ζ V C n(f) : ζ δ] = Prob f S(d) [ ζ V C n(f) : ζ α δ α ] D Γ( ) ( ) 1 α Γ n + α Γ(n)δ α. Corollary 4.9. Let δ > 0. Let P δ be the probability that a randomly chosen system f S (d) has an affine solution ζ V C (f) with ζ > δ. Then, P δ D πn, δ δ > 0, P δ ed n ( ) δ δ ln, n δ e n. Proof. This comes directly from Corollary 4.8, using that Γ ( ) n + α n α/, α [1,) Γ(n) (cf. for example [EGP00]). We also use that ( Γ 1 α ) α, α [1,). For the first inequality of the corollary, let α = 1. For the second one, let ( ) δ ln n 1 α = ln ( ). δ n Theorem 1.9 in the Introduction is a weak form of this Corollary 4.9. For the purposes of numerical solving of systems of equations, the result as stated in the Introduction suffices.

### IRREDUCIBLE OPERATOR SEMIGROUPS SUCH THAT AB AND BA ARE PROPORTIONAL. 1. Introduction

IRREDUCIBLE OPERATOR SEMIGROUPS SUCH THAT AB AND BA ARE PROPORTIONAL R. DRNOVŠEK, T. KOŠIR Dedicated to Prof. Heydar Radjavi on the occasion of his seventieth birthday. Abstract. Let S be an irreducible

### THE FUNDAMENTAL THEOREM OF ALGEBRA VIA PROPER MAPS

THE FUNDAMENTAL THEOREM OF ALGEBRA VIA PROPER MAPS KEITH CONRAD 1. Introduction The Fundamental Theorem of Algebra says every nonconstant polynomial with complex coefficients can be factored into linear

### ON COMPLETELY CONTINUOUS INTEGRATION OPERATORS OF A VECTOR MEASURE. 1. Introduction

ON COMPLETELY CONTINUOUS INTEGRATION OPERATORS OF A VECTOR MEASURE J.M. CALABUIG, J. RODRÍGUEZ, AND E.A. SÁNCHEZ-PÉREZ Abstract. Let m be a vector measure taking values in a Banach space X. We prove that

### Continued Fractions and the Euclidean Algorithm

Continued Fractions and the Euclidean Algorithm Lecture notes prepared for MATH 326, Spring 997 Department of Mathematics and Statistics University at Albany William F Hammond Table of Contents Introduction

### BANACH AND HILBERT SPACE REVIEW

BANACH AND HILBET SPACE EVIEW CHISTOPHE HEIL These notes will briefly review some basic concepts related to the theory of Banach and Hilbert spaces. We are not trying to give a complete development, but

### Mathematics Course 111: Algebra I Part IV: Vector Spaces

Mathematics Course 111: Algebra I Part IV: Vector Spaces D. R. Wilkins Academic Year 1996-7 9 Vector Spaces A vector space over some field K is an algebraic structure consisting of a set V on which are

### 1 Local Brouwer degree

1 Local Brouwer degree Let D R n be an open set and f : S R n be continuous, D S and c R n. Suppose that the set f 1 (c) D is compact. (1) Then the local Brouwer degree of f at c in the set D is defined.

### Course 221: Analysis Academic year , First Semester

Course 221: Analysis Academic year 2007-08, First Semester David R. Wilkins Copyright c David R. Wilkins 1989 2007 Contents 1 Basic Theorems of Real Analysis 1 1.1 The Least Upper Bound Principle................

### 2.1 Complexity Classes

15-859(M): Randomized Algorithms Lecturer: Shuchi Chawla Topic: Complexity classes, Identity checking Date: September 15, 2004 Scribe: Andrew Gilpin 2.1 Complexity Classes In this lecture we will look

### Inner Product Spaces and Orthogonality

Inner Product Spaces and Orthogonality week 3-4 Fall 2006 Dot product of R n The inner product or dot product of R n is a function, defined by u, v a b + a 2 b 2 + + a n b n for u a, a 2,, a n T, v b,

### ATheoryofComplexity,Conditionand Roundoff

ATheoryofComplexity,Conditionand Roundoff Felipe Cucker Berkeley 2014 Background L. Blum, M. Shub, and S. Smale [1989] On a theory of computation and complexity over the real numbers: NP-completeness,

### The Ideal Class Group

Chapter 5 The Ideal Class Group We will use Minkowski theory, which belongs to the general area of geometry of numbers, to gain insight into the ideal class group of a number field. We have already mentioned

### Similarity and Diagonalization. Similar Matrices

MATH022 Linear Algebra Brief lecture notes 48 Similarity and Diagonalization Similar Matrices Let A and B be n n matrices. We say that A is similar to B if there is an invertible n n matrix P such that

### WHICH LINEAR-FRACTIONAL TRANSFORMATIONS INDUCE ROTATIONS OF THE SPHERE?

WHICH LINEAR-FRACTIONAL TRANSFORMATIONS INDUCE ROTATIONS OF THE SPHERE? JOEL H. SHAPIRO Abstract. These notes supplement the discussion of linear fractional mappings presented in a beginning graduate course

### CURVES WHOSE SECANT DEGREE IS ONE IN POSITIVE CHARACTERISTIC. 1. Introduction

Acta Math. Univ. Comenianae Vol. LXXXI, 1 (2012), pp. 71 77 71 CURVES WHOSE SECANT DEGREE IS ONE IN POSITIVE CHARACTERISTIC E. BALLICO Abstract. Here we study (in positive characteristic) integral curves

### MA651 Topology. Lecture 6. Separation Axioms.

MA651 Topology. Lecture 6. Separation Axioms. This text is based on the following books: Fundamental concepts of topology by Peter O Neil Elements of Mathematics: General Topology by Nicolas Bourbaki Counterexamples

### INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS

INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS STEVEN P. LALLEY AND ANDREW NOBEL Abstract. It is shown that there are no consistent decision rules for the hypothesis testing problem

### 6. Metric spaces. In this section we review the basic facts about metric spaces. d : X X [0, )

6. Metric spaces In this section we review the basic facts about metric spaces. Definitions. A metric on a non-empty set X is a map with the following properties: d : X X [0, ) (i) If x, y X are points

### Numerical Analysis Lecture Notes

Numerical Analysis Lecture Notes Peter J. Olver 5. Inner Products and Norms The norm of a vector is a measure of its size. Besides the familiar Euclidean norm based on the dot product, there are a number

### On the representability of the bi-uniform matroid

On the representability of the bi-uniform matroid Simeon Ball, Carles Padró, Zsuzsa Weiner and Chaoping Xing August 3, 2012 Abstract Every bi-uniform matroid is representable over all sufficiently large

### Section 1.1. Introduction to R n

The Calculus of Functions of Several Variables Section. Introduction to R n Calculus is the study of functional relationships and how related quantities change with each other. In your first exposure to

### FACTORING POLYNOMIALS IN THE RING OF FORMAL POWER SERIES OVER Z

FACTORING POLYNOMIALS IN THE RING OF FORMAL POWER SERIES OVER Z DANIEL BIRMAJER, JUAN B GIL, AND MICHAEL WEINER Abstract We consider polynomials with integer coefficients and discuss their factorization

### The determinant of a skew-symmetric matrix is a square. This can be seen in small cases by direct calculation: 0 a. 12 a. a 13 a 24 a 14 a 23 a 14

4 Symplectic groups In this and the next two sections, we begin the study of the groups preserving reflexive sesquilinear forms or quadratic forms. We begin with the symplectic groups, associated with

### Row Ideals and Fibers of Morphisms

Michigan Math. J. 57 (2008) Row Ideals and Fibers of Morphisms David Eisenbud & Bernd Ulrich Affectionately dedicated to Mel Hochster, who has been an inspiration to us for many years, on the occasion

### Ideal Class Group and Units

Chapter 4 Ideal Class Group and Units We are now interested in understanding two aspects of ring of integers of number fields: how principal they are (that is, what is the proportion of principal ideals

### FIXED POINT SETS OF FIBER-PRESERVING MAPS

FIXED POINT SETS OF FIBER-PRESERVING MAPS Robert F. Brown Department of Mathematics University of California Los Angeles, CA 90095 e-mail: rfb@math.ucla.edu Christina L. Soderlund Department of Mathematics

### Introduction to Algebraic Geometry. Bézout s Theorem and Inflection Points

Introduction to Algebraic Geometry Bézout s Theorem and Inflection Points 1. The resultant. Let K be a field. Then the polynomial ring K[x] is a unique factorisation domain (UFD). Another example of a

### 1 Sets and Set Notation.

LINEAR ALGEBRA MATH 27.6 SPRING 23 (COHEN) LECTURE NOTES Sets and Set Notation. Definition (Naive Definition of a Set). A set is any collection of objects, called the elements of that set. We will most

### Metric Spaces. Chapter 7. 7.1. Metrics

Chapter 7 Metric Spaces A metric space is a set X that has a notion of the distance d(x, y) between every pair of points x, y X. The purpose of this chapter is to introduce metric spaces and give some

### Continuous first-order model theory for metric structures Lecture 2 (of 3)

Continuous first-order model theory for metric structures Lecture 2 (of 3) C. Ward Henson University of Illinois Visiting Scholar at UC Berkeley October 21, 2013 Hausdorff Institute for Mathematics, Bonn

### International Journal of Information Technology, Modeling and Computing (IJITMC) Vol.1, No.3,August 2013

FACTORING CRYPTOSYSTEM MODULI WHEN THE CO-FACTORS DIFFERENCE IS BOUNDED Omar Akchiche 1 and Omar Khadir 2 1,2 Laboratory of Mathematics, Cryptography and Mechanics, Fstm, University of Hassan II Mohammedia-Casablanca,

### F. ABTAHI and M. ZARRIN. (Communicated by J. Goldstein)

Journal of Algerian Mathematical Society Vol. 1, pp. 1 6 1 CONCERNING THE l p -CONJECTURE FOR DISCRETE SEMIGROUPS F. ABTAHI and M. ZARRIN (Communicated by J. Goldstein) Abstract. For 2 < p

### Inner Product Spaces

Math 571 Inner Product Spaces 1. Preliminaries An inner product space is a vector space V along with a function, called an inner product which associates each pair of vectors u, v with a scalar u, v, and

### NOTES ON LINEAR TRANSFORMATIONS

NOTES ON LINEAR TRANSFORMATIONS Definition 1. Let V and W be vector spaces. A function T : V W is a linear transformation from V to W if the following two properties hold. i T v + v = T v + T v for all

### Rotation Rate of a Trajectory of an Algebraic Vector Field Around an Algebraic Curve

QUALITATIVE THEORY OF DYAMICAL SYSTEMS 2, 61 66 (2001) ARTICLE O. 11 Rotation Rate of a Trajectory of an Algebraic Vector Field Around an Algebraic Curve Alexei Grigoriev Department of Mathematics, The

### About the inverse football pool problem for 9 games 1

Seventh International Workshop on Optimal Codes and Related Topics September 6-1, 013, Albena, Bulgaria pp. 15-133 About the inverse football pool problem for 9 games 1 Emil Kolev Tsonka Baicheva Institute

### Metric Spaces. Chapter 1

Chapter 1 Metric Spaces Many of the arguments you have seen in several variable calculus are almost identical to the corresponding arguments in one variable calculus, especially arguments concerning convergence

### Sets of Fibre Homotopy Classes and Twisted Order Parameter Spaces

Communications in Mathematical Physics Manuscript-Nr. (will be inserted by hand later) Sets of Fibre Homotopy Classes and Twisted Order Parameter Spaces Stefan Bechtluft-Sachs, Marco Hien Naturwissenschaftliche

### COMMUTATIVITY DEGREES OF WREATH PRODUCTS OF FINITE ABELIAN GROUPS

Bull Austral Math Soc 77 (2008), 31 36 doi: 101017/S0004972708000038 COMMUTATIVITY DEGREES OF WREATH PRODUCTS OF FINITE ABELIAN GROUPS IGOR V EROVENKO and B SURY (Received 12 April 2007) Abstract We compute

### Linear Codes. Chapter 3. 3.1 Basics

Chapter 3 Linear Codes In order to define codes that we can encode and decode efficiently, we add more structure to the codespace. We shall be mainly interested in linear codes. A linear code of length

### 5. Convergence of sequences of random variables

5. Convergence of sequences of random variables Throughout this chapter we assume that {X, X 2,...} is a sequence of r.v. and X is a r.v., and all of them are defined on the same probability space (Ω,

### LOW-DEGREE PLANAR MONOMIALS IN CHARACTERISTIC TWO

LOW-DEGREE PLANAR MONOMIALS IN CHARACTERISTIC TWO PETER MÜLLER AND MICHAEL E. ZIEVE Abstract. Planar functions over finite fields give rise to finite projective planes and other combinatorial objects.

### 1. R In this and the next section we are going to study the properties of sequences of real numbers.

+a 1. R In this and the next section we are going to study the properties of sequences of real numbers. Definition 1.1. (Sequence) A sequence is a function with domain N. Example 1.2. A sequence of real

### MINIMAL GENERATOR SETS FOR FINITELY GENERATED SHIFT-INVARIANT SUBSPACES OF L 2 (R n )

MINIMAL GENERATOR SETS FOR FINITELY GENERATED SHIFT-INVARIANT SUBSPACES OF L 2 (R n ) MARCIN BOWNIK AND NORBERT KAIBLINGER Abstract. Let S be a shift-invariant subspace of L 2 (R n ) defined by N generators

### 8.1 Examples, definitions, and basic properties

8 De Rham cohomology Last updated: May 21, 211. 8.1 Examples, definitions, and basic properties A k-form ω Ω k (M) is closed if dω =. It is exact if there is a (k 1)-form σ Ω k 1 (M) such that dσ = ω.

### Mathematical Methods of Engineering Analysis

Mathematical Methods of Engineering Analysis Erhan Çinlar Robert J. Vanderbei February 2, 2000 Contents Sets and Functions 1 1 Sets................................... 1 Subsets.............................

### Lecture 15 An Arithmetic Circuit Lowerbound and Flows in Graphs

CSE599s: Extremal Combinatorics November 21, 2011 Lecture 15 An Arithmetic Circuit Lowerbound and Flows in Graphs Lecturer: Anup Rao 1 An Arithmetic Circuit Lower Bound An arithmetic circuit is just like

### ON TOPOLOGICAL SEQUENCE ENTROPY AND CHAOTIC MAPS ON INVERSE LIMIT SPACES. 1. Introduction

Acta Math. Univ. Comenianae Vol. LXVIII, 2(1999), pp. 205 211 205 ON TOPOLOGICAL SEQUENCE ENTROPY AND CHAOTIC MAPS ON INVERSE LIMIT SPACES J. S. CANOVAS Abstract. The aim of this paper is to prove the

### Computing divisors and common multiples of quasi-linear ordinary differential equations

Computing divisors and common multiples of quasi-linear ordinary differential equations Dima Grigoriev CNRS, Mathématiques, Université de Lille Villeneuve d Ascq, 59655, France Dmitry.Grigoryev@math.univ-lille1.fr

### Introduction to Flocking {Stochastic Matrices}

Supelec EECI Graduate School in Control Introduction to Flocking {Stochastic Matrices} A. S. Morse Yale University Gif sur - Yvette May 21, 2012 CRAIG REYNOLDS - 1987 BOIDS The Lion King CRAIG REYNOLDS

### CHAPTER II THE LIMIT OF A SEQUENCE OF NUMBERS DEFINITION OF THE NUMBER e.

CHAPTER II THE LIMIT OF A SEQUENCE OF NUMBERS DEFINITION OF THE NUMBER e. This chapter contains the beginnings of the most important, and probably the most subtle, notion in mathematical analysis, i.e.,

### Sequences and Convergence in Metric Spaces

Sequences and Convergence in Metric Spaces Definition: A sequence in a set X (a sequence of elements of X) is a function s : N X. We usually denote s(n) by s n, called the n-th term of s, and write {s

### Primality - Factorization

Primality - Factorization Christophe Ritzenthaler November 9, 2009 1 Prime and factorization Definition 1.1. An integer p > 1 is called a prime number (nombre premier) if it has only 1 and p as divisors.

### You know from calculus that functions play a fundamental role in mathematics.

CHPTER 12 Functions You know from calculus that functions play a fundamental role in mathematics. You likely view a function as a kind of formula that describes a relationship between two (or more) quantities.

### Sets and functions. {x R : x > 0}.

Sets and functions 1 Sets The language of sets and functions pervades mathematics, and most of the important operations in mathematics turn out to be functions or to be expressible in terms of functions.

### Classification of Cartan matrices

Chapter 7 Classification of Cartan matrices In this chapter we describe a classification of generalised Cartan matrices This classification can be compared as the rough classification of varieties in terms

### DISTRIBUTIONS AND FOURIER TRANSFORM

DISTRIBUTIONS AND FOURIER TRANSFORM MIKKO SALO Introduction. The theory of distributions, or generalized functions, provides a unified framework for performing standard calculus operations on nonsmooth

### Math 4310 Handout - Quotient Vector Spaces

Math 4310 Handout - Quotient Vector Spaces Dan Collins The textbook defines a subspace of a vector space in Chapter 4, but it avoids ever discussing the notion of a quotient space. This is understandable

### Alex, I will take congruent numbers for one million dollars please

Alex, I will take congruent numbers for one million dollars please Jim L. Brown The Ohio State University Columbus, OH 4310 jimlb@math.ohio-state.edu One of the most alluring aspectives of number theory

### MATH 304 Linear Algebra Lecture 9: Subspaces of vector spaces (continued). Span. Spanning set.

MATH 304 Linear Algebra Lecture 9: Subspaces of vector spaces (continued). Span. Spanning set. Vector space A vector space is a set V equipped with two operations, addition V V (x,y) x + y V and scalar

### A COUNTEREXAMPLE FOR SUBADDITIVITY OF MULTIPLIER IDEALS ON TORIC VARIETIES

Communications in Algebra, 40: 1618 1624, 2012 Copyright Taylor & Francis Group, LLC ISSN: 0092-7872 print/1532-4125 online DOI: 10.1080/00927872.2011.552084 A COUNTEREXAMPLE FOR SUBADDITIVITY OF MULTIPLIER

### Course 421: Algebraic Topology Section 1: Topological Spaces

Course 421: Algebraic Topology Section 1: Topological Spaces David R. Wilkins Copyright c David R. Wilkins 1988 2008 Contents 1 Topological Spaces 1 1.1 Continuity and Topological Spaces...............

### Basic Concepts of Point Set Topology Notes for OU course Math 4853 Spring 2011

Basic Concepts of Point Set Topology Notes for OU course Math 4853 Spring 2011 A. Miller 1. Introduction. The definitions of metric space and topological space were developed in the early 1900 s, largely

### The van Hoeij Algorithm for Factoring Polynomials

The van Hoeij Algorithm for Factoring Polynomials Jürgen Klüners Abstract In this survey we report about a new algorithm for factoring polynomials due to Mark van Hoeij. The main idea is that the combinatorial

### Two classes of ternary codes and their weight distributions

Two classes of ternary codes and their weight distributions Cunsheng Ding, Torleiv Kløve, and Francesco Sica Abstract In this paper we describe two classes of ternary codes, determine their minimum weight

### 1 Orthogonal projections and the approximation

Math 1512 Fall 2010 Notes on least squares approximation Given n data points (x 1, y 1 ),..., (x n, y n ), we would like to find the line L, with an equation of the form y = mx + b, which is the best fit

### COMMUTATIVITY DEGREES OF WREATH PRODUCTS OF FINITE ABELIAN GROUPS

COMMUTATIVITY DEGREES OF WREATH PRODUCTS OF FINITE ABELIAN GROUPS IGOR V. EROVENKO AND B. SURY ABSTRACT. We compute commutativity degrees of wreath products A B of finite abelian groups A and B. When B

### MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION. Chapter 4: Fourier Series and L 2 ([ π, π], µ) ( 1 π

MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION Chapter 4: Fourier Series and L ([, π], µ) Square Integrable Functions Definition. Let f : [, π] R be measurable. We say that f

### An example of a computable

An example of a computable absolutely normal number Verónica Becher Santiago Figueira Abstract The first example of an absolutely normal number was given by Sierpinski in 96, twenty years before the concept

### Applied Algorithm Design Lecture 5

Applied Algorithm Design Lecture 5 Pietro Michiardi Eurecom Pietro Michiardi (Eurecom) Applied Algorithm Design Lecture 5 1 / 86 Approximation Algorithms Pietro Michiardi (Eurecom) Applied Algorithm Design

### Introduction to Topology

Introduction to Topology Tomoo Matsumura November 30, 2010 Contents 1 Topological spaces 3 1.1 Basis of a Topology......................................... 3 1.2 Comparing Topologies.......................................

### On the Union of Arithmetic Progressions

On the Union of Arithmetic Progressions Shoni Gilboa Rom Pinchasi August, 04 Abstract We show that for any integer n and real ɛ > 0, the union of n arithmetic progressions with pairwise distinct differences,

### MATH 304 Linear Algebra Lecture 20: Inner product spaces. Orthogonal sets.

MATH 304 Linear Algebra Lecture 20: Inner product spaces. Orthogonal sets. Norm The notion of norm generalizes the notion of length of a vector in R n. Definition. Let V be a vector space. A function α

### 1 VECTOR SPACES AND SUBSPACES

1 VECTOR SPACES AND SUBSPACES What is a vector? Many are familiar with the concept of a vector as: Something which has magnitude and direction. an ordered pair or triple. a description for quantities such

### 4. Expanding dynamical systems

4.1. Metric definition. 4. Expanding dynamical systems Definition 4.1. Let X be a compact metric space. A map f : X X is said to be expanding if there exist ɛ > 0 and L > 1 such that d(f(x), f(y)) Ld(x,

### Let H and J be as in the above lemma. The result of the lemma shows that the integral

Let and be as in the above lemma. The result of the lemma shows that the integral ( f(x, y)dy) dx is well defined; we denote it by f(x, y)dydx. By symmetry, also the integral ( f(x, y)dx) dy is well defined;

### Notes 11: List Decoding Folded Reed-Solomon Codes

Introduction to Coding Theory CMU: Spring 2010 Notes 11: List Decoding Folded Reed-Solomon Codes April 2010 Lecturer: Venkatesan Guruswami Scribe: Venkatesan Guruswami At the end of the previous notes,

### Compactness in metric spaces

MATHEMATICS 3103 (Functional Analysis) YEAR 2012 2013, TERM 2 HANDOUT #2: COMPACTNESS OF METRIC SPACES Compactness in metric spaces The closed intervals [a, b] of the real line, and more generally the

### 2.3 Convex Constrained Optimization Problems

42 CHAPTER 2. FUNDAMENTAL CONCEPTS IN CONVEX OPTIMIZATION Theorem 15 Let f : R n R and h : R R. Consider g(x) = h(f(x)) for all x R n. The function g is convex if either of the following two conditions

### Duality of linear conic problems

Duality of linear conic problems Alexander Shapiro and Arkadi Nemirovski Abstract It is well known that the optimal values of a linear programming problem and its dual are equal to each other if at least

### Biinterpretability up to double jump in the degrees

Biinterpretability up to double jump in the degrees below 0 0 Richard A. Shore Department of Mathematics Cornell University Ithaca NY 14853 July 29, 2013 Abstract We prove that, for every z 0 0 with z

### P NP for the Reals with various Analytic Functions

P NP for the Reals with various Analytic Functions Mihai Prunescu Abstract We show that non-deterministic machines in the sense of [BSS] defined over wide classes of real analytic structures are more powerful

### MAT2400 Analysis I. A brief introduction to proofs, sets, and functions

MAT2400 Analysis I A brief introduction to proofs, sets, and functions In Analysis I there is a lot of manipulations with sets and functions. It is probably also the first course where you have to take

### Communication on the Grassmann Manifold: A Geometric Approach to the Noncoherent Multiple-Antenna Channel

IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 48, NO. 2, FEBRUARY 2002 359 Communication on the Grassmann Manifold: A Geometric Approach to the Noncoherent Multiple-Antenna Channel Lizhong Zheng, Student

### MATH1231 Algebra, 2015 Chapter 7: Linear maps

MATH1231 Algebra, 2015 Chapter 7: Linear maps A/Prof. Daniel Chan School of Mathematics and Statistics University of New South Wales danielc@unsw.edu.au Daniel Chan (UNSW) MATH1231 Algebra 1 / 43 Chapter

### 24. The Branch and Bound Method

24. The Branch and Bound Method It has serious practical consequences if it is known that a combinatorial problem is NP-complete. Then one can conclude according to the present state of science that no

### 11 Ideals. 11.1 Revisiting Z

11 Ideals The presentation here is somewhat different than the text. In particular, the sections do not match up. We have seen issues with the failure of unique factorization already, e.g., Z[ 5] = O Q(

### Properties of BMO functions whose reciprocals are also BMO

Properties of BMO functions whose reciprocals are also BMO R. L. Johnson and C. J. Neugebauer The main result says that a non-negative BMO-function w, whose reciprocal is also in BMO, belongs to p> A p,and

### RANDOM INVOLUTIONS AND THE NUMBER OF PRIME FACTORS OF AN INTEGER 1. INTRODUCTION

RANDOM INVOLUTIONS AND THE NUMBER OF PRIME FACTORS OF AN INTEGER KIRSTEN WICKELGREN. INTRODUCTION Any positive integer n factors uniquely as n = p e pe pe 3 3 pe d d where p, p,..., p d are distinct prime

### SHARP BOUNDS FOR THE SUM OF THE SQUARES OF THE DEGREES OF A GRAPH

31 Kragujevac J. Math. 25 (2003) 31 49. SHARP BOUNDS FOR THE SUM OF THE SQUARES OF THE DEGREES OF A GRAPH Kinkar Ch. Das Department of Mathematics, Indian Institute of Technology, Kharagpur 721302, W.B.,

### Double Sequences and Double Series

Double Sequences and Double Series Eissa D. Habil Islamic University of Gaza P.O. Box 108, Gaza, Palestine E-mail: habil@iugaza.edu Abstract This research considers two traditional important questions,

Adaptive Online Gradient Descent Peter L Bartlett Division of Computer Science Department of Statistics UC Berkeley Berkeley, CA 94709 bartlett@csberkeleyedu Elad Hazan IBM Almaden Research Center 650

### Notes on Determinant

ENGG2012B Advanced Engineering Mathematics Notes on Determinant Lecturer: Kenneth Shum Lecture 9-18/02/2013 The determinant of a system of linear equations determines whether the solution is unique, without

### Stationary random graphs on Z with prescribed iid degrees and finite mean connections

Stationary random graphs on Z with prescribed iid degrees and finite mean connections Maria Deijfen Johan Jonasson February 2006 Abstract Let F be a probability distribution with support on the non-negative

### A QUICK GUIDE TO THE FORMULAS OF MULTIVARIABLE CALCULUS

A QUIK GUIDE TO THE FOMULAS OF MULTIVAIABLE ALULUS ontents 1. Analytic Geometry 2 1.1. Definition of a Vector 2 1.2. Scalar Product 2 1.3. Properties of the Scalar Product 2 1.4. Length and Unit Vectors

### COUNTING INDEPENDENT SETS IN SOME CLASSES OF (ALMOST) REGULAR GRAPHS

COUNTING INDEPENDENT SETS IN SOME CLASSES OF (ALMOST) REGULAR GRAPHS Alexander Burstein Department of Mathematics Howard University Washington, DC 259, USA aburstein@howard.edu Sergey Kitaev Mathematics

### RIGIDITY OF HOLOMORPHIC MAPS BETWEEN FIBER SPACES

RIGIDITY OF HOLOMORPHIC MAPS BETWEEN FIBER SPACES GAUTAM BHARALI AND INDRANIL BISWAS Abstract. In the study of holomorphic maps, the term rigidity refers to certain types of results that give us very specific

### Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh

Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh Peter Richtárik Week 3 Randomized Coordinate Descent With Arbitrary Sampling January 27, 2016 1 / 30 The Problem