Chapter 3. Principles and Spin. There are many axiomatic presentations of quantum mechanics that aim at



Similar documents
Particle Physics. Michaelmas Term 2011 Prof Mark Thomson. Handout 7 : Symmetries and the Quark Model. Introduction/Aims

Lecture L3 - Vectors, Matrices and Coordinate Transformations

Au = = = 3u. Aw = = = 2w. so the action of A on u and w is very easy to picture: it simply amounts to a stretching by 3 and 2, respectively.

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.

PHY4604 Introduction to Quantum Mechanics Fall 2004 Practice Test 3 November 22, 2004

Till now, almost all attention has been focussed on discussing the state of a quantum system.

Solution to Homework 2

Chapter 17. Orthogonal Matrices and Symmetries of Space

Theory of electrons and positrons

13 MATH FACTS a = The elements of a vector have a graphical interpretation, which is particularly easy to see in two or three dimensions.

Linear Algebra Review. Vectors

Introduction to Matrix Algebra

Mechanics 1: Conservation of Energy and Momentum

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS

3.1 State Space Models

3. Mathematical Induction

Lecture 1: Schur s Unitary Triangularization Theorem

8 Square matrices continued: Determinants

Vector and Matrix Norms

Unified Lecture # 4 Vectors

x = + x 2 + x

Lecture 3: Finding integer solutions to systems of linear equations

Notes on Orthogonal and Symmetric Matrices MENU, Winter 2013

8.04: Quantum Mechanics Professor Allan Adams Massachusetts Institute of Technology. Problem Set 5

Quantum Mechanics and Representation Theory

[1] Diagonal factorization

1.2 Solving a System of Linear Equations

Orthogonal Projections

Chapter 9 Unitary Groups and SU(N)

NOTES ON LINEAR TRANSFORMATIONS

Chapter 6. Orthogonality

University of Lille I PC first year list of exercises n 7. Review

Bits Superposition Quantum Parallelism

Continued Fractions and the Euclidean Algorithm

CITY UNIVERSITY LONDON. BEng Degree in Computer Systems Engineering Part II BSc Degree in Computer Systems Engineering Part III PART 2 EXAMINATION

SYSTEMS OF EQUATIONS AND MATRICES WITH THE TI-89. by Joseph Collison

7 Gaussian Elimination and LU Factorization

Similarity and Diagonalization. Similar Matrices

Section Inner Products and Norms

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m

Chemical Kinetics. 2. Using the kinetics of a given reaction a possible reaction mechanism

The Quantum Harmonic Oscillator Stephen Webb

Vieta s Formulas and the Identity Theorem

Matrices 2. Solving Square Systems of Linear Equations; Inverse Matrices

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Quantum Physics II (8.05) Fall 2013 Assignment 4

Mathematics Course 111: Algebra I Part IV: Vector Spaces

160 CHAPTER 4. VECTOR SPACES

Inner Product Spaces

Solving Systems of Linear Equations

December 4, 2013 MATH 171 BASIC LINEAR ALGEBRA B. KITCHENS

Chapter 21: The Discounted Utility Model

Chapter 12 Modal Decomposition of State-Space Models 12.1 Introduction The solutions obtained in previous chapters, whether in time domain or transfor

Data Mining: Algorithms and Applications Matrix Math Review

MATH APPLIED MATRIX THEORY

BANACH AND HILBERT SPACE REVIEW

α = u v. In other words, Orthogonal Projection

arxiv:quant-ph/ v1 11 Jul 1996

Linear Algebra: Vectors

MATHEMATICS FOR ENGINEERS BASIC MATRIX THEORY TUTORIAL 2

3 Some Integer Functions

Force on Moving Charges in a Magnetic Field

Math 4310 Handout - Quotient Vector Spaces

1 Symmetries of regular polyhedra

MATHEMATICS FOR ENGINEERING BASIC ALGEBRA

DIFFERENTIABILITY OF COMPLEX FUNCTIONS. Contents

The Characteristic Polynomial

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2015

Problem Set V Solutions

Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS

LS.6 Solution Matrices

5. Orthogonal matrices

Part 1 Expressions, Equations, and Inequalities: Simplifying and Solving

A linear combination is a sum of scalars times quantities. Such expressions arise quite frequently and have the form

Quantum Mechanics: Postulates

a 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.

3 Orthogonal Vectors and Matrices

Conservation of Momentum and Energy

Elasticity Theory Basics

1 Inner Products and Norms on Real Vector Spaces

Solving Simultaneous Equations and Matrices

2 Session Two - Complex Numbers and Vectors

Solving simultaneous equations using the inverse matrix

Concepts in Theoretical Physics

AK 4 SLUTSKY COMPENSATION

THE DIMENSION OF A VECTOR SPACE

Quotient Rings and Field Extensions

Let s first see how precession works in quantitative detail. The system is illustrated below: ...

1 Sets and Set Notation.

1 Determinants and the Solvability of Linear Systems

9. Momentum and Collisions in One Dimension*

5.61 Physical Chemistry 25 Helium Atom page 1 HELIUM ATOM

Notes on Determinant

by the matrix A results in a vector which is a reflection of the given

Inflation. Chapter Money Supply and Demand

Time dependence in quantum mechanics Notes on Quantum Mechanics

1 Introduction to Matrices

Section 1.1. Introduction to R n

1.7 Graphs of Functions

Chapter 4 One Dimensional Kinematics

Transcription:

Chapter 3 Principles and Spin 3. Four Principles of Quantum Mechanics There are many axiomatic presentations of quantum mechanics that aim at conceptual economy and rigor. However, it would be unwise for us to get into that, as our present aim is merely to learn how to use the mathematical machinery of quantum mechanics. For this reason, we might as well start with a run of the mill account that aims neither at intellectual economy nor at great rigor. With this in mind, we are now in a position to state four tenets of quantum mechanics that will allow us to study the simplest non-trivial quantum mechanical case, namely spin-half systems.. The state of a system (every physical object or collection of such objects) is described by the state vector Y.. An observable (a measurable property) O is represented by a Hermitian operator ˆ O. 3. Given an observable O represented by the Hermitian operator ˆ O, and a system in a state represented by Y = åc n y n, where y n are the orthonormal eigenvectors of n ˆ O (that is, ˆ O y n = l n y n, where l n is the eigenvalue) spanning the space, upon A run of the mill account is, of course, infected (if that is the word) by the standard interpretation, which we shall explicitly introduce at the end of chapter 5. However, the standard interpretations is, well, standard and therefore we might as well start, if somewhat surreptitiously, with something like it. 60

measurement of O, one always obtains one of the l n, and the probability of obtaining l n is c n, the square of the modulus of the expansion coefficient of y n. 4. Once the measurement of O returns l n, the state vector is instantaneously transformed from Y into y n, the eigenvector corresponding to the eigenvalue l n. Let us briefly comment on these principles. In order to be successful, a physical theory must give us predictions (which may be statistical) on what returns we shall obtain if we measure a property O (an observable, in quantum mechanical jargon) on physical systems in some quantum state or other. Consequently, quantum mechanics must contain the representation of the quantum state of any physical system on which we intend to perform a measurement and that of any observable we intend to measure. Principles () and () do just that. Principle () tells us that all the information that quantum mechanics has with respect to a system s physical state is encapsulated in a vector Y, exactly the mathematical object we just studied. For reasons that will become clear shortly Y must be normalized. Principle () states that the theoretical counterpart of an observable is a Hermitian operator. 3 For any expression x, x = x * x. That is, to obtain the squared modulus of an expression, one multiplies the expression times its complex conjugate. 3 It is worth noticing that although to every observable there corresponds a Hermitian operator, whether the converse is true is a different matter. For example, D Espagnat has argued that it not the case that to every Hermitian operator there corresponds an observable. See D Espagnat, B., (995): 98-9. 6

Principle (3) tells us what to expect if we perform a measurement of an observable O on a system in a state represented by Y. Since an observable is represented by a Hermitian operator, we can take the orthonormal eigenvectors y n of ˆ O as basis vectors of the vector space. Consequently, we can express Y as a linear combination of O ˆ s eigenvectors so that Y = åc n y n n. The measurement return is always an eigenvalue l n of ˆ O with respect to a y n ; the probability of obtaining l n is c n, the square of the modulus of the expansion coefficient of y n. 4 For example, suppose that ˆ O has only two eigenvectors, so that ˆ O y = l y and ˆ O y = l y. Then Y = c y + c y, and upon measuring O, we shall obtain l with probability c and l with probability c. 5 Since upon measuring O we must obtain one of the l n, it must be the case that 4 In reality, because of a quantum mechanical result (the energy-time uncertainty, so called), that measurement returns are always eigenvalues is not quite true. However, statistically they oscillate in predictable ways around eigenvalues; see appendix 6. Note that it follows from (.9.6) that c n = y n Y, where y n is the bra of the ket y n. That is, the probability of obtaining l n is the square of the magnitude of the modulus of the inner product between y n, the eigenvector for l n with respect to ˆ O, and the premeasurement state vector Y. 5 At the cost of being pedantic, we should note that principle (3) does not ask us to apply O ˆ to Y but to decompose Y into O ˆ s eigenvectors, something that can always be done because ˆ O is Hermitian. 6

å c n =. (3..) n In other words, the sum of the probabilities of all the possible measurement returns must be equal to one. Keeping in mind that the norm of a vector Y is Y Y, å where Y Y = c n, one can easily see that (3..) is satisfied if and only if Y is n normalized. Principle (4) embodies what is called the collapse or the reduction of the state vector. As we shall see, it is a most mysterious feature of the theory. Note that the combination of principles (3) and (4) guarantees that if a measurement of O returns l n, a new measurement of O, if performed sufficiently quickly (before the system has started evolving again), will return the same eigenvalue l n. For, principle (4) tells us that the first measurement results in the collapse of Y onto y n, so that Y is now an eigenvector of ˆ O, which entails that c n =, and consequently principle (3) tells us that the probability of obtaining l n upon re-measuring is c n =. 6 3. Spin We can decompose the total spin S of a quantum particle into its three components S x, S y, and S z, one for each dimension. As we noted, measuring one of them randomizes the measurement returns of the others, and for this reason S x, S y, and S z, are called incompatible observables. In quantum mechanics, this is expressed by the fact 6 Suppose we leave the system alone instead of performing a measurement on it. What will its temporal evolution be? The answer to this central question will concern us in the next chapter. For now, however, let us put it on the back burner. 63

that their corresponding operators ˆ S x, ˆ S y, and ˆ S z do not commute. In fact, the following relations hold: S ˆ [, S ˆ x y ]= ihs ˆ z ; S ˆ [ y, S ˆ z ]= ihs ˆ x ; S ˆ [, S ˆ z x ]= ihs ˆ y. 7 (3..) By contrast, S, the square of the total spin, is compatible with each of the spin components, and consequently ˆ S and ˆ S z commute. Since they are commuting Hermitian operators, their common set of eigenvectors spans the space, and therefore can be used as an orthonormal basis. When studying spin, it is customary to use exactly such basis, which is called the standard basis. (We shall see later how to apply the operators Ŝ x and Ŝ y when the state vector is expressed in the standard basis). If we measure a spin component, S z, for example, the possible measurement returns depend on a number s, the spin quantum number. It turns out that every species of elementary particle has an unchangeable spin quantum number, confusingly called the spin of that species. For example, protons, neutrons, and electrons, the particles constituting ordinary matter, have spin / (spin-half), while photons have spin. Spin-half is the simplest case of spin: ˆ S and ˆ S z have just two eigenvectors, namely, z, called z-spin down, and z, called z-spin up. The eigenvector equations are: S ˆ z = 3 4 h z, S ˆ z = 3 4 h z (3..) and S ˆ z z = h z, S ˆ z z = - h z. (3..3) 7 To obtain the other commutators, note that [ ˆ A, ˆ B ] = -[ ˆ B, ˆ A ]. 64

Since S ˆ and S ˆ z are Hermitian and commute, we can use { z, z } as a basis, and in this basis z = ö è 0ø, and z = 0 ö è. (3..4) ø Hence, the general state Y of a spin-half particle is a linear combination of the basis vectors: ö 0ö Y = c z + c z = c + c è 0 = c ö. (3..5) ø è ø è ø c Now let us find the matrix forms of ˆ S and ˆ S z in the standard basis. As they must be conformable with their eigenvectors, they are matrices. Moreover, they are diagonalized because expressed with respect to the basis { z, z } made up of their eigenvectors; in other words, all their elements will be zeros, with the exception of the ones making up the main diagonal, which will be the eigenvalues of the operator with respect to the basis vectors. Hence, 3 S ˆ = 4 h 0 ö 3 0 è 4 h = 3 4 h 0 ö. (3..6) è 0 ø ø Similarly, h ö 0 S ˆ z = 0 - h = h 0 ö. (3..7) è 0 -ø è ø Discovering the matrices of ˆ S x and ˆ S y in the standard basis is a bit more complicated; suffice it to say that they are 65

S ˆ x = h 0 ö (3..8) è 0ø and S ˆ y = h 0 -iö. (3..9) è i 0 ø Since we know the matrices for ˆ S x, ˆ S y, ˆ S z, and ˆ S, we can now perform a few calculations. 3.3 Measuring Spin Consider a spin-half particle in state Y = c z + c z. 8 Suppose now that we want to predict the results of a measurement of S z on the particle. As we know from principle (3), there are only two possible returns, h/ (the eigenvalue associated with z ) and -h/ (the eigenvalue associated with z ). Moreover, the probability of getting h/ is c and that of getting -h/ is c. EXAMPLE 3.3. A spin-half particle is in state Y = - iö. We can easily infer that upon 3 è ø measuring S z the probability of obtaining h/ is - i 3 =, while the probability of 3 getting -h/ is = 3 3.9 Suppose we have obtained h/. What shall we get if we 8 Every time we say that particle is in some state, we assume that the corresponding vector is normalized. 9 Remember that a + b = (a + b) * (a + b). Note also that the sum of the two probabilities is one, as it should be. 66

immediately measure S z again? As we know from the combination of principles (3) and (4), we shall surely get h/, as Y has collapsed onto z. Suppose now that the particle is in the generic state Y = c z + c z = c ö, and è ø we measure S x. What shall we get? Finding the answer is relatively simple, albeit somewhat laborious. Basically, we need to obtain the information necessary to apply principle (3). Hence, first we must determine the (normalized) eigenvectors x, x, and the eigenvalues l, l of S x. Since S x is Hermitian, its eigenvectors span the space. Hence, we can expand Y into a linear combination of the normalized eigenvectors of S x, so that Y = a x + b x. Consequently, upon measuring S x, we shall get l with c probability a, and l with probability b. So, let us proceed. Since S ˆ x = h 0 ö, if we take x ö as the generic eigenvector and l as the è 0ø è ø generic eigenvalue, we have x 0 h è hö 0 è ø x x ö = l x ö, (3.3.) ø è ø x that is, a system of two simultaneous equations ì h ï x - lx = 0 í ï h x - lx = 0 î (3.3.) The characteristic equation is 67

-l 0 = h h -l = l - h 4, (3.3.3) which yields two eigenvalues, l = -h/, and l = h/. By plugging the former into either of (3.3.), we get x = -x. By taking x =, we obtain the eigenvector for S x with eigenvalue -h/, namely ö. Similarly, by plugging l = h/ into either (3.3.) we è -ø obtain ö. è ø However, the two eigenvectors must be normalized in order for them to represent real spin states. As we know, to normalize a vector A, we must divide it by its norm A A, and in an orthonormal basis A A = a + a +...+ a n, where a...,a n are the components of the vector. Applying this to vector ö, we obtain è -ø x = ö = ö. (3.3.4) + - è -ø è -ø Similarly, x = ö = ö. (3.3.5) + è ø è ø Now let us apply the equation c n = y n Y, introduced in note 4, p. 59. As x = ö, the corresponding bra is è ø ( ). Hence, the probability of obtaining S x = h/ is 68

Pr( S x = h/)= è ( ) c c ö ø and that of obtaining S x = -h/ is Pr( S x = -h/)= - è ( ) c c = c + c ö ø = c - c, (3.3.6). (3.3.7) The eigenvalues, eigenvectors, and related probabilities for ˆ S y can be found in a similar fashion. The following table gives the operators, eigenvalues, eigenvectors, and the related probability formulas for ˆ S x, ˆ S y, and ˆ S z in the standard basis. 0 Operator Eigenvalue Eigenvector Pr(eigenvalue) S ˆ x = h 0 ö = h è 0ø s h x x = ö c + c è ø S ˆ x = h 0 ö = h è 0ø s - h x x = ö c - c è -ø S ˆ y = h 0 -iö = h è i 0 ø s h y y = ö c - ic è iø S ˆ y = h 0 -iö = h è i 0 ø s - h y y = ö c + ic è -iø S ˆ z = h 0 ö = h è 0 -ø s h z z = ö è 0ø c S ˆ z = h 0 ö = h è 0 -ø s - h z z = 0 ö è ø c Note that every time we measure one of the components of spin of a particle of spin-half, we obtain h/ or -h/. 0 s x = 0 ö, s è 0 y = 0 -i ö, and s ø è i 0 z = 0 ö are the famous Pauli operators. Note that ø è 0 -ø if we use h/ as unit of measure when measuring spin, then the spin operators reduce to their Pauli operators. 69

EXAMPLE 3.3. Let a particle be in state Y = - iö, and suppose we make just one 9 è ø measurement. The table above contains all the information needed to determine what returns we may expect and their associated probabilities. If we measure S z, we shall get h/ with probability Pr( S z = h /)= - i 9 and -h/ with probability = 5 9, (3.3.8) Pr S z = - h ö = è ø 9 = 4 9. (3.3.9) If we measure S x, we shall obtain h/ with probability Pr( S x = h/)= - i 9 + = 9 4 - i 9 = 7 8, (3.3.0) and -h/ with probability Pr S x = - h ö = è ø - i 9 - = -i 9 9 = 8. (3.3.) If we measure S y, we shall get h/ with probability P( S y = h/)= - i 9 - i = 9-3i 9 = 3 8, (3.3.) and -h/ with probability Pr S y = - h ö = è ø - i 9 + i = 9 - i 9 = 5 8. (3.3.3) What happens if we perform two quick successive measurements of the same observable? As we know, once we have carried out a measurement the state vector 70

collapses, and consequently if the system did not have time to evolve, a quick new measurement will return the same value. What happens if we measure first S x and quickly after S z? Upon measuring S x, we shall get h/ with probability 7/8, and -h/ with probability /8. Suppose we get h/. Then the state vector will collapse onto x = ö. Hence, a measurement of S z will yield h/ or -h/, both with probability è ø /. EXAMPLE 3.3.3 We can now start to make sense of some of the peculiar results of the spin-half experiment discussed in chapter one. Since z = ( x + x ), half the particles will exit the SGX device (chapter, figure 4) in state x and half in state x. In addition, when we measure S x on either path, z collapses onto x or x ; however, x = ( z + z ) and x = ( z - z ), and therefore in either case when we measure S z on electrons on path C we obtain h/ or -h/, both with probability /. Similarly, if we block path A, only x electrons will reach path C, and upon measuring S z we obtain the same result. 3.4 Series of Stern-Gerlach Devices As we noted before, measuring S x randomizes the return values for S z. As we shall see, this is a manifestation of the Generalized Uncertainty Principle. We still do not know why we obtain the same results if we measure position instead of spin before the electrons reach C. We shall address this problem later. 7

Let us look at some examples in which Stern-Gerlach devices are in series. EXAMPLE 3.4. Now let us consider the following arrangement of Stern-Gerlach devices, and let us shoot a beam of spin-half particles in state x through it (Fig. ). z x SGZ x SGZ z SGX x SGX z z x Figure Half the particles will exit the first SGZ in state z and half in state z. So, only a probabilistic prediction can be given for the behavior of any given particle. Let us now block the spin down particles and send the spin up particles into a Stern-Gerlach apparatus oriented to measure the x-component of spin (SGX). A quick look at the table shows that half of the particles will exit in state x and half in state x. Again, only a probabilistic prediction can be given for the behavior of any given particle. If we send the x particles through another SGX apparatus, all of them will exit with x-spin down. Here, we can actually predict the behavior of any particle because each is in the eigenstate x. If we send the x particles through a SGZ apparatus, half of them will come out with z-spin up and half with z-spin down; again, only a probabilistic prediction can be given for the behavior of any given particle. EXAMPLE 3.4. Consider now the following setting with a stream of spin-half particles in state x (Fig. ). 7

z x X SGZ SGZ x x z z Figure Here the device marked as X is not quite a SGX device because it lacks the detector. In other words, as the particles go though it, no measurement takes place. If we want, we can expand the state of the particles entering X as z = ( x + x ). (3.4.) However, as there is no measurement, there is no collapse of the state function that, therefore, remains unaltered as the particles enter the second SGZ device. To determine what we are going to observe, we need to express the state vector of the particles in the standard basis, so that we obtain ( x + x )= z. (3.4.) Hence, all the particles will exit the second SGZ device in state z and therefore upon measurement, we obtain S z = h/. 3 3 Although we can manipulate the relevant mathematical apparatus successfully and make correct predictions, one can fairly say that nobody really knows much about the nature of spin. To be sure, the existence of spin can be inferred from experiment and from relativistic quantum mechanical considerations based on the assumption that angular momentum is conserved. However, we have no concrete picture of it since a classically rotating electron would have a surface linear velocity far exceeding the speed of light. There is general agreement that spin is an intrinsic property of quantum particles, but its 73

3.5 Expectation Values How do we test that quantum mechanics handles spin-half correctly? Obviously, not by experimenting on one particle. The reason is not merely that in general it is a good idea to perform many experiments to make sure that our measurements are accurate, but that in order to test a probabilistic prediction we need a very large number of identical particles all in the very same state Y on which we perform the exact same measurement. Such a collection of particles is called an ensemble. For example, if an electron is in state Y = - iö (3.5.) 9 è ø and we measure S z, quantum mechanics predicts that we shall get h/ with probability 5/9 and -h/ with probability 4/9. Hence, if we perform the measurement on an ensemble of 900,000 electrons in that state and we obtain h/ 500,000 times and -h/ 400,000 times, then we may consider the prediction verified. A way of finding out whether these proportions are in fact borne out experimentally is by checking out the average of the measurement returns for S z, and for that we need a brief statistical detour. Suppose we have a cart with seven boxes in it. Two boxes weigh 5kg each; three weigh 0kg each; one weighs 40kg, and one 60kg. Let us denote the number of boxes with N and the number of boxes of weight w with N(w ). Obviously, cause is unknown. Perhaps spin originates from the internal structure of quantum particles, although electrons seem to have no components, or perhaps it is a fundamental property not amenable to any explanation; its nature remains mysterious. 74

N = å N(w), (3.5.) w that is, 7 = N(5kg) + N(0kg) + N(40kg) + N(60kg). Suppose now that we pick a box randomly out of the cart. Let us denote the probability of picking a box of weight w with Pr(w). Then, Pr(w) = N(w) N. (3.5.3) For example, since there are 3 boxes weighing 0kg and 7 boxes in all, Pr(0kg) = 3/7. Notice now that Pr(5kg) + Pr(0kg) + Pr(40kg) + Pr(60kg) =, (3.5.4) and in general, å Pr(w) =, (3.5.5) w which is another way of saying that the probability of picking a box of any weight out of the cart is one. Suppose now that we want to determine <w>, the average or mean weight of the boxes in the cart. Obviously, we add the weights of all the boxes, and divide by the number of boxes. So, < w >= å w For example, < w >= N wn(w) N. (3.5.6) ( 5 +0 3+ 40 + 60 )kg = 0kg, (3.5.7) and therefore the average weight of a box is 0kg. 4 Let us now notice that 4 Notice, by the way, that no box weighs 0kg! 75

å w wn(w) = å w N(w) N N. (3.5.8) w Consequently, by plugging (3.5.8) into (3.5.6) and using (3.5.3), we obtain < w >= å wpr( w). 5 (3.5.9) w If we apply (3.5.6) to the previous case, we obtain: < w >= 5 7 +0 3 7 + 40 7 + 60 ö kg = 0kg. (3.5.0) è 7ø In quantum mechanics, the average is generally the quantity of interest, and it is called the expectation value. It is a very misleading, but ingrained, terminology because it strongly suggests that this quantity is the most likely outcome of a single measurement. But that is the most probable value, not the average value. For example, the most probable outcome of weighing (measuring) a box randomly picked from the cart is 0kg, since P(0kg) is the highest. By contrast, the expectation value is the average value, that is, 0kg. We can now go back to our quantum mechanical problem. Recall that we wanted to determine the predicted expectation value for Sz for a particle in the state given by (3.5.). By applying (3.5.8) we obtain < S z >= h 5 9 - h 4 9 = h 8. (3.5.) 5 This is a special case of the important general law < f (w) >= å f (w)pr(w), where f (w) is any function of w. w= 0 76

Hence, if the actual measurement returns satisfy (3.5.), the prediction will be borne out. There is a nice formula for determining the expectation value of an observable O, namely, < O >= Y ˆ O Y ; (3.5.) in other words, we sandwich the relevant operator in an inner product between the vector states of the system. We leave the proof of (3.5.) as an exercise. EXAMPLE 3.5. Consider again an ensemble of particles in state Y = - iö. Let us calculate 9 è ø the expectation value for S z. By sandwiching, we obtain < S z >= Y 0 ö Y = + i è 0 -ø è 3 - iö ö h 0 ö 3 3ø è 0 -ø, (3.5.3) è 3 ø once we remember that a bra is the complex conjugate transpose of its ket. Then, < S z >= h + i è 3 - ö 3ø è - i 3 3 ö = h 5 9-4 ö = h è 9ø 8. (3.5.4) ø We can easily verify that this result jibes with the results obtained in the previous example. 77

Exercises Exercise 3.. Let Y = i ö be the state of a spin-half particle. Determine: a) the è 3i +ø probability of obtaining h/ upon measuring S x ; b) the probability of obtaining h/ upon measuring S z.. Let Y = 3+ iö be the state of a spin-half particle. Determine: a) the 5 è - iø probability of obtaining -h/ upon measuring S y ; b) the probability of obtaining h/ upon measuring S z. 3. Determine the eigenvalues, the normalized eigenvectors, and the related probabilities for ˆ S y. Exercise 3.. Consider an ensemble of particles in state Y = 3+ iö. Determine the 5 è - iø expectation value for S z.. Consider an ensemble of particles in state Y = 3iö. Determine the expectation 3 è ø value for S x Exercise 3.3. A stream of spin-half particles in state Y = 3+ iö enters a SGX device. 5 è - iø Provide the probabilities and states of the out-coming particle streams. 78

. A stream of spin-half particles in state Y = 3iö enters a SGY device. Provide 3 è ø the probabilities and states of the out-coming particle streams. 3. Prove that < O >= Y ˆ O Y for a two-dimensional vector space. [Hint: ˆ O is Hermitian; therefore Y ˆ O Y = eigenvectors.] O ˆ Y Y and Y can be expressed in terms of O ˆ s 79

Answers to the Exercises Exercise 3. a:pr S x = h ö = è ø i + 3i + = ( 4i +) (-4i +)= 7. b:pr S z = h ö = ( è ø i )(-i )=. a:pr S y = - h ö = è ø 5 b:pr S z = h ö = ( è ø 5 3+ i )( 3- i)= 3. 4 + 3i = 30 4 + 3i ( ) 4-3i ( )= 5 6. 3. We start by determining eigenvalues and eigenvectors of S ˆ y = h 0 -iö. è -i 0 ø ì h 0 -iö x ö = l x ö ï lx + hi Þ x = 0 í è -i 0 ø è x ø è x ø ï - hi x - lx = 0 î Þ 0 = l - hi hi -l = -l + hö. Hence, è ø l = -h/, and l = h/. For l, we obtain x = -ix, and therefore the eigenvector ö X l = x ; since X l X l =+=, the normalized eigenvector is è -iø ö. For l è -iø ö we obtain x = ix, and therefore the eigenvector X l = x ; since X l X l =+=, è iø the normalized eigenvector is ö. Since S ˆ y is Hermitian, its eigenvectors span the è iø space, and therefore the standard sm vector can be expressed as a linear combination of aö them. Hence, = c è b y + c y = c ö + cö. This is equivalent to the ø è -ic ø è ic ø 80

following two simultaneous equations: ì a = c + c ï í b = i(c. The top equation yields ï - c ) î c = a - c, which we plug into the bottom equation to obtain c = b + ai i = a - ib. Hence, c = a - a + ib = a + ib. Consequently, we have Y = a + ib ö + a - ib ö, the state vector expressed in terms of the eigenvectors of S ˆ y. è -iø è iø So, the probability of obtaining l = -h/ is a + ib, and that of obtaining l = h/ is a - ib. Exercise 3.. < S z >= h 3- i è 5 3+ iö + iö 0 ö 5 5 ø è 0 -ø - i = h 3- i è 5 è 5 ø 3+ iö - + i ö 5 5 ø - i = h 6. è 5 ø. < S x >= h è -3i 3 ö 0 ö 3ø è 0ø è 3i ö 3 = h è 3ø 3-3i ö 3ø è 3i ö 3 = 0. 3ø Exercise 3.3. Two particle streams come out. The first is composed of particles in state x with spin S x = h/. By using the table, we find that the probability that a particle belongs to this stream is 5 3+ i + - i = 5. The second particle stream is composed of particles 6 8

in state x with spin S x = -h/. By using the table, we find that the probability that a particle belongs to this stream is 5 3+ i - + i = 6. Note that, of course, 5 6 + 6 =.. Two particle streams come out. The first is composed of particles in state y with spin S y = h /. By using the table, we find that the probability that a particle belongs to this stream is 3i - i =. The second particle stream is composed of particles in 3 6 state y with spin S y = -h /. By using the table, we find that the probability that a particle belongs to this stream is 3i - i = 5 3 6. 3. Let Y = c e + c e, where{ e, e } is the basis made up of the eigenvectors of O ˆ. Since ˆ O is Hermitian, Y ˆ O Y = ˆ O Y Y = where l and l are ˆ O s eigenvalues. But O ˆ ( c e + c e ) Y = ( c l e + c l e ) Y, ( c l e + c l e ) Y = c * l e Y + c * l e Y = c * l c + c * l c = l c + l c. However, l c + l c = l Pr( l )+ l Pr( l ) =< O >. The extension to higher dimensional spaces is immediate. 8