The formalism of Quantum Mechancis

The formalism of Quantum Mechancis I. INTRODUCTION The formalism of quantum mechanics is discussed in chapter 3 in Griffiths. There is also an Appendix on Linear Algebra. You might read these sections in addition to these notes, and in a few places I will include comments on Griffiths presentation. Although these notes are self contained, they cover a more general case than Griffiths, so I recommend that you study section 3.4 on the statistical interpretation of the wave function. Section 3.5 on the generalized uncertainty relation is not covered in these notes. Here you will directly be introduced to the Dirac formalism, which Griffiths treats in section 3.6 but never uses fully. It is a very powerful formalism and not difficult to learn. It brings out the structure of quantum mechanics in a simple way and has also become increasingly important in many modern application of quantum mechanics such as quantum information. The mathematics of quantum mechanics is linear algebra. However, there are a couple of differences compared to the linear algebra that you have learned in your mathematics courses. These are: 1. A vector is denoted as α (or β ). This is a trivial change of notation, but it is surprisingly useful. 2. The vectors are complex i.e. they have complex components. This is a simple generalization. Just some here and there (well...). 3. The vectors may have an infinite number of components, a i, i = 1, 2, 3,... and, more importantly, they may have a continuum of components, a(x), < x < note that here x is a real number. The set of components is in this case a function of the real variable x (and we use the standard notation for functions a(x) instead of a x, which would be more analogous to a i ). Item 3 above is in general a highly non-trivial generalization of linear algebra, and some of the properties that hold in finite dimensional vector spaces may no longer be true. Fortunately, however, for the vectors that occur in quantum mechanics this does not happen and the 1

generalization is simple! Thus we need not worry about the things a mathematician would start to think about. We will just generalize the standard linear algebra to vectors whose components are functions. Why is it that the vectors of quantum mechanics have this nice property? Well, it is really something required by physics. The properties in question are namely related to the Born interpretation of the wave function as a probability amplitude, and what can go wrong is that probabilities would not add up to one. This is of course not acceptable for a physical theory, so in quantum mechanics we only use those vector spaces for which this holds true. It might then be a tricky mathematical problem to prove these properties, i.e. to prove that we are using a consistent mathematical model of a physical system. For the cases you will encounter in this course you can rest assured that these problems have been sorted out.[1] How linear algebra enters quantum mechanics will be explored and explained later, but it might be good to know from the start what to look for. Here is a brief preview and an illustrative example: 1. The state of a system is described by a vector. 2. Observables, i.e. things that can be measured, correspond to operators that implement linear transformations of the vectors. 3. A crucial feature of quantum mechanics is the principle of superposition, the linear combination aψ(x) + bφ(x) of two wave functions is a possible wave function for the system. In linear algebra this is just the basic property of a vector space, which says that a linear combination of vectors is a new vector in the space (see (6) below). Example: You have already encountered the eigenfunctions ψ n (x) and energy eigenvalues E n = ω(n + 1 ) for the harmonic oscillator hamiltonian. It turned out to be very useful to 2 introduce the ladder operators, a and a satisfying the commutation relation: [a, a ] = 1 (1) Their action on the normalized eigenfunctions were, a n = n n 1 (2) a n = n + 1 n + 1 2

From these we formed the number operator ˆN = a a. Note that all these operators are differential operators acting on wave functions ψ(x), and that the origin of the commutation relation (1) is the operator identity d dx x = x d dx + 1. Now, there is another type of objects that in general do not commute, namely matrices. So it is quite natural to ask whether the commutation relation (1) could be represented by matrices. It is not hard to prove that this is in fact not possible using finite matrices. However, if we allow for infinite matrices and define, a = 1 2 3.... a = 1 2 3..., (3) it is clear (just do the multiplication!) that (1) is satisfied, if the 1 on the LHS is interpreted as the infinite diagonal unit matrix, 1 = diag(1, 1,... ). The number operator takes the form, ˆN = 1 2 3.. (4) If we now let these matrices act on the column vectors ψ = 1 ψ 1 = 1 ψ 2 = 1 etc. (5) the relations (2) are fulfilled. Since the hamiltonaian is given by the operator relation H = ω( ˆN + 1 2 ) we see that the energy eigenvalue for the state ψ n is E n = ω(n + 1 2 ) as expected. 3

We thus have two alternative descriptions of the harmonic oscillator states, either in terms of wave functions ψ n (x) and differential operators acting on these functions, or in terms of (infinite) vectors, ψ n and matrices acting upon them. We say that these two descriptions are different representations of the quantum mechanical system. Mathematically, the function ψ n (x) and the infinite column vector ψ n are just the components of the same abstract state vector n in different basis. The first correspond to the position basis x and the second to the energy basis E. All this will be explained in detail in the following. II. DIRAC S NOTATION A vector is denoted as α (called a ket-vector or simply a ket), or, as β (called a bra-vector or simply a bra). This is standard quantum mechanics notation introduced by Dirac.[2] Note that a bra and ket together gives a bracket α β (this is a physicists sense of a joke). α β denotes the scalar product of the vectors α and β. III. COMPLEX VECTORS A vector space consists of vectors and scalars. Two vectors α, β can be added and multiplied by scalars a, b and this gives a new vector γ γ = a α + b β (6) in the vector space. As noted above this is the mathematical formulation of the quantum mechanical superposition principle. A vector can be expanded in a set of linearly independent basis vectors e i, i = 1, 2,...n, n α = a i e i. (7) i=1 The coefficients a i are complex scalars and are called the components of the vector, they describe the vector completely [3] [4]. n is the dimension of the vector space. For now we shall take n to be finite, but later we shall also discuss vector space with infinitely many components, as the one considered in the example in section I. You are probably used to the scalars being real numbers. In the vector spaces that occur in quantum mechanics the scalars are, in general, complex (they must be since the wave function is complex, and this is an example of components of a vector). This leads to some simple changes. 4

The scalar product of two vectors α, β is denoted α β and has the properties β α = α β (8) α α, and α α = if and only if α = (9) γ (a α + b β ) = a γ α + b γ β (1) Note the complex conjugation that enters in the first relation. Let e i be an orthonormal basis, e i e j = δ ij. (11) The components of α = n i=1 a i e i are then a i = e i α (12) and the scalar product of α and β = n i=1 b i e i is α β = n a i b i, (13) i=1 and the square of the norm of the vector α is α α = n a i a i. (14) i=1 To each ket α = n i=1 a i e i, there is a corresponding bra α = n e i a i. (15) i=1 Note the complex conjugation that enters on the components of α, this follows from (8) and makes the norm, i.e. the length of the vector, real and non-negative. I will frequently use the simplified notation i = e i. IV. LINEAR TRANSFORMATIONS A linear transformation ˆT takes a vector α and transforms it into another vector α = ˆT α in such a way that the linearity condition ˆT (a α + b β ) = a ˆT α + b ˆT β. (16) 5

is fulfilled. ˆT is called an operator. Because of this linearity relation, the action of the operator ˆT on an arbitrary vector can be obtained from its action on the basis vectors i : n ˆT j = T ij i Note order of indices! (17) i=1 T ij = i ˆT j. (18) T ij are the components of the operator ˆT, they form an n n-matrix T. If α = i a i i and α = i a i i we introduce the notation a and a for the column vectors (i.e. the n 1-matrices) with components a i and a i respectively. Then we can write α = ˆT α in matrix language as a i i = ˆT i j a j j = i,j a j T ij i (19) a i = j T ij a j (2) a = T a. (21) Note that matrix equations, such as (19), depend on the particular basis i used. V. HERMITIAN TRANSFORMATIONS If ˆT is an operator then one defines the hermitian conjugate operator ˆT as follows α ˆT β = β ˆT α (22) (for all vectors α, β ). [The definition (22) is equivalent to Griffiths [A.87] ˆT α β = α ˆT β ([3.83] in the first edition). His notation is confusing and I will try to avoid it, but you need to know it in order to read and understand the textbook. What stands inside... is just a name for the vector, and a natural notation is then ˆT β = ˆT β. Using this notation one finds ˆT α β = β ˆT α = β ˆT α α ˆT β = α ˆT β (23) which, when inserted in Griffiths eq. [A.87] gives (the complex conjugate of) our definition (22). Note that ˆT α is not equal to α ˆT, whereas ˆT α = ˆT α ). ] 6

A very important type of operators in quantum mechanics are those which hermitian conjugate equals the operator itself: ˆT = ˆT. (24) Such an operator is said to be hermitian or self-adjoint. It follows form (22) that the components of a hermitian operator obey (T ji ) = T ij. The matrix T is then said to be hermitian and this condition is written as T = T. For a product of operators one has the useful rule (Ŝ ˆT ) = ˆT Ŝ. (25) This can be proven using (22), or by using components, i.e. matrix notation. Note the change of order of the operators. Hermitian operators are of crucial importance in quantum mechanics since any observable, such as e.g. position, momentum, angular momentum, energy, spin, etc.... for any physical system corresponds to a hermitian operator. [5] VI. EIGENVALUES AND EIGENVECTORS Given an operator ˆT, consider the equation ˆT α = λ α, (26) where λ is a scalar. This is an eigenvalue equation. Vectors α that solve this equation are called eigenvectors of the operator ˆT since they are mapped into themselves by ˆT, and λ are called eigenvalues. In matrix notation this equation becomes n T a = λa or, in components, T ij α j = λα i. (27) You have encountered this equation in your linear algebra courses. Rewriting it as (T λ1)a =, one sees that it has solutions a only if det(t λ1) =. Solving this characteristic equation ( sekularekvationen ), which is an nth order polynomial in λ, determines the n eigenvalues λ i, i = 1... n. Having obtained the eigenvalues one can determine the corresponding eigenvectors by solving (26). If T happened to be a diagonal matrix then, obviously, the eigenvalues are simply the diagonal elements. In general, to find the 7 j=1

eigenvalues is equivalent to diagonalizing the matrix T by a change of basis vectors. T is diagonal when the basis vectors are the eigenvectors. The set of all eigenvalues {λ i } and eigenvectors { i s } for an operator is called the spectrum of the operator. If there is only one eigenvector, i with a particular eigenvalue λ i then this eigenvalue is said to be non-degenerate, if there are d linearly independent eigenvectors, i s, s = 1... d with eigenvalue λ i this eigenvalue is said to be d-fold degenerate. From now on we shall consider the non-degenerate case. If ˆT is a hermitian operator, then the following holds 1. All eigenvalues are real. 2. Eigenvectors corresponding to different eigenvalues λ λ are orthogonal. 3. The eigenvectors span the vector space. Item 3 means that the eigenvectors of a hermitian operator can be chosen as a basis i, i = 1, 2...n in the vector space. If all eigenvalues are non-degenerate, the corresponding eigenvectors are automatically orthogonal because of 2. For a degenerate eigenvalue, one may have to take linear combinations of the obtained eigenvectors to obtain orthonormal basis vectors: i j = δ ij. Properties 1 and 2 are simple to prove, see Griffiths. The property 3 is obvious if all eigenvalues are different and can be shown to hold in general. In quantum mechanics, an observable, e.g. the energy, is represented by a hermitian operator and the possible results of the measurement of this observable are the eigenvalues of this operator. These must be real and they are since the operator is hermitian. After one has measured the observable and obtained one of the eigenvalues the state of the system is described by the corresponding eigenvector this is the so called collapse of the wave function. 8

VII. SUMMARY OF RESULTS AND THE RESOLUTION OF THE IDENTITY For a hermitian operator we have from the above ˆT = ˆT (28) ˆT i = λ i i (29) i, i = 1, 2,...n basis of eigenvectors, λ i real eigenvalues (3) i j = δ ij orthonormal basis (31) n ψ = ψ i i, ψ i = i ψ (32) ψ = i=1 n n i i ψ = ( i i ) ψ. (33) i=1 i=1 From the last equation we read off the operator identity (called the resolution of the identity ) 1 = n i i. (34) i=1 Note that here 1 is an operator (and should perhaps be denoted by ˆ1). The identity (34) is a consequence of the vectors i forming a basis. This identity is very useful, it can be inserted anywhere in an equation since it is just the identity operator and many useful formulae can be derived almost without effort, for example the components of ψ above. As a further example we derive the components of an arbitrary operator Â in the basis { i } defined by the hermitian operator ˆT : Â = 1 Â 1 = i,j i i Â j j = i,j i A ij j (35) A ij = i Â j. Note that Â does not have to be hermitian. In particular, if we calculate the components of the operator ˆT itself in its own basis we get, T ij = i ˆT j = λ i i j = λ i δ ij (36) which just expresses the trivial fact that the operator ˆT is diagonal in the basis defined by its own eigenvectors. 9

So far everything was simple, every hermitian operator in a finite dimensional vector space has a complete set of eigenvectors that can be chosen as basis vectors (and the eigenvalues are real). As mentioned above the only new things compared to the linear algebra you have met in your mathematics course is that the vector space is complex but this was easily taken care of. We now want to extend the theory of linear algebra to infinite dimensional vector spaces. The vector index i may run from 1 to (i.e. n in the formulae above) but it may also be that the vector index becomes a continuous variable x which takes for example all real values. The components of a vector ψ i then becomes a function of x, ψ(x) thus functions can be thought of as components of vectors. Formally, the generalization to infinite dimensional vector spaces is simple. However, mathematical difficulties may arise. For example, it may happen that the scalar product of two vectors is infinite. Moreover, the theorems that hold for a hermitian operator in a finite dimensional vector space that the eigenvectors span the space do not in general hold for a hermitian operator in an infinite dimensional vector space. Thus infinite dimensional vector spaces are mathematically much trickier than their finite dimensional counterparts. See Griffiths for a disscussion of these problems. However, it so happens that the infinite dimensional vector spaces that occur in quantum mechanics are simple and behave exactly as their finite dimensional brethren! In particular, the hermitian operators that occur in quantum mechanics do have eigenvectors that span the vector space and hence can be used as a basis. These eigenvectors may have an infinite norm, however this infinity is of the same fairly innocent type as we encountered for the free particle. Thus, for the purposes of quantum mechanics one can generalize linear algebra to infinite dimensional vector spaces almost trivially without worrying about whether things are welldefined or not. Quantum mechanics guarantees that this is the case. By using the bra, ket notation of Dirac the formulae and the calculations will look almost the same also when there is a continuum of components. In my opinion, Griffiths puts too much emphasis on the potential mathematical difficulties in infinite dimensional vector spaces and he also stops using the Dirac notation that he has introduced. This obscures the close analogy with the finite dimensional case. Here, we will use Dirac s vector notation and trust that the mathematics works just as in the finite dimensional case as is required for the consistency of quantum mechanics. 1

VIII. THE INFINITE DIMENSIONAL DISCRETE SPECTRUM: i = 1, 2,... As noted above this generalization is trivial, just make the replacement n in the formulae above. IX. THE CONTINUOUS SPECTRUM: i x There are two new things: The sum is replaced by an integral and the Kronecker delta is replaced by Dirac s delta function[6] n The equations (28-33) become i=1 dx (37) δ ij δ(x x ). (38) ˆT = ˆT (39) ˆT x = λ x x (4) x, < x < basis of eigenvectors, λ x real eigenvalues (41) x x = δ(x x ) orthonormal basis (42) ψ = ψ = dx ψ(x) x, ψ(x) = x ψ (43) ( ) dx x x ψ = dx x x ψ (44) x here can be the position of a particle, but can also be any other continuum variable such as the momentum or the energy for the scattering states.[7] From the last equation we read off the operator identity (called the resolution of the identity ) 1 = dx x x (45) Compare this to the finite dimensional case above! The resolution of the identity can be used to derive results just as above, using the defining property of Dirac s delta function dx δ(x x )f(x ) = f(x). (46) Note that Dirac s delta function vanishes when x x and is when x = x. Thus eigenvectors corresponding to diffferent eigenvalues are orthogonal. However, the norm of the eigenvectors is infinite. (Compare the notes on the free particle.) 11

Comment: It is also possible to have an operator whose spectrum (i.e. set of eigenvalues) consists of both a discrete part and a continuous part the Hamiltonian for the Hydrogen atom is an example of this and so is the delta-potential well. In these cases, one will have both an integral and a in the formulae. X. THE SCALAR PRODUCT It is straightforward to derive the expression for the scalar product in terms of the components of the vectors, Eq [3.6] in Griffiths: ψ = dx x ψ(x), φ = dx x φ(x) (47) φ ψ = ( dx φ (x) x )( dx x ψ(x ) (48) = dx dx φ (x)ψ(x ) x x (49) using x x = δ(x x ) and the defining property of the delta function (46), we arrive at the important result φ ψ = Compare this expression to the finite dimensional one eq.(13). dx φ (x)ψ(x). (5) Note that Griffiths takes the expression (5), in terms of the components ψ(x) of the vector ψ as the definition of the scalar product. This is possible but obscures the general structure and the similarities to the finite dimensional case. The integral in (5) must exist, thus this leads to restrictions on the allowed functions ψ(x), see Griffiths. XI. CHANGE OF BASIS Using the resolution of the identity it is very easy to perform a change of basis. We illustrate this here with the discrete case, but the formulae translate immediately to the general case. Let e i, i = 1,...n and f i, i = 1,...n be two sets of normalized basis vectors (they may for example be the eigenvectors of two operators corresponding to two physical observables). 12

There are now two resolutions of the identity 1 = i e i e i = i f i f i. (51) Using these one finds immediately the linear transformation that relates the two sets of basis vectors e i = j f j f j e i. (52) A vector ψ can be expanded as ψ = i e i e i ψ = i f i f i ψ (53) and its components are related as e i ψ = j e i f j f j ψ, (54) again using the resolution of the identity. Defining the matrix S with components S ij = e i f j, this can be written in matrix notation as ψ (e) = Sψ (f), where ψ (e/f) is the vector with elements e i /f i ψ. For the components of an operator ˆT one finds e i ˆT e j = i f k f k k,m e ˆT f m f m e i. (55) In matrix notation this becomes where S 1 is the matrix with components S 1 ij T (e) = ST (f) S 1, (56) = f i e j. Using the resolution of the identity one verifies that S 1 S = 1, hence S 1 is the inverse of the matrix S. The transformation (56) is called a similarity transformation. Diagonalizing the matrix T (f) by finding a similarity transformation S such that T (e) is diagonal is equivalent to transforming to a basis consisting of the eigenvectors of ˆT. (We assume that ˆT corresponds to a physical observable so that a basis of eigenvectors exists.) XII. THE SCHRÖDINGER EQUATION The time dependent Schrödinger equation in Dirac notation reads Ĥ Ψ = i Ψ, (57) t 13

where Ĥ is the Hamiltonian describing the system, and Ψ(t) is the state vector that contains all information about the system. Written in this abstract vector form it applies to any physical system: a particle moving along the x-axis, a particle moving in three dimensions, a spin, several particles, a molecule, a solid, a superconductor... you name it! Just as we did before we can derive a time-independent equation by separing variables (we assume that Ĥ has no explicit t dependence). Making the ansatz Ψ(t) = e iet/ ψ, (58) where ψ is t-independent, we find the time-independent Schrödinger equation Ĥ ψ = E ψ. (59) The above is the Schrödinger equation in vector form, we can obtain it in component form by inserting the resolution of the identity in suitable places. In the discrete case, with i i i = 1, we obtain which gives the component equation i,j i i Ĥ j j ψ = E i where we have defined, in the standard way, i i ψ, (6) H ij ψ j = Eψ i, (61) j H ij = i Ĥ j, ψ i = i ψ. (62) The Schrödinger equation for, for example, a spin is of this form. In the continuum case, with dx x x = 1, we obtain instead dx dy x x Ĥ y y ψ = E dx x x ψ, (63) which gives the component equation dy H(x, y)ψ(y) = Eψ(x) (64) where H(x, y) = x Ĥ y, ψ(x) = x ψ. (65) 14

The Schrödinger equation [ 2 d 2 2m dx 2 + V (x)]ψ(x) = Eψ(x) that describes the motion of a particle along the x-axis is of the form (64), i.e., it is given in component form. We see immediately that the right hand sides of the two equations agree. The left hand sides will also agree if we make the identification H(x, y) = δ(x y)[ 2 + V (y)]. 2m dy 2 The variable x in the continuum case need not be the position of a particle moving along the x-axis, but can be any continuous observable, such as the momentum. The formulae can also easily be generalized to the case of several continuous variables, which is needed for, for example, one particles motion in three dimensional space or several particles or... All one has to do is to replace x above by a set of variables and the integral dx by a multiple integral. Formally, everything will be the same. d 2 XIII. THE OPERATORS ˆx AND ˆp Here we will discuss the position, ˆx, and momentum, ˆp, operators for a particles motion. Consider the eigenvalue equations ˆx x = x x (66) ˆp p = p p. (67) Here, ˆx, ˆp are operators, x, p are eigenvalues and x, p are eigenvectors. There is a unique eigenvector x ( p ) for each real x (p) and they are complete: dx x x = 1 (68) dp p p = 1. (69) This is always true for operators in quantum mechanics that correspond to physical observables. Sometimes it can be proven, but in general, we simply assume this to be true, without proof, for any quantum mechanical operator. The completeness is intimately tied to the foundations of quantum mechanics as we will see when discussing the statistical interpretation below. A general state (describing a particles motion in one dimension) can be expanded in either the x or the p basis: ψ = dx x x ψ = 15 dp p p ψ. (7)

This gives two wave functions that describe the system: ψ(x) = x ψ and ψ(p) = p ψ. These are just components of the same vector ψ in different bases, i.e. they are related by a change of basis, see the section above on this topic. ψ(x) is the wave function in position space, this is the usual wave function which enters the ususal Schrödinger equation and which gives the probability of finding the particle at position x. ψ(p) on the other hand, is the wave function in momentum space and gives the probability to find the particle with momentum p. We will see below that ψ(x) and ψ(p) are related by a Fourier transformation. Let us determine the components of x and p in the x-basis. For x, we have y = dx x x y = dx x δ(x y) (71) hence, ψ y (x) x y = δ(x y). (72) Thus, the wave function in x-space for the state x is Dirac s delta-function. This makes sense, since, according to the probability interpretation of the wave function, it means that the particle can only be found at x = y. The operator ˆx becomes in x-basis ˆx = dx dy x x ˆx y y = dx dy x xδ(x y) y (73) = dx x x x, where we have used (66), (72) and the properties of the δ-function. Notice the similarity with the finite dimensional expression eq.(35), and that in its own eigenbasis, ˆx is represented by a diagonal matrix xδ(x y) just as in eq.(36). We now turn to the components of p = dx x x p. ψ p (x) x p is the wave function for the p state. To proceed we need to know what ˆp is. We defined this operator in the x-basis as (Here, we use ˆp for the abstract operator, e.g. ˆp x ψ(x) = i ψ(x). (74) x the one in (67), whereas ˆp x denotes the representation of this operator in the x-basis (ˆp x = i / x). Usually, we use ˆp also for the latter operator.) Let us recall the solution of the eigenvalue equation for ˆp in the 16

x-representation: ˆp x ψ p (x) = p ψ p (x) (75) i x ψ p(x) = p ψ p (x) (76) ψ p (x) = A e ipx/, p real. (77) Thus we have p = 1 2π dx x e ipx/, ψ p (x) x p = 1 2π e ipx/, (78) where we have chosen A = 1/ 2π. We can now evaluate the scalar product of p states: p p = 1 dx dx x x e i(px p x )/ (79) 2π = 1 dx e ix(p p )/ 2π = δ(p p). In the last step we used the useful relation (see Problem 2.26 in Griffiths) 1 2π dk e ikx = δ(x). (8) This is proven as follows. Let F (k) be the Fourier transform of f(x): F (k) = 1 dx f(x)e ikx (81) 2π f(x) = 1 dk F (k)e ikx. (82) 2π One then finds dx f(x) from which (8) follows. [ 1 2π dk e ikx ] = 1 2π dk F (k) = f(), (83) We summarize some of the important results of this section: y = dx δ(y x) x (84) x y = δ(x y) (85) 1 p = dx e ipx/ x (86) 2π x p = 1 2π e ipx/ (87) p p = δ(p p). (88) Comments 17

p (as well as x ) are non-normalizable but note that they are orthogonal and almost normalized in the sense that δ ij δ(x y): x y = δ(x y) and p p = δ(p p) (Griffiths calls this delta-function normalized ), and this is good enough, see the notes on the free particle. The wave functions in x and p space ψ(x) = x ψ and ψ(p) = p ψ are related by a Fourier transform: x ψ = dp x p p ψ = 1 2π dp e ipx/ p ψ. (89) We end this section by proving that the operators ˆx and ˆp are hermitian. For ˆx we have: φ ˆx ψ = dx dy φ x x ˆx y y ψ = dx φ (x)xψ(x) (9) φ ˆx ψ ψ ˆx φ = ( dx ψ xφ) = dx ψxφ = φ ˆx ψ, (91) hence ˆx is hermitian, see (22). For ˆp, we find φ ˆp ψ = dx φ (x)( i d )ψ(x) (92) dx ( φ ˆp ψ ψ ˆp φ = dx ψ (x)( i d ) dx )φ(x) = ( ( ) ) i ψ φ + dx i dψ dx φ = dx φ (x)( i d dx )ψ(x) + (...) = φ ˆp ψ, (93) provided the wave functions go to zero fast enough at infinity so that the surface term vanishes (this is the case for normalizable wave functions). This shows that ˆp is hermitian (22). XIV. THE HILBERT SPACE The set of possible vectors ψ describing the quantum mechanical state of a particular system form a Hilbert space. Different systems have different Hilbert spaces. We here discuss briefly the properties of such a space. 18

A space is said to be complete if all convergent series of elements in the space also belong to the space. (Note that this has nothing to do with the completeness of the eigenvectors to an operator discussed above.) A Hilbert space H is a vector space, with a scalar product, that is complete: α = a j α j (94) j=1 is an element in H provided α i are elements in H and the series is convergent α α = a j 2 <. (95) j=1 For a particle moving along the x-axis, the Hilbert space is L 2 (, ), this is the space of all functions ψ(x), such that dx ψ(x) 2 <. (96) These functions are called square-integrable. (Note that ψ(x) are the components of the vector and that the Hilbert space here is defined in terms of the components.) For one spin 1/2, the Hilbert space is a two dimensional vector space, for spin one 1 it is a three dimensional vector space, for N spin 1/2 it is a 2 N dimensional vector space etc. XV. THE STATISTICAL INTERPRETATION Griffiths discussion of the statistical interpretation in Section 3.4 is restricted to the motion of one particle in one dimension. Here we consider the general case. 1. The system is whatever we describe by quantum mechanics. It can be a particle moving in one dimension, in three dimensions, several particles, a spin, particles with spin, a molecule, a solid etc. 2. The system is completely described by a vector ψ in a Hilbert space H. Each vector in H describes a possible state for the system. Note that the principle of superposition is implied by this. A vector in H is called a state vector since it describes a quantum state. 19

3. Each observable (i.e. each measurable quantity) Q corresponds to a hermitian operator ˆQ acting on the vectors in H. For the particle: Q(x, p, t) ˆQ(ˆx, ˆp, t). 4. The expectation value of Q is: Q = ψ ˆQ ψ ψ ψ (97) = ψ ˆQ ψ, if ψ ψ = 1. (98) Repeated measurements on identically prepared systems (i.e. each one of them in state ψ ) gives this mean value. Let ˆQ have eigenvectors e i with eigenvalues λ i (We assume here that the spectrum {λ i } is discrete, for the generalization, see below.): ˆQ e i = λ i e i, i = 1, 2,... (99) λ i are real (since ˆQ is hermitian). { e i } is a basis: e i e j = δ ij, i e i i = 1 5. A measurement of the observable Q gives one of the eigenvalues λ i. If ψ = i e i e i ψ then the probability, P i to get λ i as a result of the measurement is P i = e i ψ 2 ψ ψ (1) = e i ψ 2 if ψ ψ = 1. (11) Compare this to the statistical interpretation of ψ(x) = x ψ. The probability for the system to be found in the general (normalized) state χ is obtained by replacing e i by χ above. 2

6. Immediately after the measurement of the observable Q that resulted in λ i, the state of the system is ψ after = e i. This is the collapse of the wave function (or rather the state vector) ψ. Repeated measurement of Q imediately after the first measuremet gives the same result λ i with probability one. Continuous spectrum For a general hermitian operator ˆQ with a continuous spectrum one is, in general, not guaranteed that the eigenvectors span the space. From the above we infer that this property is of fundamental importance in quantum mechanics it is closely tied to the statistical interpretation of the state vector. One assumes that all hermitian operators that correspond to physical observables have the property that their eigenvectors span the Hilbert space. In some cases, this can be proven, in other cases one simply takes this as an assumption. In the discussion above we assumed a discrete spectrum, but 1-6 hold also when the spectrum is continuous, provided one makes the standard replacements: i dx and δ ij δ(x y).[8] [1] For the mathematically interested, more advanced texts on quantum mechanics will contain some discussion of these points. [2] Dirac was one of the most famous physicists of the twentieth century. His perhaps most important contribution was the Dirac equation which combines quantum mechanics and special relativity to give a very accurate description of electrons. His 193 textbook, The Principles of Quantum Mechanics, is still recommended reading for the serious student of quantum mechanics. [3] The vector space contains a null vector,, i.e. a vector with the property that α + = α for any α. Obviously, this is the vector with all components, I will denote this vector simply by. Griffiths uses the notation for this -vector, see [A.4] this is very bad notation. Normally, in physics texts is used to mean a vector that is not the -vector. Avoid using for the -vector! 21

[4] Griffiths sometimes puts the vector equal to its components. This is OK if you know what it means but I will try to avoid this. [5] Another important type of operator in quantum mechanics are operators that obey ÛÛ = 1. This means that the hermitian conjugate operator is equal to the inverse operator Û = Û 1, where the inverse is defined as the operator that obeys ÛÛ 1 = 1, where 1 is the identity operator. In matrix language this becomes UÛ = 1. [6] Sometimes, like in the infinite square well, we are interested in wave functions defined on a finite segment of the line, then the correspondence becomes n i=1 b dx and to define the theory one must supply proper boundary conditions. [7] It can also easily be generalized to the case of several continuous variables, which is necessary, eg for describing a particle in three dimensional space or several particles. All one has to do is to replace x by a set of variables {x 1, x 2,...} and dx by a multiple integral over these variables dx1 dx2.... a [8] There are also operators where the spectrum has both a discrete part and a continuous part, the energy for a the hydrogen atom is an example, this can also be accomodated in the formalism here with minor changes. 22