Quantum Field Theory I

Size: px
Start display at page:

Download "Quantum Field Theory I"

Transcription

1 October 12, 2011 Quantum Field Theory I Ulrich Haisch Rudolf Peierls Centre for Theoretical Physics, University of Oxford OX1 3PN Oxford, United Kingdom

2 Abstract This course deals with modern applications of quantum field theory with emphasize on the quantization of theories involving scalar and spinor fields.

3 Recommended Books and Resources There is a vast array of quantum field theory texts, many of them with redeeming features. Here I mention a few of them, mostly the ones that I used or looked at when preparing this course. To a large extent, I will follow the first section of M. Peskin and D. Schroeder, An Introduction to Quantum Field Theory This is a very clear and comprehensive book, covering essentially everything in this course as well as many advanced aspects of quantum field theory that go (far) beyond the scope of this lecture. S. Weinberg, The Quantum Theory of Fields: Volume 1, Foundations This is the first in a three volume series by one of the masters of quantum field theory. It takes a unique route through the subject, focussing initially on particles rather than fields. Since it has a very particular viewpoint, it is difficult to digest, but certainly worth reading. L. Ryder, Quantum Field Theory This elementary text has a nice discussion of much of the material in this course. It is good for a first reading. A. Zee, Quantum Field Theory in a Nutshell This is a charming book, where emphasis is placed on physical understanding and the author isn t afraid to hide the ugly truth when necessary. It contains many gems. By browsing the web, I also found interesting material. Nice introductions to quantum field theory (of different length and viewpoint) have been written by C. Anastasiou and D. Tong. The corresponding scripts can be found at: babis/teaching/qfti/qft1.pdf Other links to useful resources can be found on the web page of D. Tong: For completeness, I will also give relevant references at the end of each section of this script. The interested reader can consult them for further details on the discussed topics. 1

4 Contents 1 Introduction Why QFT? Scales and Units Elements of Classical Field Theory Dynamics of Fields Noether s Theorem Example: Electrodynamics Space-Time Symmetries Problems Klein-Gordon Theory Klein-Gordon Field as Harmonic Oscillators Structure of Vacuum Particle States Two Real Klein-Gordon Fields Complex Klein-Gordon Field Heisenberg Picture Klein-Gordon Correlators Non-Relativistic Limit Problems Interacting Fields Classification of Interactions Interaction Picture First Look at Scattering Processes Wick s Theorem Second Look at Scattering Processes Feynman Diagrams Third Look at Scattering Processes Yukawa Potential Connected and Amputated Feynman Diagrams From Correlation Functions to Scattering Matrix Elements Decay Widths and Cross Sections Problems Dirac Theory Spinor Representation Discrete Symmetries of Dirac Theory Continuous Symmetries of Dirac Theory Solutions to Dirac Equation Quantization of Dirac Theory Problems

5 1 Introduction As the term quantum field theory (QFT) suggests, QFT is the application of quantum mechanics (QM) to dynamical systems of fields, in the same sense that QM is concerned mainly with the quantization of dynamical systems of particles. QFT is not only a subject that is absolutely essential to understand the current state of elementary particle physics as well as modern aspects of cosmology, but also plays a crucial role in many active areas of research, ranging from atomic over nuclear and condensed-matter physics to pure mathematics. Since the ultimate goal of this course is to gain a basic understanding of the fundamental laws of nature, we will in the following focus mainly on the physics of elementary particles and hence deal mostly with relativistic fields. 1.1 Why QFT? The primary reason for introducing the concept of fields in classical physics is to construct laws of nature that are local. The old laws of Newton (Coulomb) involve action at a distance. This means that the force felt by a planet (an electron) changes immediately if a distant star (proton) moves. The laws of Newton and Coulomb thus feature non-local interactions. The field theories of Einstein (general relativity) and Maxwell (electrodynamics) remedied the situation, with all interactions mediated in a local fashion by fields. The requirement of locality remains a strong motivation for studying QFTs. However, there are further good reasons to treat the quantum field (and not the particle) as fundamental (or as Steven Weinberg puts it in [1]: Quantum fields are the basic ingredients of the universe, and particles are just bundles of energy and momentum made out of them. ). QM and Special Relativity A first reason is that the combination of QM and special relativity implies that particle number is not conserved. Consider a particle of mass m trapped in a box of size L. Heisenberg s uncertainty principle tells us that the uncertainty in the momentum of our particle is p /L. In the relativistic limit, momentum and energy can be treated on equivalent footing, and one has an uncertainty in the energy of order E c/l. Yet, if E = 2mc 2, there is enough energy available to create a virtual particle-antiparticle pair from the vacuum (Dirac sea). This little exercise shows that when a particle with mass m is localized within a distance λ Compton = /(mc), talking about a single particle loses its sense. For distances smaller than this Compton wavelength there is a high probability that we will detect particle-antiparticle pairs swarming around the single particle that we initially put into the box. Notice that λ Compton is always smaller than the de Broglie wavelength given by λ de Broglie = / p. 1 If you like, the de Broglie wavelength is the distance at which the wavelike nature of particles becomes apparent, while the Compton wavelength is the distance at which the concept of a single pointlike particle breaks down and one has to start thinking about how to describe multiparticle states. 1 Throughout this course we will use boldface type (ordinary italic type) to denote 3-vectors (4-vectors). 3

6 The presence of a multitude of particles and antiparticles at short distances (or high energies) tells us that any attempt to write down a relativistic version of the one-particle Schrödinger equation is doomed to fail. There is no mechanism in standard non-relativistic QM to deal with changes in the particle number. Indeed, any attempt to naively write down a relativistic version of the one-particle Schrödinger equation meets serious problems: negative probabilities, infinite towers of negative energy states, or a breakdown of causality are the common issues that arise. QM and Causality Let us have a closer look at the issue of breakdown of causality. Consider the amplitude A(t) = y e iet/ x, (1.1) that describes the propagation of a free particle from the point x to y. In non-relativistic QM one has E = p 2 /(2m) and hence 2 A(t) = y [ ( exp i p 2 /(2m) ) t/ ] x d 3 p [ ( = y exp i p 2 /(2m) ) t/ ] p p x (2π ) 3 d 3 p = (2π ) exp [ i ( p 2 /(2m) ) t/ ] exp [ ip (y x)/ ] 3 ( m ) 3/2 [ = exp im (y x) 2 /(2 2 t) ]. 2πi t Here we have made use of the completeness d 3 p/(2π ) 3 p p = 1 of p and a little bit of algebra. The expression (1.2) is non-zero for all y and t, indicating that a particle can propagate between any two points in an arbitrarily short time. In a relativistic theory, this conclusion would signal a violation of causality. One might hope that using the relativistic expression E = p 2 c 2 + m 2 c 4 for the energy would cure the problem, but it does not. In fact, in the relativistic case one has A(t) = y [ exp it/ p2 c 2 + m 2 c ] 4 x d 3 p = (2π ) exp [ it/ p 2 c 2 + m 2 c ] 4 exp [ ip (y x)/ ] 3 (1.3) = 1 2π 2 2 y x 0 dp p sin (p y x / ) exp [ it/ p 2 c 2 + m 2 c 4 ]. (1.2) This integral can be evaluated explicitly in terms of Bessel functions, but for our purposes it is sufficient to consider its asymptotic behavior for L 2 = y x 2 c 2 t 2, i.e., separations well outside the light-cone. We use the method of stationary phase. The relevant phase function 2 The symbol p denotes the momentum operator, which in many QM books is indicated by a. To avoid clutter, I will not use the latter notation, but simply write p. 4

7 pl t p 2 c 2 + m 2 c 4 has a stationary point p = imcl/ L 2 c 2 t 2. Plugging this value into (1.3), we find that (up to a rational function of L and t), A(t) exp [ m/ ] L 2 c 2 t 2. (1.4) This expression is small but non-zero outside the light-cone and causality is again violated. In both cases, the observed failure is telling us that we need a new formalism to preserve causality. This formalism is QFT. It solves the causality problem in a miraculous way. We will see later that in QFT the propagation of a particle across a space-like interval is indistinguishable from the propagation of an antiparticle in the opposite direction. When we ask whether an observation made at point x can affect an observation made at point y, we will find that the amplitudes for particle and antiparticle propagation cancel in such a way that causality is preserved. What else is QFT good for? Besides solving the causality problem, QFT also provides an elegant framework to describe transitions between states of different particle number and type. An example physical processes, exhaustivelly studied (from 1989 until 2000) at the Large Electron Positron (LEP) collider in Geneva, is the production of a muon (µ ) and its antiparticle (µ + ) out of the annihilation of an electron (e ) and its antiparticle (the positron e + ): e + e + µ + µ +. (1.5) The experimental confirmation of the QFT predictions for processes such as (1.5), often to an unprecedented level of accuracy, is our real reason for studying QFT. But the power of QFT does not end here. In traditional QM the relation between spin and statistics has to be put in by hand. To agree with experiment, one should choose Bose statistics (no minus sign if one exchanges two identical particles) for integer spin particles, and Fermi statistics (minus sign if one exchanges two identical particles) for half-integer spin particles. On the other hand, in QFT the relationship between spin and statistics is a consequence of the framework, following from the commutation quantization conditions for boson fields and anticommutation quantization conditions for fermion fields. 1.2 Scales and Units There are three fundamental dimensionful constants in nature: the speed of light c, Planck s constant (divided by 2π), and Newton s constant G N. Their dimensions are [c] = length time 1, [ ] = length 2 mass time 1, (1.6) [G N ] = length 3 mass 1 time 2. 5

8 In order to avoid unnecessary clutter, we will work throughout this course in natural units, defined by c = = 1. 3 This allows us to express all dimensionful quantities in terms of a single scale which we choose to be mass or, equivalently, energy (since E = mc 2 has become E = m). Energies will be given in units of ev (the electron volt) or more often GeV = 10 9 ev or TeV = ev, since we are typically dealing with high energies. To convert the unit of energy back to units of length or time, we have to insert the relevant powers of c and. E.g., the length scale λ associated to a mass m is λ = h/(mc). Remembering that hc ev m, (1.7) one finds that the length scale corresponding to the electron with mass m e 511 kev is λ e m. Throughout this course we will refer to the dimension of a quantity, meaning the mass dimension. Newton s constant, e.g., has [G N ] = 2 and defines a mass scale G N = M 2 P, (1.8) where M P GeV is the Planck scale. This energy corresponds to a length scale L P m the Planck length. The Planck length is believed to be the smallest length scale that makes sense: beyond this scale quantum gravity effects are likely to become important and its no longer clear that the concept of space-time can be applied. The largest length scale we can talk of is the size of the cosmological horizon, roughly L P. A number for particle physics and cosmology relevant masses and the corresponding length scales are shown in Table 1. Let me go through the list and spend some words on the most important quantities. After the size of the observable universe, the first scale we encounter is the cosmological constant (Λ) measured to be around 10 3 ev. Since nobody can really explain why the cosmological constant has this particular value, let s forget about it real quick and turn our attention to the masses of the known elementary particles. These range from less than 1 ev for the neutrinos (ν s) to around 175 GeV for the top quark (t). The (in)famous Higgs boson (h), which is the only not yet observed ingredient of the standard model (SM) of elementary particle physics, is believed to weigh in at about 100 to 200 GeV. For scales around 1 TeV, i.e., the terascale, the predictive power of the SM is expected to break down. This is precisely the energy regime that the Large Hadron Collider (LHC) at CERN in Geneva has started to explore, having a design center-of-mass energy of 14 TeV. Beyond the electroweak scale (v) of around 250 GeV, again nobody knows with certainty what is going on. One could find a plethora of new (elementary) particles or a great desert. There are experimental hints that the coupling constants of electromagnetism, and the weak and strong forces unify at around M GUT = GeV, i.e., the grand unification scale (GUT). Everything is topped off at the Planck scale where a QFT description might no longer be possible and a quantum theory including the effects of gravity is needed to describe the physics of fundamental interactions. The most likely possibility for such a theory seems to be some kind of string theory. But also many other ideas such as loop quantum gravity, Hoŕava-Lifshitz gravity, etc. exist. In fact, the theory of everything (TOE) could also be a QFT, but one in which the finite or 3 The whole point of units is that you can choose whatever units are most convenient! 6

9 Quantity Mass Length Observable universe ev m ly Cosmological constant (Λ) 10 3 ev 10 3 m Neutrinos (ν s) 1 ev 10 6 m Electron (e ) 511 kev m Muon (µ ) 106 MeV m Charm quark (c) 1.3 GeV m Tau (τ ) 1.78 GeV m Bottom quark (b) 4.6 GeV m Top quark (t) 175 GeV m Higgs boson (h) [100, 200] GeV [6, 12] m Electroweak scale (v) 250 GeV m LHC energy 14 TeV m GUT scale (M GUT ) GeV m Planck scale (M P ) GeV m Table 1: An assortment of masses and corresponding lengths scales that appear in the context of particle physics and cosmology. infinite number of renormalized couplings do not run off to infinity with increasing energy, but hit a fixed point of the renormalization group equation. This possibility goes by the name of asymptotic safety. Don t worry if haven t understood a single word of what I have mumbled about possible TOEs. All this is way too advanced to be covered in this course. I only mentioned it, to make propaganda for the research of Joe Conlon (string theory), Andre Lukas (string theory), and John Wheater (quantum gravity), which work on such theories here in Oxford. Ask them if you want to know more about it. References [1] S. Weinberg, What is quantum field theory, and what did we think it was?, arxiv:hepth/ [2] S. Weinberg, The Search for Unity: Notes for a History of Quantum Field Theory, Daedalus, Vol. 106, No. 4, Discoveries and Interpretations: Studies in Contemporary Scholarship, Volume II (1977), 17 p. [3] Chapter 1 of S. Weinberg, The Quantum theory of fields. Vol. 1: Foundations, Cambridge, UK, Univ. Pr. (1995), 609 p. [4] F. Wilczek, Rev. Mod. Phys. 71, S85 (1999) [arxiv:hep-th/ ]. 7

10 2 Elements of Classical Field Theory In this second section we will discuss various aspects of classical fields. We will cover only the bare minimum ground necessary before turning to the quantum theory, and will return to classical field theory at several later stages in this course when we need to introduce new concepts or ideas. 2.1 Dynamics of Fields A field is a quantity defined at every space-time point x = (t, x). While classical particle mechanics deals with a finite number of generalized coordinates q a (t), indexed by a label a, in field theory we are interested in the dynamics of fields φ a (t, x), (2.1) where both a and x are considered as labels. We are hence dealing with an infinite number of degrees of freedom (dofs), at least one for each point x in space. Notice that the concept of position has been relegated from a dynamical variable in particle mechanics to a mere label in field theory. Lagrangian and Action The dynamics of the fields is governed by the Lagrangian. In all the systems we will study in this course, the Lagrangian is a function of the fields φ a and their derivatives µ φ a, 4 and given by L(t) = d 3 x L(φ a, µ φ a ), (2.2) where the official name for L is Lagrangian density. Like everybody else we will, however, simply call it Lagrangian from now on. For any time interval t [t 1, t 2 ], the action corresponding to (2.2) reads t2 S = dt d 3 x L = d 4 x L. (2.3) t 1 Recall that in classical mechanics L depends only on q a and q a, but not on the second time derivatives of the generalized coordinates. In field theory we similarly restrict to Lagrangians L depending on φ a and φ a. Furthermore, with an eye on Lorentz invariance, we will only consider Lagrangians depending on φ a and not higher derivatives. Notice that since we have set = 1, using the convention described in Section 1.2, the dimension of the action is [S] = 0. With (2.3) and [d 4 x] = 4, it follows that the Lagrangian must necessarily have [L] = 4. Other objects that we will use frequently to construct Lagrangians are derivatives, masses, couplings, and most importantly fields. The dimensions of the former two objects are [ µ ] = 1 and [m] = 1, while the dimensions of the latter two quantities depend on the specific type of coupling and field one considers. We therefore postpone 4 If there is no (or only little) room for confusion, we will often drop the arguments of functions and write φ a = φ a (x) etc. to keep the notation short. 8

11 the discussion of the mass dimension of couplings and fields to the point when we meet the relevant building blocks. Principle of Least Action The dynamical behavior of fields can be determined by the principle of least action. This principle states that when a system evolves from one given configuration to another between times t 1 and t 2 it does so along the path in cofiguration space for which the action is an extremum (usually a minimum) and hence satisfies δs = 0. This condition can be rewritten, using partial integration, as follows { L δs = d 4 x L } = d 4 x δφ a + φ a ( µ φ a ) δ( µφ a ) {[ ( L L µ φ a ( µ φ a ) )] δφ a + µ ( )} L ( µ φ a ) δφ a = 0. The last term is a total derivative and vanishes for any δφ a that decays at spatial infinity and obeys δφ a (t 1, x) = δφ a (t 2, x) = 0. For all such paths, we obtain the Euler-Lagrange equations of motion (EOMs) for the fields φ a, namely ( ) L µ L = 0. (2.5) ( µ φ a ) φ a Hamiltonian Formalism (2.4) The link between the Lagrangian formalism and the quantum theory goes via the path integral. While this is a powerful formalism, we will for the time being use canonical quantization, since it makes the transition to QM easier. For this we need the Hamiltonian formalism of field theory. We start by defining the momentum density π a (x) conjugate to φ a (x), In terms of π a, φa, and L the Hamiltonian density is given by π a = L φ a. (2.6) H = π a φa L, (2.7) where, as in classical mechanics, we have eliminated φ a in favor of π a everywhere in H. The Hamiltonian then simply takes the form H = d 3 x H. (2.8) 2.2 Noether s Theorem The role of symmetries in field theory is possibly even more important than in particle mechanics. There are Lorentz symmetry, internal symmetries, gauge symmetries, supersymmetries, etc. We start here by recasting Noether s theorem in a field theoretic framework. 9

12 Currents and Charges Noether s theorem states that every continuous symmetry of the Lagrangian gives rise to a conserved current J µ (x), so that the EOMs (2.5) imply µ J µ = 0, (2.9) or in components dj 0 /dt + J = 0. To every conserved current there exists also a conserved (global) charge Q, i.e., a physical quantity which stays the same value at all times, defined as Q = d 3 x J 0. (2.10) R 3 The latter statement is readily shown by taking the time derivative of Q, dq R3 dt = d 3 x dj 0 = d 3 x J, (2.11) dt R 3 which is zero, if one assumes that J falls off sufficiently fast as x. Notice, however, that the existence of the conserved current J is much stronger than the existence of the (global) charge Q, because it implies that charge is in fact conserved locally. To see this, we define the charge in a finite volume V by Q V = d 3 x J 0. (2.12) Repeating the above analysis, we find dq V = dt V V d 3 x J = ds J, (2.13) S where S denotes the area bounding V, ds is a shorthand for n ds with n being the outward pointing unit normal vector of the boundary S, and we have used Gauss theorem. In physical terms the result means that any charge leaving V must be accounted for by a flow of the current 3-vector J out of the volume. This kind of local conservation law of charge holds in any local field theory. Proof of Theorem In order to prove Noether s theorem, we ll consider infinitesimal transformations. This is always possible in the case of a continuous symmetry. We say that δφ a is a symmetry of the theory, if the Lagrangian changes by a total derivative δl(φ a ) = µ J µ (φ a ), (2.14) for a set of functions J µ. We then consider the transformation of L under an arbitrary change of field δφ a. Glancing at (2.4) tells us that in this case [ ( )] ( ) L L L δl = µ δφ a + µ φ a ( µ φ a ) ( µ φ a ) δφ a. (2.15) 10

13 When the EOMs are satisfied than the term in square bracket vanishes so that we are simply left with the total derivative term. For a symmetry transformation satisfying (2.13) and (2.14), the relation (2.15) hence takes the form ( ) L µ J µ = δl = µ ( µ φ a ) δφ a, (2.16) or simply µ J µ = 0 with J µ = L ( µ φ a ) δφ a J µ, (2.17) which completes the proof. Notice that if the Lagrangian is invariant under the infinitesimal transformation δφ a, i.e., δl = 0, then J µ = 0 and J µ contains only the first term on the right-hand side of (2.17). We stress that that our proof only goes through for continuous transformations for which there exists a choice of the transformation parameters resulting in a unit transformation, i.e., no transformation. An example is a Lorentz boost with some velocity v, where for v = 0 the coordinates x remain unchanged. There are examples of symmetry transformations where this does not occur. E.g., a parity transformation does not have this property, and Noether s theorem is not applicable then. Energy-Momentum Tensor Recall that in classic particle mechanics, spatial translation invariance gives rise to the conservation of momentum, while invariance under time translations is responsible for the conservation of energy. What happens in classical field theory? To figure it out, let s have a look at infinitesimal translations x ν x ν ɛ ν = φ a (x) φ a (x + ɛ) = φ a (x) + ɛ ν ν φ a (x), (2.18) where the sign in the field transformation is plus, instead of minus, because we are doing an active, as opposed to passive, transformation. If the Lagrangian does not explicitly depend on x but only through φ a (x) (which will always be the case in the Lagrangians discussed in this course), the Lagrangian transforms under the infinitesimal translation as L L + ɛ ν ν L. (2.19) Since the change in L is a total derivative, we can invoke Noether s theorem which gives us four conserved currents T µ ν = (J µ ) ν one for each of the translations ɛ ν (ν = 0, 1, 2, 3). From (2.18) we readily read off the explicit expressions for T µ ν, T µ ν = L ( µ φ a ) νφ a δ µ νl. (2.20) This quantity is called the energy-momentum (or stress-energy) tensor. It has dimension [T µ ν] = 4 and satisfies µ T µ ν = 0. (2.21) 11

14 The four conserved charges are (µ = 0, 1, 2, 3) P µ = d 3 x T 0µ, (2.22) Specifically, the time component of P µ is P 0 = d 3 x T 00 = ( ) d 3 x π a φa L, (2.23) which (looking at (2.7) and (2.8)) is nothing but the Hamiltonian H. We thus conclude that the charge P 0 is the total energy of the field configuration, and it is conserved. In fields theory, energy conservation is thus a pure consequence of time translation symmetry, like it was in particle mechanics. Similarly, we can identify the charges P i (i = 1, 2, 3), P i = d 3 x T 0i = d 3 x π a i φ a, (2.24) as the momentum components of the field configuration in the three space directions, and they are of course also conserved. 2.3 Example: Electrodynamics As a simple application of the formalism we have developed so far in this section, let us try to derive Maxwell s equations using the field theory formulation. In terms of the electric and magnetic fields E and B and the charge density ρ and 3-vector current j, these equations take the well-known form B = 0, (2.25) E + B t = 0, (2.26) E = ρ, (2.27) B E t = j. (2.28) The E and B fields are spatial 3-vectors and can be expressed in terms of the components of the 4-vector field A µ = (φ, A) by E = φ A t, B = A. (2.29) This definition ensures that the first two homogeneous Maxwell equations (2.25) and (2.26) are automatically satisfied, ( A) = ɛ ijk i j A k = 0, (2.30) ( φ A ) + t t ( A) = ( φ) = ɛ ijk j k φ = 0. (2.31) 12

15 Here ɛ ijk is the fully antisymmetric Levi-Civita tensor with ɛ 123 = ɛ 123 = +1 and the indices i, j, k = 1, 2, 3 are summed over if they appear twice. The remaining two inhomogeneous Maxwell equations (2.27) and (2.28) follow from the Lagrangian L = 1 2 ( µa ν ) ( µ A ν ) ( µa µ ) 2 A µ J µ, (2.32) with J µ = (ρ, j). From the rules presented in Section 2.1, we gather that the dimension of the vector field and current is [A µ ] = 1 and [J µ ] = 3, respectively. The funny minus sign of the first term on the right-hand side is required to ensure that the kinetic term 1/2A 2 i is positive using the Minkowski metric. Notice also that the Lagrangian (2.32) has no kinetic term 1/2A 2 0 and hence A 0 is not dynamical. Why this is and necessarily has to be the case will only become fully clear if you attend the advanced QFT course. Yet, we can already get an idea what is going on by remembering that the photon (the quantum of electrodynamics) has only two polarization states, i.e., two physical dofs, while the massless vector field A µ has obviously four dofs. The fact that time component A 0 is not dynamical reduces the number of independent dofs in A µ from four to three. But this is still one too many. The last unwanted dof can be gauged away using the gauge symmetry of the quantum version of electromagnetism aka quantum electrodynamics (QED). Enough said, let s do serious business and compute something. To see that the statement made before (2.32) is indeed correct, we first evaluate L A ν = J ν, L ( µ A ν ) = µ A ν + ( ρ A ρ ) η µν, (2.33) from which we derive the EOMs, ( ) L 0 = µ L = [ 2 A ν + ν ( ρ A ρ ) ] + J ν = µ ( µ A ν ν A µ ) + J ν. (2.34) ( µ A ν ) A ν Introducing now the field-strength tensor we can write (2.32) and (2.34) quite compact, F µν = µ A ν ν A µ, (2.35) L = 1 4 F µνf µν J µ A µ. (2.36) µ F µν = J ν, (2.37) Does this look familiar? I hope so. Notice that [F µν ] = 2. In order to see that (2.37) indeed captures the physics of (2.27) and (2.28), we compute the components of F µν. We find F 0i = F i0 = 0 A i i A 0 = F ij = F ji = i A j j A i = ɛ ijk B k, 13 ( φ + A ) i = E i, t (2.38)

16 while all other components are zero. With this in hand, we then obtain from µ F µ0 = ρ and µ F µ1 = j 1, µ F µ0 = 0 F 00 + i F i0 = E = ρ, µ F µ1 = 0 F 01 + i F i1 = E1 t + B3 x 2 B2 x 3 = ( B E ) 1 = j 1. t (2.39) Similar relations hold for the remaining components i = 2, 3. Taken together this proves the second inhomogeneous Maxwell equation (2.28). Let me also derive the energy-momentum tensor T µν of electrodynamics, ignoring for the moment the source term A µ J µ. Using (2.33) one finds T µν = ( ν A µ )( ρ A ρ ) ( µ A ρ )( ν A ρ ) ηµν F ρσ F ρσ. (2.40) Notice that the first term in (2.40) is not symmetric, which implies that T µν T νµ. In fact, this is not really surprising since the definition of the energy-momentum tensor (2.20) does not exhibit an explicit symmetry in the indices µ and ν. Nevertheless, there is typically a way to massage the energy-momentum tensor of any theory into a symmetric form. 5 To learn how this can be done in the case under consideration is the objective of a homework problem. 2.4 Space-Time Symmetries One of the main motivations to develop QFT is to reconcile QM with special relativity. We thus want to construct field theories in which space and time are placed on an equal footing and the theory is invariant under Lorentz transformations, with x µ (x ) µ = Λ µ νx ν, (2.41) η µν = η ρσ Λ µ ρλ ν σ, (2.42) so that the distance ds 2 = η µν dx µ dx ν is preserved. Here η µν = η µν = diag (1, 1, 1, 1) denotes the Minkowski metric. E.g., a rotation by the angle θ about the z-axis, and a boost by v < 1 along the x-axis are respectively described by the following Lorentz transformations γ γv 0 0 Λ µ 0 cos θ sin θ 0 ν = 0 sin θ cos θ 0, γv γ 0 0 Λµ ν = , (2.43) with γ = 1/ 1 v 2. The Lorentz transformations form a Lie group under matrix multiplication. You can learn more about this if you attend the lecture course on group theory held 5 One (but not the only) reason that you might want to have a symmetric energy-momentum tensor T µν is to make contact with general relativity, since such an object sits on the right-hand side of Einstein s field equations. 14

17 by Andre Lukas. Alternatively, you can study the group theory crash course written by Martin Bauer (a PhD student at Mainz University). It can be found on my Oxford homepage. The various fields belong to different representations of the Lorentz group. The simplest example is the scalar field φ, which under the Lorentz transformation x Λx, 6 transforms as φ(x) φ (x) = φ(λ 1 x). (2.44) The inverse Λ 1 appears in the argument because we are dealing with an active transformation, in which the field is truly shifted. To see why this means that the inverse appears, it will suffice to consider a non-relativistic example such as a temperature field. Suppose we start with an initial field φ(x) which has a hotspot at, say, x = (1, 0, 0). Let s now make a rotation x Rx about the z-axis so that the hotspot ends up at x = (0, 1, 0). If we want to express the new field φ (x) in terms of the old field φ(x), we have to place ourselves at x = (0, 1, 0) and ask what the old field looked like at the point R 1 x = (1, 0, 0) we came from. This R 1 is the origin of the Λ 1 factor in the argument of the transformed field in (2.44). The Lagrangian formulation of field theory makes it especially easy to discuss Lorentz invariance, since an EOM is automatically Lorentz invariant if it follows from a Lagrangian that is a Lorentz scalar. This is an immediate consequence of the principle of least action. If a Lorentz transformation leaves the Lagrangian unchanged, the transformation of an extremum in the action will be another extremum. To give an example, let s look at the following Lagrangian L = 1 2 ( µφ) m2 φ 2. (2.45) where φ is a real scalar and, as we will see later, m is the mass of φ (for now on just think about m as a parameter). Obviously, the dimension of the field is [φ] = 1. You will show in a homework assignment that the EOM corresponding to (2.45) takes the form ( µ µ + m 2) φ = 0. (2.46) This equation is the famous Klein-Gordon equation. The Laplacian in Minkowski space is sometimes denoted by. In this notation, the Klein-Gordon equation reads ( + m 2 )φ = 0. Let us first check that a Lorentz transformation Λ leaves the Lagrangian (2.45) and its action invariant. According to (2.44), the mass term transforms as 1/2 m 2 φ 2 (x) 1/2 m 2 φ 2 (x ) with x = Λ 1 x. The transformation of µ φ is µ φ(x) µ (φ(x )) = (Λ 1 ) ν µ ( νφ)(x ). (2.47) Using (2.43) we thus find that the derivative term in the Klein-Gordon Lagrangian behaves as 1 2 ( µφ(x)) (Λ 1 ) ρ µ ( ρφ)(x )(Λ 1 ) σ ν ( σφ)(x ) η µν = 1 2 ( ρφ)(x )( σ φ)(x ) η ρσ (2.48) = 1 2 ( µφ(x )) 2, 6 To shorten the notation we will often use matrix notation and drop the indices µ, etc. 15

18 under the Lorentz transformation Λ. Putting things together, we find that the action of the Klein-Gordon theory is indeed Lorentz invariant, S = d 4 x L(x) d 4 x L(x ) = d 4 x L(x ) = S. (2.49) Notice that changing the integration variables from d 4 x to d 4 x, in principle introduces an Jacobian factor det (Λ). This factor is, however, equal to 1 for Lorentz transformation connected to the identity, that we are dealing with. A similar calculation also shows that, as promised, also the EOM of the Klein-Gordon field φ is invariant, ( 2 + m 2) φ(x) ( 2 + m 2) φ(x ) = [(Λ 1 ) ν µ ν(λ 1 ) ρµ ρ + m 2 ] φ(x ) (2.50) = ( η νρ ν ρ + m 2) φ(x ) = 0. In the case of the Klein-Gordon theory, we hence conclude that the statements made before (2.45) are indeed correct. Representations of Lorentz Group The transformation law (2.44) is the simplest possible transformation law for a field. In fact, it is the only possibility for a one-component field aka a real scalar. Yet, it is also clear that in order to describe nature (think only about electromagnetism) we need multicomponent fields, which have more complicated transformation properties. The most familiar case is that of a vector field, such as the vector potential A µ, which we have already met in Section 2.3. In this case the quantity that is distributed in space-time also carries an orientation which must be rotated and/or boosted. In fact, we will learn in this course that the Lorentz group has a variety of representations, corresponding to particles with integer (bosons) and half-integer spins (fermions) in QFT. These representations are normally constructed out of spinors. To start this general (and somewhat formal) discussion, let me examine the allowed possibilities for linear field transformations φ a (x) φ a(x ) = M(Λ) ab φ b (x), (2.51) under (2.41). The first important point to notice is that the Lorentz transformations form a group. This means that two successive Lorentz transformations, can also be described in terms of a single one x x = Λx, x x = Λ x, (2.52) x x = Λ x, (2.53) with Λ = Λ Λ. (2.54) 16

19 What happens to (2.51) under this set of Lorentz transformations? For x x = Λ x, we have (in matrix notation) φ(x) φ (x ) = M(Λ )φ(x). (2.55) On the other hand, for x x = Λx x = Λ Λx, we get φ(x) φ (x ) = M(Λ )φ (x ) = M(Λ )M(Λ)φ(x). (2.56) In order for the last two equations to be consistent with each other, the field transformations M must obviously fulfill M(Λ Λ) = M(Λ )M(Λ). (2.57) In group theory terminology, this means that the matrices M furnish a representation of the Lorentz group. Field Lorentz transformations are therefore not random, but they can be found if we find all (finite dimensional) representations of the Lorentz group. So how do the common representation of the Lorentz group look like and how do we get all of them? While both questions will be answered in this lecture, I believe it is best to do it case-by-case whenever we will meet a new type of (quantum) field. Since we already talked about the real scalar φ (Klein-Gordon field) and the vector A µ (potential in electrodynamics), it makes nevertheless sense to give the representations for these two types of fields already at this point. Since a scalar field by definition does not change under Lorentz transformations, φ(x) φ (x ) = φ(x), the scalar representation of the Lorentz group is simply M(Λ) = 1. (2.58) This was easy! The representation of the vector A µ is also not difficult to figure out. Let me for the time being only state the result. One finds M(Λ) = Λ, (2.59) which means that a vector field A µ transforms under a Lorentz transformation as (restoring indices) A µ (x) (A ) µ (x ) = Λ µ νa ν (x ). (2.60) It is important to notice that the latter transformation property implies that any term build out of A µ and µ, where all Lorentz indices are contracted is invariant under Lorentz transformations. As an exercise you are supposed to show this explicitly for terms like µ A µ, etc. Angular Momentum In classical particle mechanics, rotational invariance gives rise to conservation of angular momentum. What is the analogy in field theory? Moreover, we now have further Lorentz transformations, namely boosts. What conserved quantity do they correspond to? In order to address these questions, we first need the infinitesimal form of the Lorentz transformations Λ µ ν = δ µ ν + ω µ ν, (2.61) 17

20 where ω µ ν is infinitesimal. The condition (2.42) for Λ to be a Lorentz transformation becomes in infinitesimal form η µν = η ρσ (δ µ ρ + ω µ ρ) (δ ν σ + ω ν σ) = η µν + ω µν + ω νµ + O(ω 2 ), (2.62) which implies that ω µν must be an antisymmetric matrix, ω µν = ω νµ. (2.63) Notice that an antisymmetric 4 4 matrix has six independent parameters, which agrees with the number of different Lorentz transformations, i.e., three rotations and three boosts. Applying the infinitesimal Lorentz transformation to our real scalar field φ, we have φ(x) φ(x ωx) = φ(x) ω µ νx ν µ φ(x), (2.64) where the minus sign arises from the factor Λ 1 in (2.43). The variation of the field φ under an infinitesimal Lorentz transformation is hence given by δφ = ω µ ν x ν µ φ. (2.65) By the same line of reasoning, one shows that the variation of the Lagrangian is δl = ω µ ν x ν µ L = µ (ω µ ν x ν L), (2.66) where in the last step we used the fact that ω µ µ = 0 due to its antisymmetry. The Lagrangian changes by a total derivative, so we can apply Noether s theorem (2.17) with J µ = ω µ ν x ν L to find the conserved current, J µ = L ( µ φ) ωρ ν x ν ρ φ + ω µ ν x ν L [ ] L = ω ρ ν ( µ φ) ρφ δ µ ρl x ν = ω ρ νt µ ρx ν. (2.67) Stripping off ω ρ ν, we obtain six different currents, which we write as These currents satisfy (J λ ) µν = x µ T λν x ν T λµ. (2.68) λ (J λ ) µν = 0, (2.69) and give (as usual) rise to six conserved charges. For µ, ν 0, the Lorentz transformation is a rotation and the three conserved charges give the total angular momentum of the field (i, j = 1, 2, 3): Q ij = d 3 x ( x i T 0j x j T 0i). (2.70) What s about the boosts? In this case, the conserved charges are Q 0i = d 3 x ( x 0 T 0i x i T 00). (2.71) 18

21 The fact that these are conserved tells us that 0 = dq0i = d 3 x T 0i + t d 3 0i dt x d dt dt dt = P i + t dp i dt d d 3 x x i T 00. dt d 3 x x i T 00 (2.72) Yet, also the momentum P i is conserved, i.e., dp i /dt = 0, and we conclude that d d 3 x x i T 00 = const.. (2.73) dt This is the statement that the center of energy of the field travels with a constant velocity. In a sense it s a field theoretical version of Newton s first law but, rather surprisingly, appearing here as a conservation law. Notice that after restoring the label a our results for (J λ ) µν etc. also apply in the case of multicomponent fields. Poincaré Invariance We now require that a physical system possesses both space-time translation (2.18) and Lorentz transformation symmetry (2.41). The symmetry group that includes both transformations is called the Poincaré group. Notice that for any Poincaré-invariant theory the two charge conservation equations (2.21) and (2.69) should hold. This is only possible if the energymomentum tensor T µν is symmetric. Indeed, 0 = λ (J λ ) µν = λ ( x µ T λν x ν T λµ) = x µ λ T λν + T λν λ x µ x ν λ T λµ T λµ λ x ν (2.74) = T λν δ λ µ T λµ δ λ ν = T µν T νµ. Since Maxwell s theory is Poincaré invariant, this general result tells us that the expression of the energy-momentum tensor in (2.40) can be made symmetric without changing physics. The key to actually do it, lies in making use of the conservation law (2.21) in an appropriate way. 2.5 Problems i) Suppose that a no further specified Lagrangian L depends not only on φ and µ φ but also on the second derivatives of the fields: 7 L = L(φ, µ φ, µ ν φ). (2.75) For the case that the variations δφ vanish at the endpoints and that δ( µ1... µn φ) = µ1... µn (δφ) holds, derive the Euler-Lagrange EOMs for such a theory. 7 For the sake of brevity, we have omitted the subscript a labelling the different fields. 19

22 Apply your result to obtain the EOMs for the field φ with Lagrangian L = 1 2 ( tφ) ( x φ) + α 6 ( xφ) 3 ν 2 ( ν µ φ) 2. (2.76) ii) Let us study the dynamics of acoustic waves in an elastic medium (e.g. air), as described by the Lagrangian L = 1 ( ) 2 y 2 ρ 1 t 2 ρv2 sound ( y) 2, (2.77) with ρ the density of the medium and v sound the speed of sound. Find the Euler-Lagrange EOMs for the system and their solutions. What do they describe? Calculate the Hamiltonian H. iii) Consider the Klein-Gordon Lagrangian (2.45). Derive the kinetic and potential energy (T and V with L = T V ) as well as the Euler-Lagrange EOMs for the field φ. Write down the energy-momentum tensor T µν and show that it indeed satisfies µ T µν = 0. Give the expressions for the conserved energy E and momentum P i. iv) Using (2.60) show that the terms µ A µ, ( µ A ν ) 2, and ( µ A ν )( ν A µ ) are Lorentz invariant. What are the dimensions of these terms? v) We saw that in the case of electrodynamics in vacuum using (2.20) leads to an energymomentum tensor T µν that is not symmetric. To remedy that, one can add to T µν a term of the form λ Γ λµν, where Γ λµν is antisymmetric in its first two indices, i.e., Γ λµν = Γ µλν. Show that such an object is automatically divergenceless, i.e., it obeys µ λ Γ λµν = 0. This feature implies that instead of T µν one can also use Θ µν = T µν + λ Γ λµν, (2.78) without changing the physics, since Θ µν momentum as T µν. Show that this construction, with has the same globally conserved energy and Γ λµν = F µλ A ν, (2.79) leads to an energy-momentum tensor Θ µν that is symmetric and yields the standard formulas for the electromagnetic energy and momentum densities: E = 1 ( E 2 + B 2), S = E B. (2.80) 2 20

23 References [1] Chapter 4 of L. D. Landau and E. M. Lifshitz, The Classical Theory of Fields, Fourth Edition: Vol. 2 (Course of Theoretical Physics Series), Butterworth-Heinemann (1975), 481 p. [2] Chapter 5 of B. Thidé, Electromagnetic Field Theory, revised and extended 2nd edition, 21

24 3 Klein-Gordon Theory In QM, canonical quantization is a recipe that takes us from the Hamiltonian formalism of classical dynamics to the quantum theory. The recipe tells us to take the generalized coordinates q a and their conjugate momenta p a = L/ q a and promote them to operators. The Poisson bracket structure of classical mechanics descends to the structure of commutation relations between operators, namely [q a, q b ] = [p a, p b ] = 0, [q a, p b ] = iδ a b, (3.1) where [a, b] = ab ba is the usual commutator. If one wants to construct a QFT, one can proceed in a similar fashion. The idea is to start with the classical field theory and then to quantize it, i.e., reinterpret the dynamical variables as operators that obey canonical commutation relations, 8 [φ a (x), φ b (y)] = [π a (x), π b (y)] = 0, [φ a (x), π b (y)] = iδ (3) (x y)δ a b. (3.2) Here φ a (x) are field operators and the Kronecker delta in (3.1) has been replaced by a delta function since the momentum conjugates π a (x) are densities. Notice that for now, we are working in the Schrödinger picture which means that the operators φ a (x) and π a (x) do only depend on the spatial coordinates but not on time. The time dependence sits in the states ψ which obey the usual Schödinger equation i d ψ = H ψ. (3.3) dt While all this looks pretty much the same as good old QM there is an important difference. The wavefunction ψ in QFT, is a functional, i.e., a function of every possible configuration of the field φ a, and not a simple function. 9 So things are more complicated in QFT than in QM after all. The Hamiltonian H, being a function of φ a and π a, also becomes an operator in QFT. In order to solve the theory, one task is to find the spectrum, i.e., the eigenvalues and eigenstates of H. This is usually very difficult, since there is an infinite number of dofs within QFT, at least one for each point x in space. However, for certain theories, called free theories, one can find a way to write the dynamics such that each dof evolves independently from all the others. Free field theories typically have Lagrangians which are quadratic in the fields, so that the EOMs are linear. 8 This procedure is sometimes referred to as second quantization. We will not use this terminology here. 9 In functional analysis, a functional is a map from a vector space to the field underlying the vector space, which is usually the real numbers. In other words, it is a function that takes a vector as its argument or input and returns a scalar. Commonly, the vector space is a space of functions, so the functional takes a function as its argument, and so it is sometimes referred to as a function of a function. The use of functionals goes back to the calculus of variations where one searches for a function which minimizes a certain functional. A particularly important application in physics is to search for a state of a system which minimizes the energy functional. 22

25 3.1 Klein-Gordon Field as Harmonic Oscillators So far the discussion in this section was rather general. Let us be more specific and consider the simplest relativistic free theory as a practical example. It is provided by the classical Klein-Gordon equation (2.45). To exhibit the coordinates in which the dofs decouple from each other, we only have to Fourier transform the field φ, d 3 p φ(t, x) = (2π) 3 ei p x φ(t, p). (3.4) In momentum space (2.45) simply reads [ 2 t + ( p 2 + m 2) ] φ(t, p) = 0, (3.5) 2 which tells us that for each value of p, the Fourier transform φ(t, p) solves the equation of a harmonic oscillator with frequency ω p = p 2 + m 2. (3.6) We see that the most general solution of the classical Klein-Gordon equation is a linear superposition of simple harmonic oscillators, each vibrating at a different frequency with a different amplitude. In order to quantize the field φ, we must hence only quantize this infinite number of harmonic oscillators (as Sidney Coleman once said [1]: The career of a young theoretical physicist consists of treating the harmonic oscillator in ever-increasing levels of abstraction. ). Let s recall how to do it in QM. Harmonic Oscillator in QM Consider the QM Hamiltonian H = 1 2 p ω2 q 2, (3.7) with the canonical commutation relations [q, p] = i. In order to find the spectrum of the system, we define annihilation and creation operators (also known as lowering and raising or ladder operators) ω a = 2 q + i ω p, a = 2ω 2 q i p. (3.8) 2ω Expressing q and p through a and a gives q = 1 2ω (a + a ), ω p = i 2 (a a ). (3.9) The commutator of the operators introduced in (3.8) is readily computed. One finds [a, a ] = 1. Expressing the Hamiltonian (3.7) through a and a gives H = ω ( aa + a a ) ( = ω a a + 1 ). (3.10)

26 It is also easy to show that the commutator of H with a and a takes the form [H, a] = ωa, [H, a ] = ωa. (3.11) These relations imply that if ψ is an eigenstate of H with energy E, i.e., H ψ = E ψ, then we can construct other eigenstates by acting with the operators a and a on ψ : Ha ψ = (E ω) a ψ, Ha ψ = (E + ω) a ψ, (3.12) This feature explains why a (a ) is called annihilation (creation) operator. From the latter equation it is also clear that the spectrum of (3.7) has a ladder structure,..., E 2ω, E ω, E, E +ω, E +2ω,.... If the energy is bounded from below, there must be a ground state 0, which satisfies a 0 = 0. This state has the ground state or zero-point energy H 0 = ω/2 0. Excited states n are then created by the repeated action of a, and satisfy (a ) n 0 = n (n 1)... 1 n = n! n, (3.13) ( H n = ω N + 1 ) ( n = ω n + 1 ) n, (3.14) 2 2 where N = a a is the number operator with N n = n n. The prefactor on the right-hand side of (3.13) is needed to guarantee that the states n are normalized to 1, i.e., n n = 1. Quantization of Real Klein-Gordon Field If we treat each Fourier mode of the field φ as an independent harmonic oscillator, we can apply canonical quantization to the real Klein-Gordon theory, and in this way find the spectrum of the corresponding Hamiltonian. In analogy to (3.9), we write φ and π as a linear sum of an infinite number of operators a p and a p, labelled by the 3-momentum p, d 3 p 1 [ ] φ(x) = a (2π) 3 p e i p x + a p e i p x, 2ωp π(x) = d 3 p (2π) 3 ( i) The commutation relations (3.2) become ωp 2 [a p, a q ] = [a p, a q] = 0, [a p, a q] = (2π) 3 δ (3) (p q). [a p e i p x a p e i p x ]. Let us assume that the latter equations hold, it then follows that d 3 p d 3 q i ωq ( ) [φ(x), π(y)] = [a (2π) 6 p, a 2 ω q] e i p x i q y + [a p, a q ] e i p x+i q y p d 3 p d 3 q i ωq ( ) = (2π) 3 δ (3) (p q) e i p x i q y e i p x+i q y (2π) 6 2 ω p d 3 p i ( ) = e i p (x y) e i p (x y) = iδ (3) (x y), (2π) (3.15) (3.16) (3.17)

27 where we have dropped terms [a p, a q ] = [a p, a q] = 0 from the very beginning. To show that [φ(x), φ(y)] = [π(x), π(y)] = 0 is left as an exercise. In terms of the ladder operators a p and a p the Hamiltonian of the real Klein-Gordon theory takes the form H = 1 d 3 x [ π 2 + ( φ) 2 + m 2 φ 2] 2 = 1 [ d 3 x d 3 p d 3 q ωp ω q ( ap e i p x a 2 (2π) 6 p e i p x) ( a q e i q x a i q e q x) 2 = ( ip ap e i p x ip a p e i p x) (iq a q e i q x iq a i q e q x) ω p ω q + m2 2 ( ap e i p x + a p e i p x) ( a q e i q x + a i q e q x) ω p ω q d 3 p 1 [ ] ( ω (2π) 3 p 2 + p 2 + m 2 )(a p a p + a ω pa p) + (ωp 2 + p 2 + m 2 )(a p a p + a pa p ), p ] (3.18) where we have first used the expressions for φ and π given in (3.15) and then integrated over d 3 x to get delta functions δ (3) (p ± q), which, in turn, allows us to perform the d 3 q integral. Inserting finally the expression (3.6) for the frequency, the first term in (3.18) vanishes and we are left with H = 1 2 = d 3 p (2π) ω 3 p (a p a p + a pa ) p = d 3 p (2π) 3 ω p ( a pa p (2π)3 δ (3) (0) d 3 p (2π) ω 3 p ). ( a pa p + 1 ) 2 [a p, a p] (3.19) We see that the result contains a delta function, evaluated at zero where it has an infinite spike. This contribution arises from the infinite sum over all modes vibrating with the zeropoint energy ω p /2. Moreover, the integral over ω p diverges at large momenta p. To better understand what is going on let us have a look at the ground state 0 where the former infinity first becomes apparent. 3.2 Structure of Vacuum As in the case of the harmonic oscillator in QM, we define the vacuum 0 through the condition that it is annihilated by the action of all a p, a p 0 = 0, p. (3.20) With this definition the energy E 0 of the vacuum comes entirely from the second term in the last line of (3.19), ( ) d 3 p ω p H 0 = E 0 0 = (2π) 3 2 (2π)3 δ (3) (0) 0 = 0. (3.21) 25

28 In fact, the latter expression contains not only one but two infinities. The first arises because space is infinitely large. Infinities of this kind are often referred to as infrared (IR) divergences. To isolate this infinity, we put the theory into a box with sides of length L and impose periodic boundary conditions (BCs) on the field. Then, taking the limit L, we get (2π) 3 δ (3) (0) = lim L L/2 L/2 d 3 x e i p x p=0 = lim L L/2 L/2 d 3 x = V, (3.22) where V denotes the volume of the box. This result tells us that the delta function singularity arises because we try to compute the total energy E 0 of the system rather than its energy density E 0. The energy density is simply calculated from E 0 by dividing through the volume V. One finds E 0 = E 0 d 3 V = p ω p (2π) 3 2, (3.23) which is still divergent and resembles the sum of zero-point energies for each harmonic oscillator. Since E 0 in the limit p, i.e., high frequencies (or short distances), this singularity is an ultraviolet (UV) divergence. This divergence arises because we want too much. We have assumed that our theory is valid to arbitrarily short distance scales, corresponding to arbitrarily high energies. Recalling the discussion of energy scales in Section 1.2, this assumption is clearly absurd. The integral should be cut off at high momentum, reflecting the fact that our theory presumably breaks down at some point (most likely far below the GUT or Planck scale). Fortunately, the infinite energy shift in (3.19) is harmless if we want to measure the energy difference of the energy eigenstates from the vacuum. We can therefore recalibrate our energy levels (by an infinite constant) removing from the Hamiltonian operator the energy of the vacuum, :H : = H E 0 = H 0 H 0. (3.24) With this definition one has :H : 0 = 0. In fact, the difference between the latter Hamiltonian and the previous one is merely an ordering ambiguity in moving from the classical to the quantum theory. E.g., if we would have defined our Hamiltonian to take the form H = 1 (ωq ip) (ωq + ip), (3.25) 2 which is classically the same as our original definition (3.7), then after quantization instead of (3.10), we would have gotten H = ωa a. (3.26) This type of ordering ambiguity arises often in field theories. The method that we have used above to deal with it is called normal ordering. In practice, normal ordering works by placing all annihilation operators a p in products of field operators to the right. Applied to the Hamiltonian of the real Klein-Gordon theory this prescription leads to :H : = d 3 p (2π) 3 ω p a pa p. (3.27) In the remainder of this section, we will normal order all operators in this manner (dropping the : : for simplicity). 26

29 Cosmological Constant Above we concluded that as long as we are interested in the differences between energy levels the infinite total energy E 0 of the vacuum does not matter (which effectively means that E 0 has no effect on particle physics phenomenology). So is the value of E 0 unobservable then? No, in fact, not at all, since gravity is supposed to see all energy densities. In particular, the sum of all the zero-point energies should contribute to Einstein s equations, R µν R 2 g µν + Λg µν = 8πG N T µν, (3.28) in the form of a cosmological constant Λ = E 0 /V. Here R µν is the Ricci curvature tensor, R the scalar curvature (for their definitions please consult a text on general relativity), g µν is the metric tensor (not to be mixed up with the Minkowski metric η µν ), G N denotes Newton s constant, which we have already met in (1.8), and T µν is the energy-momentum tensor in its symmetric form. Unfortunately, I do not have time to explain (3.28) in detail. If you want to learn more about Einstein s equation, I suggest that you attend Andrew Steane s course on general relativity. In order to be able to follow this lecture, it is sufficient to know that these equations contain a term proportional to E 0 /V. An assortment of observations (cosmic microwave background, type-ia supernovae, baryon acoustic oscillations, etc.) tells us that 74% of the energy density in the universe has the properties of a cosmological constant. This constant energy density filling space homogeneously is one form of dark energy. Another possibility of dark energy would be a scalar field such as quintessence, a dynamic quantity whose energy density can vary in space. The rest of the composition of today s cosmos is made up by dark matter, amounting to 22%, and visible matter (atoms, etc.), giving the missing 4%. Dark matter is dark in the sense that it is inferred to exist from gravitational effects on visible matter and background radiation, but is undetectable by emitted or scattered electromagnetic radiation. So in conclusion, fully 96% of the universe seems to be composed of stuff we ve never seen directly on earth. But our lack of understanding does not end there. In the last subsection, we have argued that integrating in (3.23) up to infinity is not the right thing to do, but that one should only consider modes up to a certain UV cut-off Λ UV, where one stops trusting the underlying theory. The resulting energy density E 0 then scales like Λ 4 UV. While it is not clear which precise value we should take for Λ UV, let us be not very ambitious and take a value for this scale, up to which we truly believe that we understand the physics of fundamental interactions. The electron mass m e = 511 kev could be such a choice. In consequence, E predicted 0 (511 kev) ev, (3.29) where the superscript predicted should probably better read guessed. Glancing at Table 1, we see that the observed value of E 0 is E observed 0 (10 3 ev) ev, (3.30) so it is clearly non-zero but unfortunately also roughly 34 orders of magnitude smaller than our prediction. Notice that the choice Λ UV = m e that lead to (3.29) was, in fact, a conservative 27

30 one, because other educated guesses such as Λ UV = v, M GUT, etc. would have lead to a much bigger disagreement of up to 120 orders of magnitude for the choice Λ UV = M P. From the point of view of QFT, the net cosmological constant, is the sum of a number of apparently disparate contributions, including zero-point fluctuations of each field theory dof and potential energies from scalar fields, as well as a bare cosmological constant. There is no obstacle to imagining that all of the large and apparently unrelated contributions add together, with different signs, to produce a net cosmological constant consistent with the limit (3.30), other than the fact that it seems ridiculous. We know of no special symmetry which could enforce a vanishing vacuum energy while remaining consistent with the known laws of physics. This conundrum is the cosmological constant problem. While no widely accepted solution to this problem exists, there are many proposed ones ranging from the anthropic principle to the string-theory landscape. Don t bother if you have never even heard of any of them, it is not important at all for what follows. Casimir Effect Using the normal ordering prescription we can happily set E 0 = 0, while chanting the mantra that only energy differences can be measured. However, it should be possible to see that the vacuum energy is different if, for a reason, the fields vanish in some region of the space-volume or if some frequencies ω p do not contribute to the vacuum energy. Such a set-up can be realized, by forcing the real Klein-Gordon field φ to satisfy appropriate BCs. Let us assume, that φ vanishes on the planes with x = 0 and x = L, φ(0, y, z) = φ(l, y, z) = 0, (3.31) The presence of these BCs affects the Fourier decomposition of the field and, in particular, leads to a quantization of the momentum of the field inside the planes (k Z + ), ( ) kπ p = L, p y, p z. (3.32) For simplicity let us consider a massless real scalar field. In this case the ground-state energy per unit area S between the planes is given by the following expression E 0 (L) dp 2 (kπ ) 2 = 1 + p 2 S (2π) 2 2 L. (3.33) k=1 Notice that we only integrate over the perpendicular directions p = (p y, p z ), since the momentum p x is discretized. Consequently, the volume integral has to be replaced by a surface integral of the planes. In analogy to (3.22), this gives a factor S/(2π) 2 instead of V/(2π) 3. Let us see if we are able to calculate (3.33). We first switch to polar coordinates, E 0 (L) S = k=1 0 dp p 2π 1 2 (kπ L ) 2 + p 2. (3.34) 28

31 As it stands this integral is divergent in the limit p. We can regulate this singularity in a number of different ways. One way to do it, is to introduce a UV cut-off a L, so that modes of momentum much bigger than a 1 are removed. E.g., multiplying the integrand in (3.34) by the factor exp [ a ((kπ/l) 2 +p 2 )1/2 ] would do the job, since the resulting expression has the property that as a 0, one regains the full, infinite result (3.33). The drawback of this method is that the new integral is quite difficult to perform (though doable), so let s see if we find an easier way. The trick is to consider (3.33) not in d = 4 dimensions, but to work in less dimensions, say, d = 4 2ɛ with ɛ > 0. While this looks very weird at first sight, let me mention that in general there exists a value of ɛ for which the integral is well-defined. We shall perform our calculation for such a value, and then try to analytically continue the result to ɛ = 0. In d = 4 2ɛ dimensions the integral (3.34) takes the form E 0 (L) S = k=1 0 dp p 1 2ɛ 2π 1 2 (kπ L ) 2 + p 2. (3.35) To evaluate this expression, we first change variables p kπ/l l. We then obtain ( E 0 (L) = 1 ( ) π 3 2ɛ k S 4π L) 3 2ɛ dl l 1 2ɛ 1 + l 2 k=1 0 (3.36) = 1 ( π 3 2ɛ ζ(2ɛ 3) dl 8π L) 2 (l ) 2 ɛ 1 + l 2, 0 where we have identified the infinite sum with a Riemann zeta function, employing 1 = ζ(a). (3.37) ka k=1 Performing now the change of variables l 2 x/(1 x), we arrive at 1 dl 2 (l ) 2 ɛ 1 + l 2 = dx x ɛ (1 x) ɛ 5/2 = B(1 ɛ, ɛ 3/2), (3.38) 0 where in the last step we have used the definition of the Euler beta function, B(a, b) = Γ(a)Γ(b) Γ(a + b) = dx x a 1 (1 x) b 1. (3.39) Putting everything together, the final result in d = 4 2ɛ dimensions reads E 0 (L) = 1 ( π ) 3 2ɛ ζ(2ɛ 3) B(1 ɛ, ɛ 3/2). (3.40) S 8π L Amazingly, 10 we can even take the limit ɛ 0. Using Γ(a + 1) = a Γ(a) with Γ(1) = 1 and Γ(1/2) = π, and recalling that ζ( 3) = 1/120, we arrive at the finite expression E 0 (L) S = π2 1440L 3. (3.41) 10 Many subtleties have been swept under the carpet in this calculation. E.g., the dimensions of the expressions in (3.35) to (3.40) are wrong by 2ɛ. All cheats will become clear when the method of dimensional regularization is properly introduced. 29

32 This result implies that the vacuum energy depends on the distance between the two planes, on which φ vanishes. Can we realize this in an experiment? Remember that the electromagnetic field is zero inside a conductor. If we place two uncharged conducting plates parallel to each other at a distance L, then we can reproduce the BCs of the set-up that we have just studied. While the quantization of the electromagnetic field is more complicated than the real Klein-Gordon field, which we have used to model the effect, this difference becomes (almost) immaterial as far as the vacuum energy is concerned. Our analysis, leads to an amazing prediction. Two electrically neutral metal plates attract each other. This is known as the Casimir-Polder force, first predicted in 1948 [4]. Notice, that the energy of the vacuum gets smaller when the conducting plates are closer, as indicated by the minus sign in (3.41). Therefore, there is an attractive force between them. This is an effect that has by now been verified experimentally with great precision. 11 In our example, the force per unit area (pressure or rather anti-pressure) between the two conductor plates is given by F = 1 E 0 (L) S L = π2 480L. (3.42) 4 In fact, the true Casimir-Polder force is twice as large as the latter result, due to the two polarization states of the photon. 3.3 Particle States After the discussion of the properties of the vacuum, we can now turn to the excitations of φ. It s easy to verify (and therefore left as an exercise) that, in full analogy to (3.11), the Hamiltonian and the ladder operators of the real Klein-Gordon theory obey the following commutation relations [H, a p ] = ω p a p, [H, a p] = ω p a p. (3.43) These relations imply that we can construct energy eigenstates by acting on the vacuum state 0 with a p (remember that they also imply that a p 0 = 0, p). We define This state has energy p = a p 0. (3.44) H p = E p p = ω p p, (3.45) with ω p given in (3.6), which is nothing but the relativistic energy of a particle with 3- momentum p and mass m. We thus interpret the state p as the momentum eigenstate of a single scalar particle of mass m. Let us check this interpretation by studying the other quantum numbers of p. We begin with the total momentum P introduced in (2.24). Turning this expression into an operator, we arrive, after normal ordering, at d P = d 3 3 p x π φ = (2π) p 3 a pa p. (3.46) 11 The first experimental test of the Casimir-Polder force was conducted by Marcus Sparnaay in 1958, in a delicate and difficult experiment with parallel plates. Due to the large experimental errors, his results could neither prove the theoretical prediction right nor wrong. 30

33 Acting with P on our state p gives d 3 q d P p = (2π) q 3 a q a q a 3 q p 0 = (2π) q 3 a q [(2π) 3 δ (3) (p q) + a pa ] q 0 = p p, (3.47) where we have employed the second line in (3.16) and used the fact that an annihilation operator acting on the vacuum is zero. The latter result tells us that the state p has momentum p. Another property of p that we can study is its angular momentum. Again we take the classical expression for the total angular momentum (2.67) and turn it into an operator, J i = ɛ ijk d 3 x (J 0 ) jk, (3.48) It is a good exercise to show that by acting with J i on the one-particle state with zero momentum one gets J i p = 0 = 0. (3.49) This result tells us that the particle carries no internal angular momentum. In other words, quantizing the real Klein-Gordon field gives rise to a spin-zero particle aka a scalar. Multiparticle States Acting multiple times with the creation operators on the vacuum we can create multiparticle states. We interpret the state p 1,..., p n = a p 1... a p n 0, (3.50) as an n-particle state. Since one has [a p i, a p j ] = 0, the state (3.50) is symmetric under exchange of any two particles. E.g., p, q = a pa q 0 = a q a q 0 = q, p. (3.51) This means that the particles corresponding to the real Klein-Gordon theory are bosons. We see that, as promised already in Section 1.1, the relationship between spin and statistics is, in fact, a consequence of the QFT framework, following, in the case at hand, from the commutation quantization conditions for boson fields (3.2). The full Hilbert space of our theory is spanned by acting on the vacuum with all possible combinations of creation operators, 0, a p 0, a pa q 0, a pa q a r 0,.... (3.52) This space is known as the Fock space and is simply the sum of the n-particle Hilbert spaces, for all n 0. Like in QM, there is also an operator which counts the number of particles in a given state in the Fock space. It is the number operator N = d 3 p (2π) 3 a pa p, (3.53) 31

34 which satisfies N p 1,..., p n = n p 1,..., p n. Notice that the number operator commutes with the Hamiltonian, i.e., [N, H] = 0, ensuring that particle number is conserved. This means that we can place ourselves in the n-particle sector, and will remain there. This is a property of free theories, but will no longer be true when we consider interactions. Interactions create and destroy particles, taking us between the different sectors in the Fock space. Operator-Valued Distributions We have referred to the states p as particles. Yet, this name is somewhat misleading, since these states are momentum eigenstates and therefore not localized in space. Recall that in QM both the position and momentum eigenstates are not good elements of the Hilbert space since they are not normalizable (they normalize to delta functions). Similarly, in QFT neither the operators φ(x), nor a p and a p are good operators acting on the Fock space. This is because these operators all produce states that are not normalizable: 0 φ(x)φ(x) 0 = δ (3) (0), 0 ap a p 0 = (2π) 3 δ (3) (0). (3.54) This feature implies that they are operator-valued distributions and not functions. In the case of φ(x) one has that although the field operator has a well-defined vacuum expectation value (VEV), 0 φ(x) 0 = 0, the fluctuations 0 φ(x)φ(x) 0 of the operator at a fixed point are infinite. We can construct well-defined operators by smearing these distributions over space. E.g., we can create a wavepacket ψ = d 3 p (2π) 3 e i p x ψ(p) p, (3.55) which is partially localized in both position and momentum space. A typical state might be described by the Gaussian ψ(p) = exp [ p 2 /(2m 2 )]. Relativistic Normalization The vacuum 0 is normalized as 0 0 = 1. The one-particle states p = a p 0 then satisfy p q = 0 [ ] ap a q 0 = 0 (2π) 3 δ (3) (p q) + a q a p 0 = (2π) 3 δ (3) (p q), (3.56) where we have made use of (3.16) and (3.20) to arrive at the final answer. Since the latter expression depends on 3-momenta, an immediate question that arises is whether it is Lorentz invariant. What could go wrong? Suppose we perform a Lorentz transformation p µ (p ) µ = Λ µ ν p ν, (3.57) such that p p. In our QFT it would be preferable, if the state p changes under this Lorentz transformation as p p = U(Λ) p, (3.58) with U(Λ) being unitary, i.e., U (Λ)U(Λ) = U(Λ)U (Λ) = 1. In such a case the normalization of p would remain unchanged p p p p = p U (Λ)U(Λ) p = p p. (3.59) 32

35 In order to find out whether or not the original and the Lorentz-transformed state, p and p, are related by an unitary transformation, we should look at an object which we know is Lorentz invariant. One such object is the identity operator (which is really the projection operator onto one-particle states). With the normalization (3.56) we know that it is given by 1 = d 3 p p p. (3.60) (2π) 3 This operator is Lorentz invariant, but it consists of two terms: the measure d 3 p and the projector p p. Are these two objects Lorentz invariant by themselves? In fact, they are not. In order to prove this statement, we start with the measure d 4 p which is obviously Lorentz invariant. The relativistic dispersion relation for a massive particle, i.e., p 2 = m 2, and hence p 2 0 = E 2 p = p 2 +m 2 is also Lorentz invariant. Solving for p 0, there are two branches of solutions, namely p 0 = ±E p. But the choice of branch is another Lorentz-invariant concept. Putting everything together tells us that d 4 p δ(p 2 0 p 2 m 2 ) p0 >0 = d 3 p 2p 0 = p0 =E p d 3 p 2E p, (3.61) is Lorentz invariant. From the latter result we can figure out everything else. E.g., the Lorentz-invariant delta function for 3-momenta is since 2E p δ (3) (p q), (3.62) d 3 p 2E p 2E p δ (3) (p q) = 1. (3.63) This finally tells us that the relativistically normalized momentum eigenstates are given by 12 and satisfy p = 2E p p = 2E p a p 0, (3.64) p q = (2π) 3 2E p δ (3) (p q). (3.65) We can also express the identity operator in terms of the p states. One has d 3 p 1 1 = p p. (3.66) (2π) 3 2E p We remark that some texts on QFT also define relativistically normalized annihilation (creation) operators by a(p) = 2E p a p ( a (p) = 2E p a p). In order to avoid (further) confusion, we won t make use of this notation here. 12 Our notation is rather subtle here, since the relativistically normalized momentum states p differ from p just by the fact that they are not set in boldface type. 33

36 3.4 Two Real Klein-Gordon Fields Our task is to describe all known particles and their interactions. It is then interesting to study the quantization of a system with more than one field. In order to keep things simple, let us try to describe a system of two real Klein-Gordon fields φ 1,2 which differ only in their mass parameters (m 1 m 2 ), L = [ 1 2 ( µφ i ) 2 1 ] 2 m2 i φ 2 i. (3.67) i=1,2 This Lagrangian leads to two independent Klein-Gordon equations, ( 2 + m 2 i ) φ i = 0. (3.68) The Hamiltonian, the total momentum, and the number operator of the system is given by H = H 1 + H 2, P = P 1 + P 2, N = N 1 + N 2, (3.69) where H i = d 3 p (2π) ω 3 i,p a i,p a i,p, P i = d 3 p (2π) p 3 a i,p a i,p, N i = d 3 p (2π) 3 a i,p a i,p, (3.70) with ω i,p = (p 2 + m 2 i ) 1/2. It should be clear, that we can construct particle states in the same fashion as we did with the Lagrangian of just a single real Klein-Gordon field. Products of a 1,p operators acting on 0 create relativistic particles with mass m 1, while a 2,p operators create particles with mass m 2. E.g., the states satisfy S 1 = a 1,p 0, S 2 = a 2,p 0, (3.71) H S i = ω i,p S i, P S i = p S i, N S i = 1 S i. (3.72) These relations tell us that the states S 1,2 are degenerate in the sense that they are singleparticle states with the same momentum p. However, they can be distinguished by measuring the energy of the particles as long as the masses m 1,2 are different (which we have assumed for the time being). Equal-Mass Case Admittedly the case of two real Klein-Gordon fields with different masses m 1,2 is pretty boring. Things get a little bit more interesting, if we consider the special case m 1 = m 2 = m. Why? Because in this case the system possesses an additional rotation symmetry in the space of fields φ 1,2. According to Noether s theorem this should lead to a new conserved charge. In order to be able to identify the additional charge, we first write the Lagrangian (3.67) in a form that exhibits the symmetry L = 1 2 ( µφ T )( µ φ) 1 2 m2 φ T φ. (3.73) 34

37 Here we have introduced the field vector φ = (φ 1, φ 2 ) T. Obviously, the latter Lagrangian is invariant under the orthogonal transformations (O(2) transformations or two-dimensional rotations), φ φ = Rφ, (3.74) with R T = R 1. To calculate the conserved current, we again consider infinitesimal symmetry transformations (i, j = 1, 2) R ij = δ ij + θ ij + O(θ 2 ). (3.75) The orthogonality of the matrix R, δ ij + θ ji = R T ij = R 1 ij = δ ij θ ij, (3.76) tells us that the matrix θ is antisymmetric. The infinitesimal transformation of the field φ 1 under (3.74) is φ 1 φ 1 = R 1i φ i = (δ 1i + θ 1i ) φ i = φ 1 + θ 11 φ 1 + θ 12 φ 2 = φ 1 + θ 12 φ 2, (3.77) which tells us that the variation of φ 1 is An analog calculation gives δφ 1 = θ 12 φ 2. (3.78) δφ 2 = θ 21 φ 1 = θ 12 φ 1. (3.79) Knowing the variations δφ 1,2 of the fields, the conserved current corresponding to (3.74) is readily written down, J µ = L ( µ φ i ) δφ i = θ 12 [ ( µ φ 1 ) φ 2 ( µ φ 2 ) φ 1 ], (3.80) so the conserved charge is Q = d 3 x [ ( 0 φ 1 ) φ2 ( 0 φ 2 ) φ1 ]. (3.81) Substituting in the above expression the physical solutions (3.15) for the fields φ 1,2, and performing the integration over the space coordinates, one obtains (the actual computation is part of an exercise) 13 Q = i d 3 p [ ] a (2π) 3 1,p a 2,p a 2,p a 1,p, (3.82) which is an hermitian operator, i.e., it satisfies Q = Q. There is an ambiguity worth noting, when applying Noethers theorem to find the conserved charge under the transformation (3.74). Obviously, if Q is conserved, then so is every other operator c 1 Q + c 2 with c 1,2 constant numbers. The expression for Q in (3.82) is therefore unique up to a multiplicative and an 13 This expression has not be normal ordered. 35

38 additive constant. The ambiguity on the additive constant is removed when we remove the contribution of the vacuum to the charge of particle states (as we have done for the energy). The normal-ordered charge operator :Q: = Q 0 Q 0, (3.83) is ambiguous only up to a multiplicative factor, which essentially denotes the units in which we measure the charge of a state. Notice that we have already used this ambiguity in (3.81) and simply ignored the factor θ 12. In the following, we will use the normalization (3.83) of Q, dropping as before the : : to avoid unnecessary clutter. So far so good. Next we would like to determine the spectrum of Q. This is most easily done using the technique of ladder operators. We first define the following linear combinations, a ±,p = 1 2 (a 1,p ± ia 2,p ), (3.84) of annihilation operators (an analog definition holds for the hermitian conjugate operators). It is left as a homework problem to show that these new operators satisfy the following commutation relations [Q, a ±,p ] = a ±,p, [Q, a ±,p] = ±a ±,p. (3.85) The latter relations imply that we can obtain states with charge q ± 1 from a state S of charge q, i.e., Q S = q S, by the action of a ±,p, ( ) ( ) Q a ±,p S = (q ± 1) a ±,p S. (3.86) In other words the operators a ±,p are ladder operators with respect to Q. Since a ±,p are linear combinations of a 1,p and a 2,p, which are ladder operators for the Hamiltonian H and the total momentum operator P, so are a ±,p. To find now all the common eigenstates of the charge operator Q, it is sufficient to start from a single common eigenstate and then to act with a ±,p on this state. It is not surprising that the vacuum 0 is also an eigenstate of Q, namely the one with zero charge 14 Q 0 = 0 0 = 0. (3.87) Repeated application of the ladder operators, n S ± n = a ±,p i 0, (3.88) i=1 then creates n-particle states with positive ( S + n ) and negative ( S n ) charge. Consequently, one has ( n ) ( n ) H S ± n = ω i,pi S ± n, P S ± n = p i S ± n, i=1 i=1 (3.89) N S ± n = n S ± n, Q S ± n = ±n S ± n, 14 Notice that the normal ordering (3.83) of Q plays an essential role here. 36

39 The main results of this subsection can be summarized as follows. The mass degeneracy of the Klein-Gordon fields φ 1,2 results in a new O(2) symmetry of the Lagrangian. This gives rise to a new conserved quantity, the charge Q. A particle state is then characterized by its mass (or equivalently its energy), its momentum, and its charge, which can be either positive or negative. States with the same energy and momentum, but opposite charge, can be interpreted as particles and antiparticles. Notice that for a single real Klein-Gordon field there is only a single type of particle, since a real scalar particle is its own antiparticle. 3.5 Complex Klein-Gordon Field We can gain further insight into the theory by rewriting the Lagrangian (3.73) a little bit, where L = ( µ ϕ )( µ ϕ) m 2 ϕ ϕ, (3.90) ϕ = 1 2 (φ 1 + iφ 2 ), (3.91) denotes the complex Klein-Gordon field. We could now compute the Hamiltonian and momentum operators directly in terms of ϕ and ϕ, arriving at the same expressions as in the representation with two real fields (if you don t believe me you are free to check this yourself). In order to compute the charge Q, we then need to identify the internal symmetry of the new Lagrangian. In fact, it is easy to see that (3.90) is invariant under a field phase-redefinition aka a global U(1) transformation, ϕ ϕ = e iα ϕ, ϕ (ϕ ) = e iα ϕ. (3.92) Notice that this transformation is the equivalent of the rotation symmetry transformation (3.74) that we have found earlier, in the real field representation. We verify this by using the explicit form of the matrix R in terms of sine and cosine of the rotation angle α, ( ) ( ) ( ) ( ) ( ) ( ) φ 1 cos α sin α φ 1 φ 1 cos α i sin α φ 1, =, φ 2 sin α cos α φ 2 iφ 2 i sin α cos α iφ 2 (3.93) = φ 1 + iφ 2 e iα (φ 1 + iφ 2 ), = ϕ e iα ϕ. So why should we bother about the complex Klein-Gordon Lagrangian if (3.73) and (3.90) are equivalent? The reason is that the complex field representation is more suggestive to the fact that we have both particle and antiparticle states. To see this we rederive the expression for the charge operator (3.82). The variations of the fields ϕ and ϕ (treated as independent) under (3.92) are δϕ = iαϕ, δϕ = iαϕ. (3.94) Now we can again use the machinery of Noether s theorem to calculate Q. I spare you the details of this computation and simply quote the final result after normal ordering. One finds 15 d 3 p [ ] Q = a (2π) 3 +,p a +,p a,p a,p = N + + N, (3.95) 15 You can obtain this expression by simply reexpressing (3.82) in terms of a ±,p and a ±,p using the inverse of (3.84) and its hermitian conjugate analog. 37

40 where in the last step we have introduced the number operators d 3 p N ± = (2π) 3 a ±,p a ±,p. (3.96) The expression (3.95) implies that Q counts the number of antiparticles (created by a +,p) minus the number of particles (created by a,p). Since [H, Q] = 0, this difference is a conserved quantity in our quantum theory. Of course, in a free field theory this isn t such a big deal because both N + and N, i.e., the numbers of positively and negatively charged states, are separately conserved. However, we will see soon that in interacting theories Q survives as a conserved quantity, while N ± individually do not. 3.6 Heisenberg Picture Although we started with a Lorentz-invariant Lagrangian, we slowly butchered it as we quantized the theory, introducing a preferred time coordinate t. Its not at all obvious that the theory is still Lorentz invariant after quantization. E.g., the various field operators φ(x) we met depend on space, but not on time. Yet, the one-particle states obey the Schrödinger s equation, i d p(t) = H p(t), (3.97) dt which means that they evolve in time according to p(t) = e iept p. (3.98) Things start to look better in the Heisenberg picture where the time dependence is assigned to the operators O, O H = e iht O S e iht, (3.99) so that do H dt = ( ) ( ) d dt eiht O S e iht + e iht O S ddt e iht = ih e iht O S e iht + e iht O S e iht ( ih) = i [H, O H ]. (3.100) Here the subscripts S and H tell us whether the operator is in the Schrödinger or Heisenberg picture. In QFT, we drop these subscripts and we will denote the picture by specifying whether the fields depend on space φ(x) (the Schrödinger picture) or space-time φ(t, x) = φ(x) (the Heisenberg picture). The operators in the two pictures agree at a fixed time, say, t = 0. The commutation relations (3.2) become equal-time commutation relations in the Heisenberg picture. In the case of the real Klein-Gordon theory (2.45), [φ(t, x), φ(t, y)] = [π(t, x), π(t, y)] = 0, [φ(t, x), π(t, y)] = iδ (3) (x y). (3.101) 38

41 Now that our operators depend on time, we can study how they evolve when the clock starts ticking. For the field operator φ, we have φ(x) = i [H, φ(x)] = i [ { d 3 y π 2 (y) + ( φ(y) ) } ] 2 + m 2 φ 2 (y), φ(x) 2 = i d 3 y π(y) ( i) δ (3) (x y) = π(x). Similarly, we get for the conjugate operator π, π(x) = i [H, π(x)] = i 2 = i 2 [ d 3 y { π 2 (y) + ( φ(y) ) } ] 2 + m 2 φ 2 (y), π(x) (3.102) { ( y d 3 y [φ(y), π(x)] ) φ(y) + ( φ(y) ) } y [φ(y), π(x)] + 2im 2 φ(y) δ (3) (x y) = ( 2 m 2) φ(x), (3.103) where we have included the subscript y on y when there may be some confusion about which argument the derivative is acting on. To reach the last line, we have simply integrated by parts. Putting (3.102) and (3.103), we then find that φ satisfies (as one could have guessed) the Klein-Gordon equation (2.46). Things start to look more relativistic. We can also write the Fourier expansion (3.15) of the field φ by using the definition of Heisenberg operators (3.99). We first note that (a p ) H = e iht a p e iht = ( [e iht, a p ] + a p e iht) e iht = ( e iept a p e iht a p e iht + a p e iht) e iht = e iept a p, (3.104) where have applied repeatedly H n a p = a p (H E p ) n, (3.105) which holds for any n and follows from the commutation relations (3.43), after expanding the exponential in a power series (this step is actually not shown). A similar relation (with replaced by + ) holds for a p. In the case of a p, we hence have ( a p ) H = eiht a p e iht = e iept a p. (3.106) Using (3.104) and (3.106) then gives, d 3 p 1 ( φ(x) = ap e ipx + a (2π) 3 p e ipx), (3.107) 2Ep which looks pretty much like (3.15) except that the exponentials are now written in terms of 4-vectors, px = E p t p x. Note also that the sign has flipped in the exponent due to 39

42 the Minkowski metric. It s a simple exercise to check that (3.107) indeed satisfies the Klein- Gordon equation (2.45), and is therefore left as a homework. For completeness let me also give the result for the conjugate field π in the Heisenberg picture. One finds, π(x) = d 3 p (2π) 3 ( i) Ep 2 [a p e ipx a p e ipx ], (3.108) as you might have guessed immediately from looking at (3.15) and (3.107). The equation (3.107) makes explicit the dual particle and wave interpretations of the quantum field φ. On the one hand, φ is written as an operator, which creates and destroys the particles that are the quanta of field excitation. On the other hand, φ is written as a linear combination of solutions (the exponentials) of the Klein-Gordon equation. Both signs of the time dependence, i.e., ±ip 0 t with p 0 > 0, appear in the exponential. If these were single-particle wavefunctions, they would correspond to states of positive and negative energy. Let us refer to them more generally as positive- and negative-frequency modes. The connection between the particle-creation operators and the waveforms displayed here is always valid for free quantum fields. A positive-frequency solution of the field equation has as its coefficient the operator that destroys a particle in that single-particle wavefunction, while a negativefrequency solution of the field equation (being the hermitian conjugate of a positive-frequency solution) has as its coefficient the operator that creates a particle in that positive-energy single-particle wavefunction. In this way, the fact that relativistic wave equations have both positive- and negative-frequency solutions is reconciled with the requirement that a sensible quantum theory should contain only positive excitation energies. Causality It looks like we are approaching something Lorentz invariant in the Heisenberg picture, where the field operator φ satisfies the Klein-Gordon equation. Yet, there is still a hint of non- Lorentz invariance because φ and π satisfy the equal-time commutation relations (3.101). The question that we thus have to address is, what happens for arbitrary space-time separations? In particular, for our theory to be causal, we must require that all space-like separated operators commute, [O 1 (x), O 2 (y)] = 0, (x y) 2 < 0. (3.109) This ensures that a measurement at x cannot affect a measurement at y, when x and y are not causally connected (outside the light-cone). A graphical representation of the latter equation is given in Figure 3.1. Does our theory satisfy the requirement (3.109)? To answer this question, we first define (x y) = [φ(x), φ(y)]. (3.110) While the objects of the right-hand side are operators, it is seen (after a short calculation) 40

43 t O 2 (y) x O 1 (x) Figure 3.1: Picture of space-like separated operators O 1 (x) and O 2 (y). that the left-hand side is simply a complex number, [ d 3 p 1 ( (x y) = ap e ipx + a (2π) 3 p e ipx), 2Ep d 3 p d 3 q 1 ( ) = (2π) 6 2 [a p, a q] e ipx+iq y + [a p, a q ] e ipx iq y E p E q d 3 p d 3 q 1 = (2π) 6 2 (2π) 3 δ (3) (p q) [ e ipx+iq y e E p E q d 3 p 1 [ = e ip (x y) e ip (x y)]. (2π) 3 2E p d 3 q 1 ( ] aq e iq y + a iq (2π) 3 q e y) 2Eq ipx iq y] (3.111) So what do we know about (x y)? First of all, it is Lorentz invariant thanks to the Lorentz-invariant measure d 3 p/(2e p ) that we have introduced in (3.61). Second, it does not vanish for time-like separations. E.g., taking x = (t, 0, 0, 0) and y = (0, 0, 0, 0) gives [φ(x), φ(y)] exp ( imt) exp (imt), where the exp ( imt) term arises from d 3 p 1 e iep t = 4π p 2 dp (2π) 3 2E p (2π) p 2 + m 2 e it p 2 +m 2 = 1 4π 2 = m 8πt m de E 2 m 2 e iet [ Y1 (mt) + ij 1 (mt) ] t e imt, (3.112) where to arrive at the second line, we have simply changed variables p (E 2 m 2 ) 1/2. In 41

44 order to obtain the final answer, one only needs to know that the Bessel functions of first and second kind, J 1 (x) and Y 1 (x), behave like J 1 (x)/x sin x cos x and Y 1 (x)/x sin x+cos x in the relevant limit x. An analog calculation gives the exp (imt) term. Third, it vanishes for space-like separations. This follows by realizing that (x y) = 0 at equal times for all (x y) 2 = (x y) 2 < 0, which can be seen explicitly by writing [φ(t, x), φ(t, y)] = d 3 p (2π) p 2 + m 2 [ e ip (x y) e ip (x y)] = 0. (3.113) Notice that in order to arrive at the final result, we have flipped the sign of p in the second exponent. This obviously does not change the result since p is an integration variable and (p 2 + m 2 ) 1/2 is invariant under such a change. But since (x y) is Lorentz invariant, it can only be a function of (x y) 2 and must hence vanish for all (x y) 2 < 0. Taken together the above findings imply that the real Klein-Gordon theory is indeed causal with commutators vanishing outside the light-cone. This property will continue to hold in the interacting theory. Indeed, it is usually given as one of the axioms of local QFTs. Let me mention, however, that the fact that [φ(x), φ(y)] is a complex function, rather than an operator, is a property of free fields only and does not hold in an interacting theory. 3.7 Klein-Gordon Correlators The causal structure of the real Klein-Gordon theory (2.45) can also be probed in a different way. Let s create a particle at the space-time point y. What is the amplitude to find it at point x? This question can be answered by calculating d 3 p d 3 q 1 D(x y) = 0 φ(x)φ(y) 0 = (2π) a p a q 0 e ipx+iq y E p E q d 3 p d 3 q 1 = (2π) [ap, a q] 0 e ipx+iq y E p E q (3.114) d 3 p d 3 q 1 = (2π) 6 2 (2π) 3 δ (3) (p q) e ipx+iq y E p E q d 3 p 1 = e ip(x y). (2π) 3 2E p The function D(x y) is called propagator and is a Lorentz-invariant 3-momentum integral. Let us now evaluate (3.113) for purely space-like separations, i.e., x y = (0, r). 16 The propagator is then d 3 p 1 D(x y) = e ip r = 2π dp p2 e ipr e ipr (2π) 3 2E p (2π) 3 0 2E p ipr = i (3.115) p e ipr dp 2(2π) 2 r p2 + m Notice that for purely time-like separations one would obtain the result (3.112). 42

45 Im p +im im Re p Figure 3.2: Branch cuts of the propagator D(x y) for a space-like transition. Here we have first introduced spherical coordinates, then performed the integration over the azimuthal and polar angles, and finally changed variables in the second term from p p in order to combine the result into one term. The integrand in (3.115), considered as a complex function of p, has branch cuts on the imaginary axis starting at ±im. In order to evaluate the integral we push the contour up to wrap around the upper branch cut. The chosen integration contour is shown in Figure 3.2. Defining ρ = ip, we then recast (3.115) into D(x y) = 1 4π 2 r m dρ ρe ρr ρ2 m = 1 2 4π 2 r mk 1(mr) r e mr, (3.116) where the modified Bessel function K 1 (x) scales like K 1 (x) = ( π/(2x)+o(x 3/2 ) ) e x in the limit of x. The latter equation tells us that the propagator (x y) decays exponentially quickly outside the light-cone but, nonetheless, it is non-vanishing. The quantum field appears to leak out of the causal region. Yet, we have just seen in (3.113) that space-like measurements commute and the theory is causal. How do we reconcile these two facts? We get a first clue of how this puzzle is resolved by realizing that the relation (3.113), expressed in terms of propagators, takes the form (x y) = [φ(x), φ(y)] = D(x y) D(y x) = 0. (3.117) What is the physical meaning of this result? It simply means that for (x y) 2 < 0, there is no Lorentz-invariant way to order events. If a particle can travel in a space-like direction from x to y, it can just as easily travel from y to x. 17 In any measurement, the amplitudes for these two possible events cancel, so that the underlying QFT is causal. 17 When x y is space-like, a continuous Lorentz transformation can take x y to y x. 43

46 Another way to think about the cancellation of the two contributions in (3.117) is in terms of amplitudes of particles and antiparticles. Let us first consider the case of a complex scalar field. If we look at the equation [ϕ(x), ϕ (y)] = 0 outside the light-cone, the physical interpretation of (3.117) (or better its analog) is that the amplitude for the particle to propagate from x to y cancels the amplitude for the antiparticle to travel from y and x. In fact, this interpretation also applies (maybe in a less obvious way) to the case of the real scalar field, because the particle is then its own antiparticle. Green s functions In fact, the statements made after (3.117) can be put on mathematical solid grounds. Let s see how this goes. We start by considering the amplitude d 3 p 1 [ 0 [φ(x), φ(y)] 0 = e ip(x y) e ip(x y)], (3.118) (2π) 3 2E p and assume for now that x 0 > y 0. In this case we can rewrite the 3-momentum integral on the right-hand side of (3.117) as a 4-momentum integral, { d 3 p 1 0 [φ(x), φ(y)] 0 = e ip(x y) p + 1 } e ip(x y) p (2π) 3 2E p 0 =E p 2E p 0 = E p d 3 p dp 0 1 = x 0 >y 0 (2π) 3 2πi p 2 m 2 e ip(x y) (3.119) d 4 p i = (2π) 4 p 2 m 2 e ip(x y). Notice that this is the first time in this course that we have integrated over 4-momentum. Until now, we integrated only over 3-momentum, with p 0 fixed by the mass-shell condition to be p 0 = E p. Barring possible typos, the calculation in (3.119) is certainly correct, but this fact might not be obvious to everybody in the audience right away. So let me do some reverse engineering. First notice that the denominator in the last line of (3.119) can be written as p 2 m 2 = (p 0 ) 2 p 2 m 2 = (p 0 ) 2 E 2 p = (p 0 E p )(p 0 + E p ), (3.120) which implies that, for each value of p, the denominator produces a pole in the integrand at p 0 = ±E p = ± (p 2 + m 2 ) 1/2. The 4-momentum integration is hence ill-defined and we need a prescription for avoiding the singularities on the real p 0 -axis. How do we have to choose the integration contour in order to arrive at (3.119)? It is not difficult to see that in the case x 0 > y 0 the contour has to be chosen as shown in Figure 3.3. Notice that closing the contour in the lower half-plane, where p 0 i, ensures that the integrand vanishes since exp ( ip 0 (x 0 y 0 )) 0. The integral over p 0 then picks up the residues at p 0 = ±E p which are 2πi/(p 0 ± E p ) p 0 =±E p = 2πi/(2E p ), where the relative minus sign arises because we take a clockwise contour. Combining these elements shows that the calculation that led to the final result in (3.119) is in fact correct. 44

47 Im p 0 Re p 0 E p +E p p 0 i Figure 3.3: Integration contour for the retarded Green s function D R (x y). In the following, we will call the last line of (3.119) together with the prescription for going around the pole retarded Green s function, D R (x y) = θ(x 0 y 0 ) 0 [φ(x), φ(y)] 0 = θ(x 0 y 0 ) [ D(x y) D(y x) ], (3.121) where the Heaviside step function θ(x) is defined as θ(x) = 0 for x < 0 and θ(x) = 1 for x > 0. It seldom matters what value is used for θ(0), since θ(x) is mostly used as a distribution (in the half-maximum convention one has θ(0) = 1/2). The name retarded Green s function is in fact the correct one for D R (x y), since this mathematical object obeys 18 ( 2 + m 2) D R (x y) = ( 2 θ(x 0 y 0 ) ) 0 [φ(x), φ(y)] ( µ θ(x 0 y 0 ) ) ( µ 0 [φ(x), φ(y)] 0 ) + θ(x 0 y 0 ) ( 2 + m 2) 0 [φ(x), φ(y)] 0 = δ(x 0 y 0 ) 0 [π(x), φ(y)] 0 + 2δ(x 0 y 0 ) 0 [π(x), φ(y)] (3.122) = iδ (4) (x y), and vanishes for x 0 < y 0 by definition. Here all derivatives are understood with respect to x. In order to obtain the second line we have used the two relations x θ(x) = δ(x) and ( 2 x θ(x) ) f(x) = δ(x) ( x f(x) ), the latter of which is shown easily by partial integration, and paid tribute to the fact that φ(x) obeys the Klein-Gordon equation. The last line then follows by employing the second equal-time commutation relation in (3.101). The retarded Green s function is useful in classical field theory if we know the initial value of some field configuration and want to figure out what it evolves into in the presence of 18 Notice that the same result is obtained by applying the differential operator ( 2 + m 2 ) directly to the expression in the last line of (3.119). 45

48 Im p 0 p 0 +i E p +E p Re p 0 Figure 3.4: Integration contour for the advanced Green s function D A (x y). a source, meaning that we want to know the solution to the inhomogeneous Klein-Gordon equation, ( 2 + m 2 )φ(x) = J(x) for some fixed background function J(x), acting as a static source. Similarly, one can define the advanced Green s function D A (x y) which vanishes when x 0 > y 0, which is useful if we know the end point of a field configuration and want to figure out where it came from. The integration contour corresponding to the advanced Green s function is shown in Figure 3.4. You will get more familiar with the advanced Green s function in an exercise. Feynman Propagator In fact, the most important quantity in interacting field theory is neither the retarded nor the advanced Green s function but the Feynman propagator, D F (x y) = 0 T φ(x)φ(y) 0 = θ(x 0 y 0 )D(x y) + θ(y 0 x 0 )D(y x), (3.123) where T stand for time ordering, i.e., placing all operators evaluated at later times to the left so that e.g., T φ(x)φ(y) = θ(x 0 y 0 )φ(x)φ(y) + θ(y 0 x 0 )φ(y)φ(x). (3.124) Given the similarity of (3.118) and (3.123), it is does not come as a surprise, that the Feynman propagator can be written as, d 4 p i D F (x y) = (2π) 4 p 2 m 2 e ip(x y). (3.125) Again we distinguish the cases x 0 > y 0 and y 0 > x 0. In the former case, we perform the p 0 integration following the contour shown in Figure 3.5, which encloses the pole at p 0 = +E p with residuum 2πi/(2E p ), where the minus sign arises again since the path has a clockwise 46

49 Im p 0 E p Re p 0 +E p p 0 i Figure 3.5: Integration contour for the Feynman propagator D F (x y) for x 0 > y 0. In the case y 0 > x 0, the integration contour is closed in the upper-half plane. orientation. Consequently, one obtains d 3 p 2πi D F (x y) = ie iep (x0 y0 )+ip (x y) (2π) 4 2E p d 3 p 1 = e ip(x y) = D(x y). (2π) 3 2E p (3.126) In contrast, in the case y 0 > x 0 one finds D F (x y) = = = d 3 p (2π) 4 2πi ( 2E p ) ieiep (x0 y0 )+ip (x y) d 3 p (2π) 3 1 2E p e iep (y0 x0 ) ip (y x) d 3 p 1 e ip(y x) = D(y x). (2π) 3 2E p (3.127) where the integration is chosen as in Figure 3.5, but the path is closed in the upper-half plane (due to the counter-clockwise orientation of the half-circle the residuum does not pick up a minus sign). To go from the second line in (3.127) to the third, we have flipped the sign of p which is valid since we integrate over d 3 p and all other quantities depend only on p 2. Taken together the latter two relations prove the equality of (3.123) and (3.125). Like D R (x y) and D A (x y), also the Feynman propagator is a Green s function of the Klein-Gordon equation, ( 2 + m 2) d 4 p D F (x y) = (2π) 4 = i i p 2 m 2 ( p2 + m 2 ) e ip(x y) d 4 p (2π) 4 e ip(x y) = iδ (4) (x y). (3.128) 47

50 +iɛ Im p 0 +E p Re p 0 E p iɛ p 0 i Figure 3.6: Schematic picture of the iɛ prescription for x 0 > y 0. In the case y 0 > x 0, the integration contour is closed in the upper-half plane. Notice that instead of specifying the contour, we may instead write the Feynman propagator as follows d 4 p i D F (x y) = (2π) 4 p 2 m 2 + iɛ e ip(x y), (3.129) with ɛ > 0 and infinitesimal. As shown in Figure 3.6, this has the effect of shifting the poles slightly off the real p 0 -axis, so that the integration along this axis is equivalent to the integration contour displayed in Figure 3.5. This way of writing D F (x y) is, for obvious reasons, called the iɛ prescription. 3.8 Non-Relativistic Limit In order to study the non-relativistic limit of our theory, we return to the classical complex Klein-Gordon field ϕ (for reasons that will become clear later on). We decompose it as 19 ϕ(x) = e imt ϕ(x), (3.130) to single out the large kinematical part of the momentum of ϕ. In terms of the new field ϕ, the Klein-Gordon equation reads ( 2 + m 2) ϕ = ( 2 t 2 + m 2) e imt ϕ = e imt [ ϕ 2im ϕ 2 ϕ] = 0, (3.131) where the explicit m 2 term cancelled against the time derivatives. The non-relativistic limit is m p, which after a Fourier transform is equivalent to saying that ϕ m ϕ. We are 19 The exponential factor removes the large frequency part from the x-dependence in ϕ. Consequently, the x- dependence of ϕ is only governed by the small residual momentum and derivatives acting on ϕ are suppressed by powers of 1/m. This way of decomposing a field is often the starting point for the construction of an effective field theory that entails the physics of the full theory in the kinematical limit m p. The most well-known example of such a theory in particle physics is heavy quark effective theory. 48

51 hence allowed to neglect the ϕ term in (3.131), so that the Klein-Gordon equation in the limit m becomes i d dt ϕ = 1 2m 2 ϕ. (3.132) This looks very similar to the Schrödinger equation for a non-relativistic free particle of mass m. Except it does not have any probability interpretation. It is simply a classical field evolving through an equation that s first order in time derivatives. It is also worthwhile to consider the Lagrangian of the complex scalar field ϕ itself and to investigate what happens to (3.90) in the non-relativistic limit. We again take the limit ϕ m ϕ, and obtain after a straightforward calculation (where in the last step we have divided by 2m), L = i ϕ 1 ϕ 2m ( ϕ ) ( ϕ). (3.133) This Lagrangian has a conserved current related to its invariance under the global phase transformation ϕ e iα ϕ. Employing Noether s theorem (2.17), we find that the conserved current takes the form ( ) J µ = ϕ i ϕ, 2m [ ϕ ϕ ϕ ϕ ]. (3.134) To get the Hamiltonian we compute the conjugate momentum π = L ϕ = i ϕ, (3.135) which does not contain a time derivative. This looks a little disconcerting, but its fully consistent for a theory which is first order in time derivatives. In order to determine the full trajectory of the field, we only need to specify initial conditions for ϕ and ϕ at some point in time, say t = 0 (knowing the time derivatives on the initial slice is not necessary). Since the Lagrangian (3.133) already contains a term p q (and not the usual 1/2 p q ), the time derivatives drop out when one computes the Hamiltonian, H = 1 2m ( ϕ ) ( ϕ). (3.136) In order to quantize the system, we impose in the Schrödinger picture, [ ϕ(x), ϕ(y)] = [ ϕ (x), ϕ (y)] = 0, [ ϕ(x), ϕ (y)] = δ (3) (x y), (3.137) and expand the field into its Fourier components, d 3 p ϕ(x) = (2π) a 3 p e ip x. (3.138) Inserting this into the commutation relations (3.137), leads to [a p, a q ] = (2π) 3 δ (3) (p q). (3.139) 49

52 where the trivial expressions have been skipped. As usual the vacuum satisfies a p 0 = 0, and the excitations are a p 1... a p n 0. The one-particle states p = a p 0, have energy H p = p2 p, (3.140) 2m which is the non-relativistic dispersion relation. From the above, we conclude that quantizing the first order Lagrangian (3.133) gives rise to non-relativistic particles of mass m. Some comments seem to be in order. Notice that we have a complex field but only a single type of particle. The antiparticle is not in the spectrum. The existence of antiparticles is a consequence of relativity. A related fact is that the conserved charge Q = d 3 x : ϕ ϕ : is the particle number. This remains conserved even if we include interactions in the Lagrangian of the form ( ϕ ϕ) 2 etc., which are invariant under a global phase rotation. So in non-relativistic theories, particle number is conserved. It is only with relativity, and the appearance of antiparticles, that particle number can change. Finally, there is no non-relativistic limit of a real scalar field. In the relativistic theory, the particles are their own antiparticles, and there is no way to construct a multiparticle theory that conserves particle number. Recovering QM In QM, we talk about the position and momentum operators X and P. On the other hand, as we saw below (2.1), in QFT position is relegated to a label. How do we get back to good old QM? We already have the operator for the total momentum of the field, namely (3.46). When acting on a single-particle state, it gives P p = p p. It is also not too difficult to write down the position operator X. Let s do it in the non-relativistic limit. In this case the operator d ϕ 3 p (x) = (2π) 3 a p e ip x, (3.141) creates a particle localized with a delta function at x. We hence write x = ϕ (x) 0. It is now natural to define the position operator X as X = d 3 x x ϕ (x) ϕ(x), (3.142) since it has the sought property, X x = d 3 y y ϕ (y) ϕ(y) ϕ (x) 0 = d 3 y y ϕ (y) [ δ (3) (y x) ϕ (x) ϕ(y) ] 0 = x x. (3.143) We can now construct a state χ by taking a superposition of the one-particle states x, χ = d 3 x χ(x) x. (3.144) 50

53 Notice that the weight function χ(x) is what we would usually call the Schrödinger wavefunction (in the position representation). Let s make sure that it indeed has the right properties. First, it is clear that for what concerns X it behaves correctly, namely X χ = d 3 x x χ(x) x. (3.145) What about the momentum operator P? A straightforward calculation gives, d 3 x d 3 p d P χ = p a (2π) pa 3 p χ(x) ϕ 3 x d 3 p (x) 0 = p a (2π) 3 p e ip x χ(x) 0 d 3 x d 3 p ( = ) (2π) 3 a p i e ip x d 3 x d 3 p χ(x) 0 = e ip x ( i χ(x)) a (2π) 3 p 0 = d 3 x ( i χ(x) ) x. (3.146) This tells us that P acts as the familiar derivative on wave functions χ(x). To obtain the final result in (3.146), we have used in a first step the relationship [a p, ϕ (x)] = e ip x which can be easily checked. We learn that when acting on one-particle states, the operators X and P act as position and momentum operators in QM, with [X i, P j ] χ = i δ ij χ and i, j = 1, 2, 3. But what about dynamics? In particular, how does our wavefunction χ(x) change with time? To address this question, we first express the Hamiltonian corresponding to the density (3.136) through ladder operators, which implies that H = d 3 x 1 2m ( ϕ ) ( ϕ) = d 3 p (2π) 3 p 2 2m a pa p, (3.147) i d dt χ = 1 2m 2 χ, (3.148) which formally looks exactly like the time evolution of the original field ϕ given in (3.132). Yet this time, it is really the Schrödinger equation, complete with the usual probabilistic interpretation for the wavefunction χ (and not just a first-order differential equation). Note in particular, that the conserved charge arising from the current (3.134) is Q = d 3 x χ(x) 2 which is the total probability. Historically, the fact that the equation for the classical field (3.132) and the one-particle wavefunction (3.148) coincide caused some confusion. It was thought, that perhaps one is quantizing the wavefunction itself and the resulting name second quantization is still sometimes used today meaning QFT. However, it is important to stress that, despite the name, nothing is quantized twice. One simply quantizes a classical field once. Nonetheless, it is good to know that, if one treats the one-particle Schrödinger equation as a quantum field, then it will give the correct generalization to multiparticle states. 51

54 3.9 Problems i) Consider the Klein-Gordon equation with the mass term set equal to zero and a dilatation transformation with parameter α, x µ x µ = e α x µ, φ(x) φ (x ) = φ(x)e d φ α. (3.149) Show that this transformation is a global symmetry of L if one chooses the scaling dimension d φ in an appropriate way. Compute the associated Noether current and verify that it is conserved. Is the symmetry preserved if you add a quartic λ/(4!) φ 4 to the Lagrangian? What happens if you add a mass term m 2 φ 2? ii) Consider the Lagrangian L = 1 2 ( µφ µ Φ M 2 Φ 2 ) ( µϕ µ ϕ m 2 ϕ) λ 2 Φϕ2, (3.150) with m M and derive the EOMs for the fields Φ and ϕ. Express the heavy field Φ through the light field ϕ and insert it back into the Lagrangian. Expand your result in 1/M 2. What has changed compared to the original Lagrangian? Up to which energy scale would you trust the predictions of this effective Lagrangian? iii) Show that the first relation in (3.2) is satisfied if [a p, a q ] = [a p, a q] = 0 holds. Prove the commutation relations (3.43). Calculate J i p = 0 with J i defined in (3.48). Show that the number operator N defined in (3.53) commutes with the Hamiltonian H of (3.27) and satisfies N p 1,..., p n = n p 1,..., p n, where p 1,..., p n denotes the n-particle state introduced in (3.50). iv) Consider a real scalar φ(t, x) field living on a two-dimensional space-time and defined on an interval x [0, L] with Dirichlet BCs φ(t, 0) = φ(t, L) = 0. Show that the (classical) positive- and negative-frequency solutions to the Klein-Gordon equation that also satisfy the BCs have the form φ (±) n (t, x) = 1 ωn L e±iωnt sin(k n x). (3.151) Give the expression for k n in terms of L. How is ω n related to k n? We now quantize the field φ(t, x), keeping in mind that momentum here is discretized, i.e., φ(t, x) = n=1 ( φ ( ) n (t, x) a n + φ (+) n (t, x) a n), (3.152) with the ladder operators satisfying [a n, a m ] = [a n, a m] = 0 and [a n, a m] = δ mn. Compute the VEV 0 H 0 of the Hamiltonian density H = 1 [ ] φ2 + ( x φ) 2 + m 2 φ 2. (3.153) 2 52

55 Integrating your result over the interval [0, L] and show that the total vacuum energy is E 0 (L) = 1 2 ω n. (3.154) Since this quantity is infinite, we need some form of regularization in order to handle the divergence. Let us introduce an exponentially damping function exp( δω n ) with δ > 0 in the sum, and consider for simplicity the case of a massless field. Prove that in this case the vacuum energy can be written as n=1 E 0 (L, δ) = π 8L sinh 2 ( ) δπ, (3.155) 2L Take the limit δ 0 and determine the vacuum energy for the case when no BCs are imposed. With all this at hand calculate the Casimir force. v) Derive the charge operator Q for the U(1) invariant Lagrangian (3.90) using infinitesimal transformations. Show that the result expressed through creation and annihilation operators takes the form of (3.95). Prove that Q satisfies (3.85). Verify that the charge is conserved (via [H, Q] = 0). Are the operators N ± as defined in (3.96) conserved as well? What happens if you add an interaction term L = λ 4! (ϕ ϕ) 2, (3.156) to the Lagrangian? What does this result imply for the case in which particles are their own antiparticles? vi) Consider a theory with two complex scalar fields ϕ 1 and ϕ 2. Write down all possible terms of the Lagrangian which are Lorentz invariant and renormalizable, i.e., have mass dimension of four and couplings with non-negative mass dimensions. Which terms survive, if there is an additional discrete symmetry and under which the Lagrangian remains invariant? (ϕ 1, ϕ 2 ) ( ϕ 1, ϕ 2 ), (3.157) (ϕ 1, ϕ 2 ) (ϕ 1, ϕ 2 ), (3.158) Assume further, that both fields have the same mass, m 1 = m 2, and that all dimensionless couplings are identical. You can now rewrite the Lagrangian in a more economic way if you introduce the scalar doublet ( ) Φ = ϕ 1 ϕ 2, (3.159) 53

56 and its hermitian conjugate. The theory at hand has four conserved global charges. One charge follows from the U(1) invariance, Φ e iα Φ, δφ = iαφ, (3.160) of the theory, that is already present in the case of a single complex scalar. The other three charges correspond to the mixing of the scalar fields under an SU(2) transformation, Φ e iα jτ j Φ, δφ = iα j τ j Φ, (3.161) where the indices j = 1, 2, 3 are summed over and τ j = σ j /2 with σ j being the usual Pauli matrices. Compute the four conserved charges using Noether s theorem. For the SU(2) charges you should find Q j = d 3 x i ( ) ϕ a (τ j ) ab πb π a (τ j ) ab ϕ b, (3.162) where a, b = 1, 2 are field labels. Show further, that the latter charges fulfill the SU(2) commutation relation, [Q j, Q k ] = i ɛ jkl Q l. (3.163) What symmetries survive if you allow for different masses and dimensionless couplings? vii) Consider the Lagrangian for a free complex scalar field (3.90) which is invariant under global U(1) transformations (3.92). Is this Lagrangian invariant, if the global gets promoted to a local symmetry, i.e., α eα(x), where e is just a universal constant and α(x) a function of space-time? If you now add a vector field A µ to the Lagrangian with a coupling L Aϕ = iλ [ ϕ ( µ ϕ) ϕ( µ ϕ ) ] A µ + λ 2 (A µ ϕ )(A µ ϕ), (3.164) how do the vector field and the coupling constant λ have to transform under the local U(1), if the Lagrangian L + L Aϕ should remain invariant under phase redefinitions? Compute the Noether current for this local symmetry. Add a kinetic term for the vector field to the Lagrangian, L A = 1 4 F µνf µν, (3.165) and derive the EOMs for the field A µ considering the full Lagrangian L ϕ + L Aϕ + L A. viii) Compute the advanced Green s function D A (x y) for the Klein-Gordon equation using the integration contour in Figure 3.4. Recall which initial conditions one assumes in electrodynamics and why they lead to the use of the retarded Green s function. Can you imagine physical BCs in which the advanced propagator would be the right choice? Prove and explain in this context the following two relations D R ( x) = D A (x), D F ( x) = D F (x). (3.166) 54

57 ix) Explicitly perform the steps that lead to the Lagrangian (3.133). Show that the corresponding EOM (vary with respect to ϕ ) is the Schrödinger equation. The Lagrangian has a global U(1) symmetry, ϕ e iα ϕ. Verify the correctness of the expression for the Noether current (3.134) and discuss the physical meaning of the conserved charge. Based on your findings, give a reason why there is no non-relativistic limit for a real scalar field? x) Prove that the position operator X given in (3.142) satisfies X x = x x. Furthermore, show that (3.144), (3.146), and (3.148) are correct. References [1] S. R. Coleman, Physics 253: Quantum Field Theory, Course given at Harvard University, 1975 and 1976, [2] N. Straumann, The history of the cosmological constant problem, arxiv:grqc/ [3] S. M. Carroll, The Cosmological Constant, Living Rev. Relativity 3, 1 (2001), [4] H. B. G. Casimir and D. Polder, The Influence of retardation on the London-van der Waals forces, Phys. Rev. 73, 360 (1948). [5] K. A. Milton, The Casimir effect: Recent controversies and progress, J. Phys. A 37, R209 (2004) [arxiv:hep-th/ ]. 55

58 4 Interacting Fields Often in QM, we are interested in particles moving in some fixed background potential V (x). This can be easily incorporated into field theory by working with a Lagrangian with explicit x dependence. E.g., in the case of our non-relativistic complex scalar field ϕ discussed in Section 3.8, we could simply add a term L = V (x) ϕ (x) ϕ(x), (4.1) to the Lagrangian (3.133). Since this interaction does not respect translational symmetry, we won t have the associated energy-momentum tensor. While such Lagrangians are useful in condensed matter physics, we rarely (or never) come across them in high-energy physics, where all equations obey translational (and Lorentz) invariance. One can of course also consider interactions between particles. Obviously, these are only important for n particle states with n 2. We therefore expect them to arise from additions to the Lagrangian (3.133) of the form L = ϕ (x) ϕ (x) ϕ(x) ϕ(x), (4.2) which, in QFT, is an operator which destroys two particles before creating two new ones. Such terms in the Lagrangian will indeed lead to inter-particle forces, both in the non-relativistic and relativistic setting. In the following, we will explore these types of interactions in detail for relativistic theories. 4.1 Classification of Interactions The free QFTs we have discussed so far are special. We can determine their spectrum, but they are dull since nothing happens as their name suggests. They have particle excitations, but these do not interact. To make things more interesting (i.e., more complicated) let us include interactions in our theory. These will take the form of higher-order terms in the Lagrangian. We start by asking what kind of small perturbations we can add to the theory. E.g., let us consider the Lagrangian for a real scalar field (2.45) and add the infinite tower of additional terms L = L n, n=3 L n = λ n n! φn, (4.3) to it. Here the coefficients λ n are called coupling constants. The first question that we have to address, is which restrictions the coupling constants have to satisfy in order for the additional terms to be small perturbations. Naively one would think that one simply has to require that λ n 1. But this turns out to be not quite right. In order to see why the naive guess is not correct, we perform a dimensional analysis. Applying the rules gathered in Section 2.1, we find that the dimensions of the coupling constants are [λ n ] = 4 n. (4.4) 56

59 This result makes clear why we cannot simply say λ n 1, because this statement is only sensible for dimensionless quantities, but not dimensionful ones. The interaction terms in (4.3) fall into three different categories. First, dimension-three operators with [λ 3 ] = 1. For such terms, we can define a dimensionless parameter λ 3 /E, where E has dimension of mass and represents the energy scale of the process of interest. This means that L 3 = λ 3 φ 3 /(3!) is a small perturbation for high energies, i.e., E λ 3, but a big one at low energies, i.e., E λ 3. Such terms are called relevant, because they become and are most relevant at low energies which, after all, is where most of the physics that we experience lies. In a relativistic QFT, we have E > m, which means that we can always make this sort of perturbations small by taking λ 3 m. Second, terms of dimension four with [λ 4 ] = 0. E.g., L 4 = λ 4 φ 4 /(4!). Such terms are small if λ 4 1 and are called marginal. Third, operators with dimension of higher than four, having [λ n ] < 0. In this case the appropriate dimensionless parameters is (λ n E n 4 ) and terms L n = λ n φ n /(n!) with n 5 are small (large) at low (high) energies. Such contributions are called irrelevant, since in daily life, meaning E n 4 λ n, these operators do not matter. As we will see later, it is typically impossible to avoid high-energy processes in QFT. We have already seen a glimpse of this feature when we were discussing the structure of the vacuum in Section 3.2, which involved the calculation of an integral over infinitely large frequencies of a harmonic oscillator. We hence might expect problems with irrelevant operators that become important at high energies. Indeed, these operators lead to non-renormalizable QFTs in which one cannot make sense of the infinities at arbitrarily high energies. This does not mean that these theories are useless, it just means that they become incomplete at some energy scale and need to be embedded into an appropriate complete theory aka an UV completion. Let me also add that the above naive assignment of relevant, marginal, and irrelevant operators is not always carved in stone, since quantum corrections can sometimes change the character of an operator. Low-Energy Description In typical applications of QFT only the relevant and marginal couplings are important. This is due to the fact that the irrelevant couplings become small at low energies, as we have seen above. In practice this saves us, since instead of considering the infinite number of interaction terms in (4.3), only a handful are actually needed. E.g., in the case of the real scalar field φ described earlier, we only have to take into account two operators, namely L 3 = λ 3 φ 3 /(3!) and L 4 = λ 4 φ 4 /(4!), in the low-energy limit. Let us have a closer look at this issue. Suppose that at some day we discover the true superduper theory aka the TOE that describes the world at very high energy scales, say the GUT scale, or, if you wish, even the Planck scale. Whatever this scale is, let s call it Λ. Since it is an energy scale, we obviously have [Λ] = 1. What we want to understand are the laws of physics at energy scales E that we can probe directly in a laboratory, which given today s standards, means E Λ. Let us further suppose that at energies of order E, the laws of physics are described by a real scalar field. 20 This scalar field will have some complicated 20 Of course, we know that this assumption is plain wrong, since the SM is a non-abelian gauge theory with chiral fermions, but the same argument applies in that case. 57

60 interaction terms (4.3), where the precise form is dictated by all the stuff that is going on in the TOE. Can we get an idea about the interactions? Well, we can write our dimensionful coupling constants λ n in terms of dimensionless couplings g n, multiplied by a suitable power of the relevant scale Λ, λ n = g n. (4.5) Λn 4 The exact values of the dimensionless couplings g n depend on the details of the TOE, 21 so we have to do some guesswork. Since the couplings g n are dimensionless, 1 looks like a pretty good and somehow a natural guess. Since we are not completely sure, let s say g n = O(1). This means that in a laboratory with E Λ the interaction terms L n = λ n φ n /(n!) of (4.3) will be suppressed by powers of (E/Λ) n 4 if n 5. Given the LHC energy of around 1 TeV, this is a suppression by many orders of magnitude. E.g., for Λ = M P one has E/Λ = It is this simple argument based on dimensional analysis that ensures that we need to focus only on the first few terms in the interaction, namely those that are relevant and marginal. It also means that if we only have access to low-energy experiments, it is going to be very difficult to figure out the precise nature of the TOE, because its effects are highly diluted except for the relevant and marginal interactions. Some people therefore call the superduper theory that everybody is looking for, not TOE, but TOENAIL, which stands for theory of everything not accessible in laboratories. The discussion given above is a poor man s version of the ideas of effective field theory and Wilson s renormalization group, about which you can learn much more by asking Matthias Neubert. Weakly Coupled Theories In this course we will only deal with weakly coupled QFTs, i.e., theories that can be truly considered as small perturbations of the free field theory at all energies. We will look in more detail at two specific examples. The first example of a weakly coupled QFT we will study is the φ 4 theory, L = 1 2 ( µφ) m2 φ 2 λ 4! φ4, (4.6) where φ is our well-known real scalar field. For (4.6) to be weakly-coupled we have to require λ 1. We can get a hint for what the effects of the additional φ 4 term will be. Expanding it in terms of ladder operators, we find terms like a pa pa pa p, a pa pa pa p, (4.7) etc., which create and destroy particles. This signals that the φ 4 Lagrangian (4.6) describes a theory in which particle number is not conserved. In fact, it is not too difficult to check that the number operator N does not commute with the Hamiltonian, i.e., [H, N] 0. The second example we will look at is a scalar Yukawa theory. Its Lagrangian is given by L = ( µ ϕ )( µ ϕ) ( µφ) 2 M 2 ϕ ϕ 1 2 m2 φ 2 g ϕ ϕφ, (4.8) 21 If we would know the precise structure of the TOE we could, in fact, calculate the couplings g n. 58

61 with g M, m. This theory couples a complex scalar ϕ to a real scalar φ. In this theory the individual particle numbers for ϕ and φ are not conserved. Yet, the Lagrangian (4.8) is invariant under global phase rotations of ϕ, which ensures that there will be a conserved charge Q obeying [H, Q] = 0. In fact, we have met this charge already in (3.95). In consequence, in the scalar Yukawa theory the number of ϕ particles minus the number of ϕ antiparticles is conserved. Notice also that the potential in (4.8) has a stable minimum at ϕ = φ = 0, but it is unbounded from below, if gφ becomes too large. This means that we should not mess to much with the scalar Yukawa theory. 4.2 Interaction Picture In QM, there is a useful viewpoint called the interaction picture, which allows to deal with small perturbations to a well-understood Hamiltonian. Let me briefly recall how this works. In the Schrödinger picture, the states evolve as id/dt ψ S = H ψ S, while the operators O S are time independent. In contrast, in the Heisenberg picture the states do not evolve with time, but the operators change with time, namely one has ψ H = e iht ψ S and O H = e iht O S e iht. The interaction picture is a hybrid of the two. We split the Hamiltonian as H = H 0 + H int, (4.9) where in the interaction picture the time dependence of operators O I is governed by H 0, while the time dependence of the states ψ I is governed by H int. While this split is arbitrary, things are easiest if one is able to solve the Hamiltonian H 0, e.g., if H 0 is the Hamiltonian of a free theory. From what I have said so far, it follows that ψ I = e ih 0t ψ S, O I = e ih 0t O S e ih 0t. (4.10) Since the Hamiltonian is itself an operator, the latter equation also applies to the interaction Hamiltonian H int. In consequence, one has H I = (H int ) I = e ih 0t H int e ih 0t. (4.11) The Schrödiner equation in the interaction picture is readily derived starting from the Schrödinger picture, i d dt ψ S = H ψ S, = i d ( e ih 0 ) t ψ I = (H0 + H int ) e ih0t ψ I, dt = i d dt ψ I = e ih 0t H int e ih 0t ψ I, (4.12) = i d dt ψ I = H I ψ I. Dyson s Formula In order to solve the system described by the Hamiltonian (4.9), we have to find a way of how to find a solution to the Schrödinger equation in the interaction basis (4.12). Let us write the solution as ψ(t) I = U(t, t 0 ) ψ(t 0 ) I, (4.13) 59

62 where U(t, t 0 ) is an unitary time-evolution operator satisfying U(t, t) = 1, U(t 1, t 2 )U(t 2, t 3 ) = U(t 1, t 3 ), and U(t 1, t 3 ) [ U(t 2, t 3 ) ] = U(t1, t 2 ). Inserting (4.13) into the last line of (4.12) i d dt U(t, t 0) = H I (t)u(t, t 0 ). (4.14) If H I would be a function, the solution to the differential equation (4.14) would read ( t ) U(t, t 0 ) =? exp i dt H I (t ). (4.15) t 0 Yet, H I is not a function but an operator and this causes ordering issues. Let s have a closer look at the exponential to understand where the trouble comes from. The exponential is defined through its power expansion, ( t ) t ( t ) 2 exp i dt H I (t ) = 1 i dt H I (t ) + ( i)2 dt H I (t ) (4.16) t 0 t 0 2 t 0 When we differentiate this with respect to t, the third term on the right-hand side gives 1 ( t ) dt H I (t ) H I (t) 1 ( t ) 2 t 0 2 H I(t) dt H I (t ). (4.17) t 0 The second term of this expression looks good since it is part of H I (t)u(t, t 0 ) appearing on the right-hand side of (4.14), but the first term is no good, because the H I (t) sits on the wrong side of the integral, and we cannot commute it through, given that [H I (t ), H I (t)] 0 when t t. So what is the correct expression for U(t, t 0 ) then? The correct answer is provided by Dyson s formula, 22 which reads ( t ) U(t, t 0 ) = T exp i dt H I (t ). (4.18) t 0 Here T denotes time ordering as defined in (3.124). It is easy to prove the latter statement. We start by expanding out (4.18), which leads to t U(t, t 0 ) = 1 i dt H I (t ) t 0 [ t t + ( i)2 dt dt H I (t )H I (t ) + 2 t 0 t t t 0 dt t t 0 dt H I (t )H I (t ) ] (4.19) In fact, the terms in the last line are actually the same, since t t dt dt H I (t )H I (t ) = t 0 t = t t dt t 0 t t t 0 dt H I (t )H I (t ) dt dt H I (t )H I (t ), t 0 t 0 (4.20) 22 Essentially figured out by Paul Dirac, but in its compact notation due to Freeman Dyson. 60

63 where the range of integration in the first expression is over t t, while in the second expression one integrates over t t, which is, of course, the same thing. The final expression is simply obtained by relabelling t and t. In fact, it is not too difficult to show that one has t t 0 dt 1 t1 t 0 dt 2... tn 1 t 0 dt n H I (t 1 )... H I (t n ) = 1 n! t t 0 dt 1... dt n T (H I (t 1 )... H I (t n )). (4.21) Putting things together this means that the power expansion of (4.17) takes the form t t t U(t, t 0 ) = 1 i dt H I (t ) + ( i) 2 dt dt H I (t )H I (t ) (4.22) t 0 t 0 t 0 The proof of Dyson s formula is straightforward. First, observe that under the T operation, all operators commute, since their order is already fixed by time ordering. Thus, i d dt U(t, t 0) = i d [ ( t )] [ ( t )] T exp i dt H I (t ) = T H I (t) exp i dt H I (t ) dt t 0 t 0 ( t ) (4.23) = H I (t)t exp i dt H I (t ) = H I (t)u(t, t 0 ). t 0 Notice that since t, being the upper limit of the integral, is the latest time so that the factor H I (t) can be pulled out to the left. Before moving on, I have to say that Dyson s formula is rather formal. In practice, it turns out to be very difficult to compute the time-ordered exponential in (4.18). The power of (4.18) comes from the expansion (4.22) which is valid when H I is a small perturbation to H First Look at Scattering Processes Let us now try to apply the interaction picture to QFT, starting with an easy example, namely the interaction Hamiltonian of the Yukawa theory, H int = g d 3 x ϕ ϕφ. (4.24) Unlike the free theories discussed in Section 2, this interaction does not conserve the particle number of the individual fields, allowing particles of one type to morph into others. In order to see why this is the case, we look at the evolution of the state, i.e., ψ(t) = U(t, t 0 ) ψ(t 0 ), in the interaction picture. If g M, m, where M and m are the masses of ϕ and φ, respectively, the perturbation (4.24) is small and we can approximate the full time-evolution operator U(t, t 0 ) in (4.18) by (4.22). Notice that (4.22) is, in fact, an expansion in powers of H int. The interaction Hamiltonian H int contains ladder operators for each type of particle. In particular, glancing at (3.107) tells us that the φ field contains the operators a and a that create or destroy φ particles. 23 Let s call this particle mesons (M). On the other hand, from the discussion in Section 3.4 and 3.5, it follows that the field ϕ contains the operators a + and 23 The additional subscript p of the ladder operators a and a etc. is dropped hereafter in the text. 61

64 a, which implies that it creates ϕ antiparticles and destroys ϕ particle. We will call these particles nucleons (N). 24 Finally, the action of ϕ is to create nucleons through a and to destroy antinucleons via a +. While the individual particle number is not conserved, it is important to emphasize that Q = N + N as defined in (3.95) is conserved not only in the free theory, but also in the presence of H int. At first order in H int, one will have terms of the form a +a a which destroys a meson and creates a nucleon-antinucleon pair, M N N. At second order in H int, we have more complicated processes. E.g., the combination of ladder operators (a +a a)(a + a a ) gives rise to the scattering process N N M N N. The rest of this section is devoted to calculate the quantum amplitudes for such processes to occur. In order to calculate the amplitude, we have to make an important, but slightly dodgy, assumption. We require that the initial state i at t (final state f at t ) is an eigenstate of the free theory described by the Hamiltonian H 0. At some level, this sounds like a reasonable approximation. If at t the particles are well separated they do not feel the effects of each other. Moreover, we intuitively expect that the states i and f are eigenstates of the individual number operators N and N ±. These operators commute with H 0, but not with H int. As the particles approach each other, they interact briefly, before departing again, each going on its own way. The amplitude to go from i to f is given by lim f U(t +, t ) i = f S i, (4.25) t where the unitary operator S is known as the S-matrix. Needless to say, that the S in S-matrix stands for scattering. There are a number of reason why the assumption of non-interacting initial and final states i and f is shaky. First, one cannot describe bound states. E.g., naively this formalism cannot deal with the scattering of an e and proton (p) which collide, bind, and leave as a Hydrogen atom. It is possible to circumvent this objection, since it turns out that bound states show up as poles in the S-matrix. Second, and more importantly, a single particle, a long way from its neighbors, is never alone in field theory. This is true even in classical electrodynamics, where the electron sources the electromagnetic field from which it can never escape. In QED, a related fact is that there is a cloud of virtual photons surrounding the electron. This line of thought gets us into the issues of renormalization and you will hear more on this later. For the time being, let me simply use the assumption of non-interacting asymptotic states. After developing the basics of scattering theory, we will revisit the latter problem. Example: Meson Decay Let us consider the relativistically normalized initial and final states, i = 2E p1 a p 1 0, f = 4E q1 E q2 a +,q 1 a,q 2 0. (4.26) 24 Of course, in reality nucleons are spin-1/2 particles, and do not arise from the quantization of a scalar field. Our scalar Yukawa theory is therefore only a toy model for nucleons interacting with mesons. 62

65 The initial state contains a meson with momentum p 1, while the final state contains a nucleonantinucleon pair of momentum q 1 and q 2. In leading order in the interaction H int (4.24), the amplitude for the process M N N is given by f S i = ig f d 4 x ϕ I(x)ϕ I (x)φ I (x) i. (4.27) Let us calculate this matrix element step by step. We first express φ I in terms of a and a using (3.107). Notice that it is correct to apply the latter equation, since the φ I field in (4.27) is in the interaction picture, which is the same as the Heisenberg picture of the free theory. The annihilation operator in (3.107) will turn i into something proportional to 0, while the piece containing a creation operator will turn i into a two meson state. A two meson state has however no overlap with f, which is a N N state, and the ladder operator appearing in ϕ I and ϕ I cannot change this situation. So we have f S i = ig f = ig f = ig f d 4 x ϕ I(x)ϕ I (x) d 4 x ϕ I(x)ϕ I (x) d 3 k (2π) 3 d 3 k (2π) 3 d 4 x ϕ I(x)ϕ I (x) e ip 1x 0. 2Ep1 2Ek a k a p 1 e ikx 0 2Ep1 [ ] (2π) 3 δ (3) (p 1 k) a p 2Ek 1 a k e ikx 0 (4.28) Now we do the same for ϕ I and ϕ I. To get a non-zero overlap with our nucleon-antinucleon final state, we have to pick up the creation operators a + and a from the Fourier expansion of the field operators. Altogether we then have d 4 x d 3 k 1 d 3 k 2 f S i = ig 0 (2π) 6 4Eq1 E q2 4Ek1 E k2 a,q2 a +,q1 a +,k 1 a,k 2 0 e i(k 1+k 2 p 1 )x = ig d 4 x d 3 k 1 d 3 k 2 (2π) 6 4Eq1 E q2 4Ek1 E k2 (2π) 6 δ (3) (q 1 k 1 ) δ (3) (q 2 k 2 ) e i(k 1+k 2 p 1 )x (4.29) = ig (2π) 4 δ (4) (q 1 + q 2 p 1 ), where we have made repeatedly use of the commutation relations of the ladder operators as given in (3.16) and ignored contributions where annihilation operators act on the vacuum, since these vanish by definition. We have drawn first blood: the result in (4.29) is our first QFT amplitude. Notice that the delta function constraints the possible M N N decays. In particular, the decay can only happen at all if the mass of the meson is larger or equal to the mass of the nucleon-antinucleon state, i.e., m 2M. In order to see this, we simply boost our reference frame so that the meson is at rest p 1 = (m, 0, 0, 0). This is always possible. Momentum conservation, as imposed by the delta function, than implies that the nucleon and antinucleon are produced back-to-back, q 1 = q 2, and that m = 2 ( M 2 + q 1,2 2) 1/2 2M. 63

66 4.4 Wick s Theorem Using Dyson s formulas (4.18) and (4.22), we want to compute matrix elements such as f T ( H I (x 1 )... H I (x n ) ) i, (4.30) where i and f are assumed to be asymptotically free states. The ordering of the operators H I is fixed by time ordering. However, since the interaction Hamiltonian contains certain creation and annihilation operators, it would be convenient if we could start to move all annihilation operators to the right, where they can start eliminating particles in i. Recall that this is the definition of normal ordering as defined in (3.27). Wick s theorem tells us how to go from time-ordered products to normal-ordered products. Before stating Wick s theorem in its full generality, let s keep it simple and try to rederive something that we know already. This is always a good idea. Case of Two Fields The most simple matrix element of the form (4.30) is 0 T φ I (x)φ I (y) 0. (4.31) We already calculated this object in Section 3.7 and gave it the name Feynman propagator. What we want to do now is to rewrite it in such a way that it is easy to evaluate and to generalize the obtained result to the case with more than two fields. We start by decomposing the real scalar field in the interaction picture as φ I (x) = φ + I (x) + φ I (x), (4.32) with 25 d φ + 3 I (x) = k 1 d a (2π) 3 k e ikx, φ 3 I (x) = k 1 a 2Ek (2π) 3 k eikx. (4.33) 2Ek This decomposition can be done for any free field. It is useful since φ + I (x) 0 = 0, 0 φ I (x) = 0. (4.34) Now we consider the case x 0 > y 0 and compute the time-order product of the two scalar fields, T φ I (x)φ I (y) = φ I (x)φ I (y) = ( φ + I (x) + φ I (x))( φ + I (y) + φ I (y)) = φ + I (x)φ+ I (y) + φ I (x)φ+ I (y) + φ I (y)φ+ I (x) + φ I (x)φ I (y) (4.35) + [ φ + I (x), φ I (y)], where we have normal ordered the last line, i.e., brought all φ + I s to the right. To get rid of the φ + I (x)φ I (y) term, we have added the commutator [ φ + I (x), φ I (y)]. In the case x 0 < y 0, we find, repeating the above exercise, T φ I (x)φ ( y) = :φ I (x)φ I (y): + [ φ + I (y), φ I (x)], (4.36) 25 The superscripts ± do not make much sense, but I just follow Pauli and Heisenberg here. If you have to, complain with them. 64

67 where we have made use of the fact that the first four terms in the last line of (4.35) are simply the normal-ordered product of the two fields, :φ(x)φ(y):. In order to combine the results (4.35) and (4.36) into one equation, we define the contraction of two fields, [ φ + I φ I (x)φ I (y) = (x), φ I (y)], x 0 > y 0, [ (4.37) φ + I (y), φ I (x)], y 0 > x 0. This definition implies that the contraction of two φ I propagator: fields is nothing but the Feynman φ I (x)φ I (y) = D F (x y). (4.38) For a string of field operators φ I, the contraction of a pair of fields means replacing the contracted operators with the Feynman propagator, leaving all other operators untouched. Equipped with the definition (4.37), the relation between time-ordered and normal-ordered products of two fields can now be simply written as T φ I (x)φ I (y) = :φ I (x)φ I (y): + φ I (x)φ I (y). (4.39) Let me emphasize that while both T φ I (x)φ I (y) and :φ I (x)φ I (y): are operators, their difference is a complex function, namely the Feynman propagator or the contraction of two φ I fields. The formalism of contractions is also straightforwardly extended to our complex scalar field ϕ I. One has T ϕ I (x)ϕ I(y) = :ϕ I (x)ϕ I(y): + ϕ I (x)ϕ I(y), (4.40) prompting us to define the contraction in this case as ϕ I (x)ϕ I(y) = D F (x y). ϕ I (x)ϕ I (y) = ϕ I(x)ϕ I(y) = 0. (4.41) For convenience and brevity, I will from here on often drop the subscript I, whenever I calculate matrix elements of the form (4.30). There is however little room for confusion, since contractions will always involve interaction-picture fields. Strings of Fields With all this new notation at hand, the generalization to arbitrarily many fields is also easy to write down: T ( φ(x 1 )... φ(x n ) ) = : ( φ(x 1 )... φ(x n ) + all possible contractions ) :. (4.42) This identity is known as the Wick s theorem. Notice that for n = 2 the latter equation is equivalent to (4.39). Before proving Wick s theorem, let me tell you what the phrase all possible contractions means by giving a simple example. 65

68 For n = 4 we have, writing φ i instead of φ(x i ) for brevity, T ( φ 1 φ 2 φ 3 φ 4 ) = : ( φ1 φ 2 φ 3 φ 4 + φ 1 φ 2 φ 3 φ 4 + φ 1 φ 2 φ 3 φ 4 + φ 1 φ 2 φ 3 φ 4 + φ 1 φ 2 φ 3 φ 4 + φ 1 φ 2 φ 3 φ 4 + φ 1 φ 2 φ 3 φ 4 (4.43) + φ 1 φ 2 φ 3 φ 4 + φ 1 φ 2 φ 3 φ 4 + φ 1 φ 2 φ 3 φ 4 ) :. When the contracted field operator are not adjacent, we still define it to give a factor of D F. E.g., :φ 1 φ 2 φ 3 φ 4 : = D F (x 2 x 4 ) :φ 1 φ 3 :. (4.44) Since the VEV of any normal-ordered operator vanishes, i.e., 0 : O : 0 = 0, sandwiching any term of (4.43) in which there remain uncontracted field operators between the vacuum 0 gives zero. This means that only the three fully contracted terms in the last line of that equation survive and they are all complex functions. We therefore have 0 T ( φ 1 φ 2 φ 3 φ 4 ) 0 = DF (x 1 x 2 )D F (x 3 x 4 ) + D F (x 1 x 3 )D F (x 2 x 4 ) + D F (x 1 x 4 )D F (x 2 x 3 ), (4.45) which is a rather simple result and has, as we will see in the next section, a nice pictorial interpretation. Proof of Wick s Theorem We still like to prove Wick s theorem. Naturally this is done by induction. We have already proved the case n = 2. So let s assume that (4.42) is valid for n 1 and try to show that the latter equation also holds for n field operators. With out loss of generality we can assume that x 0 1 >... > x 0 n, since if this is not the case we simply relabel the points in an appropriate way. Such a relabeling leaves both sides of (4.42) unchanged. Then applying Wick s theorem to the string φ 2... φ n, we arrive at T ( φ 1... φ n ) = φ1... φ n = = φ 1 :(φ 2... φ n + all contraction not involving φ 1 ): = (φ φ 1 ) :(φ 2... φ n + all contraction not involving φ 1 ):. (4.46) We now want to move the φ ± 1 s into the : (... ) :. For φ 1 this is easy, since moving it in, it is already on the left-hand side and thus the resulting term is normal ordered. The term with φ + 1 is more complicated because we have to bring it into normal order by commuting φ + 1 to the right. E.g., consider the term without contractions, φ + 1 :φ 2... φ n : = :φ 2... φ n : φ [φ + 1, :φ 2... φ n :] = :φ + 1 φ 2... φ n : + : ( [φ + 1, φ 2 ]φ 3... φ n + φ 2 [φ + 1, φ 3 ]φ 4... φ n +... ) : (4.47) = : ( φ + 1 φ 2... φ n + φ 1 φ 2 φ 3... φ n + φ 1 φ 2 φ 3 φ 4... φ n +... ) :. 66

69 Here we first used the fact that the commutator of a single operator and a string of operators can be written as a sum of all possible strings of operators with two adjacent operators put into a commutator. The simplest relation of this type reads [φ 1, φ 2 φ 3 ] = [φ 1, φ 2 ]φ 3 + φ 2 [φ 1, φ 3 ] and is easy to prove. In the last step we then realized that under the assumption x 0 1 >... > x 0 n all commutator of two operators are equivalent to a contraction of the relevant fields. The first term in the last line of (4.47) combines with the φ 1 term of (4.46) to give : φ 1... φ n :, meaning that we have derived the first term on the right-hand side of Wick s theorem as well as all terms involving only one contraction of φ 1 with another field in (4.42). It is not too difficult to understand that repeating the above exercise (4.47) with all the remaining terms in (4.46) will then give all possible contractions of all the fields, including those of φ 1. Hence the induction step is complete and Wick s theorem is proved. 4.5 Second Look at Scattering Processes In order to see the real power of Wick s theorem let s put it to work and try to calculate NN NN scattering in the Yukawa theory (4.24). We first write down the expressions for the initial and final states, i = 4E p1 E p2 a +,p 1 a +,p 2 0 = p 1, p 2, f = 4E q1 E q2 a +,q 1 a +,q 2 0 = q 1, q 2. (4.48) We now look at the expansion of f S i in powers of the coupling constant g. In order to isolate the interesting part of the S-matrix, i.e., the part due to interactions, we define the T -matrix by S = 1 + it, (4.49) where the 1 describes the situation where nothing happens. The leading contribution to it occurs at second order in the interaction (4.24). We find ( ig) 2 d 4 xd 4 y T ( ϕ (x)ϕ(x)φ(x)ϕ (y)ϕ(y)φ(y) ). (4.50) 2 Applying Wick s theorem to the time-order production entering this expression, we get (besides others) a term D F (x y) :ϕ (x)ϕ(x)ϕ (y)ϕ(y):, (4.51) which features a contraction of the two φ fields. This term will contribute to the scattering, because the operator : ϕ (x)ϕ(x)ϕ (y)ϕ(y) : destroys the two nucleons in the initial state and generates those appearing in the final state. In fact, (4.51) is the only contribution to the process NN NN, since any other ordering of the field operators would lead to a vanishing matrix element. The matrix element of the normal-ordered operator in (4.51) is readily computed: q 1, q 2 :ϕ (x)ϕ(x)ϕ (y)ϕ(y): p 1, p 2 = q 1, q 2 ϕ (x)ϕ (y) 0 0 ϕ(x)ϕ(y) p 1, p 2 = ( e i(q 1x+q 2 y) + e i(q 1y+q 2 x) ) ( e i(p 1x+p 2 y) + e i(p 1y+p 2 x) ) = e i[(q 1 p 1 )x+(q 2 p 2 )y] + e i[(q 2 p 1 )x+(q 1 p 2 )y] + (x y), 67 (4.52)

70 where, in going to the third line, we have used the fact that for relativistically normalized states, 0 ϕ(x) p = e ipx. Putting things together, the matrix element (4.50) takes the form ( ig) 2 d 4 xd 4 y d 4 k [ 2 (2π) 4 ] e i[(q 1 p 1 )x+(q 2 p 2 )y] + e i[(q 2 p 1 )x+(q 1 p 2 )y] + (x y) ie ik(x y) k 2 m 2 + iɛ, (4.53) where the term in curly brackets arises from (4.52), while the final factor stems from the expression for the φ propagator (3.129). The (x y) terms double up with the others to cancel the factor of 1/2 in the prefactor ( ig) 2 /2, while the x and y integrals give delta functions. One arrives at ( ig) 2 d 4 k i(2π) 8 [ δ (4) (q (2π) 4 k 2 m 2 1 p 1 + k)δ (4) (q 2 p 2 k) + iɛ ] + δ (4) (q 2 p 1 + k)δ (4) (q 1 p 2 k). (4.54) Finally, we perform the k integration using the delta functions. We obtain [ ] i( ig) 2 1 (p 1 q 1 ) 2 m 2 + iɛ + 1 (2π) 4 δ (4) (p (p 1 q 2 ) 2 m p 2 q 1 q 2 ), (4.55) + iɛ where the delta function, like in (4.29), imposes momentum conservation. Let me note that, in fact, we can drop the iɛ in the propagators, since the denominators cannot become zero. In order to see this, we go to the center-of-mass (CM) frame, where p 1 = p 2 and, by momentum conservation p 1 = q 1. This ensures that the 4-momentum of the meson is k = (0, p 1 q 1 ), and in consequence k 2 < 0. We will see shortly another, much simpler way to reproduce the result (4.55) using Feynman diagrams. This will also shed light on the physical interpretation. Notice that the above calculation is also relevant for the scatterings N N N N and N N N N. Both reactions arise from the term (4.52) in Wick s theorem. However, we will never find a term that contributes to NN N N or N N NN, because these transitions would violate the conservation of the charge Q introduced in (3.95). 4.6 Feynman Diagrams As the above example demonstrates, to actually compute scattering amplitudes using Wick s theorem is (still) rather tedious. There s a much better way, which starts by drawing pretty pictures. This pictures represent the expansion of f S i and we will learn how to associate mathematical expressions with those pictures. The pictures, you probably already guessed it, are the famous Feynman diagrams. The Feynman-diagram approach turns out to be a powerful tool to calculate QFT amplitudes (or as Schwinger puts it in [1]: Like the silicon chips of more recent years, the Feynman diagram was bringing computation to the masses. ). We again start simple and consider the case of for fields, all at different space-time points, which we have already worked out in (4.45). Let us present each of the points x 1 to x 4 by a point and the propagators D F (x 1 x 2 ) etc. by a line joining the relevant points. Then the 68

71 right-hand side of (4.45) can be represented as a sum of three Feynman diagrams, T ( 1 2 ) 1 2 φ 1 φ 2 φ 3 φ 4 0 = (4.56) While this matrix element is not a measurable quantity, the pictures suggest a physical interpretation. Two particles are generated at two points and then each propagators to one of the other points, where they are both annihilated. This can happen in three possible ways corresponding to the three shown graphs. The total amplitude for this process is the sum of the three Feynman diagrams. Things get more interesting, if one considers expressions like (4.56) that contain field operators evaluated at the same space-time point. So let us have a look at the expansion of the propagator (4.31) of the real scalar field, 0 T { [ φ(x)φ(y) + φ(x)φ(y) i ] } dt H I (t) , (4.57) in the presence of the interaction term H I = λ/(4!)φ 4 of the φ 4 theory (4.6). The first term gives the free-field result, 0 T φ(x)φ(y) 0 = D F (x y), while the second term takes the form { 0 T φ(x)φ(y) ( i) dt d 3 z λ } 4! φ4 (z) 0 = 0 T { φ(x)φ(y) iλ 4! } d 4 z φ(z)φ(z)φ(z)φ(z) 0. (4.58) Now let s apply Wick s theorem (4.42) to (4.58). We get one term for each possible way to contract the six different φ s with each other in pairs. There are 15 such possibilities, but fortunately only two of these possibilities are really different. If we contract φ(x) and φ(y), there are 3 possible ways to contract the remaining φ(z) s. The other possibility is to contract φ(x) with φ(z) (four choices) and φ(y) with φ(z) (three choices), and φ(z) with φ(z) (one choice). There are 12 possible ways to do this, all giving the same result. In consequence, we have 0 T { φ(x)φ(y) ( i) = 3 iλ 4! + 12 iλ 4! dt D F (x y) d 3 z λ } 4! φ4 (z) 0 d 4 z D F (z z) D F (z z) d 4 z D F (x z) D F (y z) D F (z z). (4.59) We can understand the latter expression better if we represent each term as a Feynman graph. Again we draw each propagator as a line and each point as a dot. This time we have however to distinguish between the external points x and y and the internal point z, which is 69

72 associated with a factor iλ d 4 z. Neglecting the overall factors, we see that the expression (4.59) is equal to the sum of the following two diagrams x y z + x z y. (4.60) We refer to the lines in these diagrams as propagators, since they represent the propagation amplitudes D F (x y) etc. Internal points where four lines meet are called vertices. Since D F (x y) is the amplitude for a free Klein-Gordon particle to propagate between x and y, the diagrams actually interpret the analytic formula as a process of creation, propagation, and annihilation which takes place in space-time. Let s now move to a more complicated contraction that arises at order λ 3 in the φ 4 interaction (φ x = φ(x) etc.): ( ) 3 1 iλ 0 φ x φ y d 4 z φ z φ z φ z φ z d 4 w φ w φ w φ w φ w d 4 u φ u φ u φ u φ u 0 3! 4! = 1 ( ) 3 iλ d 4 z d 4 w d 4 u D F (x z)d F (z z)d F (z w) 3! 4! DF 2 (w u)d F (u u)d F (u y). (4.61) The number of different contractions that gives this result is large. One has 3! /2, (4.62) which means a total number or possibilities. Here the factor 3! arises from the interchange of the vertices z, w, and u, while the first 4 3 factor describes the placement of the contractions into the z vertex. The factor characterizes the placement of the contractions into the w vertex whereas the second 4 3 factor is associated to the placement of the contractions into the u vertex. Finally, the factor of 1/2 is due to the interchange of the w u contractions. The product in (4.62) is roughly 1/13 of the total number of contractions of 14 different field operators. The particular contraction (4.61) can be represented by the following cactus diagram: u (4.63) x z w 70 y.

73 It is conventional, for obvious reasons, to let this one diagram represent the sum of all identical terms. In practical applications one always draws the Feynman diagrams first, using it as a mnemonic device to write down the analytic expression. If this is done, one still has to figure out the multiplicative overall factor. Of course, one can do this as we have done it above by associating a factor d 4 z ( iλ/(4!)) with each vertex, putting in the 1/n! factor from the Taylor expansion, and then do the combinatorics by writing out the product of fields as in (4.61) and counting. Yet, typically the 1/n! factor from the Taylor series will cancel the n! factor arising from the interchanging the vertices, so that one can simply forget about this factors. Furthermore, the generic vertex has four different lines coming from four different places, so that the various contractions into the φφφφ operator generates a factor of 4! (as in the case of the w vertex in the above example). This factor of 4! cancels the denominator of iλ/(4!). It is therefore conventional to associate the expression d 4 z ( iλ) with each vertex. Applying this scheme to the Feynman graph in (4.63) gives a multiplicative factor that is too large by a factor of S = 8 = 2 2 2, which is called the symmetry factor of the diagram. Two factor of 2 come from lines that start and end on the same vertex, since the diagram is symmetric under the interchange of the ends of such lines (z and u in our case). The other factor of 2 comes from the two propagators connecting w and u, since the graph is symmetric under the interchange of these two lines. A third type of symmetry (not arising in the case at hand) is the equivalence of two vertices. In order to arrive at the correct overall factor, one has to divide by the symmetry factor, which is in general the number of possibilities to change parts of the diagrams without changing the result of the Feynman graph. Most people never need to evaluate Feynman graphs with a symmetry factor larger than 2, so there is no need to worry too much about these technicalities. But for completeness let me give some examples of nontrivial symmetry factors. Here they are (dropping the labels x and y at the external points): S = 2, S = = 8, (4.64) S = 3! = 6, S = 3! 2 = 12. Clearly, if you are in doubt about the symmetry factor you can always determine it by counting equivalent contractions, as we did above. We are now ready to summarize our rules needed to find the analytic expression for each piece of a given Feynman diagram in the φ 4 theory: 1. For each propagator one has x y = D F (x y). 71

74 2. For each vertex one has z = ( iλ) d 4 z. 3. For each external point one has x = Divide by the symmetry factor. Since these rules are written in terms of space-time points x, y, z, etc. these rules are called position-space Feynman rules. One way to interpret these rules is to think of the factor ( iλ) as the amplitude for the emission and/or absorption of particles at a vertex. The integral d 4 z tells us that we have to sum over all points where this process can occur. This means that this is nothing but the superposition principle of QM: when a process can happen in different ways, we add the amplitudes for each possibility. Furthermore, in order to calculate each individual amplitude the Feynman rules tell us to multiply the amplitudes (propagators and vertices) for each of independent part of the process. The above Feynman rules are given in position-space. Yet, in actual calculation it is (often) more convenient to work in the momentum-space by introducing the Fourier transformation of the propagator (3.129). To such a propagator one has to assign a 4-momentum p, indicating in general the direction of the momentum with an arrow (since D F (x y) = D F (y x) the direction of p is arbitrary). The z-dependent factors of the vertices in a diagram are then given by p 3 p 4 p 1 p 2 d 4 z e i(p 1+p 2 +p 3 p 4 ) z = (2π) 4 δ (4) (p 1 + p 2 + p 3 p 4 ). (4.65) In other words momentum is conserved at each vertex. The delta functions from the vertices can now be used to perform some of the momentum integrals from the propagators. We are left with the following momentum-space Feynman rules: 1. For each propagator one has = p i p 2 m 2 + iɛ. 2. For each vertex one has = iλ. 3. For each external point one has x = e ip x. p 72

75 4. Impose momentum conservation at each vertex. 5. Integrate over each undetermined momentum 6. Divide by the symmetry factor. d 4 l (2π) 4. Again, we can interpret each factor as the amplitude for that part of the process, with the integrations coming from the superposition principle. The exponential factor for an external point is just the amplitude for a particle at that point to have the needed momentum, or, depending on the direction of the arrow, for a particle with a certain momentum to be found at the specific point. 4.7 Third Look at Scattering Processes Let us now apply the things that we have learned to the case of NN NN scattering. At order g 2 we have to consider the two diagrams shown in Figure 4.1. Employing the relevant momentum-space Feynman rules, it is readily seen that the analytic expression for the sum of the displayed graphs agrees with the final result (4.55) of the calculation that we performed earlier in Section 4.5. In fact, there is a nice physical interpretation of the graphs. We talk, rather loosely, of the nucleons exchanging a meson which, in the first diagram, has momentum k = p 1 q 1 = p 2 q 2. This meson does not satisfy the usual energy dispersion relation, because k 2 m 2, where m is the mass of the meson. The meson is called a virtual particle and is said to be off-shell (or, sometimes, off mass-shell). Heuristically, it can t live long enough for its energy to be measured to great accuracy. In contrast, the momentum on the external, nucleon legs satisfy p 2 1 = p 2 2 = q 2 1 = q 2 2 = M 2, which means that the nucleons, having mass M, are on-shell. Similar considerations apply to the second diagram. It is important to notice that the appearance of the two diagrams above ensures that the particles satisfy Bose statistics. The diagrams describing the scattering of a nucleon and an antinucleon, N N N N, are a little bit different than the ones for NN NN. At lowest order, the corresponding graphs are shown in Figure 4.2. It is a simple matter to write down the amplitude using the relevant Feynman rules, i( ig) 2 [ 1 (p 1 + p 2 ) 2 m 2 + iɛ + 1 (p 1 q 1 ) 2 m 2 + iɛ ] (2π) 4 δ (4) (p 1 + p 2 q 1 q 2 ). (4.66) Notice that in the CM frame, p 1 = p 2, the denominator of the first term in the square bracket is 4 (M 2 + p 2 1) m 2. If m < 2M, then this term never vanishes and we may drop the iɛ. In contrast, if m > 2M, then the amplitude corresponding to the first diagram diverges at some value of p 1. In this case it turns out that we may also neglect the iɛ term, although for a different reason. In this case the meson is unstable when m > 2M and thus has a finite width Γ. When correctly treated, this instability adds a finite imaginary piece iγ to the denominator which makes the application of the iɛ prescription unnecessary. Nonetheless, the increase in the scattering amplitude which we see in the first diagram when 4 (M 2 + p 2 1) = m 2 is what 73

76 N p 1 q 1 N N p 1 N M + M q 1 q 2 N p 2 q 2 N N p 2 N Figure 4.1: Feynman diagrams contributing to NN NN scattering at order g 2. allows us to discover new particles. These appear as a resonance (a peak or bump) in the cross section (roughly the amplitude squared). We see that the amplitudes (4.55) and (4.66) (and in general all processes that include the exchange of just a single particle) depend on the same combinations of momenta in the denominators. There are standard names for various sums and differences of momenta that are known as Mandelstam variables. They are s = (p 1 + p 2 ) 2 = (q 1 + q 2 ) 2, t = (p 1 q 1 ) 2 = (p 2 q 2 ) 2, u = (p 1 q 2 ) 2 = (p 2 q 1 ) 2, (4.67) where, as in the explicit examples above, p 1 and p 2 are the momenta of the two initial-state particles, and q 1 and q 2 are the momenta of the two final-state particles. In order to get a feel for what these variables mean, let us assume (for simplicity) that all four particles are the same. In the CM frame, the initial two particles have the following 4-momenta p 1 = (E, 0, 0, p), p 2 = (E, 0, 0, p), (4.68) The particles then scatter at some angle θ and leave with momenta q 1 = (E, 0, p sin θ, p cos θ), q 2 = (E, 0, p sin θ, p cos θ). (4.69) Then from the definitions (4.67), we have that s = 4E 2, t = 2p 2 (1 cos θ), u = 2p 2 (1 + cos θ). (4.70) We see that the variable s measures the total center of mass energy of the collision, while the variables t and u are measures of the energy exchanged between particles (they are basically equivalent, just with the outgoing particles swapped around). Now the amplitudes that involve exchange of a single particle can be written simply in terms of the Mandelstam variables. E.g., for nucleon-nucleon scattering, the amplitude (4.55) is proportional to 26 A(NN NN) 26 Here and in the following we simply drop all iɛ terms. 1 t m u m, (4.71) 2 74

77 N p 1 M q 1 N + N p 1 M q 1 N N p 2 q 2 N N p 2 q 2 N Figure 4.2: Feynman diagrams contributing to N N N N scattering at order g 2. while in the case of nucleon-antinucleon scattering one finds A(N N N N) 1 s m t m. (4.72) 2 We say that the first case involves t- and u-channel diagrams. On the other hand, the nucleonantinucleon scattering is said to involve s- and t-channel exchange. Note finally that there is a relationship between the Mandelstam variables. In the cases of NN NN and N N N N scattering, which involves external particles with the same mass, one has s + t + u = 4M 2. (4.73) When the masses of the external particles are different this becomes s + t + u = 4 i=1 m2 i, where m i denotes the individual masses of the initial- and final-state particles. Let us now consider the case of meson-meson scattering, MM MM. The simplest diagram we can draw that describes this process is shown in Figure 4.3. It has a single loop, and momentum conservation at each vertex is no longer sufficient to determine every momentum passing through the diagram. Assigning the single undetermined momentum l to the right-hand propagator, all other momenta are fixed by the kinematics (the actual momenta assignments are not displayed in the figure). The amplitude corresponding to the displayed diagram is d i( ig) 4 4 l 1 1 (2π) 4 l 2 M 2 (l + q 1 ) 2 M (l p 1 + q 1 ) 2 M 2 (l q 2 ) 2 M 2 (2π)4 δ (4) (p 1 + p 2 q 1 q 2 ). (4.74) While an explicit calculation of this loop integral is beyond the scope of this lecture, notice that for large l, this integral goes as d 4 l/l 8, which means that it is UV finite (the integral is also IR finite since all propagators are massive). In general, loop integrals can have however both UV (l 2 ) and IR (l 2 0) singularities. The delta function follows from the conservation of 4-momentum which, in turn, follows from space-time translational invariance. It is common to all S-matrix elements. We will define the amplitude A(f i) by stripping off this momentum-conserving delta function, f S 1 i = i f T i = i(2π) 4 δ (4) (p f p i ) A(f i), (4.75) 75

78 M N M N N M N M Figure 4.3: Lowest order contribution to M M M M scattering. The momentum assignments are not explicitly shown. where p f (p i ) is the sum of the final (initial) 4-momenta, and the factor of i out front is a convention which is there to match non-relativistic QM. 4.8 Yukawa Potential So far we have calculated the quantum amplitudes for various scattering processes. But this quantities are a little bit abstract. In order to make contact to experiment let me show in the following how to translate the amplitude (4.55) for nucleon-nucleon scattering into something familiar from Newtonian mechanics, namely a potential, or force, between the particles. We start by asking a simple question in classical field theory that will turn out to be relevant in order to calculate the quantum process. Suppose that we have a fixed delta function source for our real scalar field φ, that persists for all times. What is the profile of φ(x)? In order to answer this question, we have to solve the static Klein-Gordon equation, ( 2 + m 2) φ(x) = δ (3) (x). (4.76) We can solve this equation by going to momentum-space φ(x) = d 3 p/ ((2π) 3 ) e ip x φ(p). After this Fourier transformation the relation (4.76) takes the form (p 2 + m 2 ) φ(p) = 1, which means that we can write the field as φ(x) = d 3 p (2π) 3 e ip x p 2 + m 2. (4.77) Let us compute this integral explicitly. Changing to polar coordinates, and writing p x = pr cos θ, we get φ(x) = 1 (2π) 2 = 1 (2π) 2 r = 1 2πr Re 0 dp [ dp p 2 2 sin (pr) p 2 + m 2 pr p sin (pr) p 2 + m 2 dp 2πi pe ipr p 2 + m 2 ]. (4.78) 76

79 We evaluate the last integral by closing the contour in the upper half plane p i, picking up the pole at p = im. This gives φ(x) = 1 4πr e mr. (4.79) We see that the field dies off exponentially quickly at distances 1/m, i.e., the Compton wavelength of the meson. It is now interesting to ask how the profile of the φ field (the meson) and the force between the ϕ particles (the nucleons) are related. Realize that in electrostatics where a charged particle acts as a delta-function source for the gauge potential A 0 with A µ = (φ, A) we have to face a similar problem. In this case one has 2 A 0 = δ (3) (x) which is solved by A 0 = 1/(4πr). The profile of A 0 then acts as the potential energy for another charged (test) particle moving in this background. Is such an interpretation also possible in the case of φ? Or phrased slightly different, is there a classical limit of the scalar Yukawa theory where the nucleons act as deltafunction sources for the meson field, creating the profile (4.79)? And, if so, is this profile then felt as a static potential? The answer is essentially yes, at least in the limit M m. But the correct way to describe the potential felt by the nucleons is not to talk about classical fields at all, but instead work directly with the quantum amplitudes. Let us see explicitly how this goes. We first compare the result of the first diagram in Figure 4.1 to the corresponding amplitude in non-relativistic QM which describes the interaction of two particles through a potential. In order for the comparison to be meaningful, we have to take the non-relativistic limit of (4.55). We work in the CM frame with p = p 1 = p 2 and q = q 1 = q 2 with p = q for elastic scattering. In the non-relativistic limit one has p M, which by momentum conservation implies q M. It is easy to check that in this limit the first term in (4.55) turns into ig 2 (p q) 2 + m 2. (4.80) We should now compare this result to the scattering amplitude in QM. In order to do this, we consider two particles separated by a distance x, interacting through a potential V (x). The amplitude for the particles to scatter from ±p into ±q can be computed in perturbation theory, using techniques familiar from non-relativistic QM. In Born approximation, i.e., to leading order in the perturbative expansion, the sought amplitude is given by q V (x) p = i d 3 r V (x) e i(p q) x. (4.81) Taking into account that there is a relative factor of (2M) 2 that arises in comparing the QFT amplitude to q V (x) p, which can be traced to the relativistic normalization of the states p 1, p 2, 27 we find after equating (4.80) and (4.81) the following relation d 3 r V (x) e i(p q) x λ 2 = (p q) 2 + m. (4.82) 2 27 Notice that this factor is also necessary to get the dimensions of the potential to work out correctly. 77

80 Here we have introduced the dimensionless parameter λ = g/(2m). The latter equation is trivially inverted, giving V (x) = λ 2 d 3 p (2π) 3 e ip x p 2 + m 2 = λ2 4πr e mr, (4.83) where in the last step have used the results (4.77) through (4.79). The potential V (x) is the famous Yukawa potential. The force has a range 1/m and the minus sign in (4.83) tells us that the potential is attractive. Hideki Yukawa made this potential the basis for his theory of the nuclear force and worked backwards from the range of the force (of about 1 fm) to predict the mass (of about 200 MeV) of the required boson the pion [2]. It is important to realize that QFT has given us an entirely new perspective on the nature of forces between particles. Rather than being a fundamental concept, the force arises from the virtual exchange of other particles, in this case the meson. 4.9 Connected and Amputated Feynman Diagrams We have explained in some detail how to compute scattering amplitudes by drawing all Feynman diagrams and by writing down the corresponding analytic expression for them using Feynman rules. In fact, there are a couple of caveats about what Feynman diagrams one should draw and calculate. Both of these caveats are related to the assumption made so far that the initial and final states are eigenstates of the free theory which, as we have mentioned before, is not correct. The two caveats are as follows. First, we consider only connected Feynman diagrams, where every part of the diagram is connected to at least one external line. We shall see shortly, that this will be related to the fact that the vacuum 0 of the free theory is not the true vacuum Ω of the interacting theory. An example of a disconnected diagram (or piece) is shown on the left-hand side in Figure 4.4. Second, we do not consider diagrams with loops on external lines so-called unamputated graphs. An example of such a diagram is depicted on the right-hand side of the latter figure. These diagrams are related to the fact that the one-particle states of the free theory are not the same as the one-particle states of the interacting theory. In particular, correctly dealing with these diagrams will account for the fact that particles in interacting QFTs are always surrounded by a swarm of virtual particles. We will refer to diagrams in which all loops on external legs have been removed as amputated graphs. Vacuum of the Interacting Theory We start out by discussing the properties of the vacuum Ω of the interacting theory. We will normalize the state Ω as Ω Ω = 1 and H Ω = 0. Since Ω is the ground state of H, we can isolate it by the following procedure. Imagine starting with the vacuum 0 of the free theory (i.e., H 0 0 = 0) and evolving it with H, e iht 0 = n e ient n n 0, (4.84) 78

81 Figure 4.4: Example of a disconnected (left-hand side) and an unamputated (righthand side) Feynman diagram in φ 4 theory. where E n ( n ) are the eigenvalues (eigenstates) of H. We must assume that Ω and 0 have some overlap, i.e., Ω 0 0. If this would not be the case the interaction term H I would not be a small perturbation compared to H 0. Under this assumption, we can rewrite (4.84) as follows e iht 0 = e ie 0t Ω Ω 0 + n 0 e ient n n 0, (4.85) where E 0 = Ω H 0 Ω. Since E n > E 0 for all n, we can get rid of the second term in (4.85) by sending t to infinity in a slightly imaginary direction, t (1 iɛ). 28 It follows that Ω = lim ( e ie 0 t Ω 0 ) 1 e iht 0. (4.86) t (1 iɛ) Since t is very large we can shift it by a small amount, let s say t 0, so that Ω = ( lim e ie 0 (t+t 0 ) Ω 0 ) 1 e ih(t+t 0 ) 0 t (1 iɛ) = lim ( e ie 0 (t 0 ( t)) Ω 0 ) 1 e ih(t 0 ( t)) e ih 0( t t 0 ) 0 t (1 iɛ) (4.87) = lim ( e ie 0 (t 0 ( t)) Ω 0 ) 1 U(t0, t) 0. t (1 iɛ) Here we have used in the second line that H 0 0 = 0 and employed in the third line the relation U(t, t ) = exp [ih 0 (t t 0 )] exp [ ih(t t )] exp [ ih 0 (t t 0 )] which follows from (4.14). We see that (ignoring the prefactor) we can get the ket Ω from 0 by simply evolving from t to t 0 with the time-evolution operator U. Similarly, we find for the bra Ω the expression Ω = lim 0 U(t, t 0 ) ( e ie 0(t t 0 ) 0 Ω ) 1. (4.88) t (1 iɛ) Correlation Functions There are many questions we want to ask in QFT that are not directly related to scattering experiments. E.g., we might want to compute the viscosity of the quark gluon plasma, or 28 Since the BCs of the Feynman propagator D F (x y) are such that the integration contour that is slightly rotated away from the Re p 0 -axis the contribution of the imaginary piece of t does not alter the final result. 79

82 understand the response of a condensed matter system to an experimental probe, or figure out the non-gaussianity of density perturbations arising in the cosmic microwave background from novel models of inflation. All of these questions are answered in the framework of QFT by computing elementary objects known as correlation functions. In the following we will define correlation functions, explain how to compute them using Feynman diagrams, and then relate them back to scattering amplitudes. In order to keep the following discussion as simple as possible, we will work in the real Klein-Gordon theory. We start by defining the n-point correlation (or Green s) function G (n) (x 1,..., x n ) = Ω T (φ H (x 1 )... φ H (x n )) Ω, (4.89) where φ H denotes the φ field in the Heisenberg picture of the full theory, rather than the interaction picture that we have been dealing with so far. The first question that one can ask, is how to compute G (n) in terms of matrix elements evaluated on 0, the vacuum of the free theory. Let me first state the result and then prove it. The result reads { [ t ]} 0 T φ I (x 1 )... φ I (x n ) exp i dt H I (t ) 0 G (n) t (x 1,..., x n ) = lim { [ t ]}. (4.90) t (1 iɛ) 0 T exp i dt H I (t ) 0 Notice that both the numerator and denominator appearing on the right-hand side of the latter equation can be calculated using the methods developed for S-matrix elements, namely Feynman diagrams (or alternatively Dyson s formula and Wick s theorem) after expanding the exponentials into a Taylor series. After stating the result (4.90), we still have to prove it. With out loss of generality we assume that x 0 1 >... > x 0 n > t 0. If this is not the case we simply relabel the points in an appropriate way. Such a relabeling leaves both sides of (4.90) unchanged. We then have G (n) (x 1,..., x n ) = Ω φ H (x 1 )... φ H (x n ) Ω = lim ( e ie 0 (t t 0 ) 0 Ω ) 1 0 U(t, t0 ) t (1 iɛ) [ U(x 0 1, t 0 ) ] φi (x 1 )U(x 0 1, t 0 ) [ U(x 0 2, t 0 ) ] φi (x 2 )U(x 0 2, t 0 )... t [ U(x 0 n, t 0 ) ] φi (x n )U(x 0 n, t 0 )U(t 0, t) 0 ( e ie 0(t 0 ( t)) Ω 0 ) 1 = lim ( e ie 0 (2t) 0 Ω 2) 1 t (1 iɛ) (4.91) 0 U(t, x 0 1)φ I (x 1 )U(x 0 1, x 0 2)... U(x 0 n 1, x 0 n)φ I (x n )U(x 0 n, t) 0 0 U(t, x 0 = lim 1)φ I (x 1 )U(x 0 1, x 0 2)... U(x 0 n 1, x 0 n)φ I (x n )U(x 0 n, t) 0. t (1 iɛ) 0 U(t, t) 0 Here we have first used (4.87) and (4.88) and rewritten all Heisenberg fields φ H in terms of interacting fields, φ H (x) = [ U(x 0, t 0 ) ] φi (x)u(x 0, t 0 ). (4.92) 80

83 Remember that U satisfies U(t 1, t 2 )U(t 2, t 3 ) = U(t 1, t 3 ) and U(t 1, t 3 ) [ U(t 2, t 3 ) ] = U(t1, t 2 ). In order to arrive at the last line, we have finally employed 1 = Ω Ω = ( e ie 0(2t) 0 Ω 2) 1 0 U(t, t) 0. (4.93) The proof of (4.90) is complete after noticing that all fields in (4.91) are in time order and that the product of U operators in the numerator reduces to U(t, t) = T exp [ i t t dt H I (t ) ]. Hence the last line in (4.91) is nothing but the right-hand side of (4.90). Exponentiation of Bubble Diagrams By means of (4.90) we can now (in principle) calculate any n-point correlation function. But what is the physical interpretation of this equation? We first express the denominator of (4.90) in terms of Feynman diagrams, lim t (1 iɛ) 0 U(t, t) 0 = (4.94) The disconnected Feynman diagrams appearing on the right-hand side of this relation are called vacuum bubbles. What is the value of the first non-trivial graph? Restoring the position label and the integration momenta, l 1 l 2 (4.95) it is readily seen that momentum conservation requires l 1 = l 2, so that the diagram evaluates to (2π) 4 δ (4) (0). This factor is also easily derived in position space, where one has d 4 z (const.) 2tV. (4.96) This result just tells us that the space-time process (4.95) can happen at any place in space, and at any time between t and t. Every disconnected diagram will have one such (2π) 4 δ (4) (0) = 2t V factor, where V denotes the volume of space. In fact, the contributions to G (n) from disconnected diagrams can be shown to exponentiate. To prove the linked-cluster theorem, we first label the various possible disconnected pieces: V i,,,,.... (4.97) Now we assume that a given Feynman diagram has n i pieces of the form V i for each i, in addition to its one piece that is connected. If we also denote the value of V i by v i, the value of a single Feynman graph is ( ) (v i ) n i (value of connected piece), (4.98) (n i )! 81 i

84 where 1/((n i )!) is the symmetry factor associated with interchanging the n i copies of the piece V i. The value of the sum of all diagrams is then given by ( ) (v i ) n i (value of connected piece), (4.99) (n i )! all connected diagrams all {n i } where all {n i } means all ordered sets {n 1, n 2,...} of non-negative integers. The sum of the connected diagrams factors out of this expression, giving ( ) (value of connected piece) ( ) (v i ) n i. (4.100) (n i )! all connected diagrams all {n i } In fact, not only the connected pieces factorize, but also the disconnected ones. One has ( ) (v i ) n i = (v i ) n i = ( ) exp (v i ) = exp v i. (4.101) (n i i )! (n i i )! i i all {n i } all {n i } We see that the combinatoric factors (as well as the symmetry factors) associated with each diagram are such that the whole series of disconnected pieces sums to an exponential. Taken together (4.99) through (4.101) imply that the sum of all diagrams is equal to the sum of all connected diagrams multiplied with the exponential of the sum of all disconnected graphs. Applying our findings concerning the exponentiation of bubble diagrams to (4.94), we arrive at the following pictorial identity [ lim 0 T exp i t (1 iɛ) t t ] dt H I (t ) 0 = exp i i. (4.102) The exponentiation of disconnected diagrams is also relevant in the case of the numerator of the right-hand side of (4.90). Let us consider the two-point correlation function G (2) for simplicity. In this case the numerator takes the form lim 0 T t (1 iɛ) { φ I (x)φ I (y) exp [ i t t ]} dt H I (t ) 0 = x y x y x y exp (4.103) 82

85 Combining now (4.102) and (4.103), it follows that the exponentials involving the sum of disconnected diagrams cancel between the numerator and denominator in the formula for the correlation functions. In the case of the two-point function, the final form of (4.90) is thus G (2) (x, y) = (4.104) x y x y x y x y The generalization to higher correlation function is straightforward and reads ( ) G (n) sum of all connected graphs (x 1,..., x n ) = Ω T (φ H (x 1 )... φ H (x n )) Ω =. (4.105) with n external points The disconnected diagrams exponentiate, factor, and cancel as before. It is important to remember that by disconnected we mean disconnected from all external points. In higher correlations functions, diagrams can also be disconnected in another sense. Consider, e.g., the four-point function G (4) (x 1, x 2, x 3, x 4 ) = (4.106) In many of the displayed diagrams, external points are disconnected from each other. Such diagrams do neither exponentiate nor factor, they contribute to the amplitude just as do the fully connected diagrams in which any point can be reached from any other by traveling along the lines. Energy Density of Vacuum An immediate consequence of the linked-cluster theorem is that all vacuum bubbles cancel when calculating correlation functions. Does this mean that the disconnected diagrams have no physical meaning at all? The place to look for the answer to this question is (4.91) and (4.93). Taken together these two equations imply that lim 0 T t (1 iɛ) { φ I (x 1 )... φ I (x n ) exp [ i t t ]} dt H I (t ) 0 ( = Ω T (φ H (x 1 )... φ H (x n )) Ω lim e ie 0 (2t) 0 Ω 2) (4.107) 1. t (1 iɛ) 83

86 Looking only at the t-dependent parts on both sides, it follows that [ ] [ ] exp v i exp ie 0 (2t). (4.108) i The sum of all vacuum bubbles is therefore related to the difference in the ground-state zeropoint energies of the interacting and the free theory, the latter of which was defined to be zero. Because each bubble graph V i contains a single factor of (2π) 4 δ (4) (0) = 2tV, one explicitly finds that the energy density of the ground state of the (interacting) φ 4 theory reads E 0 = E 0 V = i [ (2π) 4 δ (0)] (4). (4.109) Notice that the IR divergence arising from the infinite extent of space-time volume ( which we have first met in Section 3.2 and then again in (4.96) ) has been removed in E 0, leaving behind an highly UV-divergent expression that reflects our ignorance about the physics governing the high-energy regime. One-Particle States in Interacting Theory We now have an extremely beautiful formula (4.105) for computing an extremely abstract quantity the n-point correlation function. Our next task is to relate these objects back to S-matrix elements (4.25) ( or equivalent T -matrix elements (4.49) ), which will allow us to compute quantities that can actually be measured, namely decay rates and cross sections. In order to achieve this goal, we still have to learn how to deal with diagrams involving loops on the external lines. Let us first try to understand the problem with such graphs, looking at a specific example. We consider the following Feynman diagram l p 1 p 3 q 1 q 2 = 1 2 d 4 p 3 i p 2 3 m 2 d 4 l i l 2 m 2 (4.110) p 2 ( iλ)(2π) 4 δ (4) (p 2 + p 3 q 1 q 2 ) ( iλ)(2π) 4 δ (4) (p 1 p 3 ), appearing in φ 4 theory. We can integrate over p 3 using the second delta function. It tells us to evaluate 1 1 p 2 3 m 2 = p3 =p 1 p 2 1 m = (4.111) We get an infinity, since p 1, being the momentum of an external particle, is on-shell, i.e., p 2 1 = m 2. This is not good! Clearly, diagrams like (4.111) should not contribute to the 84

87 S-matrix elements. In fact, this is physically reasonable, since the external leg corrections, , (4.112) represent the evolution of one-particle state of the free theory into the one-particle state of the interacting theory, in the same way that the vacuum-bubble diagrams (4.97) represent the evolution of 0 into Ω. Since these corrections have nothing to do with the scattering process itself, it is somehow clear that one should exclude them from the calculation of the S-matrix. For a generic Feynman diagram with external legs, we define amputation in the following way. Starting from the tip of each external leg, find the last point at which the diagram can be cut by removing a single propagator, such that this operation separates the leg from the rest of the diagram. Cut there. Let me give an non-trivial example of a diagram that appears at O(λ 10 ), if one wants to compute φφ φφ scattering in φ 4 theory. Here it is: = amputation (4.113) So far we have learnt about the problem with external-leg corrections ( the become infinite for on-shell external states as implied by (4.111) ) and gave a simple prescription of how to solve the issue, i.e., by simply removing these corrections by amputation. A practitioner or an experimental physicist might be happy at this point, but as theorists we want more. So let s have a closer look at the connection between G (n) and S From Correlation Functions to Scattering Matrix Elements Before we start, let me warn you that this subsection will be more abstract than the preceding ones. Its main theme will be the singularities of Feynman diagrams viewed as analytic functions of their external momenta. Yet, we will see rather soon that this apparently esoteric subject is full of physical implications, and that it illuminates the relation between Feynman diagrams and the general principles of QFT. 85

88 Källén-Lehmann Spectral Representation We already know that in the free theory the matrix element 0 T φ(x)φ(y) 0 has a simple physical interpretation. It gives the amplitude for a particle to propagator from y to x. To what extent carries this over to the interacting theory? In order to answer this question, we will have a look at the two-point correlation function (4.104). Our analysis of G (2) will rely only on general principles of special relativity and QM, but will neither depend on the nature of the interactions nor on an expansion in perturbation theory. Yet, to simplify matters, we will restrict our consideration to the case of the real scalar field φ. Similar results can be obtained for correlation functions of fields with spin. We begin by studying the excited states of the interacting theory, with the corresponding energies being defined relative to the ground-state energy E 0. Let λ 0 be an excited eigenstate of the full Hamiltonian with vanishing total 3-momentum 0, i.e., P λ 0 = 0. That λ 0 can be an eigenstate of both H and P follows from the fact that [H, P ] = 0. Such a state can consist of an arbitrary number of particles or it can even be bound state. The simultaneous eigenvalues of H E 0 and P can be combined into a 4-vector p µ 0 = (m λ, 0), where m λ denotes the mass of the particular zero-momentum state. Being the generator of spacetime translations, P µ = (H E 0, P ) transforms as contravariant 4-vector under boosts, i.e., U 1 (Λ)P µ U(Λ) = Λ µ ν P ν where U(Λ) is the unitary operator that implements the Lorentz boost. This implies that by boosting λ 0 one can generate a new state λ p, which can have any 3-momenta p and is an eigenstate of H E 0 with energy E p (λ) = ( p 2 + m 2 λ )1/2. Or the other way round, any eigenstate with explicit 3-momentum can be boosted to a zeromomentum eigenstate. You are kindly asked to prove this statement explicitly. The sets of eigenvalues p µ = (E E 0, p) are thus organized into hyperboloids, as is shown in Figure 4.5. The lowest-lying isolated hyperboloid corresponds to the one-particle states of the interacting theory, whereas the other ones correspond to bound states that may or may not be present. Above a certain threshold value of m λ, a continuum of multiparticle states starts. From the above it follows that the states λ p form a complete set of states in the interacting theory, in the same way the states p do in the free theory. In turn, the completeness relation of the one-particle states in the free theory (3.66) is replaced by 1 = Ω Ω + λ d 3 p 1 (2π) 3 2E p (λ) λ p λ p, (4.114) where the first term corresponds to the ground state and the second one to all excited states. We now insert (4.114) into the two-point function G (2) (x, y) = Ω T φ(x)φ(y) Ω. 29 In the case x 0 > y 0, we obtain Ω φ(x)φ(y) Ω = Ω φ(x) Ω Ω φ(y) Ω + d 3 p 1 (2π) 3 2E p (λ) Ω φ(x) λ p λ p φ(y) Ω. λ (4.115) 29 For the sake of brevity, the labels H indicating Heisenberg fields will be dropped hereafter, whenever we discuss the properties of correlation functions. 86

89 H multiparticle continuum bound state one particle in motion m one particle at rest P Figure 4.5: The eigenvalues of P µ = (H, P ) are hyperboloids in the P H plane. For a typical theory the states consist of one or more particles of mass m. In consequence, there is a hyperboloid of one-particle states and a continuum of hyperboloids of two-, three-particle states, and so on. There may also be one or more bound state hyperboloids below the threshold for creation of two free particles. In the absence of preferred directions in the universe, the vacuum Ω should be invariant under space-time translations and Lorentz transformations, i.e., e ip x Ω = Ω and U(Λ) Ω = Ω. As part of an exercise you will show that this implies that Ω φ(x) Ω = Ω φ(0) Ω = v, (4.116) where v denotes the VEV of the field φ(x), usually taken to be zero. If v 0 than one should reformulate the theory using the shifted field φ(x) = φ(x) v, which by definition has vanishing VEV. By an appropriate choice of the dofs of the interacting theory one hence can always get rid of the first term in (4.115). The matrix elements entering the second term can be manipulated as follows Ω φ(x) λ p = Ω e ip x φ(0)e ip x λ p = e ipx Ω φ(0) λ p p 0 =E p(λ) = e ipx Ω U 1 (Λ)U(Λ)φ(0)U 1 (Λ)U(Λ) λ p p 0 =E p(λ) (4.117) = e ipx Ω φ(0) λ 0 p 0 =E p(λ), where U(Λ) implements a boost from p to 0. In order to arrive at the final expression, we have made use of the fact that Ω and φ(0) are Lorentz invariant For a field with spin we would need to keep track of its non-trivial transformation properties under the Lorentz group. 87

90 ρ(s) one-particle states bound states multiparticle continuum m 2 (2m) 2 s Figure 4.6: The spectra density ρ(s) for a typical interacting theory. The one-particle states contribute a delta function at m 2, i.e., the square of the physical mass of the particle. Multiparticle state form a continuous spectrum starting at (2m) 2. There may also be bound states below the two-particle threshold. Leaving out the VEV and using (4.117), the two-point correlation function (4.115) then takes the form (x 0 > y 0 ) Ω φ(x)φ(y) Ω = λ = λ Ω φ(0) λ 0 2 Ω φ(0) λ 0 2 d 3 p (2π) 3 d 4 p (2π) 4 e ip(x y) 2E p (λ) p 0 =E p(λ) i p 2 m 2 λ + iɛ e ip(x y), (4.118) where to arrive at the final result we have introduced an integration over p 0 employing (3.120). The integral in the last line of (4.118) is the Feynman propagator D F (x y; m 2 λ ) belonging to a φ-particle with mass m λ. We see that the particle interpretation has in fact changed in the interacting theory from free particles to dressed particles (quasi-particles), so the particles we are dealing with here are not the particles that we know from the free theory. An expression analog to the one in (4.118) holds in the case x 0 < y 0. Combining both cases one arrives at the Källén-Lehmann spectral representation of the two-point correlation function G (2) ds (x, y) = 2π ρ(s) D F (x y; s), (4.119) 0 where ρ(s) depends on the squared invariant mass s. This spectral density function is positive definite and given by ρ(s) = 2π δ(s m 2 λ) Ω φ(0) λ 0 2. (4.120) λ 88

91 Im (p 2 ) one-particle pole bound-state poles m 2 (2m) 2 multiparticle brunch cut Re (p 2 ) Figure 4.7: Analytic structure in the complex p 2 -plane of the Fourier transform of the two-point correlation function for a typical interacting theory. The one-particle states lead to an isolated pole at p 2 = m 2. States of two or more free particles give a brunch cut, while possible bound states show up as additional poles below (2m) 2. The spectral density for a typical theory is plotted in Figure 4.6. We see that the states in the interacting theory that describe one-particle states correspond to an isolated delta function in the spectral density, The factor ρ(s) = 2π δ(s m 2 ) Z + ( nothing else until s (2m) 2). (4.121) Z = Ω φ(0) λ 0 2, (4.122) is called the field-strength renormalization. It is the probability for φ(0) to create a one-particle state out of the vacuum Ω and m denotes the physical mass of the associated particle, being the energy eigenvalue in its rest frame. Notice that this physical mass is in general not equal to the bare mass parameter occurring in the Lagrangian of the φ 4 theory (4.6). To make the distinction between physical and bare quantities manifest, we will hereafter indicate bare quantities by a subscript 0. It is important to realize that only the physical mass m is directly observable, while the bare mass m 0 is not. In momentum-space the spectral decomposition (4.119) reads G (2) (p 2 ) = = d 4 x e ipx G (2) (x, 0) = iz p 2 m 2 + iɛ + (2m) 2 0 ds 2π ρ(s) i p 2 s + iɛ ds 2π ρ(s) i p 2 s + iɛ. (4.123) The analytic structure of this function in the complex p 2 -plane is depicted in Figure 4.7. The first term gives an isolated simple pole at p 2 = m 2, while the second term contributes a branch cut beginning at p 2 = (2m) 2. If there are any two-particle bound states these will appear as additional delta functions in (4.123) and thus as additional poles below the cut. Let us compare the results we have obtained in this subsection to those found in Section 3.7 for the free theory. The Fourier transform of the Feynman propagator (i.e., the two-point 89

92 correlation function in the theory of a free scalar field) reads (x 0 > 0) D F (p 2 ) = d 4 x e ipx D F (x) = d 4 x e ipx i 0 T φ(x)φ(0) 0 = p 2 m iɛ, (4.124) and is the amplitude for a particle to propagate from 0 to x. The relation (4.123) implies that the two-point correlation function of the most general theory of an interacting real scalar field φ takes a very similar form. The general expression is essentially a sum of scalar propagation amplitudes for states generated from the vacuum by the field φ(0). There are however two important differences between (4.123) and (4.124). First, (4.123) contains the field renormalization factor Z, which is one in the case of the free fields. The latter statement is easily shown explicitly by evaluating the matrix elements 0 φ(0) p and thus left as an exercise. Second, (4.123) contains contributions from multiparticle intermediate states with a continuous mass spectrum. In the free field theory, φ(0) can create only a single particle from 0. Notice that the generation of multiparticle states is the reason why the factor Z in general differs from unity in the interacting theory. Lehmann-Symanzik-Zimmermann Reduction Formula So far we have seen that the Fourier transform of the two-point correlation function (4.123) considered as an analytic function of p 2 has a simple pole at the square of the physical mass of the one-particle states, while multiparticle intermediate states give weaker branch cut singularities. In the following we will find that this rather formal observation generalizes to higher-point correlation functions and plays a crucial role in the derivation of a general relation between Green s functions and S-matrix elements. This relation has first been derived by Harry Lehmann, Kurt Symanzik, and Wolfhart Zimmermann [3] and is today known as the LSZ reduction formula. Combining the LSZ reduction formula with our Feynman rules for computing correlation functions (4.105) will then give us a master formula for S-matrix elements in terms of Feynman diagrams. For simplicity, we will again carry out the whole analysis for the case of a real scalar field. In the following we would like use the single-particle pole structure of G (2) (p 2 ) in the vicinity of p 2 m 2 to obtain the asymptotic in and out states of the theory and in particular their matrix elements, out q 1,..., q n p A, p B in = q 1,..., q n S p A, p B. (4.125) These matrix elements are plane-wave amplitudes that describe the scattering of a initial two-particle momentum state p A, p B in, constructed in the far past (t = t ), into a n-particle momentum state q 1,..., q n out, which represents the final-state particles in the far future (t = t + ). 31 The basic idea to derive the desired master formula is as follows. In order to calculate the S-matrix element for a 2 n scattering process, we start with the correlation function 31 Because human built detectors are in general not able to resolve positions down to the de Broglie wavelengths of the particles, it is correct to work with plane-wave states in the Heisenberg picture rather than wave packets to describe the collision. 90

93 involving (n + 2) Heisenberg fields. If we Fourier-transform this function with respect to the coordinate of any one of these fields, we will find a pole of the form (4.123) in the corresponding Fourier-transformed variable. We will argue that the one-particle states associated with these poles are in fact asymptotic states, i.e., states given by the limit of well-separated wave packets as they become concentrated around definite momenta. Taking the limit in which all (n + 2) external particles go on-shell, we can then interpret the coefficient of the multiple pole as an S-matrix element. We first study the Fourier-transform of the (n + 2)-point correlation function with respect to one argument x, d 4 x e ipx Ω T (φ x φ 1... φ n+1 ) Ω. (4.126) Here the shorthands φ x = φ(x), φ 1 = φ(y 1 ), etc. have been used and all φ s are Heisenberg fields. We would now like to identify poles in the variable p 0. To do this, we divide the integral over x 0 into three regions, t t+ dx 0 = dx 0 + dx 0 + dx 0, (4.127) t t + where t < min {yi 0 } and t + > max {yi 0 } with i = 1,..., n + 1. In the region x 0 [t, t + ] the result of the integral is an analytic function of p 0 without poles, since the region is bounded and the integrand depends on p 0 through the analytic function exp(ip 0 x 0 ). In the other two regions the integrand still has no poles, but the integration intervals are unbounded. Therefore singularities in p 0 may develop upon integration. Consider the third region, i.e., x 0 [t +, [. In this case x 0 is the latest time, so φ x stands first in the time-ordered product. In order to determine the pole structure of (4.126), we insert the completeness relation (4.114), assuming that the field φ has a vanishing VEV. 32 The integral over the third region then becomes dx 0 d 3 x e i(p0 x 0 p x) t + λ d 3 k 1 (2π) 3 2E k (λ) Ω φ(x) λ k λ k T (φ 1... φ n+1 ) Ω. (4.128) Using (4.117) and including a damping factor exp ( ɛx 0 ) with infinitesimal ɛ to ensure that the integral is well-defined, 33 the above integral takes the form d dx 0 3 k 1 i(p0 k0 +iɛ)x0 e Ω φ(0) λ λ t + (2π) 3 0 (2π) 3 δ (3) (p k) 2E k (λ) λ k T (φ 1... φ n+1 ) Ω (4.129) k 0 =E k (λ) = λ 1 ie i(p0 E p(λ)+iɛ) t + 2E p (λ) p 0 E p (λ) + iɛ Ω φ(0) λ 0 λ p T (φ 1... φ n+1 ) Ω. 32 If this is not the case we reformulate the theory in terms of the φ field. 33 This regularization is equivalent to the iɛ prescription used in (3.129) and the tilted time-axis prescription introduced in (4.86). 91

94 Here we have used d 3 x exp( i(p k)x) = (2π) 3 δ (3) (p k). The expression (4.129) has the same residue at p 0 = E p (λ) iɛ as the term i/(p 2 m 2 λ + iɛ) = i/( (p 0 ) 2 (E p (λ)) 2 + iɛ ) appearing the two-point correlation function (4.118). Like before this singularity will be either a single pole or a brunch cut, depending on whether the rest energy m λ is isolated or not. The one-particle state in the far future corresponds to an isolated pole at the on-shell energy p 0 = E p. In this case, (4.129) gives d 4 x e ipx Ω T (φ x φ 1... φ n+1 ) Ω p0 E p i Z p 2 m 2 + iɛ out p T (φ 1... φ n+1 ) Ω. (4.130) In order to obtain this result we have identified the matrix element Ω φ(0) λ 0 appearing in (4.129) with Z 1/2 using (4.122), absorbing the left over phase into the definition of λ 0. We have furthermore used the notation p out = λ p one particle for a one-particle eigenstate with momentum p that is created at asymptotically large times in the future. In order to evaluate the contribution from the first region, i.e., x 0 ], t ], one puts φ x last in the time-ordered product. Performing steps similar to the ones for the first integration interval (the actual calculation is left as an exercise), one find that the one-particle state in the far past corresponds to an isolated pole at the on-shell energy p 0 = E p, d 4 x e ipx Ω T (φ x φ 1... φ n+1 ) Ω p0 E p i Z p 2 m 2 + iɛ Ω T (φ 1... φ n+1 ) p in, (4.131) where p in = λ p one particle denotes the one-particle eigenstate with momentum p which is constructed at asymptotically large times in the past. We now want to repeat the same exercise for the remaining field coordinates y 1, etc. In the asymptotic treatment of multiparticle states it is, however, better to use normalized wave packets. In that case x is constrained to lie within a small band about the trajectory of a particle with momentum p, with the spatial extent of the band being determined by the wave packet. In this way the particles do not interfere and can effectively be considered free at asymptotic times, unlike plane-wave states. Instead of a simple Fourier transform d 4 x exp (ipx), we should hence have used d 3 q (2π) 3 d 4 x e ip0 x 0 e iq x ψ(q), (4.132) in (4.126), where ψ(q) is a function that is peaked around p, and at the end taken the limit of a sharply peaked wave packet ψ(q) (2π) 3 δ (3) (q p). With this modification the right-hand side in (4.129) would turn into λ d 3 q (2π) ψ(q) 1 3 2E q (λ) p 0 E p d 3 q (2π) 3 ψ(q) i p 0 E q (λ) + iɛ Ω φ(0) λ 0 λ q T (φ 1... φ n+1 ) Ω i Z p 2 m 2 + iɛ out q T (φ 1... φ n+1 ) Ω, (4.133) where p = (p 0, q). We see that the one-particle singularity is now a branch cut, whose length is the width in momentum space of the wave packet ψ(q). It follows that if the width of the 92

95 ψ(q) is taken to zero, the brunch cut sharpens up to a pole. In this limit (4.133) reduces to the simple form (4.130). The same line of reasoning applies to the pole structure that appears in the far past. In this case one recovers (4.131). The procedure described above can be generalized to the (n + 2)-particle case we are interested in by integrating each of the coordinates against a wave packet. Let me spare you the gory details of the actual calculation and only tell you about the final result. It turns out that by smearing each coordinate one can extract the leading singularities that turn out to be products of poles in the separate energy variables. The physics behind this factorization is that an (n + 2)-particle asymptotic state is created/annihilated by (n + 2) field operators that are constrained to lie in distant wave packets and therefore are effectively localized. Under these conditions an (n + 2)-particle excitation in the continuum can be represented by (n + 2) distinct (i.e., independent) one-particle excitations of the ground state. At the end one arrives at ( ) ( n ) G (n+2) (p A, p B, q 1,..., q n ) = d 4 x i e ip ix i d 4 y i e iq jy j Ω T (φ A φ B φ 1... φ n ) Ω p 0 i Ep i qj 0 Eq j = ( i=a,b ( i=a,b i=a,b j=1 i ) ( Z n i ) Z p 2 i m2 + iɛ q 2 j=1 j out q 1,..., q n p A, p B in m2 + iɛ i ) ( Z n i ) Z p 2 i m2 + iɛ q 2 j=1 j q 1,..., q n S p A, p B, (4.134) m2 + iɛ where the use of exp ( iq j y j ) ensures that the particles in the in state have positive energy. The latter relation is the famous LSZ reduction formula. It implies that the S-matrix element involving two particles in the in state and n particles in the out state can be obtained from the corresponding Fourier-transformed (n + 2)-point correlation function by extracting the leading singularities in the energies p 0 i and qj 0, which coincide with the situations where the external particles become on-shell. Diagrammatic Master Formula Our final goal is to reformulate the above procedure in the language of Feynman diagrams. For concreteness, we will first analyze the relation between the diagrammatic expansion of the scalar field four-point function and the S-matrix element and then generalize this result to the case of 2 n scattering. We will consider explicitly the fully connected Feynman diagrams contributing to the Fourier-transformed correlation functions. By a similar analysis, it is straightforward to show that disconnected diagrams should be disregarded, because they do not have the singularity structure with a product of four ( (n + 2) ) poles, appearing on the right-hand side of the LSZ reduction formula (4.134). The exact four-point correlation function is shown in Figure 4.8. In this figure we have indicated explicitly the diagrammatic corrections on each external leg. The light gray blob in 93

96 p B q 2 amp. p A q 1 Figure 4.8: Structure of the exact four-point correlation function in scalar field theory. the centre of the diagram represents the sum of all amputated four-point graphs, amp. = , (4.135) while the dark gray circles indicate the two-point Green s function aka the full propagator. The full propagator can be written as a Dyson series, = 1PI + 1PI 1PI +..., (4.136) where 1PI = iσ(p 2 ) = , (4.137) is the collection of all one-particle irreducible (1PI) self-energy diagrams. Diagrams are called 1PI if they cannot be split in two by removing a single line. The Dyson series (4.136) is in fact a geometrical series, which can be summed up according to = = i p 2 m iɛ + i p 2 m iɛ i p 2 m 2 0 Σ(p 2 ) + iɛ. ( iσ(p 2 ) ) i p 2 m iɛ +... (4.138) We see that the full propagator has a simple pole located at the physical mass m, which is shifted away from the bare mass m 0 by the self-energy: ( p 2 m 2 0 Σ(p 2 ) ) p 2=m 2 = 0, = m 2 = m Σ(m 2 ). (4.139) 94

97 Notice that our sign convention for the 1PI self-energy Σ(p 2 ) implies that a positive contribution to Σ(p 2 ) corresponds to a positive shift of the scalar particle mass. Close to its simple pole at p 2 m 2 the denominator of the full propagator (4.138) can be expanded in the following way p 2 m 2 0 Σ(p 2 ) = ( p 2 m 2) ( 1 Σ (m 2 ) ) + O ( (p 2 m 2 ) 2), (4.140) where Σ (m 2 ) stands for Σ(p 2 )/( p 2 ) p 2 =m 2. This implies that just like in the Källén-Lehmann spectral representation (4.123), the full propagator has a single-particle pole of the form p 0 E p iz + (regular terms), (4.141) p 2 m 2 + iɛ with 1 Z = 1 Σ (m 2 ). (4.142) As a result, the sum of all fully connected 2 2 diagrams contains a product of four poles iz p 2 A m2 + iɛ iz p 2 B m2 + iɛ iz q 2 1 m 2 + iɛ iz q 2 2 m 2 + iɛ, (4.143) multiplying the amputated four-point diagrams. This is exactly the singularity on the righthand side of the LSZ reduction formula (4.134). Comparing the coefficients of the product of poles, we conclude that the S-matrix element of the process φ(p A )φ(p B ) φ(q 1 )φ(q 2 ) can be expressed through p B q 1, q 2 S p A, p B = ( Z ) 4 amp., (4.144) q 2 where the light gray blob represents the sum of amputated four-point diagrams with all external momenta being on-shell. This is the sought diagrammatic master formula for the case of 2 2 scattering of scalar fields. An identical analysis can be applied to the Fourier-transformed (n + 2)-point correlator. In this case the relation between the S-matrix element and the Feynman graphs reads 34 p A q 1 p B q n ( ) n+2 q 1,..., q n S p A, p B = Z amp... (4.145) Notice that the renormalization factors Z 1/2 are irrelevant for calculations at the leading order of perturbation theory, but are important in the calculation of higher-order corrections. This completes the derivation of the connection between scattering matrix elements and fully connected amputated Feynman diagrams. 34 If the external particles are of different species, each has its own renormalization factor Z 1/2. Furthermore, if the particles have spin, there will be additional polarization factors on the right-hand side of the equation. 95 p A q 1

98 4.11 Decay Widths and Cross Sections As in usual QM, also in QFT the probabilities for things to happen are the (modulus) square of the quantum amplitudes. In this subsection we will compute these probabilities, known as decay widths and cross sections. One small subtlety here is that any T -matrix element (4.75) comes with a factor of (2π) 4 δ (4) (p f p i ), so that we end up with the square of a delta function. As we will see in a moment, this subtlety is a result of the fact that we are working in an infinite space. Fermi s Golden Rule In order to start the discussion, let me derive something familiar, namely Fermi s golden rule using Dyson s formula (4.18). For two energy eigenstates m and n with E m E n, one has in Born approximation n U(t) m = i n t 0 = i n H int m dt H I (t ) m t 0 dt e iωt = n H int m eiωt 1 ω, (4.146) where ω = E n E m and we have used in the first step the equality (4.11) to express H I in terms of H int. The probability P m n (t) for the transition from m to n to happen in the time t, is thus given by P m n (t) = n U(t) m cos (ωt) = 2 n H int m. (4.147) ω 2 The function (1 cos (ωt))/ω 2 is visualized in Figure 4.9. The ω-dependence indicates that most transitions occur in a region between energy eigenstates separated by E = 2π/t, i.e., the half-width of the function. Looking at the figure one furthermore observes that as t, the function shown in the plot approaches a delta function. In order to find the normalization, we evaluate dω 1 cos (ωt) ω 2 = πt. (4.148) This implies that 1 cos (ωt) t δ(ω). (4.149) πω 2 t Consider now a distribution of final states with density ρ(e n ). In this case one has to integrate over E n and obtains P m n (t) = de n ρ(e n ) n U(t) m cos (ωt) = de n ρ(e n ) 2 n H int m ω 2 (4.150) t 2πt n H int m 2 ρ(e m ). 96

99 t 2 2 2π t 2π t Figure 4.9: Graphical representation of (1 cos (ωt))/ω 2 appearing in P m n (t). It follows that the probability for the transition per unit time for states around the same energy E m E n = E takes the form P m n (t) = 2π n H int m 2 ρ(e), (4.151) This result is known as Fermi s golden rule. In the above derivation, we were rather careful with taking the limit t. Suppose we were a little bit sloppier, and first chose to compute the amplitude for the initial state m at t to evolve into the final state n at t. Then we would get i n dt H I (t ) m = i n H int m 2π δ(ω). (4.152) Now when squaring the amplitude, we find P m n (t) = n H int m 2 (2π) 2 [δ(ω)] 2. Tracking through the previous computation, we realize that the extra infinity arises because P m n (t) is the probability for the transition to happen in infinite time. We thus can write the delta functions as (2π) 2 [δ(ω)] 2 = 2π δ(ω)t, where t is a shorthand for t. The reason that we have stressed this point is because the T -matrix element in (4.75) has been computed in the same way as (4.152), which means that we have to reinterpret the square of the delta function arising from f T i 2 as a space-time volume factor. Decay Rates We would now like to calculate the probability for a single-particle initial state i of momentum p i and rest mass m to decay into the final state f consisting of n particles with total momentum p f = n j=1 q j. This quantity is given by the ratio P n = f S i 2 i i f f. (4.153) The states i and f obey the relativistic normalization formula (3.64), i i = (2π) 3 2E pi δ (3) (0) = 2E pi V, (4.154) 97

100 where we have replaced the delta function δ (3) (0) by the volume V of the space. Similarly, one has for the final state n f f = 2E qj V. (4.155) j=1 If the initial-state particle is at rest, i.e., E pi = m and p i = 0, we get using (4.75) for the i f decay probability P n = 1 2mV n j=1 1 2E qj V [ (2π) 4 δ (4) (p f p i ) ] 2 A(i f) 2 = 1 2mV (2π)4 δ (4) (p f p i ) A(i f) 2 V t n j=1 1 2E qj V. (4.156) Notice that in order to arrive at the second line we have replaced one of the delta functions (2π) 4 δ (4) (0) by the space-time volume V t. We can now divide out t to get the transition function per unit time. After integrating over all possible momenta of the final-state particles, i.e., V d 3 q j /(2π) 3, we then obtain in terms of the relativistically-invariant n-body phase-space element 35 dπ n = (2π) 4 δ (4) (p f p i ) n j=1 d 3 q j (2π) 3 1 2E qj, (4.157) the following expression for the partial decay width into the considered n-particle final state Γ n = 1 dπ n A(i f) 2. (4.158) 2m Notice that the factors of the spatial volume V in the measure V d 3 q j /(2π) 3 have cancelled those in (4.156), while the factors 1/(2E qj ) in (4.156) have conspired with the 3-momentum integrals in V d 3 q j /(2π) 3 to produce Lorentz-invariant measures (3.61). In consequence, the density of final states (4.157) is a Lorentz-invariant quantity. After summation over all possible n-particle final states, one finally finds the so-called total decay width Γ = 1 dπ n A(i f) 2, (4.159) 2m n with dπ n corresponding to a given final state. The total decay width is equal to the reciprocal of the half-life τ = 1/Γ of the decaying particle. If the decaying particle is not at rest, the decay rate becomes mγ/e pi. This leads to an increased half-life E pi τ/m = τ/ 1 v 2 = γτ, where v is the velocity of the decaying particle. Of course, this is a well-known effect related to time dilation. E.g. taking the muon lifetime at rest as the laboratory value of 2.22 µs, the lifetime of a cosmic ray produced muon traveling at 98% of the speed of light is about five times longer. 35 This object is in some textbooks denoted by dps n. 98

101 In terms of the partial and total decay width, (4.158) and (4.159), the branching ratio (or branching fraction) for the n-particle decay i f reads Needless to say that B(i f) [0, 1] and n B(i f) = 1. Cross Sections B(i f) = Γ n Γ. (4.160) Consider now a beam of particles of type B hitting a target at rest consisting of particles of type A. The case of two colliding particle beams like e + e (LEP), p p (Tevatron) or pp (LHC) can be obtained from this by an appropriate Lorentz boost. Let s start by assuming constant densities ρ A and ρ B in the target and the beam over their whole extents l A and l B. The number of scattering events will then be proportional to (ρ A l A ) (ρ B l B ) O, (4.161) where O denotes the cross-sectional overlap area common to both the beam and the target. The experimental set-up is illustrated in Figure The ratio σ = # scattering events (Oρ A l A ) (Oρ B l B ) /O = # scattering events N A N B /O, (4.162) defines the cross section σ as the effective area of a chunk taken out of the beam by each particle in the target. The quantities N A and N B are the numbers of A and B particles that are relevant for scattering, i.e., the particles that at some point in time belong to the overlap between target and beam. Notice that all of this can be equally well formulated in terms of time-related quantities like the scattering rate and the incoming particle flux. Simply replace the number (#) of scattering events by the number of scattering events per second and l B ρ B by the flux v B ρ B of beam particles. In reality ρ A and ρ B are not constant, since the colliding particles are described by wave packets and both target and beam have a density profile. However, the range of the interaction between the colliding particles is much smaller than the width of the individual wave packets perpendicular to the beam, which in turn is much smaller than the actual diameter of the beam. Therefore, to very good approximation ρ A and ρ B can be considered as locally constant on QM (i.e., interaction) length scales, whereas the density profiles inside the target and beam can be incorporated properly by averaging over the overlap region l A l B d 2 x ρ A (x ) ρ B (x ) = N A N B /O. (4.163) Here x is the spatial coordinate perpendicular to the beam. From this it follows that # scattering events = σ N A N B /O, (4.164) where σ can be calculated for effectively constant values of ρ A and ρ B corresponding to approximately plane-wave initial states. By the way, we do not have to restrict ourselves to the total 99

102 l A O beam ρ B v B l B ρ A target Figure 4.10: Incident beam of particles with density ρ B, extent l B, and velocity v B hitting a target of density ρ A and extent l A. The overlap area of the beam and target is denoted by O. number of scattering events. In a similar way we can study the cross section for scattering into the region d 3 q 1... d 3 q n around the n-particle final-state momentum point q 1,... q n. This is actually what detectors usually do, 36 since they detect particles with energy and momentum in certain finite bins, which are given by the detector resolution. These bins cannot resolve the momentum spread of any of the wave packets, so in the final state we should use plane waves as well. Calculating cross sections therefore amounts to computing transition probabilities in momentum space. These transition probabilities are universal in the sense that they are independent of details of the experiment, like the properties of the beams, the targets or the preparation of the initial-state particles. Consider an initial state consisting of one target and one beam particle in the momentum state i = p A, p B scattering into a final state f = q 1,..., q n. In analogy with the calculation that lead to (4.159), the corresponding differential transition probability per unit time and flux is given by dσ = 1 F dπ n 4E pa E pb V A(i f) 2, (4.165) which is usually referred to as the differential cross section. In the latter expression F stands for the flux associated with the incoming beam of particles. In the CM frame of the collision this flux reads F = v rel V = v A v B V = p A/E A p B /E B V = p CME CM E A E B V, (4.166) where E CM = E A +E B is the total CM energy and p CM is the momentum of either of the particles in the CM frame. To find this result we have used that the 4-momentum of a massive particle reads p µ 0 = (m, 0) in its rest frame, and becomes p µ = ( γ (E 0 + v p 0 ), γ (p 0 + E 0 v) ) = 36 Provided that the particle positions cannot be resolved at the level of the de Broglie wavelengths of the particles, which typically is the case in human-built detectors. 100

103 mγ (1, v) upon a boost with velocity v. In the CM frame we thus find the expression dπ n σ CM = 4 E A p B E B p A A(i f) 2, (4.167) for the total cross section. The flux factor 1/4 E A p B E B p A 1 is not Lorentz invariant, but invariant under boosts along the beam direction, as expected for a cross-sectional area perpendicular to the beam. Notice that the expression for dσ as given in (4.165) is also valid for identical particles in the final state. Finding a set of particles in the required momentum bin effectively identifies the particles. However, when integrating dσ to obtain the total cross section σ CM for the scattering into the n particles one has to restrict this integration to inequivalent configurations. E.g., the total cross section for the 2 2 reaction N N MM in the scalar Yukawa theory is obtained as σ CM = 1/2 dσ Problems i) Draw the Feynman diagrams that contribute to N N MM at O(g 2 ) in the Yukawa theory. Write down the corresponding amplitude and express your result through the Mandelstam variables. Can the amplitude develop a pole? Calculate the leading-order contribution (4.74) to MM MM scattering in the limit of small external momenta, i.e., p 2 1 M 2 etc. To do so, first relate the d-dimensional integral d d l 1 (2π) d l 2 M = i 2 (4π) Γ(1 d/2) (M 2 ) d/2 1, (4.168) d/2 to the Feynman integral appearing in (4.74), then set d = 4 2ɛ, and finally take the limit ɛ 0. Can you recover the qualitative behavior of your result in the effective theory obtained after integrating out the nucleon fields? Think in terms of higher-dimensional operators. ii) Calculate the Born-level potential for N N N N scattering in the Yukawa theory. Compare your result to (4.83). What does your finding tell you about the nature of scalar interactions? Compute the leading-order potential for φφ φφ scattering in φ 4 theory. What is the physical meaning of your result? iii) Consider the anharmonic oscillator with Hamiltonian H = 1 2 p ω2 x 2 + λ 3! x3. (4.169) The goal of this exercise is it to calculate matrix elements of the form Ω x n (0) Ω, (4.170) 101

104 where n = 1, 2,... and Ω denotes the ground state of the perturbed Hamiltonian (we write x n (0) rather than x n because the time associated with these operators is important) using Feynman diagrams and comparing the obtained results with those following from standard time-independent perturbation theory. There are two types of vertices in the possible Feynman diagrams, namely external and internal vertices. External vertices have a single line and correspond to the x(0) factors entering the matrix elements (4.170). They are labeled by the time the operators are evaluated, t = 0 in our case. Internal vertices have three lines, corresponding to the perturbation λ/(3!) x 3 in (4.169) and are labeled by a parameter t. Each internal vertex has a different parameter. The Feynman diagrams are constructed using the following Feynman rules: 0 = 1, (external vertex), t = ( iλ) 3! dt, (internal vertex), (4.171) s t = D(s, t) = 1 2ω e iω s t, (propagator). Here the limits of the t-integration are (1 iɛ). We need the iɛ because without it, the Feynman integrals would not converge. Yet, the final results of the integrals turn out to be independent of ɛ, so that in practice we could omit the iɛ from our notation. Draw the relevant Feynman diagrams that contribute at O(1) and O(λ) to the matrix elements (4.170) with n = 1, 2, 3. Determine the corresponding weight factors and calculate the graphs using (a > 0) dt e ia t = 2 ia. (4.172) Try to reproduce your results using standard time-independent perturbation theory. Proceed to compute the O(λ 2 ) correction to Ω x 2 (0) Ω. Remember to employ (4.105). The Feynman integrals appearing in this case are of the form (a, b, c > 0) ds dt e ia s e ib t e ic s t 2 = (a + b)(b + c) + 2 (a + b)(a + c) + 2 (a + c)(b + c). (4.173) This result can be easily derived from (4.172). If you want, verify that you get the same answer for the O(λ 2 ) correction to Ω x 2 (0) Ω using standard QM perturbation theory. iv) Consider a Lorentz transformation Λ that boosts p µ 0 = (m λ, 0) to Λ µ ν p ν 0 = (E p (λ), p). Show that λ p = U(Λ) λ 0 satisfies P µ λ p = Λ µ ν p ν 0 λ p where P µ = (H E 0, P ). The ground state Ω of any interacting theory has to be Poincaré invariant, since the vacuum ought not to have a preferred direction. Show that Ω φ(x) Ω = v for any φ(x) if Ω φ(0) Ω = v. 102

105 Compute the spectral density function ρ(s) and the field renormalization factor Z of the free scalar theory by explicitly calculating 0 φ(0) p. v) Evaluate the leading singularity of the Fourier-transformed (n + 2)-point correlation function (4.126) arising from the first integration region in (4.127). You should find the result given in (4.131). vi) In Section 4.10 we learnt that the two-point correlation function of the φ 4 theory viewed as an analytic function of the momentum p 2 has a branch-cut singularity associated with multiparticle intermediate states. This finding should not come as a surprise to those familiar with non-relativistic scattering theory, where the amplitudes considered as a function of energy have branch cuts on the positive real axis. The imaginary part of the scattering amplitude appears as a discontinuity across this branch cut. By the optical theorem the imaginary part of the forward-scattering amplitude is then proportional to the total cross section. In this exercise we will derive the QFT version of the optical theorem for a 2 2 scattering process. Derive a equation for the product T T involving the T -matrix starting from the unitarity of the S-matrix. What is the physical reason for the unitarity of the S-matrix? Calculate the matrix element of this relation between the two-particle initial and final states p 1, p 2 and q 1, q 2. In order to compute the matrix element of T T insert a complete set of states k i with i = 1,...,. Give a pictorial representation of the resulting identity. Set p 1 = q 1 and p 2 = q 2 and relate the matrix element of T T to a total cross section. Use this result to derive the standard form of the optical theorem, i.e., Im A(p 1, p 2 p 1, p 2 ) = 2E CM p CM σ(p 1, p 2 anything). (4.174) Here E CM is the total CM energy, p CM is the momentum of either of the particles in the CM frame, and σ(p 1, p 2 anything) is the total cross section for the production of all final states. vii) The generalized optical theorem 2Im A(a b) = f dπ f A(a f) A (b f). (4.175) is true not only for S-matrix elements, but for any amplitude that we can define in terms of Feynman diagrams. Here a and b denote asymptotic states, the sum f runs over all possible sets of final states, and dπ f is the corresponding phase-space element (4.157). In this exercise we will learn that the optical theorem can also be used to deal with unstable particles, which never appear in asymptotic states. Recall that the exact two-point function of a scalar field takes the form (4.138). Use the diagrammatic master formula (4.157) to derive a relation between the amplitude A(p p) describing 1 1 scattering and the quantity iσ(p 2 ) that is the sum of all 1PI insertions into the boson propagator (4.137). 103

106 The latter relation can be used to study the imaginary part of Σ(p 2 ). In order to do so, we change the definition of the physical mass of the φ-particle from (4.139) into m 2 = m Re Σ(m 2 ). (4.176) Assume now that the full propagator (4.138) appears in the s channel of a 2 2 Feynman diagram. Compute the cross section for the process in the vicinity of the resonance. Neglect all overall factors. Compare your result to the relativistic Breit-Wigner formula for the cross section in the region of a resonance, 2 σ 1 s m 2 + imγ. (4.177) Here m is the mass of the resonance and Γ is its width. Identify the width Γ with the imaginary part of Σ(p 2 ) assuming that the resonance is narrow, i.e., Γ m. What does this mean for the lifetime of the particle? Calculate Im Σ(m 2 ), and hence Γ, using the optical theorem (4.175). You should recover the result (4.159). viii) Derive the explicit form of the 2-body phase-space element dπ 2 from (4.157). Using your result calculate the angular differential cross section (dσ/dω) CM for a generic 2 2 process in the CM frame. The solid angle dω is given by dφ d cos θ where θ [ π, π] is the polar scattering angle (with respect to the beam axis) and φ [0, 2π[ the azimuthal scattering angle (around the beam axis). Consider also the case where the external particles all have the same mass. In this case you should obtain ( ) dσ = A(p 1, p 2 q 1, q 2 ) 2, (4.178) dω 64π 2 s CM where A(p 1, p 2 q 1, q 2 ) represents the relevant scattering matrix element. ix) The interactions of pions at low energy can be described by a phenomenological model called the linear sigma model, ( 1 H = d 3 x 2 Π2 i + 1 ) 2 ( Φ i) 2 + V (Φ 2 ). (4.179) Here Φ i with i = 1,..., N are real scalar fields and Π i denotes the conjugate momentum derived from Φ i. The potential is given by V (Φ 2 ) = 1 2 m2 (Φ i ) 2 + λ 4 (Φ2 i ) 2. (4.180) Note that for m 2 > 0 and λ = 0 the above Hamiltonian just consists out of N copies of the Klein-Gordon Hamiltonian. If one now assumes λ to be a small perturbation, one can calculate scattering amplitudes in a series expansion in λ. Show that the propagator of the Φ i fields is Φ i (x)φ j (y) = δ ij D F (x y), (4.181) 104

107 where D F (x y) is the standard Klein-Gordon propagator with mass m. Show furthermore that there is one type of vertex given by k i l j = 2iλ (δ ij δ kl + δ il δ jk + δ ik δ jl ). (4.182) A vertex involving two Φ 1 and two Φ 2 thus has the value 2iλ, while a vertex where four fields of the same type attach receives a factor 6iλ. Compute the cross sections for Φ 1 Φ 2 Φ 1 Φ 2, Φ 1 Φ 1 Φ 2 Φ 2, and Φ 1 Φ 1 Φ 1 Φ 1 scattering to first order in λ. Work in the CM frame. Now consider the case m 2 < 0. In this case, the potential has a local maximum rather than a minimum at Φ i = 0. Since the potential is symmetric under SO(N) rotations of Φ = (Φ 1,..., Φ N ), we can choose to write the fields close to the new minimum as Φ(x) = ( π 1 (x),..., π N 1 (x), v + σ(x) ) T, (4.183) where v is a constant chosen to minimize the potential (4.180), σ(x) is a small deviation, and π i (x) denote the remaining fields, called pions. Show, that with such a potential we have a theory of one massive sigma field and (N 1) massless pion fields, interacting through cubic and quartic potential terms. Assign Feynman rules to the propagators σ(x)σ(y) =, π i (x)π j (y) = i j, (4.184) and the vertices k i l j. (4.185) i j i j 105

108 References [1] J. Schwinger, Renormalization Theory of Quantum Electrodynamics: An Individual View, in The Birth of Particle Physics, Cambridge University Press (1983), 329 p. [2] H. Yukawa, Proc. Phys. Math. Soc. Jap. 17, 48 (1935). [3] H. Lehmann, K. Symanzik and W. Zimmermann, Nuovo Cim. 1, 205 (1955). 106

109 5 Dirac Theory We have seen that quantization of scalar fields gives rise to spin-zero particles. But most particles in nature have an intrinsic angular momentum or spin. These arise naturally in field theory by considering fields which themselves transform non-trivially under the Lorentz group. In this section we will describe the Dirac equation, whose quantization gives rise to fermionic spin-1/2 particles. In order to motivate the Dirac equation, we will start by studying the appropriate representation of the Lorentz group. We already know from Section 2.4 that if one considers infinitesimal Lorentz transformations (2.61), the matrices ω µν (2.63) entering the transformations have to be antisymmetric. Such an object has six independent parameters which agrees with the number of transformations of the Lorentz group, i.e., three rotations and three boosts. In the following it will turn out to be useful to introduce a basis of this six 4 4 antisymmetric matrices. We call our matrices (M αβ ) µν with α, β, µ, ν = 0, 1, 2, 3 and write the basis of six matrices as (M αβ ) µν = η αµ η βν η βµ η αν, (5.1) where the indices α and β denote which basis element we are dealing with, while µ and ν belong to the 4 4 matrices. Notice that (M αβ ) µν is antisymmetric in both α, β and µ, ν. If we use these matrices in practical applications (e.g., if we want to multiply them together or act on some field) we will typically need to lower one index, (M αβ ) µ ν = ηαµ δ β ν η βµ δ α ν. (5.2) Since we lowered the index with the Minkowski metric, we pick up various minus signs which means that when written in this form, the matrices are no longer necessarily antisymmetric. E.g., one has (M 01 ) µ ν = , (M12 ) µ ν = (5.3) The matrix M 01, which is real and symmetric, generates boost in the x direction, while M 12 is real and antisymmetric and generates rotations in the x y plane. In terms of M αβ, we can now write any infinitesimal ω µ ν as ω µ ν = 1 2 Ω αβ (M αβ ) µ ν, (5.4) where the matrix Ω αβ contains six numbers and is antisymmetric in α and β. This matrix parametrizes the Lorentz transformation we are doing. The basis of the six matrices M αβ forms the generators of the Lorentz transformations. The generators obey the Lorentz Lie algebra relations, [M αβ, M γδ ] = η βγ M αδ η αγ M βδ + η αδ M βγ η βδ M αγ. (5.5) 107

110 Here the matrix indices have been suppressed. A finite Lorentz transformation Λ can be constructed from (5.4) by building the exponential ( ) 1 Λ = exp 2 Ω αβ M αβ. (5.6) Let me stress again what each of these objects are: the M αβ are six 4 4 basis elements of the Lorentz group, while the Ω αβ are six numbers telling us what kind of Lorentz transformation we are doing. 5.1 Spinor Representation We now want to find other matrices which satisfy the Lorentz algebra commutation relations (5.5). In the following, we will construct the spinor representation of the Lorentz group using a trick due to Dirac. We start by defining something which, at first sight, has nothing to do with the Lorentz group. It is the Clifford algebra (or Dirac algebra) {γ µ, γ ν } = 2η µν 1, (5.7) where {a, b} = ab+ba is the usual anticommutator, γ µ denotes a set of four matrices (the Dirac matrices), and 1 is the n n unit matrix with n being the dimensionality of the representation. The relation (5.7) implies that we have to look for matrices γ µ that satisfy γ µ γ ν = γ ν γ µ when µ ν, (γ 0 ) 2 = 1, (5.8) and (γ i ) 2 = 1, (5.9) for i = 1, 2, 3. It is not difficult to convince oneself that the simplest representation of the Clifford algebra (for four-dimensional Minkowski space) is in terms of 4 4 matrices. In fact, there are many 4 4 matrices γ µ that obey (5.7). 37 E.g., we may take the so-called Weyl or chiral representation, ( ) ( ) γ =, γ i 0 σ i =, (5.10) 1 0 σ i 0 where each element is a 2 2 matrix itself and σ i denotes the Pauli matrices ( ) ( ) ( ) σ =, σ 2 0 i =, σ =. (5.11) 1 0 i The latter matrices themselves satisfy {σ i, σ j } = 2δ ij. Using these properties one easily shows that (5.10) indeed satisfies (5.7). This is left as a homework problem. 37 One can construct any other representation of the Clifford algebra from a specific one by taking Mγ µ M 1 for any invertible matrix M. However, up to this equivalence, it turns out that there is a unique irreducible representation of the Clifford algebra, and the matrices (5.8) provide an example. 108

111 So what is the connection between the Clifford algebra and the Lorentz group? In order to answer the question, we consider the commutator of two Dirac matrices γ µ, S µν = 1 4 [γµ, γ ν ]. (5.12) In our representation, the 0i and ij components of S µν are given explicitly by ( ) S 0i = 1 σ i 0, (5.13) 2 0 σ i and ( ) S ij = i σ k 0 2 ɛijk. (5.14) 0 σ k It is straightforward to show and thus part of an exercise, that these matrices (irrespectively of their representation) satisfy and [S µν, γ κ ] = γ µ η νκ γ ν η κµ. (5.15) [S µν, S κλ ] = η νκ S µλ η µκ S νλ + η µλ S νκ η νλ S µκ. (5.16) The latter equality tells us that the matrices S µν form a representation of the Clifford algebra (5.5). We now also understand the physical meaning of (5.13) and (5.14). The former object induces a Lorentz boost, while the latter generates a three-dimensional rotation. Dirac Spinors The S µν are 4 4 matrices, because the γ µ are. So far we haven t given an index to the rows and columns of these matrices. Let s call the indices α and β. We furthermore need a field that the (S µν ) α β act upon. The sought field has to have four complex components labelled α and we call it ψ α (x). This object is the famous Dirac spinor. Under Lorentz transformations, we have ψ α (x) S(Λ) α β ψβ (x ). (5.17) where x = Λ 1 x and the full Lorentz transformation S(Λ) takes the form ( ) 1 S(Λ) = exp 2 Ω γδ S γδ, (5.18) and the expression for Λ is given in (5.6). Although the basis of generators S γδ and M γδ is different we use the same six numbers Ω γδ in both S(Λ) and Λ. This ensures that we are doing the same Lorentz transformation on ψ and x. Both S(Λ) and Λ are 4 4 matrices. So how can we be sure that the spinor representation (5.18) is something new, and isn t equivalent to the familiar vector representation (2.60)? 109

112 In order to convince ourselves that the two representations are truly different, we look at rotations. If we write the rotation parameters as Ω ij = ɛ ijk ϕ k, then (5.6) and (5.18) become ( ) 0 0 ϕ 3 ϕ 2 Λ = exp 0 ϕ 3 0 ϕ 1, S(Λ) = e i/2 ϕ σ 0, (5.19) i/2 ϕ σ 0 e 0 ϕ 2 ϕ 1 0 where in order to arrive at the right-hand sides one has to remember that Ω 12 = Ω 21 = ϕ 3, etc. We now consider a rotation by 2π around the z-axis, which means to take ϕ = (0, 0, 2π). It follows that π 0 Λ = exp 0 2π 0 0 = 1, S(Λ) = ( e iπσ3 0 0 e iπσ3 This implies that under a 2π rotation a vector and spinor transforms as follows ) = 1. (5.20) A µ (x) A µ (x), ψ α (x) ψ α (x). (5.21) The latter relation tells us that spinors have the unintuitive property that a 2π rotation does not return them to their initial state, but a 4π rotation does. So S(Λ) definitely differs from the vector representation Λ. For later convenience let me also give explicitly the analogs of (5.19) for Lorentz boosts. Writing the boost parameter as Ω i0 = Ω 0i = χ i, one finds 0 χ 1 χ 2 χ 3 χ Λ = exp χ , S(Λ) = χ ( e i/2 χ σ 0 i/2 χ σ 0 e ). (5.22) Another important question to ask is whether or not S(Λ) is a unitary representation of the Lorentz group. 38 From (5.18), we infer that S(Λ) is unitary if S µν is anti-hermitian, i.e., (S µν ) = S µν. But we have (S µν ) = 1 4 [(γµ ), (γ ν ) ], (5.23) which can be anti-hermitian if all γ µ are hermitian or all are anti-hermitian. However, we can never arrange for this to happen since (5.8) and (5.9) imply that S µν has both real and imaginary eigenvalues, and a anti-hermitian matrix ought only to have imaginary ones. E.g., in the Weyl representation (5.10), we have the property (γ 0 ) = γ 0, (γ i ) = γ i. (5.24) 38 Notice that using the Weyl representation the relations (5.13) and (5.14) already tell us explicitly that rotations are unitary while boosts are not. This observation is also true for the vector representation. 110

113 In fact the Lorentz group being non-compact, has no finite-dimensional representations that are unitary. But this does not matter to us, since our spinor ψ is not a QM wavefunction, but a classical field. Dirac Action With the new field ψ at hand we now want to construct Lorentz-invariant EOMs involving it. In order to do this we try to write down a Lorentz-invariant action that is bi-linear in ψ. We consider the product ψ (x)ψ(x) = (ψ ) T (x)ψ(x), (5.25) where ψ (x) is the usual adjoint of a multi-component object. Under a Lorentz transformation Λ, one has ψ (x)ψ(x) ψ (x )S (Λ)S(Λ)ψ(x ), (5.26) which is not Lorentz invariant since S(Λ) is not unitary, i.e., S (Λ)S(Λ) 1. This means that ψ ψ is not a Lorentz scalar and thus not the right building block for constructing the action. Yet, it is easy to see what went wrong and to correct for it. From (5.24) we find that for µ = 0, 1, 2, 3, one has (γ µ ) = γ 0 γ µ γ 0, (5.27) which in turn implies that and This suggests that instead of ψ we should better use (S µν ) = γ 0 S µν γ 0, (5.28) S (Λ) = γ 0 S(Λ)γ 0. (5.29) ψ(x) = ψ (x)γ 0, (5.30) as a building block in our Dirac action. This object is called adjoint Dirac spinor. Equipped with ψ and ψ let us now see what kind of Lorentz covariant objects we can form out of them. We first consider ψψ. It is a simple exercise to show that this object transforms under a Lorentz transformation Λ as ψ(x)ψ(x) ψ(x )ψ(x ), (5.31) which tells us that it is a Lorentz scalar. A term d 4 x ψ(x)ψ(x) is thus Lorentz invariant since det(λ) = 1. Next we consider ψγ µ ψ. This term has the following transformation property ψ(x)γ µ ψ(x) Λ µ ν ψ(x )γ ν ψ(x ), (5.32) under Lorentz transformations. This claim is proven as part of an exercise. From (5.32) we infer that ψγ µ ψ is a Lorentz vector. This means that we can treat the µ index on the γ µ matrices as a true vector index. In particular, we can form Lorentz scalars by contracting it with other Lorentz indices. As a result terms like d 4 x ψ(x)a/(x)ψ(x) and d 4 x ψ(x) /ψ(x) are Lorentz invariant. Here we have introduced the shorthand notation a/ = γ µ a µ for any 111

114 contravariant vector a µ. Finally, we consider ψσ µν ψ with σ µν = i/2 [γ µ, γ ν ]. Not surprisingly this object behaves like a Lorentz tensor ψ(x)σ µν ψ(x) Λ µ κλ ν λ ψ(x )σ κλ ψ(x ). (5.33) This result is again easy to derive by considering separately the properties of ψ, ψ, and γ µ under Lorentz transformations. From (5.33) it follows that terms like d 4 x ψ(x)σ µν ψ(x)f µν (x), where all indices are contracted are Lorentz invariant. We are now equipped with the three bi-linears ψψ, ψγ µ ψ, and ψσ µν ψ, each of which transforms covariantly under the Lorentz group. We can try to build a Lorentz-invariant action from these. In fact, we need only the first two terms. We write S = d 4 x ψ(x) (i / m) ψ(x). (5.34) This is the Dirac action we were looking for. Since [S] = 0, [d 4 x] = 4, [ µ ] = 1, and [m] = 1, we can read off the mass dimension of the spinor field and its adjoint. We have [ψ] = [ ψ] = 3/2. The factor of i is there to make the action (5.34) real. Upon complex conjugation, it cancels a minus sign that comes from integration by parts. As we will see soon, after quantization the Dirac theory describes particles and antiparticles of mass m and spin 1/2. Notice that the Lagrangian is of first order, rather than the second-order Lagrangians we were working with for scalar fields. Also, the mass parameter appears in the Lagrangian as m, which can be positive or negative. Dirac Equation The EOMs for ψ and ψ follow from (5.34) by varying independently with respect to ψ and ψ, respectively. In the first case, we obtain 39 This is the Dirac equation. In the second case it follows that (i / m) ψ = 0. (5.35) ψ (i / + m) = 0, (5.36) which is the hermitian-conjugate form of (5.35). Here the derivative acts to the left. Both (5.35) and (5.36) are first order in derivatives, yet miraculously Lorentz invariant. As an homework assignment you are asked to show this explicitly. In contrast, in the case of a scalar field a first-order EOM would necessarily break Lorentz invariance, because one would always need to introduce a privileged vector that saturates the open index of µ. The γ µ matrices provide this index in the case of the Dirac equation. It is also important to realize that the Dirac equation mixes up different components of ψ through γ µ. However, each individual component itself solves the Klein-Gordon equation (2.46). In order to see this, we compute 0 = (iγ µ µ + m) (iγ ν ν m) ψ = ( γ µ γ ν µ ν + m 2) ψ = ( 1/2 {γ µ, γ ν } µ ν + m 2) ψ = ( µ µ + m 2) ψ. 39 Hereafter we will often drop the coordinate x in ψ(x) etc. (5.37) 112

115 The final expression contains no γ µ matrices, and so applies to each component ψ α of the spinor field separately. Chiral Spinors We have seen that in the chiral representation (5.10) both the spinor rotations (5.19) and boosts (5.20) are block diagonal. This means that the Dirac representation is reducible. It decomposes into two irreducible representations, acting only on two-component spinors ψ L,R, which in the chiral representation, are defined by ( ) ψ = ψ R ψ L. (5.38) The two-component objects ψ L,R are called chiral spinors and the labels L, R stand for leftand right-handed chirality. They transform in the same way under rotations, but oppositely under boosts: ψ L,R e i/2 ϕ σ ψ L,R, ψ L,R e 1/2 χ σ ψ L,R, (5.39) In group theory language ψ L is in the (1/2, 0) representation of the Lorentz group, ψ R is in the (0, 1/2) representation, and ψ belongs to (1/2, 0) (0, 1/2). 40 Strictly speaking, the Dirac spinor is a representation of the double cover of SO + (1, 3) = SL(2, C)/Z 2. Here SO + (1, 3) denotes the proper, orthochronous or restricted Lorentz group, which consists of those Lorentz transformations that preserve the orientation of space and direction of time, while SL(2, C) is the complex special linear group and Z 2 is the two element cyclic group. The fact that the Lorentz group is doubly connected is the source of the rotation-by-4π property (5.21) of spinors. The relations (5.38) and (5.39) correspond to the chiral representation. But what happens if we choose a different representation γ µ of the Clifford algebra, where the Lorentz group matrices S(Λ) are not block diagonal? Is there an invariant way to define chiral spinors? We can do this by defining the fifth Dirac matrix γ 5 = iγ 0 γ 1 γ 2 γ 3 = i 4! ɛµνκλ γ µ γ ν γ κ γ λ, (5.40) where ɛ µνκλ is the totally antisymmetric Levi-Civita tensor with ɛ 0123 = ɛ 0123 = 1. The γ 5 matrix has the following properties, all of which can be verified using (5.40) and the anticommutation relations (5.7): (γ 5 ) = γ 5, (γ 5 ) 2 = 1, {γ 5, γ µ } = 0. (5.41) The reason that this Dirac matrix is called γ 5 is that the set of matrices γ M = {γ µ, iγ 5 } satisfy the five-dimensional Clifford algebra, i.e., {γ M, γ N } = 2η MN where M, N = 0, 1, 2, 3, 4. It is also not difficult to check that [S µν, γ 5 ] = 0, (5.42) 40 Using this terminology, scalars belong to the (0, 0) representation, vectors are in the (1/2, 1/2) representation, and the electromagnetic field-strength tensor transforms as (1, 0) (0, 1) under the Lorentz group. 113

116 which means that γ 5 is a scalar 41 under rotations and boosts. The latter relation also tells us that eigenvectors of γ 5 whose eigenvalues are different transform without mixing, and as a result the Dirac representation must be reducible. This criterion for reducibility is Schur s lemma. It follows that P L,R = 1 γ5, (5.43) 2 form Lorentz-invariant projection operators. They satisfy (please show this) P 2 L,R = P L,R, P L,R P R,L = 0, P L + P R = 1. (5.44) One can also check easily that for the Weyl representation (5.10), one has explicitly ( ) γ =. (5.45) 0 1 We see that P L,R project onto left- and right-handed spinors, i.e., ψ L,R = P L,R ψ. Weyl Equations The Dirac Lagrangian can be written in terms of the chiral fields (5.38) as L = ψ (i / m) ψ = i ( ψl /ψ L + ψ R /ψ R ) m ( ψl ψ R + ψ R ψ L ), (5.46) where ψ L,R = ψp R,L. After a slight change of notation, σ µ = (1, σ), σ µ = (1, σ), (5.47) and multiplying with ψ = ψ L + ψ R from the left the corresponding EOMs read i ( ψ L σµ µ ψ L + ψ R σµ µ ψ R ) m ( ψ L ψ R + ψ R ψ L) = 0. (5.48) We see that a massive fermion requires both components ψ L and ψ R, since they are coupled via the mass term, which is chirality flipping. The kinetic term on the other hand is chirality conserving. This means that a massless fermion can be described by a single Weyl spinor ψ L or ψ R alone. The corresponding Euler-Lagrange equations go by the name of Weyl equations: iσ µ µ ψ L = 0, i σ µ µ ψ R = 0. (5.49) In many practical applications it is overwhelmingly convenient to employ two-component Weyl spinor notation, rather than the four-component Dirac spinors. This is due to the fact that the Lagrangian of the SM and essentially all of its extensions violate parity, i.e., the leftand right-handed fermionic components couple differently to the electroweak gauge group. If one uses four-component spinor notation, then there are a lot of clumsy left- and right-handed projection operators. This is not the case if one employs the two-component Weyl fermion notation, which treats fermionic dofs with different gauge quantum numbers separately from the start, as nature intended for us to do. Plenty of details on and many useful techniques to deal with Weyl fermions can be found in [1], which I highly recommend for further reading. 41 In fact, we will see soon that it is a pseudo-scalar and not a scalar. 114

117 Dofs Counting At this point a couple of comments about the dofs counting seem to be indicated. In classical mechanics, the number of dofs of a system is equal to the dimension of the configuration space or, equivalently, half the dimension of the phase space. In field theory we have an infinite number of dofs, but it makes sense to count the number of dofs per spatial point, which at least should be finite. E.g., in this sense a real scalar field φ has a single dofs. At the quantum level, this translates to the fact that it gives rise to a single type of particle. A classical complex scalar field, on the other hand, has two dofs, corresponding to the particle and its antiparticle in the QFT. But what about a Dirac spinor? One might think that there are eight dofs, since ψ has four complex components. But this is wrong! Crucially, and in contrast to the scalar field, the EOM of ψ is first order rather than second order. In particular, for the Dirac theory, the momentum conjugate to the spinor ψ is given by π ψ = L ψ = iψ, (5.50) which is not proportional to the time derivative of ψ. The phase space of a spinor is hence parameterized by ψ and ψ, while for a scalar it is parameterized by φ and π φ = φ. So the phase space of the Dirac spinor ψ has eight real dimensions and correspondingly the number of real dofs is four. We will learn soon that, in the QFT, this counting manifests itself as two dofs (i.e., spin up and down) for the particle, and another two for the antiparticle. A similar counting for the Weyl fermion tells us that it has two dofs. Majorana Fermions Our spinor ψ is a complex object. It has to be, since the representation S(Λ) is typically also complex. This means that if we were to try to make ψ real, e.g., by imposing ψ = ψ, then it would not stay real once we make a Lorentz transformation. However, there is a way to impose a reality condition on ψ. In order to motivate this possibility, its simplest to look at a novel basis for the Clifford algebra (5.9), known as the Majorana basis ( ) γ 0 0 σ 2 =, γ 1 = σ 2 0 ( iσ 3 0 ) ( ), γ 2 0 σ 2 =, γ 3 = 0 iσ 3 σ 2 0 ( iσ iσ 1 ). (5.51) What is special about these matrices is that they are all pure imaginary, i.e., (γ µ ) = γ µ. This implies that the generators (5.12), and hence the full Lorentz transformations (5.18) are real. In the specific basis (5.51), we can therefore work with a real spinor simply by imposing the condition, ψ = ψ, (5.52) which is preserved under Lorentz transformations. Such spinors are called Majorana spinors. Can this procedure be generalized to an arbitrary basis of Dirac matrices? We only ask that the basis satisfies (5.24). We then define the charge conjugate of a Dirac spinor ψ as ψ c = Cψ, (5.53) 115

118 where C is a 4 4 matrix obeying C C = 1, C γ µ C = (γ µ ). (5.54) The first relation tells us that charge conjugation can be described by an unitary operator. Let us first check that (5.53) is a sensible definition, meaning that ψ c transforms nicely under Lorentz transformations. One has ψ c (x) C S (Λ)ψ (x ) = S(Λ)Cψ (x ) = S(Λ)ψ c (x ). (5.55) Here we made use of the properties (5.54) to commute the matrix C past S (Λ) to the right. Comparing the latter result to (5.17), we see that ψ and ψ c transform in the same way under the Lorentz group. In fact, not only does ψ c transforms nicely under rotations and boosts, but it satisfies the Dirac equation, if ψ does. This follows from, (i / m)ψ = 0, = ( i / m)ψ = 0, = C ( i / m)ψ = (i / m)ψ c = 0, (5.56) where we have again employed (5.54). Finally, we can now impose the Lorentz-invariant reality condition on the Dirac spinor, to yield a Majorana spinor, ψ = ψ c. (5.57) After quantization, the Majorana spinor gives rise to a Majorana fermion that is its own antiparticle. This is exactly the same as in the case of scalar fields, where we have seen that a real scalar field gives rise to a spin-zero boson that is its own antiparticle. So how does the matrix C look like? This, of course, depends a lot on the basis. In the Majorana basis (5.51), where all the Dirac matrices are purely imaginary, one simply has C = 1, and in consequence the condition (5.57) turns into (5.52). In the chiral basis (5.10), on the other hand, only γ 2 is imaginary, and we may take 42 C = iγ 2. (5.58) It is also interesting to see how the Majorana condition (5.57) looks in terms of the decomposition into left- and right-handed Weyl spinors. Plugging in the various definition, we find ψ R = iσ 2 ψl and ψ L = iσ 2 ψr. In other words, a Majorana spinor can be written in terms of chiral spinors as ( ) ψ = ψ R iσ 2 ψ R. (5.59) Notice that it is not possible to impose the Majorana condition, ψ = ψ c, at the same time as the Weyl condition, ψ L = 0 or ψ R = 0. Instead the Majorana condition relates left- and right-handed spinors via (5.59). In an exercise you will learn more about Majorana fermions. So let s move on. 42 Be aware, in many texts an extra factor of γ 0 is absorbed into the definition of C. 116

119 5.2 Discrete Symmetries of Dirac Theory In addition to the continuous Lorentz transformations we have considered so far, there are two other space-time operations that are potential symmetries of any QFT, namely parity and time reversal. Parity, denoted by P, sends x = (t, x) P (t, x) = x P, (5.60) reversing the handedness of space. Times reversal, denoted by T, sends x = (t, x) T ( t, x) = x T, (5.61) interchanging the forward and backward light-cone. Since parity has an important role to play in the SM and, in particular, the theory of the electroweak interactions, let s first have a look at the action of P on spinors and bi-linears constructed from them. Parity In order to understand what happens to a spinor under parity, we consider how rotations and boosts act on Weyl spinors. In the chiral representation, the corresponding transformation properties have already been spelled out in (5.39). We also know that under parity rotations do not flip sign, while boosts do, since P acting on a particle should reverse its momentum, but not its spin. This tells us that parity exchanges right- and left-handed spinors, ψ L,R (x) P ψ R,L (x P ). (5.62) Using this knowledge and the fact that changing the parity twice is the identity, i.e., P 2 = 1, we see that the action of parity on ψ can be described in the Weyl basis by This 4 4 matrix satisfies P = γ 0. (5.63) P P = 1, P γ µ P = (γ µ ), (5.64) so also parity can be implemented by an unitary operator. Our spinor transforms under P as ψ(x) P P ψ(x P ). (5.65) Notice that if ψ(x) satisfies the Dirac equation (5.35), so does the parity-transformed spinor P ψ(x P ), since one has (iγ 0 t + iγ i i m)p ψ(t, x) = P (iγ 0 t iγ i i m)ψ(t, x) = 0. (5.66) Here the extra minus sign from passing P through γ i is compensated by the derivative acting on x instead of x. Let me now consider how the covariant interaction terms we have constructed before transform under P. We start with ψψ. Obviously, one has ψ(x)ψ(x) P ψ(x P )ψ(x P ), (5.67) 117

120 given that (γ 0 ) 2 = 1 and (γ 0 ) = γ 0. This is the transformation of a scalar. In the case of the ψγ µ ψ, we find instead ψ(x)γ µ ψ(x) P ( 1) µ ψ(xp )γ µ ψ(x P ), (5.68) where ( 1) µ = 1 for µ = 0 and ( 1) µ = 1 for µ = 1, 2, 3. Notice that the factor ( 1) µ arises from the combination of (5.24) and (5.27). The latter transformation property tells us that ψγ µ ψ transforms as a vector, with the spatial part changing sign. You can also check easily that ψσ µν ψ transforms as a tensor, namely ψ(x)σ µν ψ(x) P ( 1) µ ( 1) ν ψ(xp )σ µν ψ(x P ). (5.69) Using γ 5, we can form two more Lorentz-covariant objects, i.e., ψγ 5 ψ and ψγ 5 γ µ ψ. How do these transform under parity? In the first case, we obtain ψ(x)γ 5 ψ(x) P ψ(x P )γ 5 ψ(x P ), (5.70) where we have used the last relation in (5.41) and (γ 0 ) 2 = 1. In the second case, a straightforward calculation gives ψ(x)γ µ γ 5 ψ(x) P ( 1) µ ψ(xp )γ µ γ 5 ψ(x P ). (5.71) The minus signs in (5.70) and (5.81) earns the objects ψγ 5 ψ and ψγ µ γ 5 ψ the names pseudoscalar and pseudo-vector (or axial-vector). To summarize, we have the following spinor bilinears, ψψ : scalar, ψγ µ ψ : vector, ψσ µν ψ : tensor, ψγ 5 ψ : pseudo-scalar, (5.72) ψγ µ γ 5 ψ : pseudo-vector. The total number of bi-linears is ( (4 3)/ ) = 16 which is all we could hope for from a 4-component object. We are now equipped with new terms involving γ 5 that we can start to add to our Lagrangian to construct new theories. Typically such terms will break parity invariance of the theory, although this is not always true. E.g., the term φ ψγ 5 ψ does not break parity if φ is itself a pseudo-scalar. nature makes use of these parity-violating interactions by using γ 5 in the electroweak force. A theory which treats ψ L,R on an equal footing is called a vector-like theory. In contrast, a theory in which ψ L,R appear differently is called a chiral theory. 118

121 Time Reversal Another obvious question that we should address is how our building blocks in (5.66) transform under T. In order to answer this question, we first have to understand how the time-reversal symmetry is correctly implemented in a QM context. The implementation turns out to be more subtle than in the case of C and P, since the relevant operator is in the case of T not unitary but anti-unitary, i.e., T = T 1 with ψ T ψ = T ψ ψ, where ψ and ψ denote arbitrary multiparticle quantum states. A straightforward way to realize that the operator implementing T must be anti-unitary is to consider the behavior of the Schrödinger equation for a free particle under time reversal. In classical mechanics, a free particle has a time-reversal invariant motion, and it is reasonable that we would like to retain this property in QM as well. But the operator t is T -odd while is T -even. This is impossible to reconcile with the Schrödinger equation unless time reversal changes i i and ψ ψ. The operator thus has to be anti-unitary. Note that the anti-unitary of T implies that it does not have meaningful eigenvalues, contrary to what happens in the case of C and P. As there is no quantum number associated with time reversal, no conservation law exists when the action is invariant under time reversal. Just as for parity, we define time-reversal transformation in QFT by its action on states. We require that T should reverse the particle momentum and its spin. It is not too difficult to figure out, that the transformation of the Dirac spinor under time reversal involves in the chiral basis the matrix which satisfies The transformation itself takes the following form T = iγ 1 γ 3, (5.73) T T = 1, T γ µ T = ( 1) µ (γ µ ), (5.74) ψ(x) T T ψ (x T ). (5.75) The first thing to notice is that if ψ(x) obeys the Dirac equation, the same is true for T ψ (x T ). This follows, because (iγ 0 t + iγ i i m)t ψ (t, x) = T (i(γ 0 ) t i(γ i ) i m)ψ (t, x) = 0. (5.76) Notice that the minus sign between the (γ 0 ) and (γ i ) term is compensated by the derivative acting on t rather then t, and that the final result follows after complex conjugation which sends i i. We are now read to consider the transformation properties of the building blocks (5.72). I simply quote the results without proof, leaving the actual derivations to you as an useful exercise. For the scalar ψψ one finds that ψ(x)ψ(x) In the case of the vector ψγ µ ψ, one has instead ψ(x)γ µ ψ(x) T ψ(x T )ψ(x T ). (5.77) T ( 1) µ ψ(xt )γ µ ψ(x T ). (5.78) 119

122 This is exactly the transformation property we want for vectors, since it leaves ψ /ψ and ψa/ψ invariant under time reversal. Notice that the minus sign appearing for the space-components in (5.78) is cancelled by those appearing in the transformation of the derivative µ and the electromagnetic field A µ, respectively. One furthermore shows, that the tensor ψσ µν ψ behaves like ψ(x)σ µν T ψ(x) ( 1) µ ( 1) ν ψ(xt )σ µν ψ(x T ), (5.79) under time reversal. We finally want to know the transformation properties of the covariants involving γ 5. For the pseudo-scalar ψγ 5 ψ, one obtains ψ(x)γ 5 ψ(x) while the action of T on the pseudo-vector ψγ µ γ 5 ψ is given by Charge Conjugation ψ(x)γ µ γ 5 ψ(x) T ψ(x T )γ 5 ψ(x T ), (5.80) T ( 1) µ ψ(xt )γ µ γ 5 ψ(x T ). (5.81) The last of the three discrete symmetries is the particle-antiparticle symmetry C, which we meet already at the end of Section 5.1 when discussing the properties of Majorana fermions. In physical terms, charge conjugation is conventionally defined to take a fermion with a given spin orientation into an antifermion with the same spin orientation. As we have seen in (5.56), this transformation is a symmetry of the Dirac equation. Once again we want to know how C acts on fermion bi-linears. I again quote the relevant results without giving the details of their derivation. The scalar ψψ transforms under C as ψ(x)ψ(x) In the case of the vector ψγ µ ψ, one has instead ψ(x)γ µ ψ(x) Under C the tensor ψσ µν ψ behaves like ψ(x)σ µν ψ(x) For the pseudo-scalar ψγ 5 ψ, one arrives at ψ(x)γ 5 ψ(x) while the action of C on the pseudo-vector ψγ µ γ 5 ψ reads ψ(x)γ µ γ 5 ψ(x) C ψ(x)ψ(x). (5.82) C ψ(x)γ µ ψ(x). (5.83) C ψ(x)σ µν ψ(x). (5.84) C ψ(x)γ 5 ψ(x), (5.85) C ψ(x)γ µ γ 5 ψ(x). (5.86) 120

123 CP and CP T Symmetry We saw that the free Dirac equation (5.35) is invariant under P, T, and C separately. Yet, we can build more general QFTs that violate any of these discrete symmetries by adding to the Dirac Lagrangian appropriate perturbations. These additional terms must transform as a Lorentz scalar. The various fermionic bi-linears that can be used to construct such terms are shown in Table 1. The last line of this table tells us that all Lorentz-scalar combinations of ψ and ψ are invariant under the combined symmetry CP T. Actually, it is quite generally true that one cannot build a Lorentz-invariant QFT with a hermitian Hamiltonian that violates CP T. More precisely, one can prove the following three statements [2]: first, an interacting theory that violates CP T invariance necessarily violates Lorentz invariance, second, CP T invariance is not sufficient for out-of-cone Lorentz invariance, and third, theories that violate CP T by having different particle and antiparticle masses must be non-local. This implies that any study of CP T violation includes also Lorentz violation. Several experimental searches of such violations have been performed during the last few years. A detailed list of results of these experimental searches are summarized in [3]. So far no evidence for neither CP T nor Lorentz violation has been found. The consequences of the CP T invariance are far-reaching. The most celebrated ones are the equality of masses and total decay width (or lifetimes) for particles and antiparticles. Both statements are easy to prove. Try it! What about the other discrete symmetries in nature? Are they conserved? Although P is conserved in electromagnetism, strong interactions, and gravity, it turns out to be violated in electroweak interactions. The SM incorporates parity violation by expressing the electroweak interaction as a chiral gauge interaction. Only the left-handed components of particles and right-handed components of antiparticles participate in the electroweak interactions in the SM. This implies that P is not a symmetry of our universe, unless a hidden mirror sector exists in which parity is violated in the opposite way (a left-right symmetry ). It was suggested several times and in different contexts that parity might not be conserved, but in the absence of compelling evidence these suggestions were not taken seriously. A careful review by Tsung Dao Lee and Chen Ning Yang [4] showed that while P conservation had been verified in decays by the strong or electromagnetic interactions, it was untested in the electroweak interaction. They proposed several possible direct experimental tests. They were almost ignored, but Lee was able to convince his colleague Chien-Shiung Wu to look for P violation. In 1957, Wu s group conducted an ingenious experiment showing that in the case of the β-decay of Co 60, nature knows left from right [5]. The discovery of P violation immediately explained the outstanding τ θ puzzle related to the decay of charged kaons. So P is broken in nature, what about CP then? The first thing to notice in this respect is that a symmetry of a QM system can be restored if another symmetry can be found such that the combined symmetry remains unbroken. This rather subtle point about the structure of Hilbert space was realized shortly after the discovery of P violation, and it was proposed that charge conjugation was the desired symmetry to restore order. As a result, the CP symmetry was proposed in 1957 by Lev Landau as the true symmetry between matter and antimatter. In other words, a process in which all particles are exchanged with their antiparticles was assumed to be equivalent to the mirror image of the original process. The discovery of CP violation in 1964 in the decays of neutral kaons [6], which resulted in the Nobel Prize in Physics 121

124 Symmetry ψψ ψγ µ ψ ψσ µν ψ ψγ 5 ψ ψγ µ γ 5 ψ µ A µ F µν P +1 ( 1) µ ( 1) µ ( 1) ν 1 ( 1) µ ( 1) µ ( 1) µ ( 1) µ ( 1) ν T +1 ( 1) µ ( 1) µ ( 1) µ 1 ( 1) µ ( 1) µ ( 1) µ ( 1) µ ( 1) ν C CP +1 ( 1) µ ( 1) µ ( 1) ν 1 ( 1) µ ( 1) µ ( 1) µ ( 1) µ ( 1) ν CP T Table 1: Transformation properties of fermion bi-linears as well as µ, A µ, and F µν under the discrete P, T, and C symmetries and the combinations CP and CP T. in 1980 for its discoverers James Cronin and Val Fitch, shocked particle physics and opened the door to questions still at the core of particle physics and cosmology today. CP violation is incorporated in the SM by including a complex phase in the matrix describing quark mixing. In such a scheme a necessary condition for CP violation can then be shown to be the presence of at least three generations of quarks. This possibility was suggested by Makoto Kobayashi and Toshihide Maskawa in a seminal paper [7] in 1973, which earned them one half of the Nobel Prize in Physics in The past decade has seen tremendous progress in the study of CP violation. In particular, the so-called B factories (BaBar and Belle) have collected and analyzed an impressive amount of experimental data, that led to the confirmation of the Kobayashi-Maskawa (KM) mechanism of CP violation. Yet, the dynamical origin of CP violation remains a puzzling mystery which awaits to be unraveled. Another unsolved theoretical questions in this context is why the universe is made entirely of matter, rather than consisting of equal parts of matter and antimatter. It can be demonstrated that, to create an imbalance in matter and antimatter from an initial condition of balance, three necessary conditions [8] must be satisfied, one of which is the existence of CP violation. The other two are baryon-number violation and the presence of interactions out of thermal equilibrium. These conditions have been formulated first in 1967 by Andrei Sakharov. The SM contains only two sources that can break the CP symmetry. The first of these, involves the aforementioned KM phase, but can account for only a small portion of the needed CP violation. The second of these, resides in the quantum chromodynamics (QCD) Lagrangian and goes by the name of θ parameter. It has not been found experimentally. The fact that one would expect the θ parameter to lead to either no or CP violation that is way too large is the essence of the strong CP problem [9]. There are several proposed solutions to solve this problem. The most well-known is based on an idea original due to Robert Peccei and Helen Quinn [10], involving new scalar particles called axions. Oops! Looks like I am getting carried away here. Let s get focused and return to the discussion of the Dirac theory. 122

125 5.3 Continuous Symmetries of Dirac Theory Besides the discrete P, T, and C symmetries, the Dirac action (5.34) enjoys a number of continuous symmetries. In the following we will discuss space-time translations, Lorentz transformations, the internal vector and axial-vector symmetry, and compute the associated conserved currents. Space-Time Translations Under infinitesimal space-time translations (2.18), the Dirac spinor ψ transforms as δψ = ɛ µ µ ψ. (5.87) Given that the Dirac Lagrangian depends on µ ψ but not on ψ µ, we can use the standard formula (2.20) to obtain the energy-momentum tensor T µν = i ψγ µ ν ψ η µν L. (5.88) Since a current is conserved only when the EOMs are obeyed, we do not lose anything by imposing the Euler-Lagrange equation already on T µν. In the case of a scalar field this does not really buy us anything, because the EOMs are second order in derivatives, while the energy-momentum tensor is first order. However, for a spinor field the EOMs are first order (5.35). This means that we can ignore the second term in (5.88), leaving us with T µν = i ψγ µ ν ψ. (5.89) It follows that the total energy is given by E = d 3 x T 00 = d 3 x i ψγ 0 ψ = d 3 x ψ γ 0 ( iγ + m) ψ, (5.90) where in order to obtain the final expression we have employed the Dirac equation. The components of the total momentum are given by P i = d 3 x T 0i = d 3 x i ψγ 0 i ψ. (5.91) Both the total energy and momentum are of course conserved. Lorentz Transformations Under a Lorentz transformation, the Dirac spinor transforms as (5.17) which, in infinitesimal form, reads δψ α = ω µ ν x ν µ ψ α Ω γδ (S γδ ) α β ψβ. (5.92) From (5.6) it follows that ω µ ν = 1/2 Ω γδ (M γδ ) µ ν, where the generators of the Lorentz group (M γδ ) µ ν take the form (5.2). After direct substitution, this tells us that ωµν = Ω µν, and as a result (5.91) becomes δψ α = ω µ ν (x ν µ ψ α + 1 ) 2 (Sγδ ) αβ ψβ. (5.93) 123

126 The conserved current arising from Lorentz transformations now follows from the same calculation we saw for the scalar field (2.67). Yet, there are two small differences. First, we are allowed to neglect terms proportional to L in the computation and, second, we pick up an extra piece in the current from the second term in (5.92). At the end one has (J λ ) µν = x µ T λν x ν T λµ i ψγ λ S µν ψ. (5.94) After quantization, when (J λ ) µν is turned into an operator, this extra term will be responsible for providing the single-particle states with internal angular momentum, telling us that the quantization of a Dirac spinor gives rise to a particle carrying spin 1/2. Vector Symmetry The Dirac Lagrangian is invariant under global phase rotations of the spinor, i.e., This symmetry gives rise to the conserved current ψ e iα ψ. (5.95) j µ V = ψγ µ ψ, (5.96) where the index V stands for vector, reflecting the fact that the left- and right-handed spinors ψ L,R transform in the same way under phase rotations. It is straightforward to check using (5.35) and (5.36), that j µ V is indeed conserved under the EOMs, µ j µ V = ( ψ µ)γ µ ψ + ψγ µ ( µ ψ) = im ψψ im ψψ = 0. (5.97) The conserved quantity arising from the vector symmetry is Q = d 3 x jv 0 = d 3 x ψγ 0 ψ = d 3 x ψ ψ. (5.98) We will see shortly that this has the interpretation of electric charge, or particle number, for fermions. Axial-Vector Symmetry In the case of massless fermions, the Dirac Lagrangian possesses an extra internal symmetry, which rotates left- and right-handed fermions in opposite directions, ψ e iαγ5 ψ, ψ ψe iαγ 5. (5.99) Here the second transformation follows from the first by noticing that exp ( iαγ 5 ) γ 0 = γ 0 exp (iαγ 5 ) as a consequence of the anti-commutation relation in (5.41). Invariance under the global phase rotation (5.98) leads to the conserved current j µ A = ψγ µ γ 5 ψ, (5.100) 124

127 where the subscript A stands for axial-vector. This current is only conserved if the mass parameter m in the Dirac action (5.34) is equal to zero. Indeed, with the full Dirac Lagrangian we may compute µ j µ A = ( ψ µ)γ µ γ 5 ψ + ψγ µ γ 5 ( µ ψ) = 2im ψγ 5 ψ, (5.101) which is non-vanishing only if m 0. However, in the quantum theory things become more interesting for the axial-vector current. When the theory is coupled to gauge fields, the axial transformation remains a symmetry of the classical Lagrangian. But the symmetry does not survive the quantization process [11 13]. It is the prototypical example of an anomaly: a symmetry of the classical theory that is not preserved at the quantum level. In fact, the axial anomaly has important physical implications. It does not only determine the neutral pion decay π 0 2γ, but also provides an indirect way to determine the number of color dofs. For further reading, I recommend the recent review article [14]. 5.4 Solutions to Dirac Equation In order to get some feeling for the physics of the Dirac equation (5.35), we now discuss its plane-wave solutions. The fact that the Dirac field ψ obeys the Klein-Gordon equation, tells us that it can be written as a linear combination of plane waves. We make the ansatz ψ(x) = u(p) e ipx, (5.102) where u(p) is a 4-component spinor that is independent of x, but does depend on the 3- momentum p. 43 Notice that (5.102) is a positive frequency solution, because ψ exp ( iet). Inserting the above ansatz into the Dirac equation takes the form ( ) m p µ σ µ (p/ m) u(p) = u(p) = 0, (5.103) p µ σ µ m where we have used the notation (5.47). In order to find the solution to this equation, we write u(p) = (u 1, u 2 ) T. In terms of the two-component spinors u 1,2 the relation (5.103) reads (p σ) u 2 = mu 1, (p σ) u 1 = mu 2, (5.104) where p σ = p µ σ µ and p σ = p µ σ µ. However, these equations are not independent from each other, since (p σ)(p σ) = p 2 0 p i p j σ i σ j = p 2 0 p i p j δ ij = p µ p µ = m 2. (5.105) We conclude that any spinor of the form ( ) (p σ) ξ u(p) = N, (5.106) mξ 43 In an abuse of notation we denote hereafter the 4-component Dirac spinors by u(p) and not u(p) etc. 125

128 with constant N is a solution to (5.103). In order to make this more symmetric, we choose N = 1/m and ξ = p σ ξ. Then u 1 = (p σ) p σ ξ = m p σ ξ, and putting things together one obtains ( p ) σ ξ u(p) =, (5.107) p σ ξ where ξ is a 2-component spinor that can be chosen to satisfy ξ ξ = 1. Here it is understood that in taking the square root of a matrix, we take the positive root of each eigenvalue. Further solutions to the Dirac equation follow from the ansatz ψ(x) = v(p) e ipx. (5.108) These solutions oscillate in time as ψ exp (iet) and are therefore called negative frequency solutions. Realize however that both (5.102) and (5.108) are solutions to the classical field equations and both have positive total energy (5.90). The Dirac equation (5.35) requires that the 4-component spinor v(p) satisfies ( ) m p σ (p/ + m) v(p) = v(p) = 0. (5.109) p σ m Following the line of reasoning that lead to (5.106), it is easy to show that the latter equation is solved by ( p ) σ η v(p) =, (5.110) p σ η for some constant 2-component spinor η taken to be normalized as η η = 1. Spin-Up and Spin-Down Solutions In order to make contact to QM, consider the positive frequency solution with mass m and vanishing 3-momentum p = 0, i.e., the rest frame of the associated particle. In this case the solution to (5.103) takes the form u(p) = ( ) ξ m, (5.111) ξ where ξ is an arbitrary 2-component spinor. We can interpret the spinor ξ by looking at the rotation generator (5.19). We see that ξ transforms under rotations as an ordinary 2- component spinor of the rotation group, and therefore determines the spin orientation of the Dirac solution in the usual way. E.g., when ξ T = (1, 0), the corresponding field has spin up along the z-axis. After quantization, this will become the spin of the associated particle In the rest of this section, we will indulge in an abuse of terminology and refer to the classical solutions to the Dirac equations as particles, even though they have no such interpretation before quantization. 126

129 Starting from (5.111), we now consider the particle with spin ξ T = (1, 0) and boost it along the z-direction with p µ = (E, 0, 0, p z ). The solution (5.107) to the Dirac equation becomes ) ( ) (1 E pz 0 u(p) = ) = (1 e y/2 1 0 m ( ), (5.112) E + pz e y/ where in the last step we have introduced the rapidity which is related to E and p z via y = 1 2 ln ( E + pz E p z ), (5.113) E = m 2 ( e y + e y), p z = m 2 ( e y e y). (5.114) Notice that rapidities are, unlike speeds at relativistic velocities, additive quantities. This feature explains why in particle physics rapidities are often used instead of velocities. For large boosts, i.e., E m or equivalent y 1, the result (5.112) turns into u(p) 2E (5.115) In the same limit, one obtains for a particle with spin ξ T = (0, 1) the expression u(p) 2E (5.116) This implies that in the limit y the states degenerate into the 2-component spinors of a massless particle. We now also understand the reason for the factor of m in (5.111). It is necessary to keep the spinor expressions finite in the massless limit. Helicity The solutions (5.115) and (5.116) are the eigenstates of the helicity operator ( ) h = i 2 ɛ ijk p i S jk = 1 2 p σ i 0 i, (5.117) 0 σ i 127 0

130 where S ij is the rotation generator (5.14). The massless field in (5.115) has helicity 1/2 and is said to be right-handed, while the one in (5.116) has helicity 1/2 and is called left-handed. Notice that the helicity of a massive particle depends on the frame of reference, since one can always boost to a frame in which its momentum is in the opposite direction, but its spin is unchanged. For a massless particle which travels at the speed of light one cannot perform such a boost. This also explains the origin of the notation ψ L,R for Weyl spinors. The solutions of the Weyl equations (5.49) are states of definite helicity, corresponding to left- and righthanded particles, respectively. The Lorentz invariance of helicity (for a massless particle) is manifest in the notation of Weyl spinors, since ψ L and ψ R live in different representations of the Lorentz group. Spinor Products There are a number of identities that will be very useful in the following section, regarding the (inner) products of the spinors u(p) and v(p). For convenience, we define a basis ξ r and η r with r = 1, 2 for the 2-component spinors such that E.g., one can take ξ r ξ s = δ rs, η r η s = δ rs. (5.118) ( ) ( ) ξ 1 1 =, ξ 2 0 =, (5.119) 0 1 and similarly for η r. Let us first look at the positive frequency solutions u(p). We can take the inner product of 4-component spinors in two different ways, i.e., u r (p) u s (p) or ū r (p) u s (p). Of course, only the latter object is Lorentz invariant, but it will turn out that the former is needed when we will quantize the theory. So let me state both. One has u r (p) u s (p) = (ξ r p σ, ξ r ( p ) σ ξ s p σ) p σ ξ s (5.120) = ξ r (p σ) ξ s + ξ r (p σ) ξ s = 2ξ r p 0 ξ s = 2p 0 δ rs, while the Lorentz-invariant inner product is ū r (p) u s (p) = (ξ r p σ, ξ r ( ) ( p ) 0 1 σ ξ s p σ) 1 0 p σ ξ s = ξ r p σ p σ ξ s + ξ r p σ p σ ξ s = 2mδ rs. (5.121) Here we have used (5.105) in order to arrive at the final expression. For the negative frequency solutions v(p), one derives in an analog way v r (p) v s (p) = 2p 0 δ rs, v r (p) v s (p) = 2mδ rs. (5.122) 128

131 We can also compute the Lorentz-invariant inner product between u r (p) and v(p). We find ū r (p) v s (p) = (ξ r p σ, ξ r ( ) ( p ) 0 1 σ η s p σ) 1 0 p σ η s (5.123) = ξ r p σ p σ η s ξ r p σ p σ η s = 0, and similarly for v r (p) u s (p) = 0. The solutions u(p) and v(p) are thus orthogonal to each other. Let us furthermore calculate u r (p) v s ( p) and v r ( p) u s (p). 45 Defining p µ = (p 0, p), one has in the first case u r (p) v s ( p) = (ξ r p σ, ξ r ( ) p σ η s p σ) p σ η s (5.124) = ξ r p σ p σ η s ξ r p σ p σ η s. Here the term under the first square root is given by (p σ)( p σ) = (p 0 p i σ i )(p 0 + p i σ i ) = p 2 0 p 2 = m 2. The same result holds for (p σ)( p σ). This means that the two terms in the last line of (5.124) cancel, leaving us with Spin Sums u r (p) v s ( p) = v r ( p) u s (p) = 0. (5.125) In evaluating Feynman diagrams, we will often wish to sum over the polarization states of a fermion. We can derive the relevant spin sums (or completeness relations) by simple calculations. We start by computing u r (p)ū r (p) = ( p ) σ ξ r (ξ r p σ ξ r p σ, ξ r ( ) 0 1 p σ) 1 0 r=1,2 = r=1,2 ( p σ p σ p σ p σ p σ p σ p σ p σ ) = ( ) m p σ = p/ + m. p σ m (5.126) Notice that the two spinors appearing on the left-hand side of (5.126) are not contracted. In the derivation of the latter equation, we have used that ( ) ξ r ξ r 1 0 =. (5.127) 0 1 Similarly, one derives Again, it is crucial that r=1,2 v r (p) v r (p) = p/ m. (5.128) r=1,2 ( ) η r η r 1 0 =. (5.129) 0 1 r=1,2 45 Our notation is such that with u( p) we in fact mean u( p) etc. 129

132 5.5 Quantization of Dirac Theory We are now ready to construct the quantum version of the free Dirac field, starting from the relevant action (5.34). We will first proceed naively and treat ψ as we have done in the case of the scalar field. Yet, we will see pretty fast that things go wrong, and we will have to reconsider how to quantize the Dirac theory. Walking on this blind alley will, however, allow us to better understand the relation between spin and statistics. So at the end, it will be a quite useful detour. Little Detour We start in the usual way by calculating the momentum conjugate to ψ. In fact, we already did this in (5.50), and know that π = iψ, which does not involve the time derivative of ψ. This makes perfectly sense, because the Dirac equation is first order in time, so that we need only to specify ψ and ψ on an initial time slice to determine the full evolution. In order to quantize the theory we then proceed in analogy with the Klein-Gordon field, and promote ψ and ψ to operators, satisfying the following canonical (equal time) commutation relations [ψ α (x), ψ β (y)] = [ψ α (x), ψ β (y)] = 0, [ψ α (x), ψ β (y)] = δ (3) (x y) δ αβ, (5.130) where α and β denote spinor indices. This already looks peculiar. If ψ were real-valued the left-hand side would be antisymmetric under exchange of x and y, while the right-hand side is symmetric. But ψ is complex, so we do not have a contradiction yet. In fact, we will soon learn that much worse problems arise when we impose commutation relations on the Dirac field. But it is instructive to see how far we can get, in order to better understand the relation between spin and statistics. So let s press on. Since we are dealing with a free theory, where any classical solution is a sum of plane waves, we may write the quantum operators in the Schrödinger picture as ψ(x) = d 3 p 1 ] [a r (2π) 3 p u r (p) e ip x + b r p v r (p) e ip x, 2Ep r=1,2 ψ (x) = r=1,2 d 3 p 1 ] [a r (2π) 3 p u r (p) e ip x + b r p v r (p) e ip x, 2Ep (5.131) where the operators a r p and b r p create particles associated to the positive energy solutions u r (p) exp (ip x) and negative energy solutions v r (p) exp ( ip x), respectively. The a r p and b r p are the corresponding annihilation operators. Like in the case of scalar fields the commutation relations of the fields (5.130) lead to commutation relations for the ladder operators. The nonvanishing commutators are [a r p, a s q ] = (2π) 3 δ (3) (p q) δ rs, [b r p, b s q ] = (2π) 3 δ (3) (p q) δ rs. (5.132) 130

133 Notice that the commutator [b r p, b s q ] has a strange minus sign on the right-hand side. It is not obvious that this sign causes trouble, but we should be aware of it. With the commutation relations (5.132) at hand, it is straightforward to show that the relations (5.130) hold. One has [ψ(x), ψ (y)] = d 3 pd 3 q 1 [ [a r (2π) p, a s 6 q ] u r (p)u s (q) e i(p x y q) 4Ep E q r,s=1,2 = r=1,2 + [b r p, b s q] v r (p)v s (q) e i(p x y q) ] d 3 p (2π) 3 1 2E p [ u r (p)ū r (p) γ 0 e ip (x y) + v r (p) v r (p) γ 0 e ip (x y) ]. (5.133) In order to simplify this further, we now employ the completeness relations (5.126) and (5.128). It follows that d [ψ(x), ψ 3 p 1 [ ] (y)] = (p/ + m) γ 0 e ip (x y) + (p/ m) γ 0 e ip (x y) (2π) 3 2E p d 3 p 1 [ (p0 = γ 0 + p γ + m ) γ 0 + ( p (2π) 3 0 γ 0 p γ m ) ] γ 0 e ip (x y) (5.134) 2E p d 3 p = eip (x y) = δ (3) (x y), (2π) 3 as promised. Notice that to obtain the second line we have change the integration from p to p for what concerns the second term. We also see that the minus sign in the second relation of (5.132) is crucial here, since it is necessary so that the terms p γ = p i γ i cancel in the final expression. It is also easy to show that the first commutation relation in (5.130) is satisfied once the equations (5.132) are imposed. I leave it to the reader to perform the explicit computation. Equipped with (5.132), we can find the explicit form of the Dirac Hamiltonian in terms of ladder operators. The Hamiltonian can be simply read off from (5.90), since E = H = d 3 x H. Hence, we have H = ψ ( iγ + m) ψ, (5.135) as a starting point, which we would like to turn into an operator. We first look at ( iγ + m) ψ = d 3 p 1 [ a r (2π) 3 p ( p γ + m) u r (p) e ip x 2Ep r=1,2 + b r p (p γ + m) v r (p) e ip x ]. (5.136) In order to find this result it is important to notice that p x = p i x i, which explains the additional minus sign of the p γ terms. Using now (5.103) and (5.109) to replace the p γ 131

134 terms, leads to ( iγ + m) ψ = r=1,2 d 3 p Ep [ ] (2π) 3 2 γ0 a r p u r (p) e ip x b r p v r (p) e ip x, (5.137) We now use this expression to write the Hamiltonian as H = d 3 xd 3 pd 3 q E [ ] p a s (2π) 6 q u s (q) e iq x + b s q v s (q) e iq x 4E q r,s=1,2 = r,s=1,2 [ a r p u r (p) e ip x b r p v r (p) e ip x ] d 3 p 1 [ ( a s (2π) 3 p a r p u s (p) u r (p) ) ( b s p b r p v s (p) v r (p) ) 2 a s p b r p ( u s (p) v r ( p) ) b s pa r p ( v s (p) v r ( p) ) ], (5.138) where in the last two terms we have changed p to p. Now is the right time to employ the formulas in (5.120), (5.122), and (5.125), that allow us to get rid of the spinor products. We arrive at the simple result H = d 3 p [ ] (2π) E 3 p a r p a r p b r pb r p r=1,2 = r=1,2 d 3 p [ ] (2π) E 3 p a r p a r p b r p b r p + (2π) 3 δ (3) (0). (5.139) The delta-function term should be familiar to you by now. It is easily dealt with by normal ordering. However, the term b r p b r p is a complete mess, since it implies that the Hamiltonian is not bounded below, meaning that our quantum theory makes no sense. Taken seriously it would tell us that we could tumble to states of lower and lower energy by continually producing particles by the action of b r p. Since the above calculation was a little subtle, you might think that it s possible to rescue the theory to get the minus signs to work out right. You can play around with different things, but you ll always find this minus sign cropping up somewhere. And, in fact, it s telling us something important that we missed. Further insight in the structure of the Dirac theory, can be gained by investigating the causality of the theory. To do this we should calculate [ψ(x), ψ (y)], or more conveniently [ψ(x), ψ(y)], at non-equal times and hope to find that this commutator is zero outside the light-cone. We start this exercise by switching to the Heisenberg picture thereby restoring the time-dependence of ψ and ψ. From (3.104) and (3.106), we infer that while (a r p) H = e iht a r p e iht = e iept a r p, (a r p ) H = e iht a r p e iht = e iept a r p, (5.140) (b r p) H = e iht b r p e iht = e iept b r p, (b r p ) H = e iht b r p e iht = e iept b r p. (5.141) 132

135 It immediately follows that ψ(x) = r=1,2 ψ(x) = r=1,2 d 3 p 1 ] [a r (2π) 3 p u r (p) e ipx + b r p v r (p) e ipx, 2Ep d 3 p 1 ] [a r (2π) 3 p ū r (p) e ipx + b r p v r (p) e ipx. 2Ep (5.142) We can now compute the commutator. One has [ψ α (x), ψ β (y)] = d 3 p 1 [ ] (u r (p)ū r (p)) αβ e ip(x y) + (v r (p) v r (p)) αβ e ip(x y) (2π) 3 2E p = r=1,2 d 3 p 1 [ ] (p/ + m) αβ e ip(x y) + (p/ m) αβ e ip(x y) (2π) 3 2E p = (i / x + m) αβ d 3 p (2π) 3 1 2E p [ e ip(x y) e ip(x y)]. Looking back at (3.110) and (3.111), we see that this means that (5.143) [ψ α (x), ψ β (y)] = (i / x + m) αβ (x y). (5.144) This expression vanishes outside the light-cone, because the commutator of the real scalar field (x y) = [φ(x), φ(y)] does. As a result the quantum version of the Dirac theory is causal. Although there is no problem with causality, it is worthwhile to stare at the commutator in (5.144) a bit longer. If 0 is the vacuum state of the theory, for all r and p, then [ψ α (x), ψ β (y)] = 0 [ψ α (x), ψ β (y)] 0 a r p 0 = b r p 0 = 0. (5.145) = 0 ψ α (x) ψ β (y) 0 0 ψβ (y)ψ α (x) 0. (5.146) It is important to realize now, that the first (second) matrix element receives only contribution from terms containing the u r (p) (v r (p)) spinors. Explicitly, one has in the first case 0 ψ α (x) ψ β (y) d 3 pd 3 q 1 0 = (u r (p)ū s (q)) αβ e i(px qy) 0 a r (2π) pa s 6 q 0, (5.147) 4Ep E q r,s=1,2 and a similar expression holds in the second case. It is now crucial to ask the following questions. Can we say something about the matrix elements 0 a r pa s q 0 based on the classical symmetries of the Dirac theory? In particular, how does Lorentz invariance constrain the form of the relevant matrix elements? For the ground 133

136 state 0 to be invariant under translations, we must have exp (ip x) 0 = 0. In analogy to (5.140) the action of exp (ip x) on the ladder operators can be shown to lead to e ip x a r p e ip x = e ip x a r p, e ip x a r p e ip x = e ip x a r p. (5.148) Analog expressions hold in the case of b r p and b r p. Therefore, 0 a r pa s q 0 = 0 a r pa s q e ip x 0 = e i(p q) x 0 e ip x a r pa s q 0 = e i(p q) x 0 a r pa s q 0. (5.149) This implies that the matrix element can only be non-zero if p = q. Similarly, it can be shown that rotational invariance of 0 requires that r = s, which should be intuitively clear. From these considerations, one concludes that the matrix element can be written as 0 a r pa s q 0 = (2π) 3 δ (3) (p q) δ rs A(p), (5.150) where A(p) is an arbitrary function that is so far undetermined. Inserting the latter result into (5.147), gives 0 ψ α (x) ψ β (y) d 3 p 1 0 = (u r (p)ū r (p)) αβ e ip(x y) A(p) (2π) 3 2E p = r=1,2 d 3 p 1 (p/ + m) αβ e ip(x y) A(p). (2π) 3 2E p (5.151) For this expression to be invariant under boosts, we have to require that A(p) must be a Lorentz scalar, i.e., A(p) = A(p 2 ). In fact, since p 2 = m 2 it follows that A has to be a positive constant. The positivity of A is the result of the positivity of the norm of states in any self-respecting Hilbert space. Hence, 0 ψ α (x) ψ β (y) 0 = A (i / + m) αβ d 3 p (2π) 3 1 2E p e ip(x y). (5.152) In a similar fashion, we can also calculate the second matrix element in (5.146). The final result reads 0 ψβ (y)ψ α (x) 0 = B (i / + m) αβ d 3 p 1 e ip(x y). (5.153) (2π) 3 2E p where B is another positive constant. The minus sign is important. It arises from the completeness relation (5.128) of the v r (p) spinors and the sign of x in the exponential. From (5.152) and (5.153) we see that the two terms in the last line of (5.146) would indeed cancel if A = B. Yet, this is impossible since A and B must both be positive. So how to resolve this apparent contradiction? Setting A = B = 1, it follows from (5.152) and (5.153) that (outside the light-cone) 0 ψ α (x) ψ β (y) 0 = 0 ψβ (y)ψ α (x) 0, (5.154) which means that the spinor fields anticommute at space-like separation. This suggests that postulating the commutation relations (5.130) for the spinor fields, was the mistake that lead to the negative energy problem in (5.139). 134

137 Fermionic Quantization The key piece of physics that we obviously missed before is that spin-1/2 particles are fermions, meaning that they obey Fermi-Dirac statistics with the quantum state picking up a minus sign upon the interchange of any two particles as indicated by (5.154). This fact is embedded into the structure of relativistic QFT: the spin-statistics theorem tells us that integer spin fields must be quantized as bosons, while half-integer spin fields must be quantized as fermions. Any attempt to do otherwise will lead to an inconsistency. All inconsistencies are removed by postulating the equal-time anticommutation relation for the Dirac field, {ψ α (x), ψ β (y)} = {ψ α (x), ψ β (y)} = 0, {ψ α (x), ψ β (y)} = δ (3) (x y) δ αβ, (5.155) instead of (5.130). In this case we still have the expansions (5.131) and (5.142) in terms of the ladder operators a r p, a r p, b r p, and b r p, but the line of reasoning that lead to (5.132) now tells us that r=1,2 {a r p, a s q } = (2π) 3 δ (3) (p q) δ rs, {b r p, b s q } = (2π) 3 δ (3) (p q) δ rs, (5.156) while all other anticommutators vanish identically. Using these anticommutator relations, we can now compute the Hamiltonian again, finding H = d 3 p [ ] (2π) E 3 p a r p a r p b r pb r p = r=1,2 d 3 p [ ] (2π) E 3 p a r p a r p + b r p b r p (2π) 3 δ (3) (0). (5.157) We see that the anticommutators have saved us from the indignity of an unbounded Hamiltonian. Notice that when normal ordering, we now throw away a negative infinite contribution proportional to (2π) 3 δ (3) (0) and not a positive one as in the case of the scalar field (3.19). In principle, the negative contribution from fermionic fields could (partially) cancel the positive contribution arising from bosonic fields. So one could hope that if there is a symmetry relating fermions and bosons to each other, a so-called supersymmetry, the cosmological constant problem might be solvable. In fact, it can be shown that supersymmetry solves the cosmological constant problem halfway, but does not render a complete solution. If you want to figure out what halfway actually means, I recommend to have a look at the excellent review [15] and the relevant references therein. For completeness let me also quote the expression for the momentum operator. Inserting the expansions (5.131) into (5.91), one finds after a straightforward calculation and normal ordering the following result P = d 3 p [ ] (2π) p a r 3 p a r p + b r p b r p. (5.158) r=1,2 135

138 Fermi-Dirac Statistic Although the ladder operators now obey anticommutation relations, the Hamiltonian (5.157) has nice commutation relations with them. You can check easily that [H, a r p] = E p a r p, [H, a r p ] = E p a r p, (5.159) and likewise in the case of b r p and b r p. As in the scalar case (3.43), this implies that we can again construct a tower of energy eigenstates by acting on 0 with a r p and b r p to create particles and antiparticles. E.g., we have the one-particle state p, r = a r p 0, (5.160) with momentum p and spin quantum number r. The two-particle state obeys p 1, r 1 ; p 2, r 2 = a r 1 p 1 a r 2 p 2 0, (5.161) p 1, r 1 ; p 2, r 2 = a r 1 p 1 a r 2 p 2 0 = a r 2 p 2 a r 1 p 1 0 = p 2, r 2 ; p 1, r 1, (5.162) due to (5.156). This confirms that the particles do satisfy Fermi-Dirac statistics as anticipated. In particular, we have the Pauli s exclusion principle p, r; p, r for all p and r. Finally, if one wants to be sure about the spin of the particle, one could act with the angular momentum operator J i = ɛ ijk d 3 x (J 0 ) jk constructed from (5.94) to confirm that a stationary particle p = 0, r does indeed carry intrinsic angular momentum 1/2. This exercise, which is left to the reader, will show that in the case of p = 0, r only the third term in (5.94) will give a non-vanishing contribution to the internal angular momentum. Dirac s Hole Interpretation Before discussing the propagator of the Dirac field, a historical remark seems to be in order. Dirac originally viewed his equation (5.35) as a relativistic version of the Schrödinger equation, considering ψ as the wavefunction ot a single particle with spin 1/2 (a fact which is put in by hand in Diracs theory). In order to reinforce this interpretation, he wrote (5.35) as i ψ t = iα ψ + mβψ = Hψ, (5.163) with α = γ 0 γ and β = γ 0. The operator H appearing in the above equation is then understood as the one-particle Hamiltonian. Notice that this viewpoint is quite different from the one we held so far, where ψ is a classical field that gets quantized. In Dirac s view, the Hamiltonian is defined by (5.163), while for us the Hamiltonian is given by the field operator (5.157). But for the moment let s stick to (5.163) and see where it lead Dirac/leads us. With the interpretation of ψ as a single-particle wavefunction, the plane-wave solutions (5.102) and (5.108) are thought of as energy eigenstates, satisfying i ψ t = i t u(p)e ipx = E p u(p)e ipx = E p ψ, (5.164) 136

139 and an analog relation for ψ = v(p)e ipx with E p replaced by E p. The plane-wave solutions thus look like positive and negative energy solutions. The spectrum is again unbounded from below, because there are states v(p) with arbitrary low energy E p. At first glance this is disastrous, just like the unbounded field theory Hamiltonian of (5.157). Paul Dirac s ingenious solution to this problem was to turn to the Pauli exclusion principle. In 1930, Dirac proposed that in the true vacuum of the universe, all the negative energy states are filled, so that only the positive energy states are accessible. The filled negative energy states are referred to as the Dirac sea. Although you might worry about the infinite negative charge of the vacuum, Dirac argued that only charge differences would be observable (a trick reminiscent of the normal ordering prescription we use for field operators). Having avoided the problem with the anomalous negative-energy quantum states by introducing an infinite sea comprised of occupied negative energy states, Dirac realized that his theory made a shocking prediction. Suppose that a negative energy state is excited to a positive energy state, leaving behind a hole in the Dirac sea. The hole would have all the properties of the electron, except it would carry positive charge. After flirting with the idea that it may be the proton, 46 Dirac concluded that the hole is a new particle, the positron. It took only couple of years before the positron was discovered experimentally in 1932 by Carl Anderson, with all the physical properties predicted for the Dirac hole. Although Dirac s physical insight led him to the right answer, we now understand that the interpretation of the Dirac equation as a single-particle wavefunction is not really correct. E.g., Dirac s argument for antimatter relies crucially on the particles being fermions while, as we have seen already in this course, antiparticles exist for both fermions and bosons. What we really learn from Dirac s analysis is that there is no consistent way to interpret the Dirac equation as a single-particle wavefunction. It is instead to be thought of as a classical field which has only positive energy solutions, since the Hamiltonian (5.90) is positive definite. Quantization of this field then gives rise to both particle and antiparticle excitations and makes the vacuum the state in which no particles exist instead of an infinite sea of particles. This picture is much more convincing, especially since it recaptures all the valid predictions of the Dirac sea, such as electron-positron annihilation. On the other hand, the field formulation does not eliminate all the difficulties raised by the Dirac sea. In particular, the problem of the vacuum possessing infinite energy, is still present. Feynman Propagator We now look at the anticommutator of the fields ψ(x) and ψ(y). Dropping the indices α and β from here on, we simply write is(x y) = {ψ(x), ψ(y)}. (5.165) 46 Robert Oppenheimer pointed out that an electron and its hole would be able to annihilate each other, releasing energy on the order of the electron s rest energy in the form of energetic photons. If holes were protons, stable atoms would thus not exist, which is clearly in contradiction with observations. Hermann Weyl also noted that a hole should have the same mass as an electron, whereas the proton is about 2000 times heavier. 137

140 Inserting the expansions (5.142), we essentially only have to repeat the calculation that lead to (5.143), to obtain is(x y) = (i / x + m) [ D(x y) D(y x) ], (5.166) where D(x y) is the propagator (3.114) of the real scalar field. The object is(x y) is called the fermionic propagator. Some comments seem to be in order here. For space-like separated points (x y) 2 < 0, we have already seen in (3.117) that D(x y) D(y x) = 0. In the bosonic theory, we made a big deal out of this, since it ensured that [φ(x), φ(y)] = 0 for (x y) 2 < 0, which we took as a proof of causality. However, in the case of fermions we now have {ψ(x), ψ(y)} = 0 for (x y) 2 < 0. What happened to causality? The best that we can say is that all our observables are bi-linear in ψ and ψ, e.g., the Hamiltonian operator (5.157) or the momentum operator (5.158). These objects still commute outside the light-cone. The theory remains causal as long as individual fermionic operators are not observable. If you think this is a weak argument, remember that no one has ever seen a physical device come back to minus itself when you rotate by 2π! Notice furthermore, that the propagator satisfies (i / x m)s(x y) = 0, since ( x 2 + m 2 )D(x y) = 0 using the on-shell condition p 2 = m 2. By a similar calculation to that above, we can determine the VEVs of the bi-linears, 0 ψ(x) ψ(y) 0 = 0 ψ(y)ψ(x) 0 = d 3 p (2π) 3 (p/ + m) e ip(x y), d 3 p (2π) 3 (p/ m) eip(x y), (5.167) which allows us to define the fermionic Feynman propagator S F (x y), which is a 4 4 matrix, as the following time-ordered product S F (x y) = 0 T ψ(x) ψ(y) 0 = θ(x 0 y 0 ) 0 ψ(x) ψ(y) 0 θ(y 0 x 0 ) 0 ψ(y)ψ(x) 0, (5.168) where the minus sign in front of the second term is crucial in the QFT of fermions. When (x y) 2 < 0, there is no invariant way to determine whether x 0 > y 0 or x 0 < y 0. In this case the minus sign is necessary to make the two definitions agree since {ψ(x), ψ(y)} = 0 outside the light-cone. In full analogy to the scalar case, there is also a 4-momentum integral representation for the Feynman propagator. It reads and satisfies S F (x y) = i d 4 p p/ + m (2π) 4 e ip(x y) p 2 m 2 + iɛ, (5.169) (i / x m)s F (x y) = iδ (4) (x y), (5.170) which means that S F (x y) is a Green s function of the Dirac operator. The minus sign that we see in (5.168) also occurs for any string of operators inside any time-ordered product. While bosonic operators commute inside T, fermionic operators anticommute. We have this same behavior for normal-ordered products as well, with fermionic 138

141 operators receiving a minus sign when their order is changed. With the understanding that all fermionic operators anticommute inside time- or normal-ordered products, Wick s theorem proceeds just as in the bosonic case, which has been outlined in great detail in Section 4.4. In the fermionic case, we define the contraction as Yukawa Theory ψ(x) ψ(y) = T ψ(x) ψ(y) :ψ(x) ψ(y): = S F (x y). (5.171) Based on the experiences gained in Section 4.6, it is now straightforward to work out the Feynman rules needed to calculate fermion correlation functions. Let us for definiteness consider the case of the Yukawa theory, L = 1 2 ( µφ) µ2 φ 2 + ψ (i / m) ψ λφ ψψ, (5.172) which describes the interaction of a scalar field with mass µ and a Dirac field with mass m. Couplings of this type appear in the SM, between fermions and the Higgs boson, and give mass to the fermionic dofs after electroweak symmetry breaking. In that context, the fermions can be charged leptons (possibly neutrinos) or quarks. If you wish (5.172) is thus the proper version of the scalar Yukawa theory of (4.8). Notice that there is however an important difference following from the dimensions of the involved fields. We still have [φ] = 1, but the kinetic terms of the fermion requires that [ψ] = 3/2. Thus, unlike in the case with only scalars, the coupling is dimensionless, i.e., [λ] = 0. In order to get a grip on the Feynman rules, let us study ψψ ψψ scattering. This is pretty much the same calculation we have already performed in Section 4.5. The only minor modification is, that now the particles that scatter have spin, while the nucleons N we considered earlier on are scalars. In analogy to (4.48) we write the initial and final states as i = 4E p1 E p2 a r 1 p 1 a r 2 p 2 0 = p 1, r 1 ; p 2, r 2, f = 4E q1 E q2 a s 1 q 1 a s 2 q 2 0 = q 1, s 1 ; q 2, s 2. (5.173) Notice that for these states one has to be careful when one takes the adjoint since the fermionic creation operators anticommute. E.g., the final-state bra is f = 4E q1 E q2 0 a s 2 q 2 a s 1 q 1. (5.174) To get a contribution to the scattering of two fermions, we have to calculate the O(λ 2 ) corrections to the T -matrix element i f T i. The relevant contribution to it takes the form ( iλ) 2 d 4 xd 4 y T ( φ(x) 2 ψ(x)ψ(x) φ(y) ψ(y)ψ(y) ), (5.175) where all fields are interacting ones. Just like in the case of the bosonic calculation, the contribution to ψψ ψψ scattering comes from the term where the two φ fields are contracted, D F (x y) : ψ(x)ψ(x) ψ(y)ψ(y):. (5.176) 139

142 ψ p 1 q 1 ψ ψ p 1 ψ φ + φ q 1 q 2 ψ p 2 q 2 ψ ψ p 2 ψ Figure 5.1: Feynman diagrams contributing to ψψ ψψ scattering at order λ 2. We can now study the action of the fermionic operators on i. By expanding the ψ operators, but not the ψ fields, we find : ψ(x)ψ(x) ψ(y)ψ(y): a r 1 p 1 a r 2 d 3 k 1 d 3 k 2 ( p 2 0 = ψ(x) u t 1 (k (2π) 6 1 ) ) ( ψ(y) u t 2 (k 2 ) ) e i(k 1 x+k 2 y) 4Ek1 E k2 a t 1 k1 a t 2 k1 a r 1 p 1 a r 2 p 2 0. (5.177) Here the b t 1 k 1 and b t 2 k 2 terms in the expansion of ψ have been ignored since the do not contribute to the considered process at O(λ 2 ) and the brackets indicate how the spinor indices are contracted. Notice finally that the overall minus sign arises from moving ψ(x) past ψ(y). By anticommuting the annihilation operators past the creation operators and performing the momentum integrations using the delta functions, we then get for the right-hand side of (5.177) the following expression 1 ( ( ψ(x) u r 1 (p 1 ) ) ( ψ(y) u r 2 (p 2 ) ) e 4Ep1 E p2 i(p 1 x+p 2 y) + ( ψ(x) u r 2 (p 2 ) ) ( ψ(y) u r 1 (p 1 ) ) e i(p 1 y+p 2 x) ) 0. (5.178) Note the minus sign between the two individual terms. We now let this expression act on f from the right. Let us first have a look what happens to the first term in (5.178). Ignoring prefactors and exponentials, we have 0 a s 2 q 2 a s 1 q 1 ( ψ(x) u r 1 (p 1 ) ) ( ψ(y) u r 2 (p 2 ) ) 0 = i(q e 1 x+q2 y) (ū s 1 (q 1 ) u r 1 (p 1 )) (ū s 2 (q 2 ) u r 2 (p 2 )) 4Eq1 E q2 (5.179) ei(q 1 y+q 2 x) 4Eq1 E q2 (ū s 1 (q 1 ) u r 2 (p 2 )) (ū s 2 (q 2 ) u r 1 (p 1 )). In fact, the second term in (5.178) can be shown to give the same result up to a sign. Both terms thus add, which cancels the factor of 1/2 in (5.175). Furthermore, the square roots of 140

143 energies in (5.179) cancel against the relativistic normalizations of the states (5.173). Putting everything together and including the Feynman propagator of the φ field, we end up with ( (ū s 1 (q 1 ) u r 1 (p 1 )) (ū s 2 (q 2 ) u r 2 (p 2 )) e i[(q 1 p 1 ) x+(q 2 p 2 ) y] ( iλ) 2 d 4 xd 4 yd 4 k (2π) 4 ie ik (x y) k 2 µ 2 + iɛ (5.180) ) (ū s 1 (q 1 ) u r 2 (p 2 )) (ū s 2 (q 2 ) u r 1 (p 1 )) e i[(q 2 p 1 ) x+(q 1 p 2 ) y]. Performing the integrations over x and y and suppressing a factor i(2π) 4, which will end up in i f T i = i(2π) 4 δ (4) (p 1 + p 2 q 1 q 2 ) A(ψψ ψψ), this becomes ( iλ) 2 d 4 k ( (ū s 1 (q k 2 µ 2 1 ) u r 1 (p 1 )) (ū s 2 (q 2 ) u r 2 (p 2 )) δ (4) (q 1 p 1 + k) δ (4) (q 2 p 2 k) + iɛ (5.181) ) (ū s 1 (q 1 ) u r 2 (p 2 )) (ū s 2 (q 2 ) u r 1 (p 1 )) δ (4) (q 1 p 1 + k) δ (4) (q 2 p 2 k), from which we can immediately read of the result for the scattering amplitude [ A(ψψ ψψ) = ( iλ) 2 (ū s 1 (q 1 ) u r 1 (p 1 )) (ū s 2 (q 2 ) u r 2 (p 2 )) (p 1 q 1 ) 2 µ 2 + iɛ (ūs 1 (q 1 ) u r 2 (p 2 )) (ū s 2 (q 2 ) u r 1 (p 1 )) (p 1 q 2 ) 2 µ 2 + iɛ ]. (5.182) Honestly, the derivation of the expression for A(ψψ ψψ) was a bit tedious. Can it be done more easily? Yes, it can! Of course, the trick is again to use Feynman diagrams and rules. The lowest-order Feynman graphs for the scattering of two fermions into two fermions are shown in Figure 5.1. Starring at those diagrams as well as (5.182), it is, in fact, easy to guess the Feynman rules that reproduce the final result for the ψψ ψψ scattering amplitude. The relevant momentum-space Feynman rules involving fermions and antifermions turn out to be: i (p/ + m) 1. For each propagator one has = p p 2 m 2 + iɛ. 2. For each vertex one has = iλ. 3. For each external fermion one has p = u s (p) (initial state), p = ū s (p) (final state). 141

144 4. For each external antifermion one has p = v s (p) (initial state), p = v s (p) (final state). 5. Impose momentum conservation at each vertex. 6. Integrate over each undetermined momentum 7. Figure out the overall sign of the diagram. d 4 l (2π) 4. The Feynman rule for the propagator of the scalar field φ (indicated by a dashed line) has already been given in Section 4.6 and external scalar legs just give a trivial factor of 1. Several comments regarding the above rules are in order. First, the direction of the momentum on a fermion line is significant. On external lines, the direction of the momentum is always ingoing (outgoing) for initial-state (final-state) particles. This follows from the expansion of the operators ψ and ψ, where the annihilation (creation) operators are multiplied by exp ( ipx) (exp (ipx)) as can be seen from (5.142). On internal lines, represented by propagators, the momentum must be assigned in the direction of the particle-number flow (for electrons, this is the direction of the negative charge flow). It is conventional to draw arrows on fermion lines to represent the direction of the particle-number flow. The momentum assigned to a fermion then flows in the direction of this arrow, while in the case of an antifermion particle-number and momentum flow are opposite to each other. Hence an additional arrow, identifying the momentum flow, has been drawn next to the antifermion line. Second, in the case of the Yukawa theory the 1/n! factor from the Taylor expansion of the time-ordered exponential is always cancelled by the n! ways of interchanging the vertices to obtain the same contraction. In the case at hand there is thus no need for symmetry factors, given that the fields in the interaction term λφ ψψ cannot replace each other in a contraction. Third, the Dirac indices contract together along fermion lines. This happened in the case of ψψ ψψ scattering (5.182), but will also happen in more complicated diagrams like e.g. p 4 p 3 p 2 p 1 ū(p 4 ) i (p / 3+ m) (p 2 3 m 2 ) i (p/ 2 + m) (p 2 2 m 2 ) u(p 1). (5.183) Fourth and finally, we should understand how to determine the correct overall sign of the diagrams. Let us return to the case of fermion-fermion scattering (5.182). Here the t-channel diagram has a plus sign, while the u-channel contribution receives a minus sign. Where does the relative minus sign between the two graphs come from? Let us look at the Wick 142

145 contractions. For the contractions corresponding to the t-channel diagram in Figure 5.1, we have 0 a s 2 q 2 a s 1 q 1 ψx ψ x ψy ψ y a r 1 p 1 a r 2 p 2 0. (5.184) This contraction can be untangled by moving ψ y = ψ(y) two spaces to the left, and so one picks up a factor ( 1) 2 = 1. On the other hand, the contraction corresponding to the u-channel diagram in Figure 5.1 reads 0 a s 2 q 2 a s 1 q 1 ψx ψ x ψy ψ y a r 1 p 1 a r 2 p 2 0. (5.185) Here we only have to move ψ y one space to the left, giving a factor of 1. The relative minus sign between the two diagrams is a reflection of the Fermi-Dirac statistics. In more complicated graphs the overall sign can be determined most easily by noting that ( ψψ) x = ψ(x)ψ(x) as well as any other pair of fermions, commutes with any operator. Thus, e.g.... ( ψψ) x ( ψψ) y ( ψψ) z ( ψψ) w... =... (+1) ( ψψ) x ( ψψ) z ( ψψ) y ( ψψ) w... =... S F (x z)s F (z y)s F (y w)..., (5.186) with S F (x y) given in (5.169). Notice that in the case of the simplest closed fermion loop in the Yukawa theory the latter prescription leads to = ( ψψ) x ( ψψ) y = ( 1) tr [ ψ y ψx ψ x ψy ] (5.187) = ( 1) tr [S F (y x)s F (x y)]. Due to the cyclic property of the trace changing the ordering of S F (y x) and S F (x y) of course gives the same result. The result (5.187) extends straightforwardly to all closed fermion lines. A fermion loop hence always gives a factor of 1 and the trace of the product of fermion propagators that make up the loop. Equipped with the Feynman rules for the Yukawa theory, we can now calculate the cross sections for some simple scattering processes. This is quite a good exercise. You should try it! 5.6 Problems i) Show explicitly that the Weyl representation (5.10) satisfies the Clifford algebra (5.7). Derive the properties (5.15) and (5.16) of the matrices S µν introduced in (5.12). Show that the term ψψ, ψγ µ ψ, and ψσ µν ψ transforms as in (5.30), (5.31), and (5.32), i.e., it is a Lorentz scalar, vector, and tensor, respectively. It might be advantageous to look 143

146 at infinitesimal transformations and consider separately the transformation properties of ψ, ψ, and γ µ under the action of (5.6). Calculate the transformation properties of (5.35) and (5.36) under Lorentz transformations. You should find that theses EOMs are form invariant. Verify that the fifth Dirac matrix γ 5 defined as in (5.40) satisfies (5.41) and (5.42). Prove that the chiral projectors P L,R introduced in (5.43) obey the relations (5.44). ii) Prove the Gordon identity [ (p ū(p )γ µ u(p) = ū(p + p) µ ) 2m Here q µ = (p p) µ. ] + iσµν q ν u(p). (5.188) 2m iii) Use (5.7) to show that the following identities involving contractions of 4-dimensional Dirac matrices are correct: γ µ γ µ = 4, γ µ γ ν γ µ = 2γ ν, γ µ γ ν γ κ γ µ = 4η νκ, γ µ γ ν γ κ γ λ γ µ = 2γ λ γ κ γ ν. (5.189) Employ the anticommutation relation (5.7) in combination with the cyclic property of the trace to prove the following identities: tr (1) = 4, tr (any odd number of Dirac matrices) = 0, tr (γ µ γ ν ) = 4η µν, tr ( γ µ γ ν γ κ γ λ) = 4 ( η µν η κλ η µκ η νλ + η µλ η νκ), tr ( γ 5) = 0, (5.190) tr ( γ µ γ ν γ 5) = 0, tr ( γ µ γ ν γ κ γ λ γ 5) = 4iɛ µνκλ. iv) Products of Dirac bi-linears obey relations known as Fierz identities. The simplest of these formulas reads (ū 1 γ µ P L u 2 ) (ū 3 γ µ P L u 4 ) = (ū 1 γ µ P L u 4 ) (ū 3 γ µ P L u 2 ), (5.191) where u i with i = 1,..., 4 are 4-component Dirac spinors (the momentum dependence has been dropped here for simplicity) and P L is the left-handed projector introduced in (5.43). In fact, there are similar rearrangement formulas for any product (ū1 Γ A u 2 ) (ū3 Γ B u 4 ). (5.192) 144

147 Here Γ A and Γ B are any of the 16 combinations of Dirac matrices listed in (5.72). The goal of this exercise is to derive these Fierz identities. To begin with, normalize the 16 matrices Γ A such that tr [ Γ A, Γ B] = 4δ ab. (5.193) This gives Γ A = {1, γ 0, iγ j,...}. Write down all elements of this set. The general form of the Fierz identity is (ū1 Γ A u 2 ) (ū3 Γ B u 4 ) = M,N ) ) CMN AB (ū1 Γ M u 4 (ū3 Γ N u 2, (5.194) with unknown coefficients C AB MN. Using the completeness of the set ΓA, show that C AB MN = 1 16 tr [ Γ M Γ A Γ N Γ B]. (5.195) Employing (5.194) and (5.195) prove (5.191). In addition work out the explicit Fierz transformation of the product (ū 1 u 2 )(ū 3 u 4 ). v) In Section 4.8 we saw that the Yukawa potential for NN NN scattering is attractive. Repeat the calculation for ψψ ψψ, ψ ψ ψ ψ, and ψ ψ ψ ψ scattering in the non-relativistic limit. You might want to use (5.120) to (5.122). If you understood how to calculate the Yukawa potential the derivation of the Coulomb potential, which encodes the interactions of electrons/positrons and the photon field A µ in the non-relativistic limit, is also not difficult. Consider again the three different cases of particle-particle, particle-antiparticle, and antiparticle-antiparticle scattering. The QED Feynman rules for the photon (γ) propagator and vertex between electron (Q = 1) and positron (Q = 1) are given by γ µ ν p = ig µν p 2 + iɛ, e γ µ = iqeγ µ. e The propagator of a tensor boson, such as the graviton (G), i.e., the force carrier of gravity, looks like µ ν G p ρ σ = 1 2 [ ] i ( g µρ )( g νσ ) + ( g µσ )( g νρ ) p 2 + iɛ. 145

148 Can you derive from this result the orientation of the gravitational force? vi) The Furry theorem states that the sum of all Feynman graphs in QED with an odd number of external photons (off or on the photon mass shell) and no other external lines vanish. In order to proof Furry s theorem, consider (n = 0, 1,...) Ω T j µ 1 V (x 1)... j µ 2n+1 V (x 2n+1 ) Ω, (5.196) and show by invoking symmetry arguments that a matrix element of this form vanishes. Here Ω denotes the true vacuum of the interacting theory and j µ V (x) is the vector current introduced in (5.96). vii) The goal of this exercise is to introduce the spinor method and to derive some identities that will be very useful to calculate scattering amplitudes in the high-energy limit, where the involved particles can be treated as massless. Derive an explicit solution for the Dirac equation p/ u(p) = 0, (5.197) of a massless fermion. To do so, write out p/ using the basis ( ) ( ) ( ) γ =, γ i 0 σ i =, γ =, (5.198) 1 0 σ i of Dirac matrices. To keep the notation compact you might want to introduce p ± = p 0 ± p 3, e ±iϕp = p 1 ± ip 2 (p1 ) 2 + (p 2 ) 2. (5.199) Use the projection operators P L,R in the basis (5.198) to decompose the original solution into two helicity solutions u ± (p) = P R,L u(p). (5.200) Give the explicit form of u ± and ū ±. Show that u ±u ± = 2p 0, which fixes the normalization of the spinors. Relate u + (ū + ) with (ū ) T ((u ) T ) using γ 0 and γ 2. What is the physics behind these relations? So far we have only talked about the positive-energy solutions u. How do the negative-energy solutions v fit into the picture? In particular, how are u ± (ū ± ) and v ± ( v ± ) related in the case of a massless fermion? Consider now a set of massless momenta p i with i = 1, 2,..., n. We introduce a bra and ket notation with the spinor labelled by the index i corresponding to the momentum p i, The basic spinor product are defined as i ± = u ± (p i ), i ± = ū ± (p i ). (5.201) ij = i j +, [ij] = i + j. (5.202) 146

149 What happens to i ± j ±? Show the antisymmetry of the spinor products, i.e., ij = ji, [ij] = [ji], (5.203) by using either the explicit expressions for u ± and ū ± you have derived earlier or the charge conjugation properties of the spinors. For the case when both energies are positive, i.e., p 0 i > 0 and p 0 j > 0, derive analytic expressions for the spinor products (5.202). Express you result through s ij = (p i + p j ) 2 = 2p i p j, (5.204) and cos φ ij = p1 i p + j p1 j p + i s ij p + i p+ j, sin φ ij = p2 i p + j p2 j p + i s ij p + i p+ j. (5.205) So what is the connection between spinor products and Lorentz products of momenta? Use your explicit result to show that the two types of spinor products are related by complex conjugation, ij = [ji]. (5.206) Since spinor products should have simple properties under crossing symmetry, one defines the spinor product ij for negative energies by analytic continuation from the positiveenergy case, but with p i,j replaced by p i,j if p 0 i,j < 0. The spinor product [ij] is then defined through the identity Consider now the spinor string ij [ji] = tr [ P L p/ i p/ j ] = sij. (5.207) [i γ µ j = ū + (p i )γ µ u + (p j ), (5.208) a quantity that naturally appears as the current describing the emission of a vector boson from a right-handed massless fermion line. Notice that the helicity labels on the spinors can always be suppressed in favor of angle or square brackets as in the spinor products. So one has, i = i, [i = i +, i = i +, and i] = i. Show the charge conjugation property of the current Prove that [i γ µ j = j γ µ i]. (5.209) i [i = P R p/ i, i] i = P L p/ i, (5.210) and use these projection operators to show the correctness of the Gordon identity [i γ µ i = i γ µ i] = 2p µ i. (5.211) 147

150 Show by the use of (5.208) that holds and derive from this relation the Schouten identity i j j i = ji P R, (5.212) ij kl = ik jl + il kj. (5.213) The same identity applies when angle brackets are replaced by square brackets. In explicit calculation (5.208) is a powerful tool since its application can lead to enormous algebraic simplifications. Use the Fierz transformation to show the simple relation as well as the Fierz identity A similar relation holds for γ µ i γ µ j]. (γ µ P R ) ij (γ µ P L ) kl = 2(P L ) il (P R ) kj, (5.214) i γ µ j][k γ µ l = 2 il [jk], (5.215) γ µ [i γ µ j = 2 ( i] j + j [i ). (5.216) viii) The goal of this exercise is to calculate the squared tree-level matrix elements of the processes dū e ν e and dū e ν e g using the spinor formalism developed above. The first process dū e ν e describes the production of a massive W boson from the collision of a down (d) and an antiup quark (ū) and the subsequent decay of the W boson into an electron (e ) and an electron antineutrino ( ν e ). Draw the relevant Feynman diagram and write down the corresponding amplitude in spinor notation using the Feynman rules: W µ ν p = ig µν p 2 M 2 W + iɛ, ū W µ = i g w 2 V ud γ µ P L, d 148

151 ν e W µ = i g w 2 γ µ P L. e Here M W denotes the mass of the W boson, g w is the weak gauge coupling, and V ud 1 is the complex 11 element of the Cabibbo-Kobayashi-Maskawa (CKM) matrix, which describes quark mixing in the SM. Notice that the W boson only couples to the left-handed component of the quark and lepton fields. The deeper significance of this property will become clear once you learn more about the SM of particle physics. Simplify your result for the amplitude using the charge conjugation property of the current (5.209) and the Fierz identity (5.215). Calculate the squared matrix element and express your result in terms of scalar products of momenta. The second process is similar to the first one, but more complicated since it involves the emission of an additional gluon (g) from one of the external quark legs. Draw the possible Feynman graphs for dū e ν e g at tree level and write down the amplitude. In addition to the Feynman rules given already you will need: q g µ = i g s T a γ µ. q p µ = ɛ µ (p) (initial state), µ = ɛ µ(p) (final state). p Here g s is the coupling constant of QCD and T a with a = 1,..., 8 are the generators of the associated gauge group, i.e., SU(3) c. The symbol ɛ µ (p) stands for the polarization vector of the initial- or final-state gluon. In order to calculate the squared matrix element for dū e ν e g, we also need to introduce a spinor representation for the polarization vector for gluons with definite helicity a = ±, ɛ + µ (p, ξ) = [p γ µ ξ 2 ξp, ɛ µ (p, ξ) = p γ µ ξ] 2[ξp], (5.217) where p is the gluon momentum and ξ is an auxiliary massless vector, called the reference momentum, reflecting the freedom of on-shell gauge transformations. The objects introduced in (5.217) have the following properties. Since p/ p ± = 0, the polarization vector ɛ ± (p, ξ) is transverse to p, i.e., ɛ ± (p, ξ) p = 0, (5.218) 149

152 for any choice of ξ with pξ 0. Complex conjugation acts on the polarization vectors like ( ɛ ± µ (p, ξ) ) = ɛ µ (p, ξ), (5.219) and they are normalized as follows ɛ ± (p, ξ) ( ɛ ± (p, ξ) ) = 1, ɛ ± (p, ξ) ( ɛ (p, ξ) ) = 0. (5.220) They also fulfill a complettness relation, which reads a=± ɛ a µ(p, ξ) (ɛ a ν(p, ξ)) = η µν + p µξ ν + ξ µ p ν pξ. (5.221) Equipped with the definition and the properties of the polarization vectors you can now actually calculate the matrix element for W + g production. Consider the case of the emission of a positive and negative helicity gluon separately and keep the gauge vector ξ arbitrary. Use the charge conjugation, Schouten, and Fierz identities, (5.209), (5.213), and (5.215), as well as the projection operators (5.210), to reduce both amplitudes to combinations of basic spinor products (5.202). Also employ momentum conservation, n i=1 pµ i = 0, which leads to the identity n i=1,i j,k [ji] ik = 0. (5.222) It is important that your final result for the helicity amplitudes is independent of the choice of ξ. Could you have obtained your results far more simple by a specific choice of ξ? Square the dū e ν e g amplitude and simplify your answer as much as possible. In the fundamental representation the generators T a fulfill tr ( T a T b) = T F δ ab with T F = 1/2 and T a T a = C F where C F = (N 2 c 1)/(2N c ) = 4/3 for N c = 3 corresponding to the QCD gauge group SU(3) c. ix) Heavy quark effective theory (HQET) is an effective field theory designed to systematically exploit the simplifications of the interactions of QCD in the heavy-quark limit for the case of hadrons containing a single heavy quark such as the B and D meson. The first goal of this exercise is to derive the interactions of a heavy quark with the light dofs starting from the Lagrangian L = Q (id/ m Q ) Q, (5.223) where Q is a Dirac spinor representing the heavy quark of mass m Q and D µ = µ ig s T a G a µ, (5.224) is the covariant derivative, which describes the minimal coupling of quarks to a gluon. It depends on the QCD coupling constant g s, the gluon fields G a µ with a = 1,..., 8, and the generators T a of SU(3) c. The obtained effective description will not only allow us to 150

153 show that the Lagrangian (5.223) has a spin-flavor symmetry in the limit m Q, but also provides a systematic and rigorous way to obtain corrections to the infinite mass limit. To warm up solve the free Dirac equation for a heavy quark at rest. Use the decomposition Q(x) = e im Qt Q(0), (5.225) and plague it into (5.35). What do you observe? The heavy-quark momentum p µ can always be decomposed as p µ = m Q v µ + k µ, (5.226) where v µ is the 4-velocity of the heavy hadron. Once m Q v µ, the large kinematical part of the momentum is singled out, the remaining component k µ is determined by soft QCD bound-state interactions, and thus k 2 m 2 Q. In order to work in an arbitrary frame one defines P ± = 1 ± v/, (5.227) 2 with v/v/ = v 2 = 1. Show that P ± are projection operators and find the explicit form of them in the rest frame. Remove the large-frequency part of the x-dependence in Q(x) resulting from the large momentum m Q v µ by plugging Q(x) = e im Qvx Q(x) ] = e im Qvx [P + Q(x) + P Q(x) (5.228) = e im Qvx [ h v (x) + H v (x) ], into the Lagrangian (5.223). Notice that (5.228) is the covariant generalization of decomposing Q(x) into upper and lower components. Why? To decouple the simplified Dirac equation multiply it by the projection operators and use P ± a/ = a/ P ± vap ±, (5.229) where a µ = aµ vav µ for any 4-vector a µ. From the two resulting equations derive a relation between H v (x) and h v (x) valid up to terms of O(1/m 2 Q ). Employ the relation between H v (x) and h v (x) to eliminate the field H v (x) from the system of equations. Using [D µ, Dν ] = igg µν, (5.230) find the final form of the EOM of the heavy-quark field h v (x). In (5.229), we have introduced the QCD field strength tensor G µν = G a µνt a. In its explicit form the field strength is given by G a µν = µ G a ν ν G a µ + gf abc G b µg c ν with [T a, T b ] = if abc T c, where f abc are the fully antisymmetric structure constants. 151

154 Write down the Lagrangian that leads to the EOM for h v (x) including O(1/m Q ) terms. Discuss the spin and flavor properties of the leading term and the power corrections in the 1/m Q expansion. By going to the heavy-quark rest frame determine the physical meaning of the two O(1/m Q ) corrections. Explain the appearance of the spin and flavor symmetry (and its breaking) in physical terms. Compare your findings for heavy-light meson systems with the physics of the hydrogen atom. Point out similarities/differences. Derive the Feynman rules for the heavy-quark propagator once starting from the HQET Lagrangian and once by expanding the propagator of the free Dirac theory. Give also the Feynman rule for the interaction of the heavy quark with the gluon. The masses of the vector and pseudoscalar B and D mesons are experimentally determined to be M B = 5.33 GeV, M B = 5.28 GeV and M D = 2.00 GeV, M D = 1.86 GeV, respectively. These numbers imply that M 2 B M 2 B = 0.53 GeV 2, M 2 D M 2 D = 0.54 GeV 2, (5.231) which suggests that the difference between the square of a heavy-light vector meson mass and the square of a heavy-light pseudoscalar meson mass is a constant. Can you explain this behavior qualitatively using the heavy-quark symmetries you have derived above? References [1] S. P. Martin, arxiv:hep-ph/ [2] O. W. Greenberg, Phys. Rev. Lett. 89, (2002) [arxiv:hep-ph/ ]. [3] V. A. Kostelecky and N. Russell, arxiv: [hep-ph]. [4] T. D. Lee and C. N. Yang, Phys. Rev. 104 (1956) 254. [5] C. S. Wu, E. Ambler, R. W. Hayward, D. D. Hoppes and R. P. Hudson, Phys. Rev. 105, 1413 (1957). [6] J. H. Christenson, J. W. Cronin, V. L. Fitch and R. Turlay, Phys. Rev. Lett. 13, 138 (1964). [7] M. Kobayashi and T. Maskawa, Prog. Theor. Phys. 49, 652 (1973). [8] A. D. Sakharov, Pisma Zh. Eksp. Teor. Fiz. 5 (1967) 32 [JETP Lett. 5 (1967) 24] [Sov. Phys. Usp. 34 (1991) 392] [Usp. Fiz. Nauk 161 (1991) 61]. [9] M. Dine, arxiv:hep-ph/ [10] R. D. Peccei and H. R. Quinn, Phys. Rev. Lett. 38, 1440 (1977). [11] S. L. Adler, Phys. Rev. 177, 2426 (1969). [12] J. S. Bell and R. Jackiw, Nuovo Cim. A 60, 47 (1969). 152

155 [13] S. L. Adler and W. A. Bardeen, Phys. Rev. 182, 1517 (1969). [14] S. L. Adler, arxiv:hep-th/ [15] S. M. Carroll, The Cosmological Constant, Living Rev. Relativity 3, 1 (2001), 153

University of Cambridge Part III Mathematical Tripos

University of Cambridge Part III Mathematical Tripos Preprint typeset in JHEP style - HYPER VERSION Michaelmas Term, 2006 and 2007 Quantum Field Theory University of Cambridge Part III Mathematical Tripos Dr David Tong Department of Applied Mathematics and

More information

Feynman diagrams. 1 Aim of the game 2

Feynman diagrams. 1 Aim of the game 2 Feynman diagrams Contents 1 Aim of the game 2 2 Rules 2 2.1 Vertices................................ 3 2.2 Anti-particles............................. 3 2.3 Distinct diagrams...........................

More information

Concepts in Theoretical Physics

Concepts in Theoretical Physics Concepts in Theoretical Physics Lecture 6: Particle Physics David Tong e 2 The Structure of Things 4πc 1 137 e d ν u Four fundamental particles Repeated twice! va, 9608085, 9902033 Four fundamental forces

More information

Chapter 22 The Hamiltonian and Lagrangian densities. from my book: Understanding Relativistic Quantum Field Theory. Hans de Vries

Chapter 22 The Hamiltonian and Lagrangian densities. from my book: Understanding Relativistic Quantum Field Theory. Hans de Vries Chapter 22 The Hamiltonian and Lagrangian densities from my book: Understanding Relativistic Quantum Field Theory Hans de Vries January 2, 2009 2 Chapter Contents 22 The Hamiltonian and Lagrangian densities

More information

The career of a young theoretical physicist consists of treating the harmonic oscillator in ever-increasing levels of abstraction.

The career of a young theoretical physicist consists of treating the harmonic oscillator in ever-increasing levels of abstraction. 2. Free Fields The career of a young theoretical physicist consists of treating the harmonic oscillator in ever-increasing levels of abstraction. Sidney Coleman 2.1 Canonical Quantization In quantum mechanics,

More information

Introduction to SME and Scattering Theory. Don Colladay. New College of Florida Sarasota, FL, 34243, U.S.A.

Introduction to SME and Scattering Theory. Don Colladay. New College of Florida Sarasota, FL, 34243, U.S.A. June 2012 Introduction to SME and Scattering Theory Don Colladay New College of Florida Sarasota, FL, 34243, U.S.A. This lecture was given at the IUCSS summer school during June of 2012. It contains a

More information

Special Theory of Relativity

Special Theory of Relativity June 1, 2010 1 1 J.D.Jackson, Classical Electrodynamics, 3rd Edition, Chapter 11 Introduction Einstein s theory of special relativity is based on the assumption (which might be a deep-rooted superstition

More information

Time Ordered Perturbation Theory

Time Ordered Perturbation Theory Michael Dine Department of Physics University of California, Santa Cruz October 2013 Quantization of the Free Electromagnetic Field We have so far quantized the free scalar field and the free Dirac field.

More information

State of Stress at Point

State of Stress at Point State of Stress at Point Einstein Notation The basic idea of Einstein notation is that a covector and a vector can form a scalar: This is typically written as an explicit sum: According to this convention,

More information

STRING THEORY: Past, Present, and Future

STRING THEORY: Past, Present, and Future STRING THEORY: Past, Present, and Future John H. Schwarz Simons Center March 25, 2014 1 OUTLINE I) Early History and Basic Concepts II) String Theory for Unification III) Superstring Revolutions IV) Remaining

More information

Theoretical Particle Physics FYTN04: Oral Exam Questions, version ht15

Theoretical Particle Physics FYTN04: Oral Exam Questions, version ht15 Theoretical Particle Physics FYTN04: Oral Exam Questions, version ht15 Examples of The questions are roughly ordered by chapter but are often connected across the different chapters. Ordering is as in

More information

Unified Lecture # 4 Vectors

Unified Lecture # 4 Vectors Fall 2005 Unified Lecture # 4 Vectors These notes were written by J. Peraire as a review of vectors for Dynamics 16.07. They have been adapted for Unified Engineering by R. Radovitzky. References [1] Feynmann,

More information

Particle Physics. Michaelmas Term 2011 Prof Mark Thomson. Handout 7 : Symmetries and the Quark Model. Introduction/Aims

Particle Physics. Michaelmas Term 2011 Prof Mark Thomson. Handout 7 : Symmetries and the Quark Model. Introduction/Aims Particle Physics Michaelmas Term 2011 Prof Mark Thomson Handout 7 : Symmetries and the Quark Model Prof. M.A. Thomson Michaelmas 2011 206 Introduction/Aims Symmetries play a central role in particle physics;

More information

University of Cambridge Part III Mathematical Tripos

University of Cambridge Part III Mathematical Tripos Preprint typeset in JHEP style - HYPER VERSION Michaelmas Term, 2006 and 2007 Quantum Field Theory University of Cambridge Part III Mathematical Tripos Dr David Tong Department of Applied Mathematics and

More information

Lecture L3 - Vectors, Matrices and Coordinate Transformations

Lecture L3 - Vectors, Matrices and Coordinate Transformations S. Widnall 16.07 Dynamics Fall 2009 Lecture notes based on J. Peraire Version 2.0 Lecture L3 - Vectors, Matrices and Coordinate Transformations By using vectors and defining appropriate operations between

More information

Elasticity Theory Basics

Elasticity Theory Basics G22.3033-002: Topics in Computer Graphics: Lecture #7 Geometric Modeling New York University Elasticity Theory Basics Lecture #7: 20 October 2003 Lecturer: Denis Zorin Scribe: Adrian Secord, Yotam Gingold

More information

Theory of electrons and positrons

Theory of electrons and positrons P AUL A. M. DIRAC Theory of electrons and positrons Nobel Lecture, December 12, 1933 Matter has been found by experimental physicists to be made up of small particles of various kinds, the particles of

More information

Solutions to Problems in Goldstein, Classical Mechanics, Second Edition. Chapter 7

Solutions to Problems in Goldstein, Classical Mechanics, Second Edition. Chapter 7 Solutions to Problems in Goldstein, Classical Mechanics, Second Edition Homer Reid April 21, 2002 Chapter 7 Problem 7.2 Obtain the Lorentz transformation in which the velocity is at an infinitesimal angle

More information

Derivation of the relativistic momentum and relativistic equation of motion from Newton s second law and Minkowskian space-time geometry

Derivation of the relativistic momentum and relativistic equation of motion from Newton s second law and Minkowskian space-time geometry Apeiron, Vol. 15, No. 3, July 2008 206 Derivation of the relativistic momentum and relativistic equation of motion from Newton s second law and Minkowskian space-time geometry Krzysztof Rȩbilas Zak lad

More information

Quantum Mechanics and Representation Theory

Quantum Mechanics and Representation Theory Quantum Mechanics and Representation Theory Peter Woit Columbia University Texas Tech, November 21 2013 Peter Woit (Columbia University) Quantum Mechanics and Representation Theory November 2013 1 / 30

More information

2 Session Two - Complex Numbers and Vectors

2 Session Two - Complex Numbers and Vectors PH2011 Physics 2A Maths Revision - Session 2: Complex Numbers and Vectors 1 2 Session Two - Complex Numbers and Vectors 2.1 What is a Complex Number? The material on complex numbers should be familiar

More information

Linear Algebra Notes for Marsden and Tromba Vector Calculus

Linear Algebra Notes for Marsden and Tromba Vector Calculus Linear Algebra Notes for Marsden and Tromba Vector Calculus n-dimensional Euclidean Space and Matrices Definition of n space As was learned in Math b, a point in Euclidean three space can be thought of

More information

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS Systems of Equations and Matrices Representation of a linear system The general system of m equations in n unknowns can be written a x + a 2 x 2 + + a n x n b a

More information

The Quantum Harmonic Oscillator Stephen Webb

The Quantum Harmonic Oscillator Stephen Webb The Quantum Harmonic Oscillator Stephen Webb The Importance of the Harmonic Oscillator The quantum harmonic oscillator holds a unique importance in quantum mechanics, as it is both one of the few problems

More information

Generally Covariant Quantum Mechanics

Generally Covariant Quantum Mechanics Chapter 15 Generally Covariant Quantum Mechanics by Myron W. Evans, Alpha Foundation s Institutute for Advance Study (AIAS). ([email protected], www.aias.us, www.atomicprecision.com) Dedicated to the Late

More information

Let s first see how precession works in quantitative detail. The system is illustrated below: ...

Let s first see how precession works in quantitative detail. The system is illustrated below: ... lecture 20 Topics: Precession of tops Nutation Vectors in the body frame The free symmetric top in the body frame Euler s equations The free symmetric top ala Euler s The tennis racket theorem As you know,

More information

2. Spin Chemistry and the Vector Model

2. Spin Chemistry and the Vector Model 2. Spin Chemistry and the Vector Model The story of magnetic resonance spectroscopy and intersystem crossing is essentially a choreography of the twisting motion which causes reorientation or rephasing

More information

Chapter 9 Unitary Groups and SU(N)

Chapter 9 Unitary Groups and SU(N) Chapter 9 Unitary Groups and SU(N) The irreducible representations of SO(3) are appropriate for describing the degeneracies of states of quantum mechanical systems which have rotational symmetry in three

More information

Till now, almost all attention has been focussed on discussing the state of a quantum system.

Till now, almost all attention has been focussed on discussing the state of a quantum system. Chapter 13 Observables and Measurements in Quantum Mechanics Till now, almost all attention has been focussed on discussing the state of a quantum system. As we have seen, this is most succinctly done

More information

Assessment Plan for Learning Outcomes for BA/BS in Physics

Assessment Plan for Learning Outcomes for BA/BS in Physics Department of Physics and Astronomy Goals and Learning Outcomes 1. Students know basic physics principles [BS, BA, MS] 1.1 Students can demonstrate an understanding of Newton s laws 1.2 Students can demonstrate

More information

3. Derive the partition function for the ideal monoatomic gas. Use Boltzmann statistics, and a quantum mechanical model for the gas.

3. Derive the partition function for the ideal monoatomic gas. Use Boltzmann statistics, and a quantum mechanical model for the gas. Tentamen i Statistisk Fysik I den tjugosjunde februari 2009, under tiden 9.00-15.00. Lärare: Ingemar Bengtsson. Hjälpmedel: Penna, suddgummi och linjal. Bedömning: 3 poäng/uppgift. Betyg: 0-3 = F, 4-6

More information

PX408: Relativistic Quantum Mechanics

PX408: Relativistic Quantum Mechanics January 2016 PX408: Relativistic Quantum Mechanics Tim Gershon ([email protected]) Handout 1: Revision & Notation Relativistic quantum mechanics, as its name implies, can be thought of as the bringing

More information

Chapter 17. Orthogonal Matrices and Symmetries of Space

Chapter 17. Orthogonal Matrices and Symmetries of Space Chapter 17. Orthogonal Matrices and Symmetries of Space Take a random matrix, say 1 3 A = 4 5 6, 7 8 9 and compare the lengths of e 1 and Ae 1. The vector e 1 has length 1, while Ae 1 = (1, 4, 7) has length

More information

MASTER OF SCIENCE IN PHYSICS MASTER OF SCIENCES IN PHYSICS (MS PHYS) (LIST OF COURSES BY SEMESTER, THESIS OPTION)

MASTER OF SCIENCE IN PHYSICS MASTER OF SCIENCES IN PHYSICS (MS PHYS) (LIST OF COURSES BY SEMESTER, THESIS OPTION) MASTER OF SCIENCE IN PHYSICS Admission Requirements 1. Possession of a BS degree from a reputable institution or, for non-physics majors, a GPA of 2.5 or better in at least 15 units in the following advanced

More information

Three Pictures of Quantum Mechanics. Thomas R. Shafer April 17, 2009

Three Pictures of Quantum Mechanics. Thomas R. Shafer April 17, 2009 Three Pictures of Quantum Mechanics Thomas R. Shafer April 17, 2009 Outline of the Talk Brief review of (or introduction to) quantum mechanics. 3 different viewpoints on calculation. Schrödinger, Heisenberg,

More information

Contents. Goldstone Bosons in 3He-A Soft Modes Dynamics and Lie Algebra of Group G:

Contents. Goldstone Bosons in 3He-A Soft Modes Dynamics and Lie Algebra of Group G: ... Vlll Contents 3. Textures and Supercurrents in Superfluid Phases of 3He 3.1. Textures, Gradient Energy and Rigidity 3.2. Why Superfuids are Superfluid 3.3. Superfluidity and Response to a Transverse

More information

(Most of the material presented in this chapter is taken from Thornton and Marion, Chap. 7)

(Most of the material presented in this chapter is taken from Thornton and Marion, Chap. 7) Chapter 4. Lagrangian Dynamics (Most of the material presented in this chapter is taken from Thornton and Marion, Chap. 7 4.1 Important Notes on Notation In this chapter, unless otherwise stated, the following

More information

High Energy Physics. Lecture 4 More kinematics and a picture show of particle collisions

High Energy Physics. Lecture 4 More kinematics and a picture show of particle collisions High Energy Physics Lecture 4 More kinematics and a picture show of particle collisions 1 Recall from the previous lecture: the momentum of the scattered Particle in an elastic collision is given by p

More information

ASEN 3112 - Structures. MDOF Dynamic Systems. ASEN 3112 Lecture 1 Slide 1

ASEN 3112 - Structures. MDOF Dynamic Systems. ASEN 3112 Lecture 1 Slide 1 19 MDOF Dynamic Systems ASEN 3112 Lecture 1 Slide 1 A Two-DOF Mass-Spring-Dashpot Dynamic System Consider the lumped-parameter, mass-spring-dashpot dynamic system shown in the Figure. It has two point

More information

DO PHYSICS ONLINE FROM QUANTA TO QUARKS QUANTUM (WAVE) MECHANICS

DO PHYSICS ONLINE FROM QUANTA TO QUARKS QUANTUM (WAVE) MECHANICS DO PHYSICS ONLINE FROM QUANTA TO QUARKS QUANTUM (WAVE) MECHANICS Quantum Mechanics or wave mechanics is the best mathematical theory used today to describe and predict the behaviour of particles and waves.

More information

1 Lecture 3: Operators in Quantum Mechanics

1 Lecture 3: Operators in Quantum Mechanics 1 Lecture 3: Operators in Quantum Mechanics 1.1 Basic notions of operator algebra. In the previous lectures we have met operators: ˆx and ˆp = i h they are called fundamental operators. Many operators

More information

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS 1. SYSTEMS OF EQUATIONS AND MATRICES 1.1. Representation of a linear system. The general system of m equations in n unknowns can be written a 11 x 1 + a 12 x 2 +

More information

Continued Fractions and the Euclidean Algorithm

Continued Fractions and the Euclidean Algorithm Continued Fractions and the Euclidean Algorithm Lecture notes prepared for MATH 326, Spring 997 Department of Mathematics and Statistics University at Albany William F Hammond Table of Contents Introduction

More information

arxiv:1111.4354v2 [physics.acc-ph] 27 Oct 2014

arxiv:1111.4354v2 [physics.acc-ph] 27 Oct 2014 Theory of Electromagnetic Fields Andrzej Wolski University of Liverpool, and the Cockcroft Institute, UK arxiv:1111.4354v2 [physics.acc-ph] 27 Oct 2014 Abstract We discuss the theory of electromagnetic

More information

CBE 6333, R. Levicky 1 Differential Balance Equations

CBE 6333, R. Levicky 1 Differential Balance Equations CBE 6333, R. Levicky 1 Differential Balance Equations We have previously derived integral balances for mass, momentum, and energy for a control volume. The control volume was assumed to be some large object,

More information

arxiv:1408.3381v1 [physics.gen-ph] 17 Sep 2013

arxiv:1408.3381v1 [physics.gen-ph] 17 Sep 2013 Derivation of the relativistic momentum and relativistic equation of motion from Newton s second law and Minkowskian space-time geometry arxiv:1408.3381v1 [physics.gen-ph] 17 Sep 2013 Krzysztof Rȩbilas

More information

1 Introduction. 1 There may, of course, in principle, exist other universes, but they are not accessible to our

1 Introduction. 1 There may, of course, in principle, exist other universes, but they are not accessible to our 1 1 Introduction Cosmology is the study of the universe as a whole, its structure, its origin, and its evolution. Cosmology is soundly based on observations, mostly astronomical, and laws of physics. These

More information

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1. MATH10212 Linear Algebra Textbook: D. Poole, Linear Algebra: A Modern Introduction. Thompson, 2006. ISBN 0-534-40596-7. Systems of Linear Equations Definition. An n-dimensional vector is a row or a column

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

8.04: Quantum Mechanics Professor Allan Adams Massachusetts Institute of Technology. Problem Set 5

8.04: Quantum Mechanics Professor Allan Adams Massachusetts Institute of Technology. Problem Set 5 8.04: Quantum Mechanics Professor Allan Adams Massachusetts Institute of Technology Tuesday March 5 Problem Set 5 Due Tuesday March 12 at 11.00AM Assigned Reading: E&R 6 9, App-I Li. 7 1 4 Ga. 4 7, 6 1,2

More information

5 Homogeneous systems

5 Homogeneous systems 5 Homogeneous systems Definition: A homogeneous (ho-mo-jeen -i-us) system of linear algebraic equations is one in which all the numbers on the right hand side are equal to : a x +... + a n x n =.. a m

More information

15.062 Data Mining: Algorithms and Applications Matrix Math Review

15.062 Data Mining: Algorithms and Applications Matrix Math Review .6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop

More information

Math 4310 Handout - Quotient Vector Spaces

Math 4310 Handout - Quotient Vector Spaces Math 4310 Handout - Quotient Vector Spaces Dan Collins The textbook defines a subspace of a vector space in Chapter 4, but it avoids ever discussing the notion of a quotient space. This is understandable

More information

A Theory for the Cosmological Constant and its Explanation of the Gravitational Constant

A Theory for the Cosmological Constant and its Explanation of the Gravitational Constant A Theory for the Cosmological Constant and its Explanation of the Gravitational Constant H.M.Mok Radiation Health Unit, 3/F., Saiwanho Health Centre, Hong Kong SAR Govt, 8 Tai Hong St., Saiwanho, Hong

More information

5.61 Physical Chemistry 25 Helium Atom page 1 HELIUM ATOM

5.61 Physical Chemistry 25 Helium Atom page 1 HELIUM ATOM 5.6 Physical Chemistry 5 Helium Atom page HELIUM ATOM Now that we have treated the Hydrogen like atoms in some detail, we now proceed to discuss the next simplest system: the Helium atom. In this situation,

More information

Notes on Orthogonal and Symmetric Matrices MENU, Winter 2013

Notes on Orthogonal and Symmetric Matrices MENU, Winter 2013 Notes on Orthogonal and Symmetric Matrices MENU, Winter 201 These notes summarize the main properties and uses of orthogonal and symmetric matrices. We covered quite a bit of material regarding these topics,

More information

Physics 9e/Cutnell. correlated to the. College Board AP Physics 1 Course Objectives

Physics 9e/Cutnell. correlated to the. College Board AP Physics 1 Course Objectives Physics 9e/Cutnell correlated to the College Board AP Physics 1 Course Objectives Big Idea 1: Objects and systems have properties such as mass and charge. Systems may have internal structure. Enduring

More information

The continuous and discrete Fourier transforms

The continuous and discrete Fourier transforms FYSA21 Mathematical Tools in Science The continuous and discrete Fourier transforms Lennart Lindegren Lund Observatory (Department of Astronomy, Lund University) 1 The continuous Fourier transform 1.1

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

More information

PHY4604 Introduction to Quantum Mechanics Fall 2004 Practice Test 3 November 22, 2004

PHY4604 Introduction to Quantum Mechanics Fall 2004 Practice Test 3 November 22, 2004 PHY464 Introduction to Quantum Mechanics Fall 4 Practice Test 3 November, 4 These problems are similar but not identical to the actual test. One or two parts will actually show up.. Short answer. (a) Recall

More information

Does Quantum Mechanics Make Sense? Size

Does Quantum Mechanics Make Sense? Size Does Quantum Mechanics Make Sense? Some relatively simple concepts show why the answer is yes. Size Classical Mechanics Quantum Mechanics Relative Absolute What does relative vs. absolute size mean? Why

More information

Continuous Groups, Lie Groups, and Lie Algebras

Continuous Groups, Lie Groups, and Lie Algebras Chapter 7 Continuous Groups, Lie Groups, and Lie Algebras Zeno was concerned with three problems... These are the problem of the infinitesimal, the infinite, and continuity... Bertrand Russell The groups

More information

REALIZING EINSTEIN S DREAM Exploring Our Mysterious Universe

REALIZING EINSTEIN S DREAM Exploring Our Mysterious Universe REALIZING EINSTEIN S DREAM Exploring Our Mysterious Universe The End of Physics Albert A. Michelson, at the dedication of Ryerson Physics Lab, U. of Chicago, 1894 The Miracle Year - 1905 Relativity Quantum

More information

Mechanics 1: Conservation of Energy and Momentum

Mechanics 1: Conservation of Energy and Momentum Mechanics : Conservation of Energy and Momentum If a certain quantity associated with a system does not change in time. We say that it is conserved, and the system possesses a conservation law. Conservation

More information

P.A.M. Dirac Received May 29, 1931

P.A.M. Dirac Received May 29, 1931 P.A.M. Dirac, Proc. Roy. Soc. A 133, 60 1931 Quantised Singularities in the Electromagnetic Field P.A.M. Dirac Received May 29, 1931 1. Introduction The steady progress of physics requires for its theoretical

More information

So let us begin our quest to find the holy grail of real analysis.

So let us begin our quest to find the holy grail of real analysis. 1 Section 5.2 The Complete Ordered Field: Purpose of Section We present an axiomatic description of the real numbers as a complete ordered field. The axioms which describe the arithmetic of the real numbers

More information

Cross section, Flux, Luminosity, Scattering Rates

Cross section, Flux, Luminosity, Scattering Rates Cross section, Flux, Luminosity, Scattering Rates Table of Contents Paul Avery (Andrey Korytov) Sep. 9, 013 1 Introduction... 1 Cross section, flux and scattering... 1 3 Scattering length λ and λ ρ...

More information

Name Partners Date. Energy Diagrams I

Name Partners Date. Energy Diagrams I Name Partners Date Visual Quantum Mechanics The Next Generation Energy Diagrams I Goal Changes in energy are a good way to describe an object s motion. Here you will construct energy diagrams for a toy

More information

Unification - The Standard Model

Unification - The Standard Model Unification - The Standard Model Information on Physics level 4 Undergraduate Course PT.4.6 K.S. Stelle, Office Huxley 519 November 9, 2015 Rapid Feedback to be handed in to the UG office Level 3 (day

More information

3. Open Strings and D-Branes

3. Open Strings and D-Branes 3. Open Strings and D-Branes In this section we discuss the dynamics of open strings. Clearly their distinguishing feature is the existence of two end points. Our goal is to understand the effect of these

More information

Vector or Pseudovector?

Vector or Pseudovector? Vector or Pseudovector? Jeffrey A. Phillips Loyola Marymount University Los Angeles, CA 90045 By using a corner reflector it is possible to perform an inversion or improper transformation thereby identifying

More information

SYSTEMS OF EQUATIONS AND MATRICES WITH THE TI-89. by Joseph Collison

SYSTEMS OF EQUATIONS AND MATRICES WITH THE TI-89. by Joseph Collison SYSTEMS OF EQUATIONS AND MATRICES WITH THE TI-89 by Joseph Collison Copyright 2000 by Joseph Collison All rights reserved Reproduction or translation of any part of this work beyond that permitted by Sections

More information

Inner Product Spaces

Inner Product Spaces Math 571 Inner Product Spaces 1. Preliminaries An inner product space is a vector space V along with a function, called an inner product which associates each pair of vectors u, v with a scalar u, v, and

More information

CHAPTER 5 Round-off errors

CHAPTER 5 Round-off errors CHAPTER 5 Round-off errors In the two previous chapters we have seen how numbers can be represented in the binary numeral system and how this is the basis for representing numbers in computers. Since any

More information

Boardworks AS Physics

Boardworks AS Physics Boardworks AS Physics Vectors 24 slides 11 Flash activities Prefixes, scalars and vectors Guide to the SI unit prefixes of orders of magnitude Matching powers of ten to their SI unit prefixes Guide to

More information

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 2

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 2 CS 70 Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 2 Proofs Intuitively, the concept of proof should already be familiar We all like to assert things, and few of us

More information

FLAP P11.2 The quantum harmonic oscillator

FLAP P11.2 The quantum harmonic oscillator F L E X I B L E L E A R N I N G A P P R O A C H T O P H Y S I C S Module P. Opening items. Module introduction. Fast track questions.3 Ready to study? The harmonic oscillator. Classical description of

More information

Second Order Linear Nonhomogeneous Differential Equations; Method of Undetermined Coefficients. y + p(t) y + q(t) y = g(t), g(t) 0.

Second Order Linear Nonhomogeneous Differential Equations; Method of Undetermined Coefficients. y + p(t) y + q(t) y = g(t), g(t) 0. Second Order Linear Nonhomogeneous Differential Equations; Method of Undetermined Coefficients We will now turn our attention to nonhomogeneous second order linear equations, equations with the standard

More information

7. DYNAMIC LIGHT SCATTERING 7.1 First order temporal autocorrelation function.

7. DYNAMIC LIGHT SCATTERING 7.1 First order temporal autocorrelation function. 7. DYNAMIC LIGHT SCATTERING 7. First order temporal autocorrelation function. Dynamic light scattering (DLS) studies the properties of inhomogeneous and dynamic media. A generic situation is illustrated

More information

4.5 Linear Dependence and Linear Independence

4.5 Linear Dependence and Linear Independence 4.5 Linear Dependence and Linear Independence 267 32. {v 1, v 2 }, where v 1, v 2 are collinear vectors in R 3. 33. Prove that if S and S are subsets of a vector space V such that S is a subset of S, then

More information

Constrained optimization.

Constrained optimization. ams/econ 11b supplementary notes ucsc Constrained optimization. c 2010, Yonatan Katznelson 1. Constraints In many of the optimization problems that arise in economics, there are restrictions on the values

More information

Chapter 15 Collision Theory

Chapter 15 Collision Theory Chapter 15 Collision Theory 151 Introduction 1 15 Reference Frames Relative and Velocities 1 151 Center of Mass Reference Frame 15 Relative Velocities 3 153 Characterizing Collisions 5 154 One-Dimensional

More information

SCATTERING CROSS SECTIONS AND LORENTZ VIOLATION DON COLLADAY

SCATTERING CROSS SECTIONS AND LORENTZ VIOLATION DON COLLADAY SCATTERING CROSS SECTIONS AND LORENTZ VIOLATION DON COLLADAY New College of Florida, 5700 Tamiami Trail, Sarasota, FL 34243, USA E-mail: [email protected] To date, a significant effort has been made

More information

Gauge theories and the standard model of elementary particle physics

Gauge theories and the standard model of elementary particle physics Gauge theories and the standard model of elementary particle physics Mark Hamilton 21st July 2014 1 / 35 Table of contents 1 The standard model 2 3 2 / 35 The standard model The standard model is the most

More information

DIFFERENTIABILITY OF COMPLEX FUNCTIONS. Contents

DIFFERENTIABILITY OF COMPLEX FUNCTIONS. Contents DIFFERENTIABILITY OF COMPLEX FUNCTIONS Contents 1. Limit definition of a derivative 1 2. Holomorphic functions, the Cauchy-Riemann equations 3 3. Differentiability of real functions 5 4. A sufficient condition

More information

NOTES ON LINEAR TRANSFORMATIONS

NOTES ON LINEAR TRANSFORMATIONS NOTES ON LINEAR TRANSFORMATIONS Definition 1. Let V and W be vector spaces. A function T : V W is a linear transformation from V to W if the following two properties hold. i T v + v = T v + T v for all

More information

Introduction to Elementary Particle Physics. Note 01 Page 1 of 8. Natural Units

Introduction to Elementary Particle Physics. Note 01 Page 1 of 8. Natural Units Introduction to Elementary Particle Physics. Note 01 Page 1 of 8 Natural Units There are 4 primary SI units: three kinematical (meter, second, kilogram) and one electrical (Ampere 1 ) It is common in the

More information

Time dependence in quantum mechanics Notes on Quantum Mechanics

Time dependence in quantum mechanics Notes on Quantum Mechanics Time dependence in quantum mechanics Notes on Quantum Mechanics http://quantum.bu.edu/notes/quantummechanics/timedependence.pdf Last updated Thursday, November 20, 2003 13:22:37-05:00 Copyright 2003 Dan

More information

Notes on Quantum Mechanics

Notes on Quantum Mechanics Notes on Quantum Mechanics K. Schulten Department of Physics and Beckman Institute University of Illinois at Urbana Champaign 405 N. Mathews Street, Urbana, IL 680 USA April 8, 000 Preface i Preface The

More information

Physics 235 Chapter 1. Chapter 1 Matrices, Vectors, and Vector Calculus

Physics 235 Chapter 1. Chapter 1 Matrices, Vectors, and Vector Calculus Chapter 1 Matrices, Vectors, and Vector Calculus In this chapter, we will focus on the mathematical tools required for the course. The main concepts that will be covered are: Coordinate transformations

More information

The Matrix Elements of a 3 3 Orthogonal Matrix Revisited

The Matrix Elements of a 3 3 Orthogonal Matrix Revisited Physics 116A Winter 2011 The Matrix Elements of a 3 3 Orthogonal Matrix Revisited 1. Introduction In a class handout entitled, Three-Dimensional Proper and Improper Rotation Matrices, I provided a derivation

More information

Chapter 3. Cartesian Products and Relations. 3.1 Cartesian Products

Chapter 3. Cartesian Products and Relations. 3.1 Cartesian Products Chapter 3 Cartesian Products and Relations The material in this chapter is the first real encounter with abstraction. Relations are very general thing they are a special type of subset. After introducing

More information

Section 1.1. Introduction to R n

Section 1.1. Introduction to R n The Calculus of Functions of Several Variables Section. Introduction to R n Calculus is the study of functional relationships and how related quantities change with each other. In your first exposure to

More information

PHYS 1624 University Physics I. PHYS 2644 University Physics II

PHYS 1624 University Physics I. PHYS 2644 University Physics II PHYS 1624 Physics I An introduction to mechanics, heat, and wave motion. This is a calculus- based course for Scientists and Engineers. 4 hours (3 lecture/3 lab) Prerequisites: Credit for MATH 2413 (Calculus

More information

PYTHAGOREAN TRIPLES KEITH CONRAD

PYTHAGOREAN TRIPLES KEITH CONRAD PYTHAGOREAN TRIPLES KEITH CONRAD 1. Introduction A Pythagorean triple is a triple of positive integers (a, b, c) where a + b = c. Examples include (3, 4, 5), (5, 1, 13), and (8, 15, 17). Below is an ancient

More information

0.33 d down 1 1. 0.33 c charm + 2 3. 0 0 1.5 s strange 1 3. 0 0 0.5 t top + 2 3. 0 0 172 b bottom 1 3

0.33 d down 1 1. 0.33 c charm + 2 3. 0 0 1.5 s strange 1 3. 0 0 0.5 t top + 2 3. 0 0 172 b bottom 1 3 Chapter 16 Constituent Quark Model Quarks are fundamental spin- 1 particles from which all hadrons are made up. Baryons consist of three quarks, whereas mesons consist of a quark and an anti-quark. There

More information

PEDAGOGY: THE BUBBLE ANALOGY AND THE DIFFERENCE BETWEEN GRAVITATIONAL FORCES AND ROCKET THRUST IN SPATIAL FLOW THEORIES OF GRAVITY *

PEDAGOGY: THE BUBBLE ANALOGY AND THE DIFFERENCE BETWEEN GRAVITATIONAL FORCES AND ROCKET THRUST IN SPATIAL FLOW THEORIES OF GRAVITY * PEDAGOGY: THE BUBBLE ANALOGY AND THE DIFFERENCE BETWEEN GRAVITATIONAL FORCES AND ROCKET THRUST IN SPATIAL FLOW THEORIES OF GRAVITY * Tom Martin Gravity Research Institute Boulder, Colorado 80306-1258 [email protected]

More information

a 11 x 1 + a 12 x 2 + + a 1n x n = b 1 a 21 x 1 + a 22 x 2 + + a 2n x n = b 2.

a 11 x 1 + a 12 x 2 + + a 1n x n = b 1 a 21 x 1 + a 22 x 2 + + a 2n x n = b 2. Chapter 1 LINEAR EQUATIONS 1.1 Introduction to linear equations A linear equation in n unknowns x 1, x,, x n is an equation of the form a 1 x 1 + a x + + a n x n = b, where a 1, a,..., a n, b are given

More information