Appendices
Appendix A: Elements of General Relativity
The following is a brief introduction to general relativity (GR). The goal is to present the minimal theoretical background for cosmology.
A.1 Spacetime and Relativity
The speed of light is the same in all inertial reference frame. Consider two inertial frame 𝑆 and 𝑆ʹ. The frame 𝑆ʹ is moving with a constant velocity 𝑣 in the 𝑥-direction against 𝑆. Then the coordinates in 𝑆ʹ are related to those in 𝑆 by the following Lorentz transformation
(A.1) 𝑡ʹ = 𝛾(𝑡 - 𝑣𝑥/𝑐2), 𝑥ʹ = 𝛾(𝑥 - 𝑣𝑡), 𝑦ʹ = 𝑦, 𝑧ʹ = 𝑧, where 𝛾 ≡ 1/ √(1 - 𝑣2/𝑐2) [Lorentz factor]
Consider 3-dimensional Euclidean space with coordinates 𝐱 = (𝑥, 𝑦, 𝑧) as defined in a frame 𝑆. A second frame 𝑆ʹ may have coordinates 𝐱ʹ = (𝑥ʹ, 𝑦ʹ, 𝑧ʹ), where 𝐱ʹ = 𝑅𝐱, for some rotation matrix 𝑅. The two coordinate systems share the same origin, so that the coordinates in 𝑆ʹ become a mixture of the coordinates in 𝑆. Similarly Lorentz transformation can be thought of as rotation between time and space. The mixing of space and time has profound implications: (1) evens that are simultaneous in on frame are not simultaneous in another; (2) moving clocks run slow ("time dilation"); and (3) moving rods are shorten ("length contraction").
A.1.2 Spacetime and Four-Vectors
Although a rotation changes the components of a vector ∆𝐱 connecting two points in space, it will not change the distance ∣∆𝐱∣ between the points, ∣∆𝐱∣2 = ∆𝑥2 + ∆𝑦2 + ∆𝑧2 is an invariant. Similarly, although time and space are relative, all observers will agree on the spacetime interval:
(A.2) ∆𝑠2 = -𝑐2∆𝑡2 + ∆𝑥2 + ∆𝑦2 + ∆𝑧2.
We can demonstrate this explicitly for the the specific transformation in (A.1) as follows
(A.3) ∆𝑠2 = -𝑐2(∆𝑡ʹ)2 + (∆𝑥ʹ)2 = -𝛾2(𝑐∆𝑡 - 𝑣∆𝑥/𝑐)2 + 𝛾2(∆𝑥 - 𝑣∆𝑡)2 = -𝛾2(𝑐2 - 𝑣2)(∆𝑡)22 + 𝛾2(∆𝑥 - 𝑣∆𝑡)2 + 𝛾2(1 - 𝑣2/𝑐2)(∆𝑡)2 = -𝑐2∆𝑡2 + ∆𝑥2.
In GR we encounter the spacetime interval between points that are infinitesimally close to each other. We write the interval is
(A.4) d𝑠2 = -𝑐2d𝑡2 + d𝑥2 + d𝑦2 + d𝑧2,
and call it line elements.
Two events that are timelike separated have ∆𝑠2 < 0. Two events with ∆𝑠2 > 0v are said to be spacelike separated. Finally two events with ∆𝑠2 = 0 are lightlike separated and can be connected by a light ray. The set of all points that are lightlike separated from a point 𝑝 defines its lightcone. Points tjat are timeike separated from 𝑝 lie this lightcone, while spacelike separated points are out the lightcone. T respect causality a particle must travel on a timelike path through spaetime. We call this path the particle's worldline.
We can combine time and space in relativity into a four-vector
(A.5) 𝑥𝜇 = (𝑐𝑡, 𝑥, 𝑦, 𝑧), where 𝜇 runs from 0 to 3 and the zeroth component is time.
To make the symmetry between time and space more manifest, we will use units where speed of light is set to unity, 𝑐 ≡ 1. The line element (A. 4) can then be written as
(A.6) d𝑠2 = 𝜂𝜇𝜈d𝑥𝜇d𝑥𝜈
where 𝜂𝜇𝜈 is the Minkowski metric
(A.7) 𝜂𝜇𝜈 = ⌈ -1 0 0 0 ⌉
┃ 0 1 0 0 ┃
┃ 0 0 1 0 ┃
⌊ 0 0 0 1 ⌋
In (A. 6) we used Einstein's summation convention which declares repeated indices to be summed over.
Under a Lorentz transformation the spacetime four-vector transforms as
(A.8) 𝑥ʹ𝜇 = 𝛬𝜇𝜈𝑥𝜈,
where 𝛬𝜇𝜈 is a 4 × 4 matrix, which for the specific transformation in (𝛢. 1) is
(A.9) 𝛬𝜇𝜈 = ⌈ 𝛾 -𝛾𝑣 0 0 ⌉
┃ -𝛾𝑣 𝛾 0 0 ┃
┃ 0 0 1 0 ┃
⌊ 0 0 0 1 ⌋
In general, the invariance of the elements (A. 6) requires that
(A.10) 𝜂𝜌𝜎 = 𝛬𝜇𝜌𝛬𝜈𝜎𝜂𝜇𝜈,
and the set of matrices satisfying these constraint defines the Lorentz group.
The metric can also be used to lower the index of the vector 𝑥𝜇 to produce the components of the dual co-vector
(A.11) 𝑥𝜇 = 𝜂𝜇𝜈𝑥𝜈 = (-𝑡, 𝑥, 𝑦, 𝑧).
Sometimes 𝑥𝜇 is called a covariant vector, while 𝑥𝜇 is a contravariant vector. To raise an index, we need the inverse metric 𝜂𝜇𝜈, defined by 𝜂𝜇𝜈𝜂𝜇𝜈 = 𝛿𝜇𝜈, so that 𝑥𝜇 = 𝜂𝜇𝜈𝑥𝜈. An important co-vector is the differential operator
(A.12) ∂𝜇 ≡ ∂/∂𝑥𝜇 = (∂𝑡, ∂𝑥, ∂𝑦, ∂𝑧).
The inner product of a vector and a co-vector is
(A.13) 𝑥𝜇𝑥𝜇 = -𝑡 + 𝐱 ⋅ 𝐱.
In order for this inner product to be Lorentz invariant, the components of a co-vector must transform as
(A.14) 𝑥ʹ𝜇 = (𝛬-1)𝜇𝜈𝑥𝜈, where (𝛬-1)𝜇𝜈 is the inverse of 𝛬𝜇𝜈.
Natural generalization such vectors and co-vectors are tensors. tensor of rank (𝑚, 𝑛) has 𝑚 contravariant (upper) indices and 𝑛 covariant (lower) indices:
(A.15) 𝛵𝜇1∙∙∙𝜇𝑚𝜈1∙∙∙𝜈𝑛,
The transformation of such a tensor is
(A.16) (𝛵ʹ)𝜇1∙∙∙𝜇𝑚𝜈1∙∙∙𝜈𝑛 = 𝛬𝜇1𝜎1∙ ∙ ∙ (𝛬-1)𝜌1𝜎1𝜈1∙ ∙ ∙ 𝛵𝜎1∙∙∙𝜎𝑚𝜌1∙∙∙𝜌𝑛.
The most complicated tensors in SR are electromagnetic field strength 𝐹𝜇𝜈 and the energy-momentum tensor
𝛵𝜇𝜈. In GR the most complicated one is the Riemann tensor 𝑅𝜇𝜈𝜌𝜎.
If a physical law can be written in terms of spacetime tensors, it means that it holds in any reference frame. So if the law is true in one inertial frame, it will be true in any Lorentz-transformed frame. Newton's law cannot be written in terms of spacetime tensors but Maxwell's equations can be written in tensorial form and therefore consistent with with relativity. Einstein was motivated by Maxwell's equations because they imply that the speed of light should be independent of the motion of the observer.
Exercise A.1 Consider the inhomogenous Maxwell equations:
(A.17-8) 𝛁 ⋅ 𝐄 = 𝜌, 𝛁 × 𝐁 - ∂𝑡𝐄 = 𝐉.
Defining the 4-vector current 𝐽𝜇 = (𝜌, 𝐽𝑖) and the field -strength tensor,
(A.19) 𝐹𝜇𝜈 = ⌈ 0 𝐸𝑥 𝐸𝑦 𝐸𝑧 ⌉
┃ -𝐸𝑥 0 𝛣𝑧 -𝛣𝑦 ┃
┃ -𝐸𝑦 -𝛣𝑧 0 𝛣𝑥 ┃
⌊ -𝐸𝑧 𝛣𝑦 -𝛣𝑥 0 ⌋
show that these equations can be written as
(A.20) ∂𝜈𝐹𝜇𝜈 = 𝐽𝜇, where ∂𝜈 ≡ ∂/∂𝑥𝜈.
Hints: Write the Maxwell equation in components, using (𝐄)𝑖 = 𝐹0𝑖 and (𝛁 × 𝐁)𝑖 = 𝜖𝑖𝑗𝑘∂𝑗𝛣𝑘 = ∂𝑗𝐹𝑖𝑗.
[Solution] We know that 𝛁 ⋅ 𝐄 = ∂𝑖(𝐄)𝑖 = ∂𝑖𝐹0𝑖 and (𝛁 × 𝐁) = ∂𝑗𝐹𝑖𝑗,
(a) ∂𝑖𝐹0𝑖 = 𝜌 = 𝐽0,
(b) ∂𝑗𝐹𝑖𝑗 - ∂0𝐹0𝑖 = 𝐽𝑖.
Since from (A.19), 𝐹00 = 0 and ∂0𝐹0𝑖 = -∂0𝐹𝑖0,
(c) ∂𝑖𝐹0𝑖 + ∂0𝐹00 = ∂𝜈𝐹0𝜈 =𝐽0,
(d) ∂𝑗𝐹𝑖𝑗 + ∂0𝐹𝑖0 = ∂𝜈𝐹𝑖𝜈 = 𝐽𝑖, So we have
(e) ∂𝜈𝐹𝜇𝜈 = 𝐽𝜇. ▮
A.1.3 Relativistic Kinematics
Consider a massive particle moving through spacetime. The trajectory of the particle is specified by the function 𝑥𝜇(𝜆), where 𝜆 is a parameter labeling the points along the particle's worldline. As 𝜆, we can choose to use the time experienced by the particle called the proper time 𝜏. In the rest frame, we have
(A.21) d𝜏2 = -d𝑠2.
In a general frame, the spatial position 𝐱 of the particle will be a function of time 𝑡. In terms of these coordinates
(A.22) d𝜏 = √(d𝑡2 - d𝐱2) = d𝑡√[1 - (d𝐱/d𝑡)2] = d𝑡√(1 - 𝑣2) = d𝑡/𝛾.
Given the function 𝑥𝜇(𝜏), we can define the four-velocity of the particle
(A.23) 𝑈𝜇 ≡ d𝑥𝜇/d𝜏.
Since 𝜏 is a Lorentz invariant, 𝑈𝜇 transforms in the same way as 𝑥𝜇 and is also a 4-vector. But d𝑥𝜇/d𝑡 is not a 4-vector since both 𝑥𝜇 and 𝑡 change under a Lorentz transformation. Since 𝑈𝜇 is a 4-vector, the inner product 𝑈𝜇𝑈𝜇 is a Lorentz invariant, 𝑈𝜇𝑈𝜇 = -1. Finally from (A.22) the 4-velocity in a general frame is
(A.24) 𝑈𝜇 = 𝛾(1, 𝐯),
while in the rest frame of the particle it becomes 𝑈𝜇 = (1, 0, 0, 0).
Another important quantity is the four momentum
(A.25) 𝑃𝜇 = 𝑚𝑈𝜇,
where 𝑚 is the mass of the particle. Given (A.24), we have 𝑃𝜇 = 𝛾𝑚(1, 𝐯). The spatial part gives the relativistic generalization of the 3-momentum, 𝐩 = 𝛾𝑚𝐯, while the time component is the energy of the particle 𝐸 = 𝛾𝑚. The inner product of the 4-momentum is 𝑃𝜇𝑃𝜇 = -𝑚2, which for 𝑃𝜇 = (𝐸, 𝑝𝑖) becomes
(A.26) 𝐸2 = 𝐩2 + 𝑚2.
This is the generalization of the famous 𝐸 = 𝑚𝑐2 to include kinetic energy.
In case of massless particles like photons, they travel on lightlike trajectory with d𝑠2 = 0. The result in (A.26) still holds in the massless limit and it gives
(A.27) 𝐸 = √(𝐩2 + 𝑚2) → ∣𝐩∣.
The 4-momentum of a massless particle is 𝑃𝜇 = (∣𝐩∣, 𝐩), with 𝑃𝜇𝑃𝜇 = 0.
Exercise A.2 Recall from quantum mechanics that he energy of a photon is
(A.28) 𝐸 = 𝘩𝑐/𝜆,
where 𝘩 is Plank's constant and 𝜆 is the wavelength of the light. Consider an observer traveling with a velocity 𝑣 toward a photon moving in the 𝑥-direction. By transforming to the frame of observer, determine the shift in the wavelength of the photon. This is the relativistic Doppler effect.
[Solution] By the Lorentz transform
(a) 𝐸ʹ = 𝛾(𝐸 + 𝑣𝑝𝑥) = 1/√(1 - 𝑣2) 𝐸(1 + 𝑣𝑝𝑥/∣𝐩∣) = √[(1 + 𝑣)/(1 - 𝑣)] 𝐸.
According to (A.28) 𝐸 ∝ 1/𝜆, hence we find
(b) 𝜆ʹ = √[(1 - 𝑣)/(1 + 𝑣)] 𝜆, where √[(1 - 𝑣)/(1 + 𝑣)] < 1, so the light to be seen is blueshfted. ▮
A.1.3 Relativistic Dynamics
We will discuss the coarse-grained dynamics of a large collection of particles. Now we want to follow the evolution of average quantities, such as the number density 𝑛, energy density 𝜌 and pressure 𝑃 in relativity.
Number density
Consider a box of volume 𝑉 entered around a position 𝐱. So the density of particles is 𝑛 = 𝑁/𝑉. Taking the box size to be small we can think of this as the local density at the point 𝐱. Consider a frame 𝑆ʹ where the box is moving with a velocity 𝑣. The dimension of the box will be Lorentz contracted along the direction of travel, so the volume now is 𝑉ʹ = 𝑉/𝛾. Since the number of particles inside the box stays the same, the number density in this frame will be 𝑛ʹ = 𝛾𝑛. Using (A.24) we may write this as
(A.29) 𝑛ʹ = 𝑛𝑈0,
where 𝑛 is the number density in the rest frame and 𝑈0 is the time component of the 4-velocity of the box. This suggests hat the number density is the time component of a 4-vector called the number current:
(A.30) 𝛮𝜇 ≡ 𝑛𝑈𝜇.
This 4-vector has components 𝛮𝜇 = (𝑛ʹ, 𝐧ʹ), where we reserve 𝑛 for the density in the rest frame. The spatial part of this four-vector is the number current density, 𝐧ʹ = 𝛾𝑛𝐯. Given an era element 𝐀, the inner product 𝐧ʹ ⋅ 𝐀 describes the number of particles flowing across the area per unit ti
me.
If particles are neither created and nor destroyed, then the number density only changes when particles flow into or out of the volume. Locally this is described by the following continuity equation:
(A.31) ∂𝑛ʹ/∂𝑡 = -𝛁 ⋅ 𝐧
̂ʹ,
Using the number current 4-vector, this equation can be written as
(A.32) ∂𝜇𝑁𝜇 = 0, where ∂𝜇 was defined in (A.12).
Energy-momentum tensor
Of particular importance in GR are the density of energy and momentum, since these are the source for the pacetime curvature.
We would now like to write the energy and momentum densities as the time comonents of 4-vector currents. We then combine these currents into a single object, 𝛵0𝜇, where 𝛵00 is the density of the energy and 𝛵0𝑖 is the density of momentum (in the direction 𝑥𝑖). We are building a new rank-2 tensor 𝛵𝜇𝜈 called the energy-momentum tensor. The second index tells us whether we are taking about the energy (𝜈 = 0) or the momentum (𝜈 = 𝑖), while the first index tells us whether we are talking density (𝜇 = 0) or the flow (𝜇 = 𝑖). Hence, we have
𝛵00 = density of energy, 𝛵𝑖0 = flow of energy,
𝛵0𝑖 = density of momentum, 𝛵𝑖𝑗 = flow of momentum.
Note that each component of momentum has its own flux. For example, 𝛵12 is the flow of the 𝑥2-momentum along the 𝑥1-direction. The flow of the momentum density creates a stress (= force per unit area) and 𝛵𝑖𝑗 is therefore often called the stress tensor. Its diagonal components are the pressure and the off-diagonal components are the anisotropic stress. Integrating over space gives the total energy and momentum, or 𝑃𝜈 = ∫ d3𝑥 𝛵0𝜈. By analogy with (A.32), we write the following conservation equation for he energy-momentum tensor:
(A.33) ∂𝜇𝛵𝜇𝜈 = 0.
These are four equations: one for the energy density (𝜈 = 0) and three for different components of the momentum density (𝜈 = 𝑖).
As a simple example, let us return to our particles in the box. Ignoring the kinetic energies of the individual particles, the total energy density in the rest frame is 𝜌 = 𝑚𝑛. In the boosted frame, the energy of the particles and their number density each increase by a factor of 𝛾, so that 𝜌ʹ = 𝛾𝜌. Similarly, the momentum density becomes 𝛑ʹ = 𝛾2𝜌𝐯. Using (A.24), we can also write this as
(A.34) 𝜌ʹ = 𝜌𝑈0𝑈0,
(A.35) 𝜋ʹ𝑖 = 𝜌𝑈0𝑈𝑖,
where 𝜌 is the energy density in the rest frame. The energy-momentum tensor of the particles inside the box can be expressed as
(A.36) 𝛵𝜇𝜈 = 𝜌𝑈𝜇𝑈𝜈, where 𝛵0𝜈 = (𝜌ʹ, 𝛑ʹ).
If we include the random motion of the particles, then we get an extra contribution from the pressure 𝑃 created by the motion. Since the pressure is isotropic, the energy-momentum tensor in the rest frame must be diagonal:
(A.37) 𝛵𝜇𝜈 = diag(𝜌, 𝑃, 𝑃, 𝑃)
In a general frame, this becomes
(A.38) 𝛵𝜇𝜈 = (𝜌 + 𝑃)𝑈𝜇𝑈𝜈 + 𝑃𝜂𝜇𝜈.
This is the energy-momentum tensor of a perfect fluid. It plays an important role in cosmology, since on large scale all matter can be modeled by a perfect fluids.
Relativistic field theory
In modern physics, fields are fundamental and particles are derived concept. Elementary particles are now understood to be quantum excitations of fields and the standard Model of particle physics is a relativistic quantum field theory. Even in classical physics, fields-like gravitational field and the electromagnetic field- play an important role. In SR the dynamics of fields can be expressed as follows.
Consider a real scalar field 𝜙(𝑡, 𝐱). The field has a "kinetic energy" (density) 1/2 𝜙̇2, a "gradient energy" 1/2 (∇𝜙)2 and a "potential energy" 𝑉(𝜙). The kinetic and gradient energies can be combined into a Lorentz-invariant "kinetic term"
(A.39) -1/2 𝜂𝜇𝜈∂𝜇𝜙∂𝜈𝜙 = 1/2 𝜙̇2 - 1/2 (∇𝜙)2,
which is often abbreviated as -1/2 (∂𝜙)2. The Lagrangian of the theory takes the form of "kinetic minus potential energy":
(A.40) 𝐿 = ∫ d3𝑥 [-1/2 𝜂𝜇𝜈∂𝜇𝜙∂𝜈𝜙 - 𝑉(𝜙)],
and the action is
(A.41) 𝑆 = ∫𝑡2𝑡1 d𝑡 𝐿.
The evolution of the field configuration 𝜙(𝑡, 𝐱) between two times 𝑡1 and 𝑡2 follows from the principle of least action. Consider an infinitesimal change of the field, 𝜙 → 𝜙 + 𝛿𝜙. The corresponding variation of the action is
(A.42) 𝛿𝑆 ≡ 𝑆[𝜙 + 𝛿𝜙] - 𝑆[𝜙]
= ∫ d4 [-𝜂𝜇𝜈∂𝜇𝜙∂𝜈𝛿𝜙 - d𝑉/d𝜙 𝛿𝜙] = ∫ d4 [𝜂𝜇𝜈∂𝜇∂𝜈𝜙 - d𝑉/d𝜙]𝛿𝜙,
where the the first term in the second line has been integrated by parts. The resulting boundary term has been dropped because 𝛿𝜙(𝑡1, 𝐱) = 𝛿𝜙(𝑡2, 𝐱) = 0. In order for the variation 𝛿𝑆 to vanish for arbitrary 𝛿𝜙, the bracket in (A. 42) must vanish. This gives the Klein-Gordon equation
(A.43) □𝜙 = d𝑉/d𝜙,
where □ ≡ 𝜂𝜇𝜈∂𝜇∂𝜈 is the d'Alembertian operator.
A.1.5 Gravity and Relativity
Newtonian gravity is not consistent with relativity. Consider a particle of mass 𝑚 in a gravitational field 𝛷(𝐱, 𝑡). The force it experience is given by 𝐅 = -𝑚𝛁𝛷, where the gravitational potential satisfies the Poisson equation:
(A.44) ∇2𝛷 = 4𝜋𝐺𝜌.
The Green's function solution to the Poisson equation is
(A.45) 𝛷(𝐱, 𝑡) = -𝐺 ∫ d3 𝑥ʹ 𝜌(𝐱ʹ, 𝑡)/∣𝐱 - 𝐱ʹ∣,
which describes how a matter distribution with mass density 𝜌(𝐱, 𝑡) creates the potential. The problem with this expressiom is that any change in 𝜌(𝐱, 𝑡) propagates instantaneously throughout space in obvious violation of relativity. The above equation does not take the same form in every inertial frame.
A similar issue arises in Coulomb's law of electrostatics. The equation for the electric potential 𝜙 takes a very similar form, ∇2𝜙 = -𝜌𝑒/𝜖0, where 𝜌𝑒(𝐱, 𝑡) is the charge density. A change in the charge density would also be experienced instantaneously through space. Of course, we know the resolution is Maxwell;s equations of electrodynamics, which can be written in tensorial form using vector potential 𝛢𝜇 = (𝜙, 𝐀) and the vector current 𝐽𝜇 = (𝜌𝑒, 𝐉𝑒): see Exercise A.1. Our challenge will be to find the analog of Maxwell's equations for gravity.
A.2 Gravity Is Geometry
In the rest of this appendix we will give a basic introduction to GR.
A.2.1 The Equivalence Principle
The origin of GR lies in the simple question: Why do objects with different masses fall at the same rate? We think we know the answer: the mass of an object cancels in Newton's law
(A.46) 𝑚 𝐚 = 𝑚 𝐠, where 𝐠 is the local gravitational acceleration.
However we should really distinguish the two masses by giving them different names:
(A.47) 𝑚𝐼 𝐚 = 𝑚𝐺 𝐠,
The gravitational mas, 𝑚𝐺, is a source for the gravitational field, while inertial mass, 𝑚𝐼, characterizes the dynamical response to any forces. There is a nontrivial result that experiment find
(A.48) 𝑚𝐼/𝑚𝐺 = 1 ± 10-13.
In Newtonian gravity, this equality appears to be accident, while in GR, the observation that 𝑚𝐼 = 𝑚𝐺 is taken as a fundamental property of gravity called the weak equivalence principle (WEP).
There are two other forces which are also proportional to the inertial mass. These are
(A.49) Centrifugal force: 𝐅 = -𝑚𝐼𝛚 × (𝛚 × 𝐫), Coriolis force: 𝐅 = -2𝑚𝐼𝛚 × 𝐫̇.
Here we understand that these forces are "fictitious forces" in a non-inertial frame. Could gravity also be a fictitious force, arising only because we are in a non-inertial reference frame?
An important consequence of the equivalence principle is that gravity is "universal," meaning that it acts in the same way on all objects. Consider a particle in a gravitational field 𝐠. Using WEP the equation of motion of the particle is
(A.50) d𝐱̇/d𝑡 = 𝐠(𝐱(𝑡), 𝑡).
Imagine being confined to a sealed box. see (Fig. A.1) Consider first the case (right) where the box is sitting in an electric field. We can tell the situation easily by studying the motion of an electron and proton. However, in case of gravity, as Einstein's famous thought experiment, when we see two test particles with different masses are falling in exactly the same way, 𝐚 = -𝐠, we cannot tell the difference between a uniform gravitational field (left) and uniform acceleration (left). We conclude that
A uniform gravitational field is indistinguishable from uniform acceleration.
A corollary of this observation is the fact that the effects of gravity can be removed by falling freely, 𝐚 = 𝐠, the particles in the box will not fall to the ground. Einstein called this his "happiest thought": a freely falling observer doesn't feel a gravitational field (see Fig. A.2).
Einstein said that there is no experiment that distinguish uniform acceleration from a uniform gravitational field. This generalization of the WEP is called the Einstein equivalence principle (EEP).It implies that, in a small region (so that the gravitational field is approximately uniform), we can always find coordinates so that there is no acceleration. THese coordinates correspond to a local inertial frame where the spacetime is approximately Minkowsy space. Said differently:
In a small region, the laws of physics reduces to those of special relativity.
As we will see, the EEP suggests that the effects of gravity are associated with the curvature of spacetime which becomes relevant on larger scales where the field cannot be approximated as being uniform.
Consider a box that is freely falling towards the Earth (see Fig. A.3). We again drop two test particles. They will then accelerate toward each other because of a force towards the center of the Earth. This is an example of a tidal force arising from the non-uniformity of the gravitational field. Note that these tidal forces cause initially "parallel" trajectories to converge. As we will see, this violation of Euclidean geometry is a manifestation of the curvature of spacetime.
A.2.2 Gravity as Curved Spacetime
We will give a simple argument which will link the equivalence principle rather directly to the curvature of spacetime. For clarity, we will restore light speed 𝑐 factor in this section.
Let us begin by describing a famous observational consequence of the equivalence principle, the gravitational redshift. Consider Alice and Bob in a uniform gravitational field of strength 𝑔 in the negative 𝑧-direction (see Fig/ A.4. They are at heights 𝑧𝛢 = 0 and 𝑧𝛣 = 𝘩, respectively. If Alice send out a light signal with wavelength 𝜆𝛢 = 𝜆0. What is the wavelength 𝜆𝛣 received by Bob? By the equivalence principle, we can take Alice and Bob to be moving with acceleration 𝐚 = 𝐠 in the positive 𝑧-direction in Minkowski spacetime. (see Fig. A,4). Assuming ∆𝑣/𝑐 to be small, the light reaches Bob after a time ∆𝑡 ≈ 𝘩/𝑐. By this time Bob's velocity has increased by ∆𝑣 = 𝑔∆𝑡 = 𝑔 𝘩/𝑐. Due to the Doppler effect, the received light will have a silghtly longer wavelength, 𝜆𝛣 = 𝜆0 + ∆𝜆, with
(A.51) ∆𝜆/𝜆0 = ∆𝑣/𝑐 = 𝑔𝘩/𝑐2.
By the equivalence principle, light emitted from the ground with wavelength 𝜆0 must therefore be "redshifted" by the amount
(A.52) ∆𝜆/𝜆0 = ∆𝛷/𝑐2,
where ∆𝛷 = 𝑔𝘩 is the change in the gravitational potential. This gravitational redshift was first measured by Pound and Rebka in 1959.
We can also think of the gravitational redshift as an effect of time dilation. The period of the emitted light is 𝛵𝛢 = 𝜆𝛢/𝑐 and that of the received light is 𝛵𝛣 = 𝜆𝛣/𝑐. The resukt in (A.52) then implies that
(A.53) 𝛵𝛣 = [1 + (𝛷𝛣 - 𝛷𝛢)/𝑐2] 𝛵𝛢.
We conclude that time run slower in a region of stronger gravity (smaller 𝛷). In the example above, we have 𝛷𝛢 < 𝛷𝛣 (Alice feels a stronger gravitational field than Bob), so that 𝛵𝛢 < 𝛵𝛣 (time runs slower for Alice than for Bob). The result holds for any type of clock in a gravitational field. It also applies to the heart rate of the observer. This means that Alice will see Bob aging more rapidly. This "gravitational twin paradox" has been tested with atomic clocks on planes.3
Let us finally see why all of this implies that spacetime is curved. Anlie now sends out two pulses of light, eprataed by a time interval ∆𝑡𝛢 (as measured by her clock). Bob receives the signals spaced out by ∆𝑡𝛣 (as measured by his clock). Fig. A.5 shows the corresponding spacetime diagram. When in the diagram, drawing the congruent worldline ∆𝑡𝛣 = ∆𝑡𝛢, in apparent contradiction to (A.53), we implicitly assumed that the spacetime is flat. The resolution to the paradox is to accept that the spacetime is curved.
To see this more explicitly, consider a spacetime is not given by d𝑠2 = -d𝑡2 + d𝐱2, but by
(A.54) d𝑠2 = -[1 + 2𝛷(𝐱)/𝑐2] d𝑡2 + d𝐱2, with 𝛷 ≪ 𝑐2.
In these coordinates, the interval ∆𝑡 is the same for Alice and Bob, but their observed proper times are different. The proper time interval between the signals sent by Alice is
(A.55) ∆𝜏𝛢 = √[-𝑔00(𝐱)] ∆𝑡 = √[1 + 2𝛷𝛢/𝑐2] ∆𝑡 ≈ (1 + 𝛷𝛢/𝑐2) ∆𝑡,
where we used ∆𝐱 = 0 and expanded to first order in small 𝛷𝛢 = 𝛷(𝐱𝛢). similarly, the proper time between the signals received by Bob is
(A.56) ∆𝜏𝛣 = (1 + 𝛷𝛣/𝑐2) ∆𝑡.
Combining (A.55) and (A.56), we find
(A.57) ∆𝜏𝛣 = (1 + 𝛷𝛣/𝑐2) (1 + 𝛷𝛣/𝑐2)-1 ∆𝜏𝛢 ≈ [1 + (𝛷𝛣 - 𝛷𝛢)/𝑐2] ∆𝜏𝛢,
which is the same as (A.53). The time dilation has therefore been explained by the geometry of spacetime.
A.3 Motion in Curved Spacetime
General relativity contains two key ideas: (1) "spacetime curvature tells matter how to move" (equivalence principle) and (2) "matter tells spacetime how to curve" (Einstein equation). We will show that in the absence of any non-gravitational force, particles move along special paths in the curved space time called geodesics.
A.3.1 Relativistic Action
The action of a relativistic point particle is
(A.58) 𝑆 = -𝑚 ∫ d𝜏,
where 𝜏 is the proper time along the wordline of the particle and 𝑚 is its mass. The action must be a Lorentz scalar, so that all observers compute the same value for it. All observers will agree on the amount of time that elapsed on a clock carried by the moving particle.
As a useful consistency check, we evaluate the action (A.58) for a particular observer in Minkowski spacetime. Using (A.22), the action can be written as following integral over time
(A.59) 𝑆 = -𝑚 ∫ d𝑡 √(1 - 𝑣2), where 𝑣2 = 𝛿𝑖𝑗𝑥̇𝑖𝑥̇𝑗.
For 𝑣 ≪ 1, the integrand is -𝑚 + 1/2 𝑚𝑣2. We see that the Lagrangian is simply the kinetic energy of the particle plus a constant that doesn't affect the equation of motion.
Substituting the line element (A.54) into (A.58), we get
(A.60) 𝑆 = -𝑚 ∫ d𝑡 √[(1 + 2𝛷) - 𝑣2] ≈ ∫ d𝑡 ( -𝑚 + 1/2 𝑚𝑣2 - 𝑚𝛷 + ∙ ∙ ∙ ),
where we expanded the square root for small 𝑣 and 𝛷. We see that the metric perturbation 𝛷 indeed plays the role of the gravitational potential in Newtonian gravity. It is now also obvious why the inertial mas (appearing in the kinetic term 1/2 𝑚𝑣2) is the same as the gravitational mass (appearing in the potential 𝑚𝛷).
A.3.2 Geodesic Equation
Let us now use the action (A.58) to study the motion of particles in a general curved spacetime with metric 𝑔𝜇𝜈(𝑡, 𝐱). Consider an arbitrary timelike curve 𝓒 connecting two points 𝑝 and 𝑞 (see Fig. A.6). A geodesic is the preferred curve for which the action is an extremum. We first introduce a parameter 𝜆 to label points along the curve, 𝑥𝜇(𝜆), with 𝓒(0) = 𝑝 and 𝓒(1) = 𝑞. The action for curve 𝓒 can be written
as
(A.61) 𝑆[𝑥𝜇(𝜆)] = -𝑚 ∫10 d𝜆 √(-𝑔𝜇𝜈 d𝑥𝜇/d𝜆 d𝑥𝜈/d𝜆).
This action has an extremum if path satisfies the geodesic equation
(A.62) d2𝑥𝜇/d𝜏2 + 𝛤𝜇𝛼𝛽 d𝑥𝛼/d𝜏 d𝑥𝛽/d𝜏 = 0,
where 𝜏 is the proper time along the curve and
(A.63) 𝛤𝜇𝛼𝛽 ≡ 1/2 𝑔𝜇𝜆(∂𝛼𝑔𝛽𝜆 + ∂𝛽𝑔𝛼𝜆 - ∂𝜆𝑔𝛼𝛽)
is the Christoffel symbol (or "connection coefficient").
Derivation Consider a small variation of path 𝑥𝜇 → 𝑥𝜇 + 𝛿𝑥𝜇. The corresponding variation of the action is
(A.64-5) 𝛿𝑆 ≡ 𝑆[𝑥𝜇 + 𝛿𝑥𝜇] - 𝑆[𝑥𝜇]
= 𝑚 ∫10 d𝜆 1/2𝐿 (𝛿𝑔𝜇𝜈 d𝑥𝜇/d𝜆 d𝑥𝜈/d𝜆 + 2𝑔𝜇𝜈 d𝑥𝜇/d𝜆 d𝛿𝑥𝜈/d𝜆), where 𝐿2 ≡ -𝑔𝜇𝜈 d𝑥𝜇/d𝜆 d𝑥𝜈/d𝜆 = (d𝜏/d𝜆)2.
Since d𝜏/d𝜆 = 𝐿, (A.64) becomes
(A.66) 𝛿𝑆 = 𝑚 ∫ d𝜏 (d𝜆/d𝜏) 1/2𝐿 [𝛿𝑔𝜇𝜈𝐿2ẋ𝜇ẋ𝜈 + 2𝐿2𝑔𝜇𝜈ẋ𝜇 𝛿(ẋ𝜈)] = 𝑚 ∫ d𝜏 (d𝜆/d𝜏) 1/2 [∂𝛼𝑔𝜇𝜈 𝛿𝑥𝛼ẋ𝜇ẋ𝜈 + 2𝑔𝜇𝜈𝑥̇𝜇 𝛿(ẋ𝜈)],
where ẋ𝜇 ≡ d𝑥𝜇/d𝜏. Integrating the second term of (A.66) by parts, we get
(A.67) 𝛿𝑆 = 𝑚 ∫ d𝜏 (1/2 ∂𝛼𝑔𝜇𝜈ẋ𝜇ẋ𝜈 𝛿𝑥𝛼 - 𝑔𝜇𝜈ẍ𝜇 𝛿𝑥𝜈 - ∂𝛼𝑔𝜇𝜈ẋ𝛼ẋ𝜇 𝛿𝑥𝜈)
In the last term, we can replace ∂𝛼𝑔𝜇𝜈 with 1/2 (∂𝛼𝑔𝜇𝜈 + ∂𝜇𝑔𝛼𝜈) because it is contracted with an object that is symmetric in 𝛼 and 𝜇. After relabeling of indices, we get
(A.68) 𝛿𝑆 = -𝑚 ∫ d𝜏 [𝑔𝜇𝜈ẍ𝜇 + 1/2 (∂𝛼𝑔𝛽𝜈 + ∂𝛽𝑔𝛼𝜈 - ∂𝜈𝑔𝛼𝛽) ẋ𝛼ẋ𝛽] 𝛿𝑥𝜈.
Factoring out the metric 𝑔𝜇𝜈, we can write this as
(A.69) 𝛿𝑆 = -𝑚 ∫ d𝜏 𝑔𝜇𝜈 (ẍ𝜇 + 𝛤𝜇𝛼𝛽 d𝑥𝛼/d𝜏 d𝑥𝛽/d𝜏) 𝛿𝑥𝜈, where 𝛤𝜇𝛼𝛽 is the Christoffel symbol in (A.63).
In order for 𝛿𝑆 to vanish for arbitrary 𝛿𝑥𝜈, the bracket in (A.69) must vanish and we obtain the geodesic equation (A.62). ▮
We see that the simple action (A.58) gives rise to a relatively complex geodesic equation, which is often applied in the cosmological context (see Section 2.2.1). It is also useful to the geodesic equation in terms of 4-velocity 𝑈𝜇 = d𝑥𝜇/d𝜏 Using the chain rule to write
(A.70) d/d𝜏 𝑈𝜇(𝑥𝛼(𝜏)) = d𝑥𝛼/d𝜏 ∂𝑈𝜇/∂𝑥𝛼 = 𝑈𝛼 ∂𝑈𝜇/∂𝑥𝛼,
equation (A.62) becomes
(A.71) 𝑈𝛼 (∂𝑈𝜇/∂𝑥𝛼 + 𝛤𝜇𝛼𝛽𝑈𝛽) = 0.
The term in the bracket is so-called covariant derivative of the 4-vector 𝑈𝜇 which we write as
(A.72) ∇𝛼𝑈𝜇 ≡ ∂𝛼𝑈𝜇 + 𝛤𝜇𝛼𝛽𝑈𝛽.
The geodesic equation can then be written in the following compact way:
(A.73) 𝑈𝛼∇𝛼𝑈𝜇 = 0.
We will see below this form of geodesic equation can also be derived directly by the concept "parallel transport" of the 4-velocity.
Newtonian limit
In Newtonian gravity, the equation of motion for a test particle in a gravitational field is
(A.74) d2𝑥𝑖/d𝑡2 = -∂𝑖𝛷.
Let us recover this result from the Newtonian limit of the geodesic equation (A.62). The Newtonian approximation assumes that (1) particles are moving slowly (relative to 𝑐), (2) the gravitational field is weak (so can be treated as a perturbation of Minkowski space), and (3) the field is also static. The first condition means that
(A.75) d𝑥𝑖/d𝜏 ≪ d𝑡/d𝜏
,
so that (A.62) becomes
(A.76) d2𝑥𝜇/d𝜏2 + 𝛤𝜇00 (d𝑡/d𝜏)2 = 0.
In the static, weak limit, we write the metric (and its inverse) as
(A.77) 𝑔𝜇𝜈 = 𝜂𝜇𝜈 + 𝘩𝜇𝜈, 𝑔𝜇𝜈 = 𝜂𝜇𝜈 - 𝘩𝜇𝜈,
where the perturbation is small, ∣𝘩𝜇𝜈∣ ≪ 1, and time independent. The first order in 𝘩𝜇𝜈, the relevant component of Christoffel symbol is
(A.78) 𝛤𝜇00 = 1/2 𝑔𝜇𝜆(∂0𝑔𝜆0 + ∂0𝑔𝜆0 - ∂𝜆𝑔00) = -1/2 𝜂𝜇𝑗∂𝑗𝘩00.
The 𝜇 = 0 component of (A.76) reads d2𝑡/d𝜏2 = 0, so that d𝑡/d𝜏 is a constant, while the 𝜇 = 𝑖 component becomes
(A.79) d2𝑥𝑖/d𝜏2 = 1/2 (d𝑡/d𝜏)2∂𝑖𝘩00.
Dividing both sides by (d𝑡/d𝜏)2, we get
(A.80) d2𝑥𝑖/d𝑡2 = 1/2 ∂𝑖𝘩00.
which matches (A.74) if
(A.81) 𝘩00 = -2𝛷.
Note that this result is consistent with (A.54).
Massless particles
For massless particles, (A.61) doesn't hold, while the general form of the geodesic equation still holds
(A.82) d2𝑥𝜇/d𝜆2 + 𝛤𝜇𝛼𝛽 d𝑥𝛼/d𝜆 d𝑥𝛽/d𝜆 = 0,
but 𝜆 cannot be identified with proper time. Instead we choose 𝜆 such that 𝑃𝜇 = d𝑥𝜇/d𝜆, where 𝑃𝜇 is 4- momentum of the particle. The geodesic equation for a massless particle can be written as 𝑃𝛼∇𝛼𝑃𝜇 = 0, with 𝑃𝜇𝑃𝜇 = 0 (see Section 2.2.1).
A.3.3 A Mathematical Interlude*
In this section we will fill in some of the mathematical details which is important to get a full appreciation for the logical structure of the theory.
Tensors in general relativity
In SR all physical law must be written in terms of tensors because we want them to be valid in every inertial frame. The same is true in GR, but the theory now deals with non-inertial frame.
Consider a general coordinate transformation 𝑥𝜇 → 𝑥ʹ𝜇(𝑥). The infinitesimal coordinate differentials transform as
(A.83) d𝑥ʹ𝜇 = ∂𝑥ʹ𝜇/∂𝑥𝜈 d𝑥𝜈 ≡ 𝑆𝜇𝜈 d𝑥𝜈,
where the transformation matrix 𝑆𝜇𝜈 play the same role as 𝛬𝜇𝜈 in a Lorentz transformation, but it crucially it now depends on the position 𝑥𝜇. A 4-vector field 𝑉𝜇(𝑥) is an object that transforms in the same way as d𝑥𝜇:
(A.84) 𝑉ʹ𝜇 = 𝑆𝜇𝜈𝑉𝜇.
Note that although d𝑥𝜇 transforms like a vector, 𝑥𝜇 does not.
To determine the transformation of a co-vector 𝑊𝜇, we use that the inner product 𝑊𝜇𝑉𝜇 must be an invariant. The transformation of 𝑊𝜇 therefore involves the inverse matrix
(A.85) 𝑊ʹ𝜇 = (𝑆-1)𝜇𝜈 = ∂𝑥𝜈/∂𝑥ʹ𝜇 𝑊𝜈.
By inspection of (A.84) and (A.85) we can guess that an arbitrary tensor transform as
(A.86) (𝛵ʹ)𝜇1∙ ∙ ∙ 𝜇𝑚𝜈1∙ ∙ ∙ 𝜈𝑛 = 𝑆𝜇1𝜎1 ∙ ∙ ∙ (𝑆-1)𝜈1𝜌1 ∙ ∙ ∙ 𝛵𝜎1∙ ∙ ∙ 𝜎𝑚𝜎1∙ ∙ ∙ 𝜎𝑛.
This is the generalization of the analogous result (A.16) in SR. An important tensor is the metric tensor 𝑔𝜇𝜈. Its transformation can also be derived from the invariance of the line element:
(A.87) d𝑠2 = 𝑔ʹ𝜇𝜈(𝑥ʹ)d𝑥ʹ𝜇d𝑥ʹ𝜈 = 𝑔𝜌𝜎(𝑥)d𝑥𝜌d𝑥𝜎 = 𝑔𝜌𝜎(𝑥) ∂𝑥𝜌/∂𝑥ʹ𝜇 ∂𝑥𝜎/∂𝑥ʹ𝜈 d𝑥ʹ𝜇d𝑥ʹ𝜈,
so that
(A.88) 𝑔ʹ𝜇𝜈 = 𝑔𝜌𝜎 ∂𝑥𝜌/∂𝑥ʹ𝜇 d𝑥𝜎/d𝑥ʹ𝜈 ≡ (𝑆-1)𝜇𝜌(𝑆-1)𝜈𝜎𝑔𝜌𝜎,
which is consistent with the general transformation law (A.86).
Derivatives of tensors
Let us see how derivatives of scalars, vectors, and tensors transform under a general coordinate transformation. Consider first the partial derivative of a scalar field, ∂𝜇𝜙. Since 𝜙ʹ(𝑥ʹ) = 𝜙(𝑥), we
get
(A.89) ∂𝜇𝜙ʹ(𝑥ʹ) = ∂𝜙ʹ(𝑥ʹ)/d𝑥ʹ𝜇 = ∂𝑥𝜈/∂𝑥ʹ𝜇 ∂𝜙(𝑥)/d𝑥𝜈 ≡ (𝑆-1)𝜇𝜈 ∂𝜈𝜙(𝑥).
Unsurprisingly, ∂𝜇𝜙 transforms like a covariant vector. However, when we take the partial derivative of a vector, ∂𝜆𝑉𝜇, this case is different:
(A.90) ∂ʹ𝜆𝑉ʹ𝜇(𝑥ʹ) = ∂𝑉ʹ𝜇(𝑥ʹ)/∂𝑥ʹ𝜆 = ∂𝑥𝜎/∂𝑥ʹ𝜆 ∂/∂𝑥𝜎 [𝑆𝜇𝜈(𝑥)𝑉𝜈(𝑥)]
(A.91) = (𝑆-1)𝜆𝜎𝑆𝜇𝜈∂𝜎𝑉𝜈 + [(𝑆-1)𝜆𝜎∂𝜎𝑆𝜇𝜈]𝑉𝜈.
The second term in (A.91) spoils the transformation law. So we would like to define a new derivative ∇𝜆𝑉𝜇 that does transform like a tensor:
(A.92) ∇ʹ𝜆𝑉ʹ𝜇 = (𝑆-1)𝜆𝜎𝑆𝜇𝜈∇𝜎𝑉𝜈.
In the next exercise we will show that the covariant derivative defined in (A. 72) has precisely this property.4
Exercise A.3 Show that the Christoffel symbol defined in (A.63) transform as
(A.93) 𝛤ʹ𝜇𝜆𝜈 = ∂𝑥ʹ𝜇/∂𝑥𝜌 ∂𝑥𝜎/∂𝑥ʹ𝜆 ∂𝑥𝜂/∂𝑥ʹ𝜈 𝛤𝜌𝜎𝜂 + ∂𝑥ʹ𝜇/∂𝑥𝜂 ∂2𝑥𝜂/∂𝑥ʹ𝜆∂𝑥ʹ𝜈 = 𝑆𝜇𝜌(𝑆-1)𝜆𝜎(𝑆-1)𝜈𝜂 𝛤𝜌𝜎𝜂 + 𝑆𝜇𝜂(𝑆-1)𝜆𝜌∂𝜌(𝑆-1)𝜈𝜂.
We see that the Christoffel symbol does not transform as a tensor. Show that (A.92) holds!
Hint: We need ∂𝑆-1 = -𝑆-1(∂𝑆)𝑆-1, which holds because 𝑆𝑆-1 = 1. [Solution is omitted.]
So far, we only saw the covariant derivative of a contravariant vector:
(A.94) ∇𝜇𝑉𝜈 = ∂𝜇𝑉𝜈 + 𝛤𝜈𝜇𝛼𝑉𝛼.
To determine the covariant derivative of a covariant vector consider how it acts on the scalar 𝑓 ≡ 𝑊𝜈𝑉𝜈. Using ∇𝜇𝑓 = ∂𝜇𝑓, we get
(A.95) ∇𝜇𝑊𝜈 = ∂𝜇𝑊𝜈 - 𝛤𝛼𝜇𝜈𝑊𝛼.
Exercise A.4 Derive (A.95) from (A.94)
[Solution] Using ∇𝜇𝑓 = ∂𝜇𝑓, we find
(a) ∇𝜇(𝑊𝜈𝑉𝜈) = ∂𝜇(𝑊𝜈𝑉𝜈) = (∂𝜇𝑊𝜈)𝑉𝜈 + 𝑊𝜈(∂𝜇𝑉𝜈)
(b) ∇𝜇(𝑊𝜈𝑉𝜈) = (∇𝜇𝑊𝜈)𝑉𝜈 + 𝑊𝜈(∇𝜇𝑉𝜈) = (∇𝜇𝑊𝜈)𝑉𝜈 + 𝑊𝜈(∂𝜇𝑉𝜈 + 𝛤𝜈𝜇𝛼𝑉𝛼).
Comparing the last equality in (a) and (b), we find
(c) (∂𝜇𝑊𝜈)𝑉𝜈 + 𝑊𝜈(∂𝜇𝑉𝜈) = (∇𝜇𝑊𝜈)𝑉𝜈 + 𝑊𝜈(∂𝜇𝑉𝜈 + 𝛤𝜈𝜇𝛼𝑉𝛼) ⇒ (∇𝜇𝑊𝜈)𝑉𝜈 = (∂𝜇𝑊𝜈)𝑉𝜈 - 𝛤𝜈𝜇𝛼𝑊𝜈𝑉𝛼 = (∂𝜇𝑊𝜈 - 𝛤𝛼𝜇𝜈𝑊𝛼)𝑉𝜈 ⇒
(d) ∇𝜇𝑊𝜈 = ∂𝜇𝑊𝜈 - 𝛤𝛼𝜇𝜈𝑊𝛼. ▮
The covariant derivative of the mixed tensor 𝛵𝜇𝜈 can be derive similarly by considering 𝑓 ≡ 𝛵𝜇𝜈𝑉𝜈𝑊𝜇. This gives
(A.96) ∇𝜎𝛵𝜇𝜈 = ∂𝜎𝛵𝜇𝜈 + 𝛤𝜇𝜎𝛼𝛵𝛼𝜈 - 𝛤𝛼𝜎𝜈𝛵𝜇𝛼.
From flat to curved spacetime
As we saw the covariant derivative of a tensor transforms like a tensor, while the partial derivative does not, A simple prescription to upgrade equations from flat space to curved space is to replace every partial derivative by a covariant derivative, ∂𝜇 → ∇𝜇.5 In (A.20) we showed that the inhomogeneous Maxwell equations can be written as ∂𝜈𝐹𝜇𝜈 = 𝐽𝜇. The generalization of this equation to curved spacetime is simply
(A.97) ∇𝜈𝐹𝜇𝜈 = 𝐽𝜇,
where the dependence on the metric is encoded and this describes the dynamics of electromagnetic fields in GR. Similarly, we saw that the conservation of the energy-momentum tensor in SR implies ∂𝜈𝛵𝜇𝜈 = 0. In GR this becomes
(A.98) ∇𝜈𝛵𝜇𝜈 = 0.
The covariant derivative depends on the metric and hence defines a coupling between matter and the gravitational degrees of freedom.
Parallel transport and geodesics
Given the notion of a covariant derivative, we can define the concept of a parallel transport of a tensor. In flat spacetime "parallel transport" simply means translating a vector along a curve while "keeping it constant." So a vector 𝑉𝜇 is constant along a curve 𝑥𝜇(𝜆) if its components don't depend on the parameter 𝜆:
(A.99) d𝑉𝜇/d𝜆 = d𝑥𝜈/d𝜆 ∂𝜈𝑉𝜇 = 0. (flat spacetime).
We generalize this to curved spacetimes by replacing the partial derivative in (A.99) by a covariant derivative. This gives so-called directional covariant derivative. a vector is parallel transport in GR if the directional covariant derivative of the vector along a curve vanishes:
(A.100) 𝐷𝑉𝜇/𝐷𝜆 = d𝑥𝜈/d𝜆 ∇𝜈𝑉𝜇 = 0. (curved spacetime).
Although we wrote the equation for a vector field, an analogous equation applies for arbitrary tensors. Write out te covariant derivative, the equation of parallel transport becomes
(A.101) d𝑉𝜇/d𝜆 + 𝛤𝜇𝜎𝜈 d𝑥𝜎/d𝜆 𝑉𝜇 = 0,
which tells us that the components of the vector will now change along the curve and that this change is determined by the connection 𝛤𝜇𝜎𝜈.
Using parallel transport, we can give an alternative definition of a geodesic as the curve generated by a tangent vector d𝑥𝜇/d𝜆 that is transported parallel to itself. This generalizes the notion of a straight line in flat space, which can be thought of as the path that parallel transports its own tangent vector. Substituting 𝑉𝜇 = d𝑥𝜇/d𝜆 into (A.101), we get
(A.102) d2𝑥𝜇/d𝜆2 + 𝛤𝜇𝜎𝜈 d𝑥𝜎/d𝜆 d𝑥𝜈/d𝜆 = 0,
which is indeed the same as the geodesic equation, if we write 𝜆 = 𝑎𝑥 + 𝑏 for a massive particle.
A.4 The Einstein Equation
So far. we described how test particles move in arbitrary curved spacetime. Next we will discuss the spacetime curvature is determined by the local matter distribution. We are in search of the following relationship:
(A.103) (a measure of local spacetime curvature) = (a measure of local stress-energy density).
A.4.1 Tidal Forces and Curvature
We will study the relative acceleration of two test particles in GR and find that the motion is determined by the Riemann tensor an object of fundamental importance in differential geometry.
Consider two particles 𝐱(𝑡) and 𝐱(𝑡) + 𝐛(𝑡). In Newtonian gravity they satisf
y
(A.104-5) d2𝑥𝑖/d𝑡2 = -∂𝑗𝛷(𝑥𝑗), d2(𝑥𝑖 + 𝑏𝑖)/d𝑡2 = -∂𝑗𝛷(𝑥𝑗 + 𝑏𝑗)
Subtracting (A.104) from (A.105), and expanding the result to first order, we get
(A.106) d2𝑏𝑖/d𝑡2 = -∂𝑗∂𝑖𝛷 𝑏𝑗. [verification needed]
We see the relativistic acceleration of particles is determines by the tidal tensor ∂𝑖∂𝑗𝛷. The Poisson equation relates the trace of this tidal tensor to the mass density
(A.107) ∇2𝛷 = ∂𝑖∂𝑗𝛷 = 4π𝐺𝜌.
Let us find of (A.106) in GR where it is called the geodesic deviation equation. The analog of the tidal tensor will give us local measure of the spacetime curvature. Its trace will lead to the Einstein equation.
Consider two geodesics separated by an infinitesimal vector 𝛣𝜇 (see Fig. A.7). We define the "relative velocity" of the two geodesics as the directional covariant derivative of 𝛣𝜇 along one of the geodesics:
(A.108) 𝑉𝜇 ≡ 𝐷𝛣𝜇/𝐷𝜏 = 𝑈𝜈∇𝜈𝛣𝜇 = d𝛣𝜇/d𝜏 + 𝛤𝜇𝜎𝜈𝛣𝜎, where 𝑈𝜇 = d𝑥/d𝜏.
The "relative acceleration" is
(A.109) 𝛢𝜇 ≡ 𝐷2𝛣𝜇/𝐷𝜏2 = 𝑈𝜈∇𝜈𝑉𝜇 = d𝑉𝜇/d𝜏 + 𝛤𝜇𝜎𝜈𝑉𝜎.
After some work (see below the derivation), we find
(A.110) 𝐷2𝛣𝜇/𝐷𝜏2 = -𝑅𝜇𝜈𝜎𝜌𝑈𝜈𝑈𝜎𝛣𝜌.
where we defined the Riemann tensor
(A.111) 𝑅𝜇𝜈𝜎𝜌 ≡ ∂𝜌𝛤𝜇𝜈𝜎 - ∂𝜎𝛤𝜇𝜈𝜌 + 𝛤𝜇𝜌𝜆𝛤𝜆𝜈𝜎 - 𝛤𝜇𝜎𝜆𝛤𝜆𝜈𝜌.
This scary beast is the analog of the tidal tensor in Newtonian gravity.
Derivation Substituting (A.108) into (A.109), we get
(A.112) 𝛢𝛼 = d𝑉𝛼/d𝜏 + 𝛤𝛼𝛽𝛾𝑈𝛽𝑉𝛾 = d/d𝜏 (d𝛣𝛼/d𝜏 + 𝛤𝛼𝛽𝛾𝑈𝛽𝛣𝛾) + 𝛤𝛼𝛽𝛾𝑈𝛽 (d𝛣𝛾/d𝜏 + 𝛤𝛾𝛿𝜖𝑈𝛿𝛣𝜖)
= d2𝛣𝛾/d𝜏2 + d𝛤𝛼𝛽𝛾/d𝜏 𝑈𝛽𝛣𝛾 + 𝛤𝛼𝛽𝛾 d𝑈𝛽/d𝜏 + 2𝛤𝛼𝛽𝛾𝑈𝛽 d𝑈𝛽/d𝜏 + 𝛤𝛼𝛽𝛾𝛤𝛾𝛿𝜖𝑈𝛽𝑈𝛿𝛣𝜖.
(A.113-4) d𝛤𝛼𝛽𝛾/d𝜏 = -𝑈𝛿∂𝛿𝛤𝛼𝛽𝛾, d𝑈𝛽/d𝜏 = -𝛤𝛽𝛿𝜖𝑈𝛿𝑈𝜖),
where (A.114) follows fr the geodesic equation (A.71). We therefore get
(A.115) 𝛢𝛼 = d2𝛣𝛾/d𝜏2 + 2𝛤𝛼𝛽𝛾𝑈𝛽 d𝑈𝛽/d𝜏 + (∂𝛿𝛤𝛼𝛽𝛾 - 𝛤𝜖𝛿𝛽𝛤𝛼𝜖𝛾 + 𝛤𝛼𝛽𝜖𝛤𝜖𝛿𝛾) 𝑈𝛽𝑈𝛿𝛣𝛾,
where we relabeled some dummy indices to extract 𝑈𝛽𝑈𝛿𝛣𝛾 from three of terms. To replace the derivatives of 𝛣𝛼, we note that 𝑥𝛼(𝜏) + 𝛣𝛼(𝜏) obeys the geodesic equation
(A.116) d2(𝑥𝛼 + 𝛣𝛼)𝛾/d𝜏2 + 𝛤𝛼𝛽𝛾(𝑥𝛿 + 𝛣𝛿) d(𝑥𝛽 + 𝛣𝛽)/d𝜏 d(𝑥𝛾 + 𝛣𝛾)/d𝜏 = 0.
Subtracting the geodesic equation for 𝑥𝛼(𝜏) and expanding the result to linear order in 𝛣𝛼, we get
(A.117) d2𝛣𝛾/d𝜏2 + 2𝛤𝛼𝛽𝛾𝑈𝛽 d𝑈𝛽/d𝜏 = -∂𝛿𝛤𝛼𝛽𝛾𝛣𝛿𝑈𝛽𝑈𝛾 = -∂𝛾𝛤𝛼𝛽𝛿𝑈𝛽𝑈𝛿𝛣𝛾,
Substituting this into (A.115), we find
(A.118) 𝛢𝛼 = -(∂𝛾𝛤𝛼𝛽𝛿 - ∂𝛿𝛤𝛼𝛽𝛾 + 𝛤𝜖𝛿𝛽𝛤𝛼𝜖𝛾 - 𝛤𝛼𝛽𝜖𝛤𝜖𝛿𝛾)≡ 𝑅𝛼𝛽𝛾𝛿 𝑈𝛽𝑈𝛿𝛣𝛾. ▮
In the local inertial frame of a freely falling observer, with 4-velocity 𝑈𝜇 = (1, 0, 0, 0), the geodesic deviation equation becomes
(A.119) d2𝛣𝜇/d𝜏2 = -𝑅𝜇0𝜈0𝛣𝜈.
For the static, weak-field metric (A.54), we have 𝑅𝑖0𝑗0 = ∂𝑗𝛤𝑖00 = ∂𝑖𝑗𝛷 and (A.119) reduces to Newtonian result (A.106). [RE (A.74-81)]
A.4.2 The Mathematics of Curvature*
There is a lot of beautiful mathematics underlying the physics of spacetime curvature. We will see a very brief snapshot though.
Parallel transport and curvature
An important property of the parallel transport of a vector on a curved manifold is that it depends on the path along which the vector is transported. This is illustrated in Fog. A.8 for the case 2-sphere.
This path dependence of the parallel transport gives another way diagnose whether the spacetime is curved. Consider a parallelogram spanned by the infinitesimal vectors 𝛢𝜌 and 𝛣𝜎 (see Fig. A.8) and imagine parallel transporting a vector 𝑉𝜇. From the equation of parallel transport (A.101), we have that the change of the vector along a side 𝛿𝑥𝜌 is
(A.120) 𝛿𝑉𝜇 = -𝛤𝜇𝜈𝜌𝑉𝜈𝛿𝑥𝜌.
On "path 1" we parallel transport the vector first along 𝛢𝜌 and then along 𝛣𝜎 while on "path 2" we reverse the order (giving the gray path in Fig. A.8). Using (A.120), we get
(A.121) 𝛿𝑉𝜇(1) = -𝛤𝜇𝜈𝜌(𝑥)𝑉𝜈(𝑥)𝛢𝜌 - 𝛤𝜇𝜈𝜌(𝑥 + 𝛢)𝑉𝜈(𝑥 + 𝛢) 𝛣𝜌, 𝛿𝑉𝜇(2) = -𝛤𝜇𝜈𝜌(𝑥)𝑉𝜈(𝑥)𝛣𝜌 - 𝛤𝜇𝜈𝜌(𝑥 + 𝛣)𝑉𝜈(𝑥 + 𝛣) 𝛢𝜌,
and the difference is
(A.122) 𝛿𝑉𝜇 ≡ 𝛿𝑉𝜇(1) - 𝛿𝑉𝜇(2) = ∂𝜎(𝛤𝜇𝜈𝜌𝑉𝜈) 𝛣𝜎𝛢𝜌 - ∂𝜎(𝛤𝜇𝜈𝜌𝑉𝜈) 𝛢𝜎𝛣𝜌,
where we have Taylor expanded the arguments for small 𝛢𝜌 and 𝛣𝜌. Swapping the dummy indices on he second term, 𝜌 ⟷ 𝜎, and differentiating the products, we find
(A.123) 𝛿𝑉𝜇 = (∂𝜎𝛤𝜇𝜈𝜌𝑉𝜈 + 𝛤𝜇𝜈𝜌∂𝜎𝑉𝜈 - ∂𝜌𝛤𝜇𝜈𝜌𝑉𝜈 + 𝛤𝜇𝜈𝜌∂𝜌𝑉𝜈)𝛢𝜌𝛣𝜎.
Using (A.101) again, we have ∂𝜎𝑉𝜈 = -𝛤𝜈𝜎𝜆𝑉𝜆 and hence (A.123) becomes
(A.124) 𝛿𝑉𝜇 = 𝑅𝜇𝜈𝜌𝜎𝑉𝜈𝛢𝜌𝛣𝜎,
where 𝑅𝜇𝜈𝜌𝜎 is the Riemann tensor as defined in (A.111)
Properties of the Riemann tensor
Only 20 of the 44 = 256 components of 𝑅𝜇𝜈𝜌𝜎 are independent, because the Riemann tensor has a lot of symmetries that relate its different components. These symmetries are easiest present to present for the the Riemann tensor with only lower indices 𝑅𝜇𝜈𝜌𝜎 = 𝑔𝜇𝜆𝑅𝜆𝜈𝜌𝜎. We then have
(A.125-8) 𝑅𝜇𝜈𝜌𝜎 = -𝑅𝜈𝜇𝜌𝜎, 𝑅𝜇𝜈𝜌𝜎 = -𝑅𝜇𝜈𝜎𝜌, 𝑅𝜇𝜈𝜌𝜎 = 𝑅𝜌𝜎𝜇𝜈, 𝑅𝜇𝜈𝜌𝜎 + 𝑅𝜇𝜌𝜎𝜈 + 𝑅𝜇𝜎𝜈𝜌 = 0.
In addition, the Riemann tensor satisfies an important differential identity called the Bianchi identity, states that the sum of the cyclic permutations of the first three indices of ∇𝜆𝑅𝜇𝜈𝜌𝜎 vanishes:
(A.129) ∇𝜆𝑅𝜇𝜈𝜌𝜎 + ∇𝜈𝑅𝜆𝜇𝜌𝜎 + ∇𝜇𝑅𝜆𝜈𝜌𝜎 = 0.
This is the analog of homogeneous Maxwell equation ∂𝜆𝐹𝜇𝜈 + ∂𝜈𝐹𝜆𝜇 + ∂𝜇𝐹𝜈𝜆 = 0.
A.4.3 Guessing the Einstein Equation
First, we will "guess" it. Then, we will construct an action for the metric and show that the corresponding equation of motion lead to the same Einstein equation.
We are searching for the relativistic generalization of the Poission equation:
(A.130) ∇2𝛷 = 4π𝐺𝜌.
We would like to write this equation in tensorial form, so that it is valid independently of the choice of coordinates. On the right side 𝜌 suggests that 𝛵𝜇𝜈 should appear, while on the left side 𝛷 suggests that we expect an object with two derivatives acting on the metric. We recall that the Poisson equation is the trace of the tidal tensor, ∂𝑖∂𝑗𝛷, and the generalization of the tidal tensor is the Riemann tensor, 𝑅𝜇𝜈𝜌𝜎. This suggests that the trace of the Riemann tensor would be an interesting object. Taking the trace means contracting the upper index with a lower index and so this leads to the Ricci tensor
(A.131) 𝑅𝜇𝜈 ≡ 𝑅𝜆𝜇𝜆𝜈 = ∂𝜆𝛤𝜆𝜇𝜈 - ∂𝜈𝛤𝜆𝜇𝜆 + 𝛤𝜆𝜌𝜆𝛤𝜌𝜇𝜈 - 𝛤𝜌𝜇𝜆𝛤𝜆𝜈𝜌.
This gas all the properties we want: it is a symmetric (0, 2) tensor with second-order derivatives acting on the metric.
A first and second guess
Einstein's first guess for the field equation of GR was
(A.132) 𝑅𝜇𝜈 =? 𝜅𝛵𝜇𝜈, where 𝜅 is a constant.
However, it doesn't work because we can have ∇𝜇𝑅𝜇𝜈 ≠ 0, which would be inconsistent with the conservation of energy-momentum tensor, ∇𝜇𝛵𝜇𝜈 = 0. To see this, we consider the double contraction of the Bianchi identity (A. 129):
(A.133) 0 = 𝑔𝜎𝜆𝑔𝜇𝜌(∇𝜆𝑅𝜇𝜈𝜌𝜎 + ∇𝜈𝑅𝜆𝜇𝜌𝜎 + ∇𝜇𝑅𝜆𝜈𝜌𝜎) = ∇𝜎𝑅𝜈𝜎 - ∇𝜈𝑅 + ∇𝜇𝑅𝜈𝜌,
where 𝑅 = 𝑔𝜇𝜈 is the Ricci scalar. This implies that
(A.134) ∇𝜇𝑅𝜇𝜈 = 1/2 ∇𝜈𝑅,
which doesn't vanish except where 𝑅 (and hence 𝛵 = 𝑔𝜇𝜈𝛵𝜇𝜈) is a constant.
To fix the problem we simply not that (A. 134) can be written as
(A.135) ∇𝜇(𝑅𝜇𝜈 - 1/2 𝑔𝜇𝜈𝑅) = 0.
This suggests an alternative measure of curvature, the so called Eisenstein tensor.
(A.136) 𝐺𝜇𝜈 ≡ 𝑅𝜇𝜈 - 1/2 𝑔𝜇𝜈𝑅.
which is consistent with the conservation of the energy-momentum tensor. So the improved guess of the Einstein equation is
(A.137) 𝐺𝜇𝜈 =? 𝜅𝛵𝜇𝜈.
Now, we have to verify that it reduces to the Poisson equation (A. 136) in the Newtonian limit.
Newtonian limit
Contracting both sides of (A.137) gives
(A.138) 𝑅 = -𝜅𝛵,
where we use that 𝛿𝜇𝜇 = 4. Substituting this back into (A.137), we get trace-reverced Einstein equation:
(A.139) 𝑅𝜇𝜈 = 𝜅(𝛵𝜇𝜈 - 1/2 𝑔𝜇𝜈𝛵)
.
In the Newtonian limit, the energy-momentum tensor takes the form of a pressureless fluid, with 𝛵00 = 𝜌 and 𝑔00𝛵00 ≈ -𝛵00 = -𝜌. Note that we have taken 𝜌 to be small and could used the unperturbated metric at leading order. The temporal component of (A.139) is
(A.140) 𝑅00 = 1/2 𝜅𝜌.
We would like to evaluate 𝑅00 in the static, weak-field limit, where the metric can be written as 𝑔𝜇𝜈 = 𝜂𝜇𝜈 + 𝘩𝜇𝜈, cf. (A.77). In addition
(A.141) 𝑅00 = 𝑅𝑖0𝑖0 = ∂𝑖𝛤𝑖00 - ∂0𝛤𝑖𝑖0 + 𝛤𝑖𝑗𝜆𝛤𝜆00 - 𝛤𝑖0𝜆𝛤𝜆𝑗0 = ∂𝑖𝛤𝑖00.
Where we used 𝑅0000 = 0 and dropped the terms of the form 𝛤2 which are second order in the metric perturbation, because the the Christoffel symbols are first order.[verification needed] We also dropped ∂0𝛤𝑖𝑖0 because the metric perturbation is assumed to be time independent. In (A.78) we have 𝛤𝑖00 = -1/2 ∂𝑖𝘩00. At first order in the metric perturbation, we find
(A.142) 𝑅00 = 1/2 ∇2𝘩00,
and (A.140) becomes
(A.143) ∇2𝘩00 = -𝜅𝜌.
Recall 𝘩00 = -2𝛷 in (A.140) reproduce the Poisson equation (A.130) if 𝜅 = 8𝜋𝐺.
The Einstein equation
The final form of the Einstein equation then is
(A.144) 𝐺𝜇𝜈 = 8π𝐺 𝛵𝜇𝜈.
This is one of the most beautiful equations ever written down. It describes a wide range of phenomena from Newtonian physics to the expansion of the universe and black holes.
Note that (A.144) are ten second-order partial differential equations for the metric. Because the Bianchi identity, ∇𝜇𝐺𝜇𝜈 = 0, imposes four constraints, we have only six independent equations.
This counting makes sense since there are four coordinate transformations and hence the metric has six independent components.
A.4.4 Einstein-Hilbert Action
An alternative way of deriving the Einstein equations from an action principle. The action must be an integral over a scalar function. Moreover, this scalar function should be a measure of the local spacetime curvature and be at most second order in derivatives of the metric. The unique such object is Ricci scalar8 and the corresponding Einstein-Hilbert action is
(A.145) 𝑆 = ∫ d4𝑥√(-𝑔) 𝑅,
where 𝑔 ≡ det 𝑔𝜇𝜈 is the determinant of the metric. The factor of √(-𝑔) was introduced so that the volume element d4𝑥√(-𝑔) is invariant under a coordinate transformation.
The Einstein equation follows by varying the action with respect to the (inverse) metric. Writing the Ricci scalar as 𝑅 = 𝑔𝜇𝜈𝑅𝜇𝜈, we have
(A.146) 𝛿𝑆 = ∫ d4𝑥 [𝛿√(-𝑔) 𝑔𝜇𝜈𝑅𝜇𝜈 + √(-𝑔) 𝛿𝑔𝜇𝜈 𝑅𝜇𝜈 + √(-𝑔) 𝑔𝜇𝜈 𝛿𝑅𝜇𝜈].
It can be shown that the last term is a total derivative 𝑔𝜇𝜈 𝛿𝑅𝜇𝜈 = ∇𝜇𝑋𝜇, with 𝑋𝜇 ≡ 𝑔𝜌𝜈 𝛿𝛤𝜇𝜌𝜈 - 𝑔𝜇𝜈 𝛿𝛤𝜌𝜈𝜌, and can be dropped withot affecting the equation of motion. Te evaluate the first term, we use that
(A.147) 𝛿√(-𝑔) = -1/2 √(-𝑔) 𝑔𝜇𝜈 𝛿𝑔𝜇𝜈.
To prove this idensity, we have to use that any diagonalizable matrix 𝑀 satisfies log(det 𝑀) = Tr(log 𝑀). Substituting (A.147) into (A.146), we find
(A.148) 𝛿𝑆 = ∫ d4𝑥 (𝑅𝜇𝜈 - 1/2 𝑔𝜇𝜈𝑅) 𝛿𝑔𝜇𝜈.
For the action to be an extremum, this variation must vanish for arbitrary 𝛿𝑔𝜇𝜈. This is only case if 𝐺𝜇𝜈 = 0, which is the vacuum Einstein equation.
Exercise A.5 Show that the last term in (A.146) is a total derivative
(A.149) 𝑔𝜇𝜈 𝛿𝑅𝜇𝜈 = ∇𝜇𝑋𝜇, with 𝑋𝜇 ≡ 𝑔𝜌𝜈 𝛿𝛤𝜇𝜌𝜈 - 𝑔𝜇𝜈 𝛿𝛤𝜌𝜈𝜌.
Hint: We may use that 𝛤𝜇𝜌𝜈 is a tensor, although 𝛤𝜇𝜌𝜈 is not. Consider "normal coordinates" at a point 𝑝, for which 𝛤𝜇𝜌𝜈(𝑝) = 0. Compute 𝛿𝑅𝜇𝜈 in these coordinates. The result is a tensor equation and therefore holds in all coordinate systems.
[Solution] If we choose to work in normal coordinates 𝛤𝜇𝜌𝜈(𝑝) = 0, we have
(a) 𝛿𝑅𝜇𝜈 = ∂𝜆𝛿𝛤𝜆𝜇𝜈 - ∂𝜈𝛿𝛤𝜆𝜇𝜆 + 𝛤𝜆𝜌𝜆𝛤𝜌𝜇𝜈 - 𝛤𝜌𝜇𝜆𝛤𝜆𝜈𝜌 = ∇𝜆𝛿𝛤𝜆𝜇𝜈 - ∇𝜈𝛿𝛤𝜆𝜇𝜆
So we find
(b) 𝑔𝜇𝜈 𝛿𝑅𝜇𝜈 = 𝑔𝜇𝜈 (∇𝜎𝛿𝛤𝜎𝜇𝜈 - ∇𝜈𝛿𝛤𝜎𝜇𝜎) = ∇𝜎 (𝑔𝜇𝜈𝛿𝛤𝜎𝜇𝜈) - ∇𝜈 (𝑔𝜇𝜈𝛿𝛤𝜎𝜇𝜎) = ∇𝜎 (𝑔𝜇𝜈𝛿𝛤𝜎𝜇𝜈 - 𝑔𝜇𝜎𝛿𝛤𝜌𝜇𝜌) = ∇𝜎𝑋𝜎. ▮
A.4.5 Including Matter
To get the non-vacuum Einstein equation, we add an action for matter to Einstein-Hilbert action. The complete action is
(A.150) 𝑆 = 1/2𝜅 ∫ d4𝑥√(-𝑔) 𝑅 + 𝑆𝑀,
where the constant 𝜅 allows for a difference in the relative normalization of the gravitational action and the matter action. Varying this action with respect to the metric gives
(A.151) 𝑆 = 1/2 ∫ d4𝑥√(-𝑔) (1/𝜅 𝐺𝜇𝜈 - 𝛵𝜇𝜈) 𝛿𝑔𝜇𝜈,
where we defined the energy-momentum tensor as
(A.152) 𝛵𝜇𝜈 ≡ -2/√(-𝑔) 𝛿𝑆𝑀/𝛿𝑔𝜇𝜈. The action (A.150) has an extremum when the metric satisfies (A.137): 𝐺𝜇𝜈 = 𝜅𝛵𝜇𝜈. Fixing the constant 𝜅 in the same way before gives the Einstein equation (A.144).
A.4.6 The Cosmological Constant
There is one other term that can be added to the left side of the Einstein equation which is consistent with the local conservation of 𝛵𝜇𝜈, namely a term of the form 𝛬𝑔𝜇𝜈 = 0. Einstein added such a term and called it the cosmological constant. The modified form of the Einstein equation is
(A.153) 𝐺𝜇𝜈 + 𝛬𝑔𝜇𝜈 = 8π𝐺 𝛵𝜇𝜈.
It has also become common practice to identify this cosmological constant with the stress-energy of the vacuum (if any) and include it on the right side as a contribution to the energy-momentum tensor. The action leading to (A.153) is
(A.154) 𝑆 = 1/16π𝐺 ∫ d4𝑥√(-𝑔) (𝑅 - 2𝛬) + 𝑆𝑀.
We see that the cosmological constant corresponds to a pure volume term in the action.
A.4.7 Some Simple Solutions
The Einstein equations are complicated nonlinear functions of the metric, but a few exact solutions exist in solution with a large amount of symmetry. We will first consider the vacuum Einstein equation (𝛵𝜇𝜈 = 0) with a cosmological constant. Contracting both sides of (A. 153) with the metric, we get 𝑅 = 4𝛬 and hence
(A.155) 𝑅𝜇𝜈 = 𝛬𝑔𝜇𝜈.
There are a few famous solutions to this equation:
• First, we set 𝛬 = 0. The Minkowski line elements,
(A.156) d𝑠2 = -d𝑡2 + d𝐱2,
satiafies the vacuum Einstein equation 𝑅𝜇𝜈 = 0. A more nontrivial solution to the same equation is the Schwarzschild metric:
(A.157) d𝑠2 = -(1 - 2𝐺𝑀/𝑟) d𝑡2 + (1 - 2𝐺𝑀/𝑟)-1 d𝑟2 + 𝑟2d𝛺22,
where d𝛺22 ≡ d𝜃2 + sin2𝜃 d𝜙2 is the metric on the unit 2-sphere. This solution describes the spacetime around a spherically symmetric object of mass 𝑀.
• Next, we consider the case of positive cosmological constant, 𝛬 > 0. The corresponding solution to the Einstein equation is de Sitter space (in static patch coordinates):
(A.158) d𝑠2 = -(1 - 𝑟2/𝑅2) d𝑡2 + (1 - 𝑟2/𝑅2)-1 d𝑟2 + 𝑟2d𝛺22,
where 𝑅2 ≡ 3/𝛬. The static patch coordinates cover only part of the de Sitter geometry, namely that accessible to a single observer which is bounded by the cosmological horizon at 𝑟 = 𝑅. Alternative coordinates that cover the whole space are so-called global coordinates
(A.159) d𝑠2 = -d𝛵2 + 𝑅2 cosh2 (𝛵/𝑅) d𝛺32,
where d𝛺32 = d𝜓2 + sin2 𝜓 d𝛺32 is the metric on the unit 3-sphere. In these coordinates, we can think of de Sitter space as an evolving 3-sphere that starts infinitely large at 𝛵 → -∞, shrinks to a minimal size at 𝛵 = 0 and then expand to infinit e size at 𝛵 → +∞. In applications to inflation, we often use the planar coordinates
(A.160) d𝑠2 = -d𝑡̂2 + 𝑒2𝑖/𝑅 (d𝑟2 + 𝑟2d𝛺22),
which cover half of the global geometry. This describes an exponentially expanding universe with flat spatial slicles (although this timedependence only becomes physical when the time translation invariance of de Sitter space is broken by additional matter fields like inflaton). Refer to [2, 3] about more on the geometry of de Sitter space and the different coorfinates.
• Finally, we can also take thr cosmological constant to be negative, 𝛬 < 0. The corresponding solution is anti-de Sitter space:
(A.161) d𝑠2 = -(1 + 𝑟2/𝑅2) d𝑡2 + (1 + 𝑟2/𝑅2)-1 d𝑟2 + 𝑟2d𝛺22, where 𝑅2 ≡ -3/𝛬.
Last, the Robertson-Walker metric for a homogeneous and isotropic universe is (see Chapter 2)
(A.162) d𝑠2 = -d𝑡2 + 𝑎2(𝑡) d𝐱2,
where the scale factor 𝑎(𝑡) describes the expansion of the universe. This solves the Einstein equation if the energy-momentum tensor is that of a perfect fluid, 𝛵𝜇𝜈 = diag(𝜌, 𝑃, 𝑃, 𝑃) (in the rest frame), and the scale factor satisfies the Friedmann equations:
(A.163-4) (𝑎̇/𝑎)2 = 8π𝐺/3 𝜌. (𝑑2𝑎/𝑑𝑡2)/𝑎 = -4π𝐺/3 (𝜌 + 3𝑃).
These equations play an important role in cosmology.
In A.5 Summary, there is one more description that the Newtonian limit of gravity is captured by line elements
(A.171) d𝑠2 = -(1 + 2𝛷)d𝑡2 + (1 -2𝛷)𝛿𝑖𝑗d𝑥𝑖d𝑥𝑗
where 𝛷(𝑥𝑖) ≪ 1. With this metric, the geodesic equation and the Einstein Equation reduces to d2𝑥𝑖/d𝑡2 = -∂𝑖𝛷 and ∇2𝛷 = 4π𝐺𝜌.
4 The Christoffel symbol defined in (A.63) is a special case of a connection called Levi-Civita connection.
5 It is possible to find coordinate-call "Riemann normal coordinates"-so that they vanish at a given point, 𝛤𝜇𝛼𝛽(𝑝) = 0, where covariant derivatives reduce to partial derivatives and the physics becomes that of SR.
8 Gravity as an effective field theory also contains higher-order curvature terms such as 𝑅2 or 𝑅𝜇𝜈𝑅𝜇𝜈. These are only important at very short distances.