Skip to content

8.2: Momentum

In this chapter we talk about:

  • Electromagnetic momentum
  • Maxwell stress tensor
  • Conservation of electromagnetic momentum
  • Angular momentum in EM fields

8.2.1: Electromagnetic Momentum

As it turns out, if you disregard the momentum associated with electromagnetic fields, Newton's laws appear not to work out! Consider a basic system in cartesian coordinates of two moving point charges:

Figure 8.3

What happens between the two charges? Well, the magnetic field of q1 q_1 points into the page at q2 q_2 , so the magnetic force on q2 q_2 is to the right, and the magnetic field of q2 q_2 is out of the page at q1 q_1 , so the magnetic force on q1 q_1 is upward. The net electric force between the two charges is repulsive and opposite, but the magnetic forces aren't, so the electromagnetic force on q1 q_1 on q2 q_2 is equal but not opposite to the force of q2 q_2 on q1 q_1 , in violation of Newton's third law! We've got a problem, and we're going to solve it by invoking the momentum of the EM field.

8.2.2 The Maxwell Stress Tensor

The way to recover conservation of momentum proceeds the same way we recovered the conservation of energy via the Poynting vector. Starting with the basic Coulomb/Lorentz laws, we'll write down an expression for the electromagnetic force on charges in a volume. We're going to integrate that over all space, which can have any distribution of charge, and relate that expression for force to an expression which only involves the field. In the interest of brevity, we'll skip around a bit and leave the full derivations for the real textbook.

Suppose we have a volume V V containing some distribution of charge, current, and electromagnetic fields. The total force on that volume is

F=V(E+v×B)ρdτ(dq=ρdτ)=V(ρE+J×B)dτ(J=ρv) \vec{F} = \int_V ( \vec E + \vec v \cross \vec B) \rho \dd \tau \qquad (\dd q = \rho \dd \tau) \\ = \int _V (\rho \vec E + \vec J \cross \vec B ) \dd \tau \qquad (\vec J = \rho \vec v)

Again, the goal is to replace anything that looks like a source in favor of fields, using Maxwell's equations. It's handy to define the force per unit volume f f :

fρE+J×B \vec f \equiv \rho \vec E + \vec J \cross \vec B

ρ=ϵ0(E)(Gauss’ Law)J=1μ0×Bϵ0Et \rho = \epsilon_0 ( \div \vec E ) \quad \text{(Gauss' Law)} \\ \vec J = \frac{1}{\mu_0} \curl \vec B - \epsilon_0 \pdv{\vec E}{t}

f=ϵ0(E)E+(×Bμ0ϵ0Et)×B \rightarrow \vec f = \epsilon_0 ( \div \vec E) \vec E + \left( \frac{\curl \vec B}{\mu_0} - \epsilon_0 \pdv{\vec E}{t} \right) \cross \vec B

Skipping through a few steps, we cut to the chase. Similar to the derivation of the Poynting theorem, also using the other two Maxwell equations we haven't yet, we get

f=Tϵ0μ0St \vec f = \div \overline{\vec T} - \epsilon_0 \mu_0 \pdv{\vec S}{t}

T(TxxTxyTxzTyxTyyTyzTzxTzyTzz)Tijϵ0(EiEj12δijE2)+1μ0(BiBj12δijB2) \overline{\vec T} \equiv \begin{pmatrix} T_{xx} & T_{xy} & T_{xz} \\ T_{yx} & T_{yy} & T_{yz} \\ T_{zx} & T_{zy} & T_{zz} \end{pmatrix} \\ T_{ij} \equiv \epsilon_0 \left( E_i E_j - \frac{1}{2} \delta_{ij} E^2 \right) + \frac{1}{\mu_0} \left(B_i B_j - \frac{1}{2} \delta_{ij} B^2 \right)

where S \vec S is the Poynting vector and T \overline{\vec T} is the so-called "Maxwell stress tensor." To keep in mind what kind of units we're talking about here, f \vec f has units force per unit volume, and the divergence will strip one spatial dimension, so the Maxwell stress tensor will have units of stress (force per unit area).

The tensor has diagonal "pressure" terms and off-diagonal "shear" terms. For "pressure" forces, the force and area are in the same direction, and in the "shear" case the force and area are orthogonal.

The divergence term T \div \overline{\vec T} is itself a vector

T=(i^x+j^y+k^z)T=ϵ0[(E)Ej+(E)Ej12jE2]+1μ0[(B)Bj+(B)Bj12jB2] \div \overline{\vec T} = \left( \vu{i} \pdv{}{x} + \vu j \pdv{}{y} + \vu k \pdv{}{z} \right) \cdot \overline{\vec T} \\ = \epsilon_0 \left[ (\div \vec E) E_j + (\vec E \cdot \grad) E_j - \frac{1}{2} \grad _j E^2 \right] \\ + \frac{1}{\mu_0} \left[ ( \div \vec B) B_j + (\vec B \cdot \grad) B_j - \frac{1}{2} \grad _j B^2 \right]

As we do the volume integral to go from f \vec f to F \vec F

F=V(Tϵ0μ0St)dτ=Tdaϵ0μ0tSdτ \vec F = \int _V \left(\div\overline{\vec T} - \epsilon_0 \mu_0 \pdv{\vec S}{t} \right) \dd \tau \\ = \oint \overline{\vec{T}} \cdot \dd \vec a - \epsilon_0 \mu_0 \pdv{}{t} \int \vec{S} \dd \tau

8.2.3 Conservation of Electromagnetic Momentum

F=dpmechdt=ϵ0μ0ddtVSdτ+Tda \vec F = \dv{\vec{p}_{mech}}{t} = - \epsilon_0 \mu_0 \dv{}{t} \int_V \vec S \dd \tau + \oint \overline{\vec T} \cdot \dd \vec a

The first term on the right is related to the momentum stored in the electromagnetic field. The second term is the rate at which momentum flows across the surface, and we describe the left-hand-side as the rate of change of the momentum of charges within the volume.

We identify another useful term as the first integrand on the right:

gϵ0μ0S=ϵ0(E×B)(momentum density in EM fields) \vec g \equiv \epsilon_0 \mu_0 \vec S = \epsilon_0 (\vec E \cross B) \quad \text{(momentum density in EM fields)}

which is the momentum density within the fields. Just as a note, the signs here are swapped from the Poynting theorem - the Maxwell stress tensor is defined such that momentum flowing into the region corresponds with increasing T \overline{\vec T} , and vice-versa, opposite the case we had with S \vec S .

In a charge-free region,

ddtVgdτ+STda=0 - \dv{}{t} \int_V \vec g \dd \tau + \oint_S \overline{\vec T} \cdot \dd \vec a = 0

and since the above is true for all regions V V , we have our familiar continuity-type equation

gt+T=0 - \pdv{\vec g}{t} + \div \overline{\vec T} = 0

Example: Problem 8.7

Consider an infinite parallel-plate capacitor, with the lower plate (at z=d/2 z = -d/2 ) carrying surface charge density σ -\sigma , and the upper plate (at z=+d/2 z = +d/2 ) carrying charge density +σ + \sigma .

(a) Determine all elements of the stress tensor, in the region between the plates. Lucky for us, the magnetic field between the plates is

B=0 \vec B = 0

The electric field is very simple

E=σϵ0(z^) \vec E = \frac{\sigma}{\epsilon_0} ( - \vu z)

So we can already tell that the off-diagonal terms will be zero, since they all contain two factors of EiEj E_i E_j , one of which will be zero. The B B terms on the diagonal will be zero, and

T=σ22ϵ0(100010001) \overline{\vec T} = \frac{\sigma^2}{2 \epsilon_0} \begin{pmatrix} -1 & 0 & 0 \\ 0 & -1 & 0 \\ 0 & 0 & 1 \end{pmatrix}

(b) Use F=STda \vec F = \oint_S \overline{\vec T} \cdot \dd \vec a at the boundary to determine the electromagnetic force per unit area on the top plate.

The force on the top plate will be

F/A=1ASTda=1Aσ22ϵ0(100010001)(00A)=σ22ϵ0z^ \vec F / A = \frac{1}{A} \oint _S \overline{\vec T} \cdot \dd \vec a \\ = \frac{1}{A} \frac{\sigma ^2}{2 \epsilon_0} \begin{pmatrix} -1 & 0 & 0 \\ 0 & -1 & 0 \\ 0 & 0 & 1 \end{pmatrix} \begin{pmatrix} 0 \\ 0 \\ -A \end{pmatrix} \\ = - \frac{\sigma^2}{2 \epsilon_0} \vu z

(c) What is the electromagnetic momentum per unit area, per unit time, crossing the xy plane (or any other plane parallel to that one, between the plates)?

This is just to show us that the momentum flow through the interior of the capacitor is the same as whatever force is pulling the top and bottom plates apart.

Tzz=σ2/2ϵ0 - T_{zz} = - \sigma^2 / 2 \epsilon_0

is the momentum in the z direction crossing a surface perpendicular to z, per unit area, per unit time.

8.2.4 Angular Momentum in EM Fields

As a reminder, we associate with the electromagnetic fields an energy density

uem=12ϵ0E2+μ02B2 u_{em} = \frac{1}{2} \epsilon_0 E^2 + \frac{\mu_0}{2} B^2

and a momentum density

g=ϵ0(E×B) \vec g = \epsilon_0 ( \vec E \cross \vec B)

For that matter, we define angular momentum in the normal fashion

l=r×g=ϵ0[r×(E×B)] \vec l = \vec r \cross \vec g = \epsilon_0 [ \vec r \cross ( \vec E \cross \vec B) ]

where the presence of r \vec r means it's defined about some point or axis.

Example 8.4

Imagine a very long solenoid with radius R R , n n turns per unit length, and current I I . Coaxial with the solenoid are two long cylindrical (non-conducting) shells of length l l - one inside the solenoid at radius a a carries a charge +Q +Q distributed uniformly over its surface; the other outside the solenoid at radius b b carries charge Q -Q . When the current in the solenoid is gradually reduced to nothing, the cylinders begin to rotate. Question: where does the angular momentum come from?

Figure 8.7

We'll start by writing down the initial angular momentum of the system, then see what happens when we start to change the current. It is assumed that the solenoid is very long compared with the charged cylinders, and that the length of each charged cylinder is much much greater than its radius.

The initial E \vec E is given by looking at a Gaussian cylinder between the two charged cylinders

E2πsl=Qϵ0 E 2 \pi s l = \frac{Q}{\epsilon_0}

E=Q2πϵ0l1ss^a<s<b \rightarrow \vec E = \frac{Q}{2 \pi \epsilon_0 l} \frac{1}{s} \vu s \qquad a < s < b

The initial B \vec B is just that of the solenoid, namely

B={μ0nIz^0<s<R0,s>R \vec B = \begin{cases} \mu_0 n I \vu z & \qquad 0 < s < R \\ 0, & \qquad s > R \end{cases}

The linear momentum density we defined earlier is

g=ϵ0μ0S=ϵ0(E×B)=QnIμ022πs(ϕ^) \vec g = \epsilon_0 \mu_0 \vec S = \epsilon_0 ( \vec E \cross \vec B) = \frac{Q n I \mu_0 ^2}{2 \pi s} (- \vu \phi)

And the angular momentum density with respect to the cylindrical axis is

l=s×g \vec l = \vec s \cross \vec g

And it will point in the z^ - \vu z direction

L=Vldτ=Vs×ϵ0Q2πϵ0l1ss^×μ0Inz^dτ \vec L = \int _V \vec l \dd \tau = \int_V \vec s \cross \epsilon_0 \frac{Q}{2 \pi \epsilon_0 l} \frac{1}{s} \vu s \cross \mu_0 I n \vu z \dd \tau

=μ0nIQ2πlVs^×ϕ^dτ=μ0nIQ2πlz^π(R2a2)l=μnnIQ2(R2a2)z^ = - \frac{\mu_0 n I Q }{2 \pi l} \int _V \vu s \cross \vu \phi \dd \tau = - \frac{\mu_0 n I Q}{2 \pi l} \vu z \pi (R^2 - a^2 ) l = - \frac{\mu_n n I Q}{2} (R^2 - a^2) \vu z

So that's the initial angular momentum. Of course, nothing is moving, so this is just the momentum stored in the EM fields. Assuming nothing like friction complicates the situation, any angular momentum lost by the EM fields will be transferred to the cylinders as rotation.

As we crank down the current, we know that a time variation in the magnetic field implies an induced EMF E=dΦdt \mathcal{E} = - \dv{\Phi}{t} . E \vec E experienced at radii a a and b b are now different. Use Faraday

Ea2πa=μ0ndIdtπa2Ea=μ0na2dIdtϕ^ E_a 2 \pi a = - \mu_0 n \dv{I}{t} \pi a^2 \rightarrow \vec E_a = - \frac{\mu_0 n a}{2} \dv{I}{t} \vu \phi

This field is in the direction to torque cylinder +Q +Q

Γa=s×(QEa)=μ0na2dIdtQaϕ^ \vec{\Gamma_a} = \vec s \cross (Q \vec{E_a}) = - \frac{\mu_0 n a}{2} \dv{I}{t} Q a \vu \phi

=μ0na22QdIdtz^ = - \frac{\mu_0 n a^2}{2} Q \dv{I}{t} \vu z

Integrating torque over time, we get the changed angular momentum of the cylinder

La=0tτdt=μ0na22Q0dIdtdtz^=μ0na22Qz^0dI \vec{L_a} = \int_0 ^t \tau \dd t = - \frac{\mu_0 n a^2}{2} Q \int_0 ^\infty \dv{I}{t} \dd t \vu z = - \frac{\mu_0 n a^2}{2} Q \vu z \int_0 ^\infty \dd I

=μ0nI2a2z^ = \frac{\mu_0 n I}{2} a^2 \vu z

What happens to the outer cylinder? Something very similar, but when we calculated from Faraday's law our loop radius is b b , and the area of flux is πR2 \pi R^2

Eb=μ0nR22adIdtϕ^ \vec E_b = - \frac{\mu_0 n R^2}{2a} \dv{I}{t} \vu \phi

Fb=QEb \vec{F_b} = - Q \vec{E}_b

Γb=bs^×Fb \vec {\Gamma _b} = b \vu s \cross \vec{F_b}

Lb=μ0nR2Q20dIdtdtz^=μ0nR2QI2z^ \rightarrow \vec{L}_b = \frac{\mu_0 n R^2 Q}{2} \int_0 ^\infty \dv{I}{t} \dd t \vu z = - \frac{\mu_0 n R^2 Q I}{2} \vu z

So the total angular momentum once the current is finally turned down is

La+Lb=μ0nIQ2(R2a2)z^ \vec{L_a} + \vec{L_b} = - \frac{\mu_0 n I Q}{2} (R^2 - a^2) \vu z

which is exactly the angular momentum that was stored in the fields, so we've successfully conserved angular momentum :)