8.2: Momentum#

In this chapter we talk about:

Electromagnetic momentum
Maxwell stress tensor
Conservation of electromagnetic momentum
Angular momentum in EM fields

8.2.1: Electromagnetic Momentum#

As it turns out, if you disregard the momentum associated with electromagnetic fields, Newton’s laws appear not to work out! Consider a basic system in cartesian coordinates of two moving point charges:

Figure 8.3

What happens between the two charges? Well, the magnetic field of \( q_1 \) points into the page at \( q_2 \), so the magnetic force on \( q_2 \) is to the right, and the magnetic field of \( q_2 \) is out of the page at \( q_1 \) , so the magnetic force on \( q_1 \) is upward. The net electric force between the two charges is repulsive and opposite, but the magnetic forces aren’t, so the electromagnetic force on \( q_1 \) on \( q_2 \) is equal but not opposite to the force of \( q_2 \) on \( q_1 \), in violation of Newton’s third law! We’ve got a problem, and we’re going to solve it by invoking the momentum of the EM field.

8.2.2 The Maxwell Stress Tensor#

The way to recover conservation of momentum proceeds the same way we recovered the conservation of energy via the Poynting vector. Starting with the basic Coulomb/Lorentz laws, we’ll write down an expression for the electromagnetic force on charges in a volume. We’re going to integrate that over all space, which can have any distribution of charge, and relate that expression for force to an expression which only involves the field. In the interest of brevity, we’ll skip around a bit and leave the full derivations for the real textbook.

Suppose we have a volume \( V \) containing some distribution of charge, current, and electromagnetic fields. The total force on that volume is

\[\vec{F} = \int_V ( \vec E + \vec v \cross \vec B) \rho \dd \tau \qquad (\dd q = \rho \dd \tau) \\ = \int _V (\rho \vec E + \vec J \cross \vec B ) \dd \tau \qquad (\vec J = \rho \vec v)\]

Again, the goal is to replace anything that looks like a source in favor of fields, using Maxwell’s equations. It’s handy to define the force per unit volume \( f \):

\[\vec f \equiv \rho \vec E + \vec J \cross \vec B\] \[\rho = \epsilon_0 ( \div \vec E ) \quad \text{(Gauss' Law)} \\ \vec J = \frac{1}{\mu_0} \curl \vec B - \epsilon_0 \pdv{\vec E}{t}\] \[\rightarrow \vec f = \epsilon_0 ( \div \vec E) \vec E + \left( \frac{\curl \vec B}{\mu_0} - \epsilon_0 \pdv{\vec E}{t} \right) \cross \vec B\]

Skipping through a few steps, we cut to the chase. Similar to the derivation of the Poynting theorem, also using the other two Maxwell equations we haven’t yet, we get

\[\vec f = \div \overline{\vec T} - \epsilon_0 \mu_0 \pdv{\vec S}{t}\] \[\overline{\vec T} \equiv \begin{pmatrix} T_{xx} & T_{xy} & T_{xz} \\ T_{yx} & T_{yy} & T_{yz} \\ T_{zx} & T_{zy} & T_{zz} \end{pmatrix} \\ T_{ij} \equiv \epsilon_0 \left( E_i E_j - \frac{1}{2} \delta_{ij} E^2 \right) + \frac{1}{\mu_0} \left(B_i B_j - \frac{1}{2} \delta_{ij} B^2 \right)\]

where \( \vec S \) is the Poynting vector and \( \overline{\vec T} \) is the so-called “Maxwell stress tensor.” To keep in mind what kind of units we’re talking about here, \( \vec f \) has units force per unit volume, and the divergence will strip one spatial dimension, so the Maxwell stress tensor will have units of stress (force per unit area).

The tensor has diagonal “pressure” terms and off-diagonal “shear” terms. For “pressure” forces, the force and area are in the same direction, and in the “shear” case the force and area are orthogonal.

The divergence term \( \div \overline{\vec T} \) is itself a vector

\[\div \overline{\vec T} = \left( \vu{i} \pdv{}{x} + \vu j \pdv{}{y} + \vu k \pdv{}{z} \right) \cdot \overline{\vec T} \\ = \epsilon_0 \left[ (\div \vec E) E_j + (\vec E \cdot \grad) E_j - \frac{1}{2} \grad _j E^2 \right] \\ + \frac{1}{\mu_0} \left[ ( \div \vec B) B_j + (\vec B \cdot \grad) B_j - \frac{1}{2} \grad _j B^2 \right]\]

As we do the volume integral to go from \( \vec f \) to \( \vec F \)

\[\vec F = \int _V \left(\div\overline{\vec T} - \epsilon_0 \mu_0 \pdv{\vec S}{t} \right) \dd \tau \\ = \oint \overline{\vec{T}} \cdot \dd \vec a - \epsilon_0 \mu_0 \pdv{}{t} \int \vec{S} \dd \tau\]

8.2.3 Conservation of Electromagnetic Momentum#

\[\vec F = \dv{\vec{p}_{mech}}{t} = - \epsilon_0 \mu_0 \dv{}{t} \int_V \vec S \dd \tau + \oint \overline{\vec T} \cdot \dd \vec a\]

The first term on the right is related to the momentum stored in the electromagnetic field. The second term is the rate at which momentum flows across the surface, and we describe the left-hand-side as the rate of change of the momentum of charges within the volume.

We identify another useful term as the first integrand on the right:

\[\vec g \equiv \epsilon_0 \mu_0 \vec S = \epsilon_0 (\vec E \cross B) \quad \text{(momentum density in EM fields)}\]

which is the momentum density within the fields. Just as a note, the signs here are swapped from the Poynting theorem - the Maxwell stress tensor is defined such that momentum flowing into the region corresponds with increasing \( \overline{\vec T} \), and vice-versa, opposite the case we had with \( \vec S \).

In a charge-free region,

\[- \dv{}{t} \int_V \vec g \dd \tau + \oint_S \overline{\vec T} \cdot \dd \vec a = 0\]

and since the above is true for all regions \( V \), we have our familiar continuity-type equation

\[- \pdv{\vec g}{t} + \div \overline{\vec T} = 0\]

Example: Problem 8.7#

Consider an infinite parallel-plate capacitor, with the lower plate (at \( z = -d/2 \)) carrying surface charge density \( -\sigma \), and the upper plate (at \( z = +d/2 \)) carrying charge density \( + \sigma \).

(a) Determine all elements of the stress tensor, in the region between the plates.

(b) Use \( \vec F = \oint_S \overline{\vec T} \cdot \dd \vec a \) at the boundary to determine the electromagnetic force per unit area on the top plate.

(c) What is the electromagnetic momentum per unit area, per unit time, crossing the xy plane (or any other plane parallel to that one, between the plates)?

(a) Lucky for us, the magnetic field between the plates is

\[\vec B = 0\]

The electric field is very simple

\[\vec E = \frac{\sigma}{\epsilon_0} ( - \vu z)\]

So we can already tell that the off-diagonal terms will be zero, since they all contain two factors of \( E_i E_j \), one of which will be zero. The \( B \) terms on the diagonal will be zero, and

\[\overline{\vec T} = \frac{\sigma^2}{2 \epsilon_0} \begin{pmatrix} -1 & 0 & 0 \\ 0 & -1 & 0 \\ 0 & 0 & 1 \end{pmatrix}\]

(b) The force on the top plate will be

\[\vec F / A = \frac{1}{A} \oint _S \overline{\vec T} \cdot \dd \vec a \\ = \frac{1}{A} \frac{\sigma ^2}{2 \epsilon_0} \begin{pmatrix} -1 & 0 & 0 \\ 0 & -1 & 0 \\ 0 & 0 & 1 \end{pmatrix} \begin{pmatrix} 0 \\ 0 \\ -A \end{pmatrix} \\ = - \frac{\sigma^2}{2 \epsilon_0} \vu z\]

(c) This is just to show us that the momentum flow through the interior of the capacitor is the same as whatever force is pulling the top and bottom plates apart.

\[- T_{zz} = - \sigma^2 / 2 \epsilon_0 \]

is the momentum in the z direction crossing a surface perpendicular to z, per unit area, per unit time.

8.2.4 Angular Momentum in EM Fields#

As a reminder, we associate with the electromagnetic fields an energy density

\[u_{em} = \frac{1}{2} \epsilon_0 E^2 + \frac{\mu_0}{2} B^2\]

and a momentum density

\[\vec g = \epsilon_0 ( \vec E \cross \vec B)\]

For that matter, we define angular momentum in the normal fashion

\[\vec l = \vec r \cross \vec g = \epsilon_0 [ \vec r \cross ( \vec E \cross \vec B) ]\]

where the presence of \( \vec r \) means it’s defined about some point or axis.

Example 8.4#

Imagine a very long solenoid with radius \( R \), \( n \) turns per unit length, and current \( I \). Coaxial with the solenoid are two long cylindrical (non-conducting) shells of length \( l \) - one inside the solenoid at radius \( a \) carries a charge \( +Q \) distributed uniformly over its surface; the other outside the solenoid at radius \( b \) carries charge \( -Q \) . When the current in the solenoid is gradually reduced to nothing, the cylinders begin to rotate. Question: where does the angular momentum come from?

Figure 8.7

We’ll start by writing down the initial angular momentum of the system, then see what happens when we start to change the current. It is assumed that the solenoid is very long compared with the charged cylinders, and that the length of each charged cylinder is much much greater than its radius.

The initial \( \vec E \) is given by looking at a Gaussian cylinder between the two charged cylinders

\[E 2 \pi s l = \frac{Q}{\epsilon_0}\] \[\rightarrow \vec E = \frac{Q}{2 \pi \epsilon_0 l} \frac{1}{s} \vu s \qquad a < s < b\]

The initial \( \vec B \) is just that of the solenoid, namely

\[\vec B = \begin{cases} \mu_0 n I \vu z & \qquad 0 < s < R \\ 0, & \qquad s > R \end{cases}\]

The linear momentum density we defined earlier is

\[\vec g = \epsilon_0 \mu_0 \vec S = \epsilon_0 ( \vec E \cross \vec B) = \frac{Q n I \mu_0 ^2}{2 \pi s} (- \vu \phi) \]

And the angular momentum density with respect to the cylindrical axis is

\[\vec l = \vec s \cross \vec g\]

And it will point in the \( - \vu z \) direction

\[\vec L = \int _V \vec l \dd \tau = \int_V \vec s \cross \epsilon_0 \frac{Q}{2 \pi \epsilon_0 l} \frac{1}{s} \vu s \cross \mu_0 I n \vu z \dd \tau\] \[= - \frac{\mu_0 n I Q }{2 \pi l} \int _V \vu s \cross \vu \phi \dd \tau = - \frac{\mu_0 n I Q}{2 \pi l} \vu z \pi (R^2 - a^2 ) l = - \frac{\mu_n n I Q}{2} (R^2 - a^2) \vu z\]

So that’s the initial angular momentum. Of course, nothing is moving, so this is just the momentum stored in the EM fields. Assuming nothing like friction complicates the situation, any angular momentum lost by the EM fields will be transferred to the cylinders as rotation.

As we crank down the current, we know that a time variation in the magnetic field implies an induced EMF \( \mathcal{E} = - \dv{\Phi}{t} \). \( \vec E \) experienced at radii \( a \) and \( b \) are now different. Use Faraday

\[E_a 2 \pi a = - \mu_0 n \dv{I}{t} \pi a^2 \rightarrow \vec E_a = - \frac{\mu_0 n a}{2} \dv{I}{t} \vu \phi\]

This field is in the direction to torque cylinder \( +Q \)

\[\vec{\Gamma_a} = \vec s \cross (Q \vec{E_a}) = - \frac{\mu_0 n a}{2} \dv{I}{t} Q a \vu \phi\] \[= - \frac{\mu_0 n a^2}{2} Q \dv{I}{t} \vu z\]

Integrating torque over time, we get the changed angular momentum of the cylinder

\[\vec{L_a} = \int_0 ^t \tau \dd t = - \frac{\mu_0 n a^2}{2} Q \int_0 ^\infty \dv{I}{t} \dd t \vu z = - \frac{\mu_0 n a^2}{2} Q \vu z \int_0 ^\infty \dd I\] \[= \frac{\mu_0 n I}{2} a^2 \vu z\]

What happens to the outer cylinder? Something very similar, but when we calculated from Faraday’s law our loop radius is \( b \), and the area of flux is \( \pi R^2 \)

\[\vec E_b = - \frac{\mu_0 n R^2}{2a} \dv{I}{t} \vu \phi\] \[\vec{F_b} = - Q \vec{E}_b\] \[\vec {\Gamma _b} = b \vu s \cross \vec{F_b}\] \[\rightarrow \vec{L}_b = \frac{\mu_0 n R^2 Q}{2} \int_0 ^\infty \dv{I}{t} \dd t \vu z = - \frac{\mu_0 n R^2 Q I}{2} \vu z\]

So the total angular momentum once the current is finally turned down is

\[\vec{L_a} + \vec{L_b} = - \frac{\mu_0 n I Q}{2} (R^2 - a^2) \vu z\]

which is exactly the angular momentum that was stored in the fields, so we’ve successfully conserved angular momentum :)