Dirac equation
Encyclopedia : D : DI : DIR : Dirac equation
In physics, the Dirac equation is a relativistic quantum mechanical wave equation formulated by British physicist Paul Dirac in 1928 and provides a description of elementary spin-½ particles, such as electrons, consistent with both the principles of quantum mechanics and the theory of special relativity. The equation demands the existence of antiparticles and actually predated their experimental discovery, making the discovery of the positron, the antiparticle of the electron, one of the greatest triumphs of modern theoretical physics.
Since the Dirac equation was originally invented to describe the electron, we will generally speak of "electrons" in this article. Actually, the equation also applies to quarks, which are also elementary spin-½ particles. A modified Dirac equation can be used to approximately describe protons and neutrons, which are not elementary particles (they are made up of quarks). Another modification of the Dirac equation, called the Majorana equation, is thought to describe neutrinos.
The Dirac equation is
- [ \left(\alpha_0 mc^2 + \sum_^3 \alpha_j p_j \, c\right) \psi (\mathbf,t) = i \hbar \frac (\mathbf,t) ]
The α's are linear operators that act on the wavefunction. Their most fundamental property is that they must anticommute with each other. In other words,
- [\alpha_i\alpha_j = -\alpha_j\alpha_i,]
A convenient (but not unique) choice of [\alpha]s is
- [\alpha_0 = \begin 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & -1 & 0 \\ 0 & 0 & 0 & -1 \end, \quad \alpha_1 = \begin 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 \\ 0 & 1 & 0 & 0 \\ 1 & 0 & 0 & 0 \end, ]
- [\alpha_2 = \begin 0 & 0 & 0 & -i \\ 0 & 0 & i & 0 \\ 0 & -i& 0 & 0 \\ i & 0 & 0 & 0 \end, \quad \alpha_3 = \begin 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & -1 \\ 1 & 0 & 0 & 0 \\ 0 & -1 & 0 & 0 \end, ]
The Dirac equation describes the probability amplitudes for a single electron. This is a single-particle theory; in other words, it does not account for the creation and destruction of the particles. It gives a good prediction of the magnetic moment of the electron and explains much of the fine structure observed in atomic spectral lines. It also explains the spin of the electron. Two of the four solutions of the equation correspond to the two spin states of the electron. The other two solutions make the peculiar prediction that there exist an infinite set of quantum states in which the electron possesses negative energy. This strange result led Dirac to predict, via a remarkable hypothesis known as "hole theory," the existence of particles behaving like positively-charged electrons. Dirac thought at first these particles might be protons. He was chagrined when the strict predition of his equation (which actually specifies particles of the same mass as the electron) was verified by the discovery of the positron in 1932. When asked later why he hadn't actually boldy predicted the yet unfound positron with its correct mass, Dirac answered "Pure cowardice!" He shared the Nobel Prize anyway, in 1933.
Despite these successes, Dirac's theory is flawed by its neglect of the possibility of creating and destroying particles, one of the basic consequences of relativity. This difficulty is resolved by reformulating it as a quantum field theory. Adding a quantized electromagnetic field to this theory leads to the theory of quantum electrodynamics (QED). Moreover the equation cannot fully account for particles of negative energy but is restricted to positive energy particles.
A similar equation for spin 3/2 particles is called the Rarita-Schwinger equation.
Derivation of the Dirac equation
The Dirac equation is a relativistic extension of the Schrödinger equation, which describes the time-evolution of a quantum mechanical system:
- [ H \left| \psi (t) \right\rangle = i \hbar \left| \psi (t) \right\rangle.]
- [ H \psi (\mathbf,t) = i \hbar \frac (\mathbf,t) ]
We have to specify the Hamiltonian so that it appropriately describes the total energy of the system in question. Let us consider a "free" electron isolated from all external force fields. For a non-relativistic model, we adopt a Hamiltonian analogous to the kinetic energy of classical mechanics (ignoring spin for the moment):
- [ H = \sum_^3 \frac, ]
- [p_j \psi(\mathbf,t) \equiv - i \hbar \, \frac (\mathbf,t)]
- [E = \sqrt^3 (p_jc)^2}.]
- [ \sqrt^3 (p_jc)^2} \; \psi = i \hbar \frac. ]
- [E \cdot I = \alpha_0 mc^2 + c \sum_^3 \alpha_i p_i. ]
- [i\hbar \frac = \left[ c sum_^3 alpha_i p_i + alpha_0 mc^2 right] \psi]
- [ E^2 = (mc^2)^2 + \sum_^3 (p_jc)^2 = \left( \alpha_0 mc^2 + \sum_^3 \alpha_j p_j \, c \right)^2. ]
- [ \alpha_0^2 = I, ]
- [ \alpha_i \alpha_0 + \alpha_0 \alpha_i = 0 \,, \quad i = 1,2,3, ]
- [ \quad \alpha_i \alpha_j + \alpha_j \alpha_i = 2 \delta_ \,,\quad i,j = 1, 2, 3. ]
- [\left\ = 2\delta_ \cdot I \,,\quad \mu,\nu = 0, 1, 2, 3]
These conditions cannot be satisfied if the α's are ordinary numbers, but they can be satisfied if the α's are matrices. The matrices must be Hermitian, so that the Hamiltonian is Hermitian. The smallest matrices that work are 4×4 matrices, but there is more than one possible choice, or representation, of matrices. Although the choice of representation does not affect the properties of the Dirac equation, it does affect the physical meaning of the individual components of the wavefunction.
In the introduction, we presented the representation used by Dirac. This representation can be more compactly written as
- [\alpha_0 = \begin I & 0 \\ 0 & -I \end \quad \alpha_j = \begin 0 & \sigma_j \\ \sigma_j & 0 \end ]
The Hamiltonian in this equation,
- [ H = \,\alpha_0 mc^2 + \sum_^3 \alpha_j p_j \, c ]
Nature of the wavefunction
Since the wavefunction ψ is acted on by the 4×4 Dirac matrices, it must be a four-component object. We will see, in the next section, that the wavefunction contains two sets of degrees of freedom, one associated with positive energies and the other with negative energies, with each set containing two degrees of freedom that describe the probability amplitudes for the spin to be pointing "up" or "down" along a specified direction.
We may explicitly write the wavefunction as a column matrix:
- [\psi(\mathbf,t) \equiv \begin\psi_1(\mathbf,t) \\ \psi_2(\mathbf,t) \\ \psi_3(\mathbf,t) \\ \psi_4(\mathbf,t) \end. ]
- [\psi^\dagger(\mathbf,t) \equiv \begin\psi_1^*(\mathbf,t) & \psi_2^*(\mathbf,t) & \psi_3^*(\mathbf,t) & \psi_4^*(\mathbf,t) \end ]
As in ordinary single-particle quantum mechanics, the "absolute square" of the wavefunction gives the probability density of the particle at each position x and time t. In this case, the "absolute square" is the scalar product of the wavefunction with its dual:
- [\psi^\dagger \psi \, (\mathbf,t) = \sum_^4 \psi_j^*(\mathbf,t) \psi_j(\mathbf,t). ]
- [\int \psi^\dagger \psi \, (\mathbf,t) \; d^3x = 1. ]
- [\frac \psi^\dagger \psi \, (\mathbf,t) = - \nabla \cdot \mathbf. ]
- [ J_j = c \psi^\dagger \alpha_j \psi.]
The values of the wavefunction components depend on the coordinate system. Dirac showed how ψ transforms under general changes of coordinate system, including rotations in three-dimensional space as well as Lorentz transformations between relativistic frames of reference. It turns out that ψ does not transform like a vector under rotations and is in fact a type of object known as a spinor.
Energy spectrum
It is instructive to find the energy eigenstates of the Dirac Hamiltonian. To do this, we must solve the time-independent Schrödinger equation,
- [H \psi_0 (\mathbf) = E \psi_0(\mathbf) ]
- [\psi (\mathbf, t) = \psi_0 (\mathbf) e^. ]
- [ \psi_0 = w e^} ]
- [ \begin mc^2 & 0 & pc & 0 \\ 0 & mc^2 & 0 & -pc \\ pc & 0 & -mc^2 & 0 \\ 0 & -pc & 0 & -mc^2 \end w = E w. ]
- [E_\pm (p) = \pm \sqrt.]
- [\left\pc \\ 0 \\ \epsilon \\ 0 \end \,,\, \begin0 \\ pc \\ 0 \\ - \epsilon \end \right\} \times \frac}]
- [\left\-\epsilon \\ 0 \\ pc \\ 0 \end \,,\, \begin0 \\ \epsilon \\ 0 \\ pc \end \right\} \times \frac}]
- [\epsilon \equiv |E| - mc^2.]
In the non-relativistic limit, the ε spinor component reduces to the kinetic energy of the particle, which is negligible compared to pc:
- [\epsilon \sim \frac \ll pc. ]
Hole theory
The negative E solutions found in the preceding section are problematic, for it was assumed that the particle has a positive energy. Mathematically speaking, however, there seems to be no reason for us to reject the negative-energy solutions. Since they exist, we cannot simply ignore them, for once we include the interaction between the electron and the electromagnetic field, any electron placed in a positive-energy eigenstate would decay into negative-energy eigenstates of successively lower energy by emitting excess energy in the form of photons. Real electrons obviously do not behave in this way.
To cope with this problem, Dirac introduced the hypothesis, known as hole theory, that the vacuum is the many-body quantum state in which all the negative-energy electron eigenstates are occupied. This description of the vacuum as a "sea" of electrons is called the Dirac sea. Since the Pauli exclusion principle forbids electrons from occupying the same state, any additional electron would be forced to occupy a positive-energy eigenstate, and positive-energy electrons would be forbidden from decaying into negative-energy eigenstates.
Dirac further reasoned that if the negative-energy eigenstates are incompletely filled, each unoccupied eigenstate – called a hole – would behave like a positively charged particle. The hole possesses a positive energy, since energy is required to create a particle–hole pair from the vacuum. As noted above, Dirac initially thought that the hole might be the proton, but Hermann Weyl pointed out that the hole should behave as if it had the same mass as an electron, whereas the proton is over 1800 times heavier. The hole was eventually identified as the positron, experimentally discovered by Carl Anderson in 1932.
It is not entirely satisfactory to describe the "vacuum" using an infinite sea of negative-energy electrons. We must postulate that the negative-energy electrons do not contribute to the total energy and momentum of the vacuum, which would otherwise be infinite, and that the negative-energy electrons do not produce an electric field, although they can be affected by an external field. These difficulties led physicists to abandon hole theory in favour of Dirac field theory, which bypasses the problem of negative energy states by treating positrons as true particles.
In certain applications of condensed matter physics, however, the underlying concepts of "hole theory" are valid. The sea of conduction electrons in an electrical conductor, called a Fermi sea, contains electrons with energies up to the chemical potential of the system. An unfilled state in the Fermi sea behaves like a positively-charged electron, though it is referred to as a "hole" rather than a "positron". The negative charge of the Fermi sea is balanced by the positively-charged ionic lattice of the material.
Quaternion representation
The Dirac equation can be written nicely in quaternion notation. We write it in terms of two quaternion fields representing the left-handed (Ψ) and right-handed (Φ) electrons:
- [ \partial_t\psi i + i \partial_x \psi+j \partial_y \psi + k\partial_z \psi= m_e \phi j, ]
- [ \partial_t\phi i - i \partial_x \phi-j \partial_y \phi- k\partial_z \phi = m_e \psi j. ]
Electromagnetic interaction
So far, we have considered an electron that is not in contact with any external fields. Proceeding by analogy with the Hamiltonian of a charged particle in classical electrodynamics, we can modify the Dirac Hamiltonian to include the effect of an electromagnetic field. The revised Hamiltonian is (in SI units):
- [H = \alpha_0 mc^2 + \sum_^3 \alpha_j \left[p_j - e A_j(mathbf, t) right] c + e \varphi(\mathbf, t) ]
By setting φ = 0 and working in the non-relativistic limit, Dirac solved for the top two components in the positive-energy wavefunctions (which, as discussed earlier, are the dominant components in the non-relativistic limit), obtaining
- [ \left( \frac \sum_j |p_j - e A_j(\mathbf, t)|^2 - \frac \sum_j \sigma_j B_j(\mathbf) \right) \begin\psi_1 \\ \psi_2 \end]
[= (E - mc^2) \begin\psi_1 \\ \psi_2 \end]
For several years after the discovery of the Dirac equation, most physicists believed that it also described the proton and the neutron, which are both spin-½ particles. However, beginning with the experiments of Stern and Frisch in 1933, the magnetic moments of these particles were found to disagree significantly with the predictions of the Dirac equation. The proton has a magnetic moment 2.79 times larger than predicted (with the proton mass inserted for m in the above formulas), i.e., a g-factor of 5.58. The neutron, which is electrically neutral, has a g-factor of −3.83. These "anomalous magnetic moments" were the first experimental indication that the proton and neutron are not elementary particles. They are in fact composed of smaller particles called quarks. Incidentally, quarks are spin-½ particles, which are exactly described by the Dirac equation !
Interaction Hamiltonian
It is noteworthy that the Hamiltonian can be written as the sum of two terms:
- [H = H_} + H_} \,]
- [H_} = e \varphi(\mathbf, t) - ec \sum_^3 \alpha_j A_j(\mathbf, t). ]
- [\langle H \rangle = \int \, \psi^\dagger H_} \psi \, d^3x = \int \, \left(\rho \varphi - \sum_^3 j_i A_i \right) \, d^3x ]
- [\langle H \rangle = \int \, \left( \sum_^3 \eta^ j_\mu A_\nu \right) \; d^3r]
- [\eta^ = 1, ]
- [\eta^ \;= -1 \quad\, \forall \, i=1,2,3, ]
- [\eta^ = 0 \qquad \forall \, \mu \ne \nu. ]
Relativistically covariant notation
Let us return to the Dirac equation for the free electron. It is often useful to write the equation in a relativistically covariant form, in which the derivatives with time and space are treated on the same footing.
To do this, first recall that the momentum operator p acts like a spatial derivative:
- [\mathbf \psi(\mathbf,t) = - i \hbar \nabla \psi(\mathbf,t).]
- [ \left[ ihbar c left(alpha_0 frac + sum_^3 alpha_0 alpha_j frac right) - mc^2 right] \psi = 0. ]
- [ \gamma^0 \equiv \alpha_0 \,,\quad \gamma^j \equiv \alpha_0 \alpha_j. ]
- [\left\ = 2\eta^ \cdot I\,,\quad \mu,\nu = 0, 1, 2, 3]
The Dirac equation may now be written, using the position-time four-vector x = (ct,x), as
- [\left(i\hbar c \, \sum_^3 \; \gamma^\mu \, \partial_\mu - mc^2 \right) \psi = 0.]
- [\mathcal = \int \bar\psi(i \hbar c \, \sum_\mu \gamma^\mu \partial_\mu - mc^2)\psi \, d^4 x ]
- [\bar\psi \equiv \psi^\dagger \gamma_0 ]
A notation called the "Feynman slash" is sometimes used. Writing
- [a\!\!\!/ \leftrightarrow \sum_\mu \gamma^\mu a_\mu]
- [(i \hbar c \, \partial\!\!\!/ - mc^2) \psi = 0]
- [\mathcal = \int \bar\psi(i \hbar c \, \partial \!\!\!/ - mc^2)\psi \, d^4 x. ]
- [\partial_\mu \rightarrow D_\mu = \partial_\mu - i e A_\mu. ]
Dirac bilinears
There are five different (neutral) Dirac bilinear terms not involving any derivatives:
- (S)calar: [\bar \psi] (scalar, P-even)
- (P)seudoscalar: [\bar \gamma^5 \psi] (scalar, P-odd)
- (V)ector: [\bar \gamma^\mu \psi] (vector, P-even)
- (A)xial: [\bar \gamma^\mu \gamma^5 \psi] (vector, P-odd)
- (T)ensor: [\bar \sigma^ \psi] (antisymmetric tensor)
See also
References
Selected papers
- P.A.M. Dirac, Proc. R. Soc. A117 610 (1928)
- P.A.M. Dirac, Proc. R. Soc. A126 360 (1930)
- C.D. Anderson, Phys. Rev. 43, 491 (1933)
- R. Frisch and O. Stern, Z. Phys. 85 4 (1933)
Textbooks
- Dirac, P.A.M., Principles of Quantum Mechanics, 4th edition (Clarendon, 1982)
- Shankar, R., Principles of Quantum Mechanics, 2nd edition (Plenum, 1994)
- Bjorken, J D & Drell, S, Relativistic Quantum mechanics
- Thaller, B., The Dirac Equation, Texts and Monographs in Physics (Springer, 1992)
- Schiff, L.I., Quantum Mechanics, 3rd edition (McGraw-Hill, 1955)
From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.
