15. Macroscopic potential ($\bphi$) and field ($\E$)

https://doi.org/10.48550/arXiv.2403.13981

It is well known that the electric potential is a relative quantity and that, when its value at a point is quoted, this value is always the difference between the potential at the point and a reference potential. In theoretical work the reference point is often taken to be a notional point in the vacuum at infinity, in experimental work it may be a particular electrode, and in engineering it is common to reference potentials to the ‘ground’ or ‘earth’. The difference in meaning between the terms mean inner potential and macroscopic potential is of little relevance to this work. The MIP is the average of $\phi$ over all points in a material; therefore it is a scalar constant. The macroscopic potential is a scalar field, defined at all points in a material, but defined only to a finite precision ${\precphi}$. To within this precision its value at each point in the bulk is equal to the MIP. I mostly refer to $\bphi$ as the macroscopic potential in this section and as the MIP when discussing Bethe’s approximate expression for it in Sec. 16.

It is very important to be able to calculate changes in macroscopic potential. From a conceptual and theoretical viewpoint, our understanding of the relationship between macroscale and microscale electrostatics cannot be considered complete if we do not know how, in principle, to calculate the potential at the macroscale from the charge density or potential at the microscale. From a practical perspective, the MIP, which is believed to be positive, is a key quantity in several areas of experimental and computational science. For example, computer simulations can be used to calculate microscopic charge densities and, up to an unknown constant, microscopic potentials. To improve the designs of devices, such as batteries, fuel cells, chemical sensors, and solar cells, computational scientists simulate their constituent materials independently and, from those simulations, try to calculate the change in the average potential that an electron or ion would experience if it moved from one device component to another [Blumenthal et al., 2017; Hörmann et al., 2019]. The MIP is also used in electron microscopy to analyse and interpret electron diffraction images [Yesibolati et al., 2020]. In both of these contexts, the distinction between the MIP and the time average of the potential felt by the charge carrier is rarely made, but it is the latter that is of interest and the MIP is used as an approximation to it.

In this section I show how to calculate the change ${\Delta\bphi}$ in the macroscale electric potential $\bphi$ between the vacuum above a material’s surface and its bulk from the microscopic charge density $\rho$. The results are easy to generalize to the change in potential across an interface between two materials by treating it as a pair of adjoined surfaces. They also allow the value of $\bphi$ in an isolated material to be calculated relative to a distant point in the vacuum surrounding it.

Some of my results, such as the macroscale potential within a macroscopically-thin film whose surfaces are equally- and oppositely-charged, are well known and serve as a sanity check on my theory and reasoning. However, the main result, which I justified on symmetry grounds in Sec. 6, contradicts most of the literature on this subject over the past century. This result is that ${\bphi=0}$ in the bulk $\bulk$ of an isolated material unless it has charged surfaces or unless its bulk contains charged macroscopic heterogeneities.

It is important to note that ${\Delta\bphi}$ is the difference in potential across a surface at the macroscale. Therefore the derivation that follows is valid in the limit ${a/l\to 0}$, which is the limit in which surfaces become truly planar. Many similar derivations and calculations in existing literature do not assume this limit [Baldereschi et al., 1988; Colombo et al., 1991; Junquera et al., 2007], and this limit is not appropriate for many purposes. For example, it would not be appropriate to assume this limit when calculating variations in the average potential on the nanoscale [Junquera and Ghosez, 2003; Bernardini and Fiorentini, 1998], because it is the limit in which all such variations vanish.

15.1 Change in potential across a surface (${\Delta\bphi}$)

To calculate ${\Delta\bphi}$, I will consider the microscopic potential at an arbitrary position ${\rvec_b\in\bulk}$ deep below the surface at ${x=x_L}$. I will calculate the potential within a finite chunk of material and then take the large size limit. It is important, when doing this, to order the limits appropriately. To illustrate the possible pitfalls, consider the well-known example of the macroscopic potential $\bphi$ on the plane ${\bx=\mxb\approx (\mxl+\mxr)/2}$ from equally- and oppositely-charged surfaces at $\mxl$ and $\mxr$. If I calculate $\bphi$ for an isolated material that is finite in all directions and take the limit ${\abs{\mxl-\mxr}\to\infty}$ before I take the limit of large size in the lateral ($yz$) directions, I find that $\bphi$ vanishes. However, if I take the limit of large cross-sectional area first, I find that $\bphi$ is linear in $\bx$ and that $\E$ is constant. The appropriate order to choose for the limits depends on the aspect ratio of the material and on the position within the material at which the potential is being calculated.

I want to calculate the average potential in a bulk-like region of the material that is much closer to one surface than any other. The plane ${x=x_L}$ is parallel to this surface and in the vacuum just beyond it. Because all other surfaces are further away, it is appropriate to assume that $\abs{x-x_L}$ is much smaller than the material’s lateral dimensions. I will first calculate the microscopic potential due to the charge within a cylindrical region of the material, of radius $R$, whose axis is normal to the surface. I will use the cylindrical coordinates $\rvec=(x,s,\phi)$ or $\rvec=(x,\svec)$, where $\svec\in\realtwo$ is a vector in the plane parallel to the surface, $s=\abs{\svec}$, and $\phi$ is the azimuth.

The potential at ${\rvec_b\equiv(x_b,s_b,\phi_b)=(x_b,\svec_b)}$ due to the charge density within a cylinder bounded by the surfaces ${\abs{\svec-\svec_b}=R}$, ${x=x_L}$, and ${x=x_b}$ is

\begin{align*} \Phi_L&(x_b,\svec_b;R) \\ = & \kappa\, \int_{x_L}^{x_b} \left( \iint_{\abs{\svec-\svec_b}<R} \frac{\rho(x,\svec)}{\sqrt{(x-x_b)^2+\abs{\svec-\svec_b}^2}}\ddpow{2}{s}\right)\dd{x} \end{align*}

where ${\kappa\equiv \left(4\pi\epsilon_0\right)^{-1}}$. I will first cast this expression into a more convenient form. Then I will calculate its mesoscale average over a range of positions $(x_b,\svec_b)$ in the limit of large $R$. Finally, I will add the mesoscale average of the potential, $\Phi_r$, from charge deeper below the surface than $\rvec_b$. The right-hand boundary of the region whose charge density contributes to $\Phi_r$ is the plane ${x=x_r}$, where ${x_b<x_r<x_R}$; I use a lower case subscript for $\Phi_r$ to distinguish the plane ${x=x_r}$ from the right hand surface plane ${x=x_R}$. I will be considering the cases ${x_r=x_R}$ and ${\abs{x_r-x_b}\ll\abs{x_R-x_r}}$ separately.

The average volumetric charge density on a disc of radius $R$, which is parallel to the surface and centered at $(x,\svec)$, is

\begin{align*} \bar{\rho}(x,\svec\,;R) & \equiv \frac{1}{\pi R^2} \iint_{\abs{\uvec}<R} \rho(x,\svec+\uvec) \ddpow{2}{u} \end{align*}

Defining $\displaystyle \Delta\rho(\uvec;x,\svec,R) \equiv \rho(x,\svec+\uvec)-\bar{\rho}(x,\svec;R)$ allows $\Phi_L$ to be split into two terms:

\begin{align*} \Phi_L(x_b,\svec_b;R) = \overbrace{\frac{1}{2\epsilon_0} \int_{x_L}^{x_b} \bar{\rho}(x,\svec_b;R) \left( \int_0^R \frac{u}{\sqrt{(x-x_b)^2+u^2}}\dd{u}\right)\dd{x}}^{\displaystyle \Phi_L^{[\rhobar]}(x_b,\svec_b;R)} + \overbrace{ \vphantom{\left(\int_0^R\frac{u}{\sqrt{(x-x_b)^2+u^2}}\dd{u}\right)} \kappa \int_{x_L}^{x_b} \iint_{\abs{\uvec}<R} \frac{\Delta\rho(\uvec;x,\svec_b,R)}{\sqrt{(x-x_b)^2+\abs{\uvec}^2}}\ddpow{2}{u}\dd{x}}^{\displaystyle \Phi_L^{[\Delta\rho]}(x_b,\svec_b;R)} \tag{78} \end{align*}

and similarly for $\Phi_r$. It is easy to see that the surface excess of ${\Delta \rho(\uvec;x,\svec_b,R)}$ vanishes when averaged over the plane parallel to the surface. The planar averages of ${\Phi^{[\Delta\rho]}_L}$ and ${\Phi^{[\Delta\rho]}_r}$ also vanish in the large $R$ limit, i.e.,

\begin{align*} \lim_{\depth/R\to 0} \left\{\frac{1}{\pi R^2}\iint_{\abs{\uvec}<R} \Phi_L^{[\Delta\rho]}(x_b,\svec_b+\uvec;R)\ddpow{2}{u}\right\} & =0, \\ \lim_{\depth/R\to 0} \left\{\frac{1}{\pi R^2}\iint_{\abs{\uvec}<R} \Phi_r^{[\Delta\rho]}(x_b,\svec_b+\uvec;R)\ddpow{2}{u}\right\} & = 0, \end{align*}

where ${\depth\equiv\abs{x_L-x_b}}$ is the depth of ${\rvec_b}$ below the surface. Therefore, the only contributors to the bulk average of the microscopic potential are $\Phi^{[\rhobar]}_L$ and $\Phi_r^{[\rhobar]}$.

15.1.1 Mesoscale average of ${\Phi_L^{[\rhobar]}}$

Integrating over $u$ in the expression for ${\Phi_L^{[\rhobar]}}$, choosing ${R>\depth}$, and using a Taylor expansion gives

\begin{align*} \Phi_L^{[\rhobar]}(x_b,\svec_b;R) & = \frac{1}{2\epsilon_0} \int_{x_L}^{x_b} \bar{\rho}(x,\svec_b;R) \left( \sqrt{(x-x_b)^2+R^2}+(x-x_b)\right)\dd{x} \\ & = \frac{1}{2\epsilon_0} \int_{x_L}^{x_b} \bar{\rho}(x,\svec_b;R) (x-x_b) \dd{x} + \frac{R}{2\epsilon_0} \int_{x_L}^{x_b} \bar{\rho}(x,\svec_b;R) \left[ 1 + \frac{1}{2}\left(\frac{x-x_b}{R}\right)^2 +\order{\left(\frac{x-x_b}{R}\right)^4} \right] \dd{x} \tag{79} \end{align*}

I assume that there exists a well-defined macroscopic average of the volumetric charge density on every plane parallel to the surface. By this I mean that, although $\bar{\rho}(x,\svec_b;R)$ may exhibit microscopic fluctuations as $R$ increases, it converges to a well-defined value rather than systematically growing or shrinking. Furthermore, as $\svec_b$ is varied at fixed $R$, ${\bar{\rho}(x,\svec_b;R)}$ fluctuates microscopically about the value to which it converges in the large $R$ limit. If I also assume that the bulk of the material is charge-neutral, only the first term of the series expansion in Eq. (79) can survive the large $R$ limit. In anticipation of this limit, and with the understanding that ‘$=$’ means ‘$\approx$’ until the limit is taken, I write

\begin{align*} \Phi_L(x_b,\svec_b;R) = \frac{1}{2\epsilon_0} \bigg[ \left(R-x_b\right) &\int_{x_L}^{x_b} \bar{\rho}(x,\svec_b\,;R)\dd{x} \\ + \int_{x_L}^{x_b} \, x \, &\bar{\rho}(x,\svec_b\,;R) \dd{x} \bigg] \tag{80} \end{align*}

$\Phi_L^{[\rhobar]}(x_b,\svec_b;R)$ depends sensitively on $x_b$ and so do both of its constituent terms on the right hand side. Therefore, I will average over $x_b$. Before doing so, let us consider the average over $\svec_b$, which we should perform to calculate the three dimensional macroscopic bulk average. Because we will be taking the limit of large $R$, we can assume that at every value of ${\svec_b}$ the planar averages of ${\bar{\rho}(x,\svec_b;R)}$ and ${\Phi_L(x_b,\svec_b;R)}$ are converged, to our desired precisions, on an area much smaller than ${\pi R^2}$. Because they are now insensitive to $\svec_b$, I will drop it from their list of arguments. The mesoscale average over $x_b$ of $\int_{x_L}^{x_b}\bar{\rho}(x\,;R)\dd{x}$ is the average areal surface charge density, ${\bsigma_L(R)}$.

The mesoscale average over $x_b$ of $\int_{x_L}^{x_b}x\,\bar{\rho}(x\,;R)\dd{x}$ is $X_{\bsigma}^L \,\bsigma_L(R)$, where

\begin{align*} X_{\bsigma}^L &\equiv \frac{\expval{\int_{x_L}^{x_b} \,x\, \bar{\rho}(x;R)\, \dd{x}}_\prectheo}{\expval{\int_{x_L}^{x_b} \bar{\rho}(x;R)\, \dd{x}}_\prectheo}, \end{align*}

is the center of the microscale distribution of excess surface charge. The microscale charge density is bulk-like at mesoscopic depths and so $X_\bsigma^L$ is within ${\prectheo/2}$ of $x_L$, which means that ${X_\bsigma^L\in\mxl\equiv\interval(x_L,\prectheo)}$. When working at the macroscale, $\mxl$ and $\mxb$ are treated as having precise values. Therefore, taking the mesoscale average of both sides of Eq. (80) gives

\begin{align*} \bphi_L(R) & = \frac{1}{2\epsilon_0}\bsigma_L(R) \left(R - \abs{\mxb-\mxl}\right) \tag{81} \end{align*}

15.1.2 Mesoscale average of $\Phi_r$

By the same approach that led to Eq. (80), we can find the potential at ${(x_b,\bolds_b)}$ from charge within the cylinder bounded by the surfaces ${\abs{\svec-\svec_b}=R}$, ${x=x_b}$, and ${x=x_{r}}$, where ${x_{r}>x_b}$ is any position to the right of $x_b$ such that ${\abs{x_r-x_b}\ll R}$. This potential is

\begin{align*} \Phi_r(x_b,x_r;R) = \frac{1}{2\epsilon_0}\bigg[(R+x_b) \int_{x_b}^{x_r}&\bar{\rho}(x;R)\dd{x} \\ - \int_{x_b}^{x_r} \, x \, &\bar{\rho}(x;R) \dd{x} \bigg] \tag{82} \end{align*}

Except in the case of thin material films, if ${\abs{x_L-x_b}\ll R}$ then ${\abs{x_R-x_b}\not\ll R}$, which invalidates the derivation of Eq. (82) for $x_r=x_R$. Therefore, I will separately treat two cases. First, I will consider the case of a thin film, for which $\abs{x_L-x_R}\ll R$. I will set $x_r=x_R$, and calculate the mesoscale average of the potential in the center of the film from the charge density of the entire film. In the second case, I will assume that the surface at ${x_R}$ is far away and that its macroscopic surface charge density $\bsigma_R$ is zero. In this case I will add to $\bphi_L$ the contribution to the mesoscale average of the potential from charge density at positions ${x>x_b}$ which are still within the bulk of the material.

15.1.3 Test case I: Thin film, ${\abs{x_L-x_R}\ll R}$

Defining ${\bsigma_R(R)}$ as the average areal charge density of the surface at $\mxr$, and ${X_\bsigma^R\in\mxr}$ as the center of the microscale distribution of excess charge at the right-hand surface, allows me to express the mesoscale average of Eq. (82), when ${x_r=x_R}$, as

\begin{align*} \bphi_R(R)=\frac{1}{2\epsilon_0}\bsigma_R(R)\left(R-\abs{\mxr-\mxb}\right) \end{align*}

The total potential at $\mxb$ is the sum of $\bphi_L$ and $\bphi_R$ in the large $R$ limit. That is,

\begin{align*} \bphi(x_b)& = \lim_{R\to \infty} \bigg\{\frac{R}{\epsilon_0}\left(\frac{\bsigma_L(R)+\bsigma_R(R)}{2}\right) \\ &- \frac{1}{2\epsilon_0}\left[\sigma_L(R)\,\abs{\mxb-\mxl} + \bsigma_R(R)\abs{\mxb-\mxr}\right]\bigg\} \tag{83} \end{align*}

The term proportional to $R$ becomes infinite in this limit unless ${\bsigma_L+\bsigma_R=0}$. Therefore, I assume that the surfaces have equal and opposite average areal charge densities and I define ${\bsigma\equiv \bsigma_L=-\bsigma_R}$. Then Eq. (83) becomes

\begin{align*} \bphi(\mxb)& = \frac{\bsigma}{\epsilon_0}\left[\frac{1}{2}\left(\mxl+\mxr\right)-\mxb\right] \tag{84} \end{align*}

From Eq. (84), we can immediately calculate the change in the macroscopic potential between the charged surfaces at $\mxl$ and $\mxr$ as

\begin{align*} \Delta\bphi\equiv\bphi(\mxl)-\bphi(\mxr) = \frac{\bsigma}{\epsilon_0}\left(\mxr-\mxl\right) \end{align*}

For very large finite values of $R$ we can write this as ${\Delta\bphi = Q/(\epsilon_0 A)}$, where ${Q\equiv A\,\bsigma\left(\mxr-\mxl\right)}$ and ${A\equiv \pi R^2}$. This is the familiar formula for the magnitude of the potential difference between parallel plates carrying equal and opposite charges. Therefore an important sanity check on the theory has been passed.

15.1.4 Test case II: Macroscopic sample, $\abs{x_L-x_R}\not\ll R$

When the surface at $x_R$ is not charged and is sufficiently far away that it does not contribute to the potential at $x_b$, the total microscopic potential at $x_b$ from all charge at greater depths ($x>x_b$) can be assumed to emanate from bulk regions where the charge density is macroscopically uniform and neutral.

If the bulk of the material is charge neutral, there is no contribution to the potential in the vicinity of $x_b$ from points $x_r>x_b$ sufficiently far from it. However, the potential at $x_b$ will depend on the precise choice of $x_r$ because the integrated charge between $x_b$ and $x_r$ depends sensitively on its value. Therefore, as before, I will take the mesoscale average over $x_r$ of the microscopic potential, $\Phi_r(x_b,x_r;R)$, at $x_b$. Eq. (99) and Eq. (101), which are derived in Sec. 9, can be used in Eq. (82) to write the following expression for the mesoscale average of $\Phi_r(x_b,x_r;R)$ over $x_r$.

\begin{align*} &\overline{\Phi_r(x_b;R)} = \frac{1}{2\epsilon_0} \Bigg[ (R+x_b)\left( \int_{x_b}^{x_r}\bar{\rho}(x;R)\dd{x} - \momone{x_r} \right) \\ & \qquad - \left( \int_{x_b}^{x_r} x \, \bar{\rho}(x;R) \dd{x} - x_r\momone{x_r} - \momtwo{x_r}\right) \Bigg] \tag{85} \end{align*}

This is independent of the choice of $x_r$, and so $x_r$ may be chosen such that $\int_{x_b}^{x_r}\bar{\rho}(x;R)\dd{x} = 0$. The reason for making this choice is that it implies the following relationships.

\begin{align*} \momone{x_r} & =\momone{x_b} \\ \momtwo{x_r} & =\momtwo{x_b} \\ \int_{x_b}^{x_r}\,x\,\bar{\rho}(x;R)\dd{x} & = (x_r-x_b) \momone{x_b} \end{align*}

Using these formulae, Eq. (85) simplifies to

\begin{align*} \overline{\Phi_r(x_b;R)} = \frac{1}{2\epsilon_0}\left(\momtwo{x_b} - R\momone{x_b}\right) \end{align*}

and so the total microscopic potential at $x_b$ is

\begin{align*} \Phi&(x_b;R) = \Phi_L(x_b;R)+\overline{\Phi_r(x_b;R)} \\ & = \frac{1}{2\epsilon_0} \Bigg[ (R-x_b)\int_{x_L}^{x_b}\bar{\rho}(x_b;R)\dd{x} \\ & + \int_{x_L}^{x_b}\,x\,\bar{\rho}(x_b;R)\dd{x} + \momtwo{x_b} - R\momone{x_b} \Bigg] \tag{86} \end{align*}

It is straightforward to show that the average of $\momone{x_b}$ over $x_b$ is zero for any macroscopically-uniform charge density (see Eq. (56) of Sec. 10). Making use of Eq. (81) we can write down the mesoscale average of $\Phi(x_b;R)$, which is

\begin{align*} \bphi(R) & = \frac{1}{2\epsilon_0} \left[\bsigma_L(R) \left(R - \abs{\mxb-\mxl}\right) + \boldmtworho(R)\right] \\ & = \frac{\bsigma_L(R)}{2\epsilon_0} \left(R - \abs{\mxb-\mxl}\right) \tag{87} \end{align*}

where ${\boldmtworho(R)}$ is the mesoscale average over $x_b$ of $\momtwo{x_b}$, which is shown in Appendix J.3 to be zero. The remaining term diverges in the limit ${R\to\infty}$, which demonstrates that charged surfaces are not stable.

Now let us assume that ${\bsigma_L=0}$. Eq. (87) becomes ${\bphi=0}$, which implies that the macroscopic potential, which is the mesoscale average of the microscopic potential, is zero in the bulk of any material that is mesoscopically charge-neutral in the bulk and which does not have charged surfaces or contain any charged macroscale heterogeneities. There are two very important points to note about this result.

The first is that it contradicts the prevailing view that the MIP is finite and positive [Bethe, 1928; Yesibolati et al., 2020]. It also contradicts a view commonly expressed or implied in textbooks on electromagnetism [Jackson, 1998] and solid state physics [Ashcroft and Mermin, 1976; Kittel, 2004], namely, that it is possible for the symmetry of a crystalline microstructure to endow a material with a macroscale electric field.

The second important point is that, because $\Phi[\rho]$ is a linear functional of $\rho$, this result is the only result that could emerge from an internally-consistent theory of structure homogenization. Mathematically, the spatial average ${\Rho}$ of $\rho$ is simply the weighted sum (integral) of the infinite set of charge densities ${\{\rho_u: \rho_u(x+u)\equiv \rho(x), \forall u\in\realone \;\;\text{and}\; \forall x \in \realone\}}$. It follows from linearity that ${\bphi=\expval{\Phi[\rho]}_\prectheo = \Phi[\expval{\rho}_\prectheo] = \Phi[\Rho]}$. Therefore, if ${\Rho=0}$ everywhere, as is the case for an isolated uniform material whose surfaces are uncharged, then ${\bphi=0}$ everywhere.

15.2 Lorentz's fallacy: the macroscopic local field

As discussed in Sec. 6.5, unless there exist sources of macroscopic fields that are external to the material’s bulk (e.g., an applied field $\Eext$ or a net charge at one or more of its surfaces) the isotropy and homogeneity of its macrostructure $\Rho$, which vanishes everywhere in the bulk, preclude the existence of a non-vanishing $\E$ field. Isotropy is incompatible with the existence of a vector field. As discussed in Sec. 6.6 and Sec. 15, if the macroscopic charge density is zero in the bulk of an isolated material whose surfaces are uncharged, there is no source of macroscopic potential $\bphi$. If $\bphi=0$ throughout the bulk, $\E=0$ throughout the bulk.

Nevertheless, most textbooks posit the existence of a non-vanishing $\E$ emanating from the bulks of crystals that lack inversion symmetry at the microscale. Furthermore, it is commonly believed that the net field acting at each point $x$ in a material’s bulk is ${\E + \pp/3\epsilon_0 + \me(x)}$, where $\me$ is a microscopic field and ${\pp/3\epsilon_0}$ is another macroscopic (${\bk=0}$) contribution. The purpose of this section is to critically examine the reasoning used to infer the existence of the contribution ${\pp/3\epsilon_0}$. Almost all derivations of this term are based on a construction and line of reasoning first presented by Hendrik A. Lorentz in a series of lectures given at Columbia University in 1906, which were subsequently published in book form [Lorentz, 1916] (p137). His construction, which is illustrated in Fig. 15, is sometimes known as the Lorentz cavity. This construction has been used in many textbooks [Griffiths, 1999; Jackson, 1998; Born and Huang, 1954; Kirkwood, 1936; van Vleck, 1937; Kirkwood, 1940], but as I will now explain, both Lorentz’s original argument and all of its descendents that I am aware of are fatally flawed.

**Figure 15.** The *Lorentz cavity* [Lorentz, 1916]. See Sec. 15.2.

Lorentz set out to calculate the average force or electric field acting on a microscopic particle $A$ in the bulk of a dielectric in which there exists a macroscopic electric field $\E$. The particle could be a molecule, an atom, or an electron. He expressed the electric field at $A$ as ${\me_A = \EAfar + \EAnear}$, where $\EAfar$ is the field emanating from all material beyond a mesoscopic spherical region of radius $R$ centered at $A$ and $\EAnear$ is the field emanating from all charges within the region, except $A$ itself. As the boldface notation suggests, $\EAfar$ is calculated by treating the material as a continuum; therefore it is a macroscopic quantity. This is reasonable because the length scale on which fluctuations of the microscopic charge density occur is much smaller than the distance ($>R \sim l$) to $A$. On the other hand $\EAnear$ can be expressed as ${\EAnear = \EAnearmacro + \Delta\EAnearmicro}$, where ${\EAnearmacro}$ and ${\Delta\EAnearmicro}$ are macroscopic and microscopic contributions, respectively. It is a microscopic quantity.

In some presentations of this approach, and as illustrated in Fig. 15, these two separate contributions are imagined in different and separated material systems. To calculate $\EAfar$ a continuous material with a cavity in its bulk is imagined, with $A$ at the center of the cavity. To calculate $\EAnear$ a microscopically-varying spherical charge distribution is imagined, with $A$ at its center. This is the charge that was evacuated to form the cavity and it is frozen in the arrangement it had prior to being evacuated.

From here, different authors have derived the term ${\pp/3\epsilon_0}$ in different ways, but it tends to be thought of as arising either from the charge on the cavity’s surfaces or from the dipole moment of the material evacuated from it. It is interesting that the version of the argument that appears in the first edition of Jackson’s book [Jackson, 1962] differs substantially from the one appearing in its 1975 second edition [Jackson, 1975] and that the latter is very similar to the one in Ashcroft and Mermin’s 1976 book [Ashcroft and Mermin, 1976]. However, it is not necessary to go in detail into these differences because we have already introduced the fatal flaw in Lorentz’s reasoning and, to my knowledge, all variants of his derivation suffer from it.

Just as the charges at the surfaces of the materials depicted in Fig. 11 and Fig. 12 depend sensitively, in magnitude and sign, on how the surfaces are terminated, so too does the charge on the surface of the cavity and the dipole moment of the evacuated material. They both depend sensitively and microscopically on the cavity radius $R$ and vanish when averaged over a continuous mesoscopic range of radii. Therefore the true value of the macroscopic field at $A$ is the sum of only two contributions: the applied field and the field from charge at the material’s surfaces.

15.3 LO-TO splitting

I have argued that inversion asymmetry of a crystal’s microstructure does not endow it with a macroscopic $\E$ field. This implies that a macroscopic field is not created when the sublattices of an inversion symmetric crystal are relatively displaced. An oscillating rigid relative displacement of a crystal’s sublattices can be regarded as a ${\bk=0}$ phonon, so another way of saying that ${\E}$ vanishes is to say that a ${\bk=0}$ phonon does not have an intrinsic electric field.

However, it is well known that the frequency of a ${\bk\to 0}$ longitudinal optical (LO) phonon is increased by the electric field that is intrinsic to it, and which opposes its motion [Royo and Stengel, 2021; Pick et al., 1970; Cochran and Cowley, 1962; Born and Huang, 1954; Ashcroft and Mermin, 1976; Jones and March, 1973; Coiana et al., 2024; Lyddane et al., 1941]. Were it not for this field, the frequencies of some crystals’ LO and TO phonons would be equal, by symmetry, in the long wavelength limit (${\bk\to 0}$). The breaking of this degeneracy by the LO phonon’s intrinsic field is commonly referred to as LO-TO splitting [Born and Huang, 1954; Ashcroft and Mermin, 1976; Jones and March, 1973].

Therefore I am claiming that, in the ${\bk\to 0}$ limit, an LO phonon of wavevector $\bk$ creates an electric field of wavevector $\bk$, but that a ${\bk=0}$ phonon does not create an electric field in a crystal whose surfaces are earthed.

A ${\bk=0}$ phonon does create a uniform (i.e., ${\bk=0}$) electric field if the surfaces are not earthed, because the polarization current that flows during the rigid relative motion of sublattices changes the areal charge densities on parallel opposing surfaces at equal and opposite rates. If this charge accumulates, a field emanates from it.

15.3.1 Why finite-wavevector LO phonons create electric fields

**Figure 15.** See text of Sec. 15.3.1. The net charge within the pink-shaded interval of width $a$ is ${+q}$. In the unperturbed crystal, the net charge in every interval of width $a$ is zero.

Suppose that the crystal’s microstructure ($\rho$) is modulated along $\hat{x}$ by an LO phonon, of finite wavelength $\lambda$, propagating along $\hat{x}$. At each instant, this modulation creates regions of excess positive charge and regions of excess negative charge, which alternate along $\hat{x}$ with a wavelength of $\lambda$. This excess charge density wave, in turn, creates an electric field of the same wavelength, $\lambda$, which opposes the LO mode’s motion.

To help understand why excesses of charge are created, it is instructive to consider the perfect crystal, without any perturbation, and to calculate the excess charge, ${\bsigma(\tx)}$, on plane ${\plane(\tx)}$, which is perpendicular to ${\hat{x}}$ at ${\tx\in\bulk}$. To do so, we can treat the plane as a pair of adjoined surfaces, use Eq. (60) to calculate the excess charge on each one, and add them to give ${\bsigma(\tx)}$. Because ${\tx}$ is in the bulk, and the bulk of an unperturbed crystal is uniform, we can choose ${x_L=x_b=\tilde{x}}$. We find that the excess charge at $\tx$ is

\begin{align*} \bsigma(\tx)=\bsigma_+(\tx) +\bsigma_-(\tx) & = \frac{1}{a}\int_{-a}^{0}\rho(\tx+u)\,u\,\dd{u} \\ & - \frac{1}{a}\int_{0}^{a}\rho(\tx+u)\,u\,\dd{u} = 0, \end{align*}

where ${\bsigma_+}$ and ${\bsigma_-}$ are the areal charge densities on the ‘surfaces’ at $\tx$ whose outward normals are ${\hat{x}}$ and ${-\hat{x}}$, respectively, and their cancellation follows from the periodicity of the crystal.

Now let us consider what happens when an LO phonon breaks periodicity by modulating the structure along the $x$ axis. When this happens, ${\bsigma_+(\tx)}$ and ${\bsigma_-(\tx)}$ are no longer exactly equal in magnitude, in general, which means that ${\bsigma(\tx)}$ does not vanish. Calculating its value is more complicated than in the periodic case because the crystal is no longer uniform. Therefore it is no longer valid to regard the point $\tx$ as both defining the position of our imaginary surfaces and as points in the ‘bulk’ beneath them. However, as an illustration, let us calculate the net charge in the interval ${\interval(\tx+u,a)}$ averaged over all ${u}$ between ${-a/2}$ and ${a/2}$.

For the purpose of this illustration, let us suppose that each unit cell contains a single anion-cation pair and that the distance between the pair in interval ${[\tx-a,\tx]}$ is smaller by ${\delta x}$ than the distance between the pair in interval ${[\tx,\tx+a]}$, such that the difference between the dipole moments of these unit cells is ${\Delta d = q\delta x}$, where ${q}$ is the cation’s charge. Then, the average over ${u\in(-a/2,a/2)}$ of the net charge in interval ${\interval(\tx+u,a)}$ is ${q\delta x/ a =\Delta d/a}$.

Now suppose that we have calculated the same quantity for every pair of adjacent unit cells in an interval ${\interval(\tx,\ell)}$, where ${a\ll \ell\ll\lambda}$, and then repeated this calculation for a continuous range of values of $\tx$. Let us denote ${1/\ell}$ times the sum of all net charges in interval ${\interval(x,\ell)}$ by ${\expval{\rho}^*(x)}$ and ${1/\ell}$ times the sum of the cells’ dipole moments by ${\expval{\mpp}^*(x)}$. Then, it can be shown that ${\expval{\rho}^*(x)=-\div \expval{\mpp}^*(x)}$. The similarity of this expression to the relation ${\rho=-\div\pp}$ is not coincidental: Maxwell used a similar line of reasoning to deduce it, albeit with displacements of charges replaced by displacements of the ether.

This example illustrates that ${\bsigma(\tx)}$ does not vanish in the presence of an LO perturbation because the symmetry reason for it vanishing no longer exists. Furthermore, because a more realistic charge density $\rho$ would be a smooth function of position, ${\bsigma(\tx)}$ would be a smooth function of $\tx$, with the same periodicity $\lambda$ as the LO perturbation that created it. Therefore, there would be an excess charge density wave of periodicity $\lambda$, from which would emanate an electric field of periodicity $\lambda$.

15.3.2 Zero-wavevector LO phonons

If ${\melo(\bk,u)}$ denotes the electric field created by displacing a crystal by $u$ along the eigenvector of an LO phonon of wavevector $\bk$, my claim about the difference between the point ${\bk=0}$ in reciprocal space and the ${\bk\to 0}$ limit can be stated as follows:

\begin{align*} 0\neq \lim_{\bk\to 0} \melo(\bk,u)\neq \melo(0,u)=0. \end{align*}

In other words, the limit ${\bk\to 0}$ is a singular limit of ${\melo(\bk,u)}$.

The fact that ${\melo(\bk,u)}$ is discontinuous at ${\bk=0}$ is well known when expressed in a different way: Squared phonon frequencies are eigenvalues of a crystal’s dynamical matrix. Therefore, if ${\melo(\bk,u)}$ vanishes suddenly at ${\bk=0}$, causing LO phonon frequencies at ${\bk=0}$ to be smaller than their values in the ${\bk\to 0}$ limit, the dynamical matrix must be discontinuous at ${\bk=0}$. It is very well known that it is discontinuous, and non-analytic corrections are commonly applied to the ${\bk=0}$ dynamical matrix to calculate the ${\bk\to 0}$ dynamical matrix [Pick et al., 1970; Cochran and Cowley, 1962; Cochran, 1960; Giannozzi et al., 1991; Gonze and Lee, 1997; Baroni et al., 2001; Born and Huang, 1954; Jones and March, 1973]. The ${\bk\to 0}$ dynamical matrix is not one matrix, in general, because both it, and the LO phonon frequency, depend on the direction in reciprocal space from which the point ${\bk=0}$ is approached.

To understand why the ${\bk\to 0}$ limit is singular, it is easier to think about the LO phonon’s wavelength in the ${\lambda\to\infty}$ limit than its wavevector in the ${\bk\to 0}$ limit: Imagine a microscopic or mesoscopic neighbourhood of a point in the bulk of an arbitrarily large perfect crystal, and then imagine that the crystal is perturbed by displacing it from equilibrium along the eigenvector of an LO phonon of wavelength $\lambda$. Now imagine increasing $\lambda$.

As ${\lambda}$ becomes much larger than the size of the neighbourhood, and continues to increase, the microstructure within the neighbourhood looks more and more like it would look if the crystal’s sublattices had been displaced rigidly relative to one another. Therefore it looks more and more like a crystal that has been perturbed by displacing it along the eigenvector of a ${\bk=0}$ phonon. Nevertheless, no matter how large $\lambda$ becomes, if one moves a distance ${\lambda/2}$ in the direction of ${\bk}$, the relative displacements of the atoms in the direction of $\bk$ are reversed.

In other words, microscopically, increasing $\lambda$ brings the structure closer to the ${\bk=0}$ structure, but macroscopically it does not; and it is the macroscopic structure that determines whether or not there is a macroscopic $\E$ field.