Appendix I. Energies of pure states in their natural bases

https://doi.org/10.48550/arXiv.2403.13981

One purpose of this appendix is to derive some illuminating, and possibly useful, expressions for the expectation values of the energies of two kinds of physical systems. The first comprises multiple interacting indistinguishable particles, such as electrons, confined by a potential, such as the electrostatic potential from a set of quasi-static nuclei. The second physical system is a non-overlapping interacting pair of charge-neutral composite particles, such as nanoparticles or noble gas atoms.

A second purpose of this appendix is to use these examples to illustrate how much can be learned about the structure of a generic pure statistical state $\Psi$ of a physical system or subsystem $\subject$ by expressing the expectation values of $\subject$’s observables in basis sets that are intrinsic properties of state ${\Psi}$, rather than eigenstates of the observables’ operators.

The eigenstates of the operator $\hObs$ of an observable $\Obs$ inherit their characteristics from three sources: Namely, the observable $\Obs$, the physical system $\subject$ of which they are statistical states, and the apparatus or probe to which $\subject$ is coupled during the measurement of $\Obs$. Therefore, they are not necessarily representative of an arbitrary pure state of $\subject$. However, even if they were, there is nothing general to be learned about pure states from an expression for the expectation value ${\expval{\Obs}}$ of $\Obs$ that is derived by expanding an arbitrary pure state $\Psi$ in a basis of eigenstates of ${\hObs}$: For every observable, this procedure leads to the classical expression,

\begin{align*} \expval{\Obs}=\sum_\alpha \Pr(\Obs=\obs_\alpha)\obs_\alpha, \end{align*}

where each ${\obs_\alpha}$ is one of the possible results of measuring ${\Obs}$.

This work demonstrates by example that it is possible to gain insight into characteristics of an arbitrary pure state $\Psi$ by expressing expectation values of its observables in terms of its natural basis sets, which are basis sets whose elements are natural states. A pure state’s natural states are the eigenstates of its reduced density matrices; and their eigenvalues are their occupation numbers [Coleman, 1963; Löwdin, 1955; Davidson, 1972; Ando, 1963; McWeeny, 1960].

All of the theory that is presented or developed in this appendix is applicable to a pure state of a set of classical particles whose energy expectation value can be expressed as the expectation value of a sum of $1$-particle energies and $2$-particle interaction energies. Although, I briefly express the $1$-particle energy in the form that it would take for a set of quantum mechanical particles in an external potential, my derivations do not require it to have this form.

I.1 Pure states and mixed states

I.1.1 Pure states

A pure state of a classical or quantum mechanical system $\subject$ that comprises $N$ indistinguishable particles is a function ${\Psi\in\lebesgue(\domain^N)}$ such that

\begin{align*} \pdf(x_1\cdots x_N) = \abs{\Psi(x_1\cdots x_N)}^2 \end{align*}

is the probability distribution for the system’s microstructure, ${(x_1\cdots x_N)}$, where ${x_i\in\domain}$ specifies the coordinates of the ${i^\text{th}}$ particle; and ${\domain}$ is the set of all possible coordinates of a single particle. For example, if ${\domain=\realone^3}$, ${\pdf(x_1\cdots x_N)}$ is a position probability density function and each element of ${\domain}$ is a point in physical space and a possible location of a particle.

The pure state specified by $\Psi$ will sometimes be represented by an element ${\ket{\Psi}}$ of an abstract Hilbert space ${\hilbert_N}$, whose elements are in one to one correspondence with the elements of ${\lebesgue(\domain^N)}$. It can also be specified by an idempotent density operator,

\begin{align*} \Dop\equiv\dyad{\Psi} \implies \Dop\Dop = \ket{\Psi}\braket{\Psi}{\Psi}\bra{\Psi} =\dyad{\Psi}=\Dop. \end{align*}

Idempotency of the density operator can be regarded as the defining property of a pure state.

I.1.2 Mixed states

A mixed state is a more general kind of statistical state. A mixed state density operator is not idempotent and has the mathematical form,

\begin{align*} \Dop =\sum_\alpha \Pr(\Psi=\Psi_\alpha)\Dop_\alpha =\sum_\alpha\Pr(\Psi=\Psi_\alpha)\dyad{\Psi_\alpha}, \end{align*}

where ${\Pr(\Psi=\Psi_\alpha)}$ is the probability that the physical system is in pure state ${\Psi_\alpha}$, and

\begin{align*} \sum_\alpha \Pr(\Psi=\Psi_\alpha)=1. \end{align*}

I.1.3 Pure states define the set of all states

**Figure 20.** Schematic depiction of the set $\ddomain$ of all density operators. Its vertices are idempotent, and therefore pure states; and every interior point ${\Dop}$ is a mixed state, which can be expressed as ${\Dop=\weight_1\Dop_1+\weight_2\Dop_2+\weight_3\Dop_3}$, where ${\weight_1,\weight_2,\weight_3\in(0,1)}$ and ${\weight_1+\weight_2+\weight_3=1}$. Set $\ddomain$ is defined by the pure states, just as a triangle is defined by its vertices.

The set of all density operators, $\ddomain$, is the set of all positive Hermitian operators ${\hilbert_N\to\hilbert_N}$ with unit trace. It is a convex set [Coleman, 1963; von Neumann et al., 2018; von Neumann, 1955], which means that

\begin{align*} \weight\Dop_1+(1-\weight)\Dop_2\in\ddomain, \;\forall \Dop_1,\Dop_2\in\ddomain, \;\forall \weight\in[0,1]. \end{align*}

Convexity of ${\ddomain}$ implies that if ${\ddomain}$ is visualized as a simplex, as it is depicted in Fig. 20, an element of ${\ddomain}$ is a pure state if and only if it is at a vertex of the simplex. Therefore the set of pure state density operators is the set of extreme points of $\ddomain$; and convexity implies that the set of extreme points (vertices) defines the entire set. Therefore, as pointed out by [Coleman, 1963], we can hope to understand a great deal about an arbitrary mixed state of $\subject$ by understanding an arbitrary pure state of ${\subject}$.

I.2 Notation, definitions, and assumptions

The notation in this appendix deviates from the convention, introduced in Appendix I.2 and used throughout the rest of this work, that ${x}$ or ${x_i}$ represents a single coordinate of a particle’s position vector. In this appendix a single variable, such as $x$, $y$, $x_i$ or $y_j$ denotes all of a single particle’s coordinates; and in Appendix I.4 I will often use $i$ to denote $x_i$.

For simplicity it is assumed in Appendix I.5 that all particles are either spin-0 electrons or spin-0 nuclei, whose coordinates are simply their positions. However, no assumptions are made that are inconsistent with classical physics. Therefore the particles referred to as ‘electrons’ represent any set of indistinguishable charged light particles, and those referred to as ‘nuclei’ represent any set of oppositely-charged indistinguishable heavy particles.

In Appendix I.3 and Appendix I.4, if the particles have spins the $i^\text{th}$ particle’s coordinates will be ${i\equiv x_i\equiv(\rvecsub{i},\spin_i)}$, where $\rvecsub{i}$ is its position and $\spin_i$ is its spin; and I will often use the abbreviation ${\dmeasure{1\cdots p}\equiv\dd{x_1}\cdots\dd{x_p}}$. I do not discuss magnetization, particles’ spins, or magnetic interactions explicitly; and all of the theory presented is applicable to charged spin-$0$ particles. However, if the theory is applied to particles with spins, integrals over the coordinates of one or more particles denote a sum over all possible (sets of) spins of the integral over all possible (sets of) positions. For example,

\begin{align*} \int f(i,j)\dmeasure{i,j} &\equiv \int f(x_i,x_j)\dd{x_i}\dd{x_j} \\ &\equiv \sum_{\spin_i,\spin_j}\int f(\rvecsub{i},\spin_i,\rvecsub{j},\spin_j)\ddpow{3}{r_i}\ddpow{3}{r_j}. \end{align*}

I.2.1 Basis bras and kets

The meaning of ${\ket{x}}$ was explained in Appendix C the general case in which $x$ represents $\Ndof$ degrees of freedom, but no account was taken of the degrees of freedom of a single particle it can be thought of as the tensor product of the state vectors of those degrees of freedom.

For example, if ${\rvec\in\realone^3}$, and if $x$ momentarily represents the $x$-coordinate of $\rvec$, then ${\ket{\rvec}}$ should be thought of as

\begin{align*} \ket{\rvec}=\ket{x}\otimes\ket{y}\otimes\ket{z} \end{align*}

or as

\begin{align*} \ket{\rvec}=\ket{z}\otimes\ket{x}\otimes\ket{y} \end{align*}

or as a tensor product, in any other order, of the three states that represent the same particle’s Cartesian position coordinates. The order does not matter as long as the same order is used consistently for all particles.

Therefore, from now on, ${\ket{x}}$ is an element of ${\hilbert_1}$ that represents a possible configuration of a single particle; and just as ${\hilbert_N}$ is an abstract representation of ${\lebesgue(\domain^N)}$, ${\hilbert_1}$ is an abstract representation of ${\lebesgue(\domain)}$. Therefore if, for example, ${\domain\equiv\onetorus}$, then ${\ket{x}}$ also represents a square integrable function that is localized around ${x\in\onetorus}$. If the width $\dwidth$ of that function vanished, it would be the Dirac delta distribution. However its width is arbitrarily small, but finite.

The vector ${\ket{x_1\cdots x_p}\in\hilbert_p}$, where ${\hilbert_p}$ is an abstract representation of ${\lebesgue(\domain^p)}$, is defined as

\begin{align*} \ket{x_1\cdots x_p}\equiv \antisymmetrizer\left\{\ket{x_1}\otimes\ket{x_2}\cdots\otimes\ket{x_p}\right\}, \end{align*}

where I will be using operator $\antisymmetrizer$ in multiple vector spaces to denote the norm-preserving antisymmetriser.

Note that a tensor product of multiple states is only antisymmetrized if each state refers to a different particle. It was not needed for the definition ${\ket{\rvec}=\ket{x}\otimes\ket{y}\otimes\ket{z}}$.

Since the space $\domain$ has not been specified, it would be unnecessarily complicated to be specific and rigorous about the normalizations of states like ${\ket{x_1\cdots x_p}}$. It suffices to consider the expression,

\begin{align*} \braket{x_1\cdots x_p}{x'_1\cdots x'_p} = \delta(x_1-x'_1)\cdots \delta(x_p-x'_p), \end{align*}

where ${\delta}$ plays the same role within integrals over ${\domain}$ that is played by Dirac’s delta function, when it is in the hands of physicists. This expression is inappropriate and problematic in several ways, but it is fine for present purposes. It is implied by the equally-problematic expression ${\braket{x}{x'}=\delta(x-x')}$ and the fact that $\antisymmetrizer$ preserves normalizations.

I.2.2 Contractions

If ${\ket{f}}$ is a $p$-state and ${\ket{F}}$ is a $q$-state, then the contraction [Lounesto, 2001; Chisolm, 2012; Dorst, 2002; Vaz and da Rocha, 2016; Doran and Lasenby, 2003] of ${\ket{F}}$ by ${\ket{f}}$ is the ${(q-p)}$-state,

\begin{align*} \lcontract{f}{F} \equiv \int \bar{f}(1\cdots p) F(1\cdots p\cdots q)\ket{p+1\cdots q} \dmeasure{1\cdots q}, \end{align*}

and the contraction of ${\bra{F}}$ by ${\bra{f}}$ is the dual of $\lcontract{f}{F}$,

\begin{align*} \rcontract{F}{f} \equiv \int \bar{F}(1\cdots p\cdots q) f(1\cdots p) \bra{p+1\cdots q} \dmeasure{1\cdots q}. \end{align*}

Note that ${\ket{f\rfloor F}}$ and ${\bra{f\rfloor F}}$ can also be denoted as ${\ket{F\lfloor f}}$ and ${\bra{F\lfloor f}}$, respectively.

It will be useful to denote the unit-normalized contractions with a ‘$1$’ subscript, i.e.,

\begin{align*} \lcontractN{f}{F}\equiv \frac{\lcontract{f}{F}}{\sqrt{\braket{F\lfloor f}{f\rfloor F}}} = \rcontractN{F}{f}^\dagger, \\ \rcontractN{F}{f}\equiv \frac{\rcontract{F}{f}}{\sqrt{\braket{f\rfloor F}{F\lfloor f}}} = \lcontractN{f}{F}^\dagger, \end{align*}

where, for example,

\begin{align*} \norm{\lcontract{f}{F}}=\sqrt{\braket{F\lfloor f}{f\rfloor F}}. \end{align*}

I.3 Natural states

A natural $p$-state ${\mathcal{X}_\alpha(x_1,\cdots,x_p)}$ of an isolated system of ${N=p+q}$ identical particles in a pure state, ${\Psi(x_1,\cdots,x_{N})\in\hilbert_N}$, is an eigenstate of its $p^\text{th}$-order reduced density matrix (or simply ${p}$-matrix). That is,

\begin{align*} \int \dmatrixarg{\Psi}_p(x_1\cdots x_p; x_1'\cdots x_p') \mathcal{X}_\alpha&(x'_1,\cdots,x'_p)\dd{x'_1}\cdots\dd{x'_p}\\ &= \lambda_\alpha \mathcal{X}_\alpha(x_1,\cdots,x_p), \end{align*}

where $\lambda_\alpha$ is a nonnegative real number and

\begin{align*} \dmatrixarg{\Psi}_p(x_1\cdots x_p;& x_1'\cdots x_p') \equiv \int \Psi(x_1\cdots x_p,x_{p+1}\cdots x_N) \\ &\times\bar{\Psi}(x'_1\cdots x'_p, x_{p+1}\cdots x_N) \dd{x_{p+1}}\cdots \dd{x_N}. \end{align*}

Natural states have many nice properties. For example, if ${\{\tilde{\mathcal{X}}_\alpha\}}$ and ${\{\tilde{\mathcal{Y}}_\beta\}}$ are not sets of natural states, but are any other complete orthonormal bases of the ${p}$-particle and ${q}$-particle Hilbert spaces, respectively, then ${\Psi}$ can be expressed exactly as the double infinite sum

\begin{align*} \Psi(x_1\cdots &x_{N}) \\ &= \sum_{\alpha,\beta} \tilde{C}_{\alpha\beta} \tilde{\mathcal{X}}_\alpha(x_1\cdots x_p) \tilde{\mathcal{Y}}_\beta(x_{p+1}\cdots x_{N}), \tag{161} \end{align*}

for some set of constants ${\tilde{C}_{\alpha\beta}\in \mathbb{C}}$. However if ${\{\mathcal{X}_\alpha\}}$ and ${\{\mathcal{Y}_\beta\}}$ are the sets of natural $p$-states and $q$-states, this expression simplifies to the single infinite sum,

\begin{align*} \Psi(x_1\cdots &x_{N}) \\ &= \sum_{\alpha} C_{\alpha} \mathcal{X}_\alpha(x_1\cdots x_p) \mathcal{Y}_\alpha(x_{p+1}\cdots x_{N}), \tag{162} \end{align*}

where ${C_\alpha\in\complex}$ and ${\mathcal{X}_\alpha}$ and ${\mathcal{Y}_\alpha}$ are eigenstates of the $p$-matrix and the $q$-matrix, respectively, with the same eigenvalue, ${\lambda_\alpha\equiv \abs{C_{\alpha}}^2}$. Furthermore,

\begin{align*} C_\alpha &\mathcal{Y}_\alpha(x_{p+1}\cdots x_{N}) = \left(\bra{x_{p+1}\cdots x_N}\otimes\bra{\mathcal{X}_\alpha}\right)\ket{\Psi} \\ & \equiv \int \bar{\mathcal{X}}_\alpha(x_1\cdots x_p) \Psi(x_1\cdots x_{N}) \dd{x_1}\cdots\dd{x_p}, \tag{163} \end{align*}

which means that ${C_\alpha\mathcal{Y}_\alpha}$ is the contraction of ${\bar{\mathcal{X}}_\alpha}$ onto ${\Psi}$.

Therefore ${\mathcal{Y}_\alpha}$ resides in the Hilbert subspace that is orthogonal to $\mathcal{X}_\alpha$. Eq. (163) also means that both ${\mathcal{X}_\alpha}$ and ${\mathcal{Y}_\alpha}$ inherit from $\Psi$ its symmetry or antisymmetry with respect to exchange of positions.

I refer the reader to [Coleman, 1963] for a clear explanation of many of the nice properties of natural states. These properties suggest that natural $p$-states are the only ${p}$-particle states to which physical meaning should be attached in a system comprised of more than $p$ particles. I state only two of these properties here.

Property 1:

It can be shown (see Coleman’s Theorem 3.1) that if ${\Phi}$ is restricted to the mathematical form

\begin{align*} \Phi(x_1\cdots &x_N) =\sum_{\alpha\leq u,\, \beta\leq v} \tilde{C}_{\alpha\beta}\tilde{\mathcal{X}}_\alpha(x_1\cdots x_p)\tilde{\mathcal{Y}}_\beta(x_{p+1}\cdots x_N), \end{align*}

where ${u\leq v<\infty}$, and if ${\norm{\Psi-\Phi}^2}$ is minimized with respect to the set of coefficients ${\{\tilde{C}_{\alpha\beta}\}}$ and the sets of functions, ${\{\tilde{\mathcal{X}}_{\alpha}\}_{\alpha\leq u}}$ and ${\{\tilde{\mathcal{Y}}_{\beta}\}_{\beta\leq v}}$, the minimum is obtained by the following truncation of the sum in Eq. (162):

\begin{align*} \Phi(x_1\cdots &x_N) =\sum_{\alpha\leq u} C_{\alpha}\mathcal{X}_\alpha(x_1\cdots x_p)\mathcal{Y}_\alpha(x_{p+1}\cdots x_N), \end{align*}

where the coefficients are indexed such that ${\alpha<\beta\implies \abs{C_{\alpha}}\geq \abs{C_\beta}}$.

Property 2:

It can also be shown (see Coleman’s Theorem 3.3) that if ${p}$ is odd and ${2p<N}$, then

\begin{align*} \int \bar{\mathcal{X}}_\alpha(x_1\cdots x_p) \mathcal{Y}_\alpha(x_1\cdots x_{N}) \dd{x_1}\cdots\dd{x_p}= 0. \tag{164} \end{align*}

As mentioned in Appendix I.2, I will often use $j$ as shorthand for $x_j$. Therefore,

\begin{align*} \ket{1\cdots N} &= \ket{1\cdots p}\otimes\ket{p+1\cdots N}\\ &\equiv \ket{x_1\cdots x_p}\otimes\ket{x_{p+1}\cdots x_{N}}= \ket{x_1\cdots x_N}. \end{align*}

I.4 Natural orbitals

There is a long history of simplifying many-particle states and many-particle energetics by treating the particles as quasi-independent; and approximations based on this simplification are widely used. The purpose of this subsection is to present a rigorous theoretical justification of the concept of a quasi-independent particle state in some many-particle systems; and to provide insight into the conditions under which this concept ceases to be meaningful or justified.

It is hoped that this may lead to a better understanding of how Bloch functions, Wannier functions, and other kinds of single particle states should be interpreted; and a better understanding of the validity of the assumption that the electron densities of atoms and chemical bonds have substructures of atomic and molecular orbitals.

The justification that is presented consists of a derivation of a few closely-related exact expressions for the energy

\begin{align*} E=\expvaltwo{\Ham}{\Psi} \end{align*}

of a set of $N$ interacting indistinguishable particles with wavefunction $\Psi$. One of those expressions is

\begin{align*} E& = \sum_\alpha \occ_\alpha \energy_\alpha + \sum_{\{\alpha,\beta\}} \sqrt{\occ_\alpha \occ_\beta}\,w_{\alpha\beta}, \tag{165} \end{align*}

where the first sum is over the set ${\{\varphi_\alpha\}}$ of all of $\Psi$’s natural orbitals (natural $1$-states); the second sum is over all distinct pairs of natural orbitals; the set ${\{\occ_\alpha\}}$, which is uniquely determined by $\Psi$, has the properties ${\occ_\alpha\in[0,1]}$ and ${\sum_\alpha \occ_\alpha =N}$ that would be required of orbital occupation probabilities; ${\energy_\alpha}$ denotes ${\expvaltwo{\hamsmall}{\varphi_\alpha}}$, where ${\hamsmall}$ is a $1$-particle Hamiltonian; and

\begin{align*} w_{\alpha\beta}\equiv 2\Re\left\{\mel{\varphi_\alpha}{\what_{\alpha\beta}}{\varphi_\beta}\right\}, \end{align*}

where ${\what_{\alpha\beta}}$ can be viewed as a coupling between natural orbitals ${\varphi_\alpha}$ and ${\varphi_\beta}$ that is mediated by the ${(N-1)}$-particle states ${\ket{\Theta_\alpha}\equiv\lcontractN{\varphi_\alpha}{\Psi}}$ and ${\ket{\Theta_\beta}\equiv\lcontractN{\varphi_\beta}{\Psi}}$.

Eq. (165) is an exact expression for the energy of a set of $N$ interacting indistinguishable particles in a pure state as a weighted sum of the energies ${\{\energy_\alpha\equiv\expvaltwo{\hamsmall}{\varphi_\alpha}\}}$ of independent particles whose wavefunctions are the natural orbitals (natural $1$-states) ${\{\varphi_\alpha\}}$ of $\Psi$, plus an interaction term.

The interaction term is a weighted sum of the terms ${\left\{w_{\alpha\beta}\right\}}$, which have the appearance of pairwise couplings between orbitals. However $\what_{\alpha\beta}$ depends on the natural ${(N-1)}$-states ${\Theta_\alpha}$ and ${\Theta_\beta}$ that are the dual states of ${\varphi_\alpha}$ and ${\varphi_\beta}$, respectively. Therefore ${\mel{\varphi_\alpha}{\what_{\alpha\beta}}{\varphi_\beta}}$ is not a $2$-particle term, but an $N$-particle term, which it might be appropriate to interpret as a mediated coupling between orbitals $\varphi_\alpha$ and $\varphi_\beta$, or as a mediated interaction between two particles occupying them.

If all of the interaction energies were sufficiently small, $E$ could be interpreted as approximately a weighted sum of the energies of independent particles occupying different orbitals, where the weight given to each energy is both the probability of the corresponding orbital being occupied at a particular instant and the fraction of time for which it is occupied. Then ${\{w_{\alpha\beta}\}}$ could be interpreted as the set of energies of the interactions responsible for moving particles between orbitals.

Therefore, when interaction energies are small, Eq. (165) is consistent with the physical picture of each particle occupying an orbital for a long period until, eventually, its weak or rare interactions with other particles move it to a different orbital. On the other hand, the fact that ${w_{\alpha\beta}}$ is an $N$-particle energy means that when interaction energies are large, $E$ is not approximately a sum of single particle energies. In that case, Eq. (165) is consistent with the residence times of particles in orbitals being too short for the concept of orbital occupation to be valid.

The derivation of Eq. (165) makes use of properties possessed only by the set natural orbitals. Therefore it strengthens the case for natural orbitals being the most ‘physical’ $1$-particle states in a many particle system, and suggests that comparisons with natural orbitals might shed light on how other sets $1$-particle states should be interpreted.

I.4.1 Theoretical setup

Let us begin with the exact expression

\begin{align*} \Psi(1\cdots N) = \sum_\alpha c_\alpha \varphi_\alpha(1)\Theta_\alpha(2\cdots N), \tag{166} \end{align*}

where ${\{\varphi_\alpha\}}$ and ${\{\Theta_\alpha\}}$ are the sets of natural orbitals and natural ${(N-1)}$-states, respectively, and the functions in each set are mutually orthogonal and have been chosen to be normalized to one, i.e., ${\braket{\varphi_\alpha}{\varphi_\beta}=\delta_{\alpha\beta}}$ and ${\braket{\Theta_\alpha}{\Theta_\beta}=\delta_{\alpha\beta}}$. Let us also choose ${\Psi}$ to be normalized to one, which implies that ${\sum_\alpha\lambda_\alpha=1}$, where ${\lambda_\alpha\equiv\abs{c_\alpha}^2}$.

We will make use of the function

\begin{align*} \densityn_\alpha(1)\equiv (N-1) \int\abs{\Theta_\alpha(1\cdots N-1)}^2 \dmeasure{2\cdots N-1}, \end{align*}

which would be the number density of ${N-1}$ particles whose state was the ${\alpha^\text{th}}$ natural ${(N-1)}$-state, $\Theta_\alpha$.

The $N$-particle Hamiltonian is the following sum of an independent-particle operator, $\Hamone$, and an interaction operator, ${\Hamtwo}$:

\begin{align*} \Ham \equiv \overbrace{\sum_{i=1}^N\hamone(i)}^{\displaystyle \Hamone} + \overbrace{\sum_{i=1}^{N}\sum_{j=i+1}^N\hamtwo(i,j)}^{\displaystyle \Hamtwo}. \tag{167} \end{align*}

Both $\Hamone$ and $\Hamtwo$ operate on $N$-particle states, but are sums of $1$-particle operators and $2$-particle operators, respectively. The independent-particle Hamiltonian, $\Hamone$, is a sum over $i$ of the single-particle Hamiltonian, ${\hamsmall(i)}$, which operates on the coordinates of the ${i^\text{th}}$ particle.

The interaction term has the form ${\Hamtwo=\sum_{i,j>i}\hamtwo(i,j)}$, where ${\hamsmalltwo(i,j)}$ is the interaction between particles with coordinates $i$ and $j$. I will use ${\Hamtwo}$ more generally to denote the interaction operator of a system with $M$ particles, where $M$ is the number of particles of the state on which ${\Hamtwo}$ acts.

Eq. (165) and the other expressions for ${\expvaltwo{\Ham}{\Psi}}$ that will be derived in this section (Appendix I.4), are quite general. They are valid for any set of indistinguishable classical or quantum mehcanical particles and any operator $\Ham$ that can be expressed as a sum of $1$-particle and $2$-particle terms. Nevertheless, I will sometimes refer to the particles as electrons and I will assume that the $1$-particle operator has the form, ${\hamsmall(i)= \kinetic(i) + \vextop(i)}$, where $\kinetic$ is the $1$-particle kinetic energy operator; and $\vextop$ is the operator for the energy of a single particle in an external potential.

I.4.2 Single particle energy

Eq. (166) can be used to express the expectation value of the $1$-particle energy as

\begin{align*} E_1\equiv\expvaltwo{\Hamone}{\Psi} & = \sum_{\alpha,\beta} \bar{c}_\alpha c_\beta \int \bar{\varphi}_\alpha(1)\bar{\Theta}_\alpha(2\cdots {N}) \\ &\times \left(\sum_i \hat{h}(i)\right) \varphi_\beta(1)\Theta_\beta(2\cdots {N}) \dmeasure{1\cdots N}. \end{align*}

Using the orthonormality of the natural $(N-1)$-states and the antisymmetry of $\Psi$, this can be simplified to

\begin{align*} E_1= \sum_\alpha \occ_\alpha\left(t_\alpha + \vext_\alpha\right) = \sum_\alpha\occ_\alpha\energy_\alpha, \tag{168} \end{align*}

where ${\occ_\alpha \equiv N\lambda_\alpha \equiv N\abs{c_\alpha}^2}$; ${\energy_\alpha\equiv t_\alpha+\vext_\alpha}$; and I have introduced the $1$-particle energy expectation values, ${t_\alpha \equiv \expvaltwo{\,\hat{t}\,}{\varphi_\alpha}}$ and

\begin{align*} \vext_\alpha &\equiv \expvaltwo{\vextop}{\varphi_\alpha} = \int \vext(x) n_\alpha(x)\dd{x}, \end{align*}

where ${n_\alpha(x)\equiv\abs{\varphi_\alpha(x)}^2}$; ${\vextop_\alpha\equiv \int \vext(1)\dyad{1}\dmeasure{1}}$; and ${\vext(1)=\vext(x_1)}$ is the external potential felt by a single particle whose coordinates are $x_1$.

I.4.3 Interaction energy - Expression 1

The interaction energy is

\begin{align*} \Energy &\equiv\expvaltwo{\Hamtwo}{\Psi} \\ & = \int\Psi^*(1\cdots N)\left(\sum_{i,j>i}\hamtwo(i,j)\right)\Psi(1\cdots N)\dmeasure{1\cdots N}; \end{align*}

and if ${\Psi}$ is expressed as in Eq. (166), it becomes

\begin{align*} \Energy \equiv\sum_{\alpha\beta}&\bar{c}_\alpha c_\beta \int\bar{\varphi}_\alpha(1)\bar{\Theta}_\alpha(2\cdots N) \\ \times&\left(\sum_{i,j>i}\hamtwo(i,j)\right)\varphi_\beta(1)\Theta_\beta(2\cdots N)\dmeasure{1\cdots N}; \tag{169} \end{align*}

The exchange symmetries of ${\Psi^*\Psi}$ and ${\bar{\Theta}_\alpha\Theta_\beta}$ allow the sum of interactions ${\hamtwo(i,j)}$ in parentheses to be replaced by any of the following three expressions:

\begin{align*} \sum_{i,j>i} \hamtwo(i,j) &= \sum_{j>1}\hamtwo(1,j) + \sum_{i>1,j>i}\hamtwo(i,j) \\ &= (N-1)\hamtwo(1,2) + \sum_{i>1,j>i}\hamtwo(i,j) \tag{170} \\ &= N\sum_{j>1}\hamtwo(1,j) = N(N-1)\hamtwo(1,2). \tag{171} \end{align*}

Eq. (170) will be used in Appendix I.4.4. We will use Eq. (171) in this subsection. Let us define the $1$-particle operator,

\begin{align*} \what_{\alpha\beta}(1) &\equiv \int\bar{\Theta}_\alpha(2\cdots N)\left(\sum_{j>1}\hamtwo(1,j)\right)\Theta_\beta(2\cdots N)\dmeasure{2\cdots N} \\ &= (N-1)\int\bar{\Theta}_\alpha(2\cdots N)\hamtwo(1,2)\Theta_\beta(2\cdots N)\dmeasure{2\cdots N}. \end{align*}

This allows Eq. (169) to be expressed as

\begin{align*} W &=N\sum_{\alpha,\beta} \bar{c}_\alpha c_\beta\mel{\varphi_\alpha}{\what_{\alpha\beta}}{\varphi_\beta}. \tag{172} \end{align*}

Now let us denote the argument of ${c_\alpha}$ by $\vartheta_\alpha$, i.e., ${c_\alpha=\abs{c_\alpha}e^{i\vartheta_\alpha}}$. Then we could define

\begin{align*} w_{\alpha\beta}&\equiv 2\Re\left\{e^{i(\vartheta_\beta-\vartheta_\alpha)}\mel{\varphi_\alpha}{\what_{\alpha\beta}}{\varphi_\beta}\right\}, \tag{173} \end{align*}

and this definition would lead to the expressions presented in Appendix I.4.5. However, because the magnitudes of the coupling energies ${\{w_{\alpha\beta}\}}$ determine whether or not orbital occupation is a valid concept, let us choose a path by which it can be expressed in a simpler mathematical form. Without losing generality, let us choose the coefficients ${\{c_\alpha\}}$ to be real. This is possible because we can express ${c_\alpha\varphi_\alpha}$ as

\begin{align*} c_\alpha\varphi_\alpha&=\abs{c_\alpha}e^{i\vartheta_\alpha}\varphi_\alpha = \abs{c_\alpha}\left(e^{i\vartheta_\alpha}\varphi_\alpha\right) =c_\alpha \left(e^{i\vartheta_\alpha}\varphi_\alpha\right) \end{align*}

or, if we want $c_\alpha$ to be negative, as

\begin{align*}c_\alpha\varphi_\alpha&=-\abs{c_\alpha}\left(e^{i(\vartheta_\alpha+\pi)}\varphi_\alpha\right) = c_\alpha\left(e^{i(\vartheta_\alpha+\pi)}\varphi_\alpha\right). \end{align*}

Therefore the phase factors of the coefficients ${\{c_\alpha\}}$ can be merged into the natural orbitals. Having done so, we can define the coupling energy between ${\varphi_\alpha}$ and ${\varphi_\beta}$ as

\begin{align*} w_{\alpha\beta}&\equiv 2\Re\left\{\mel{\varphi_\alpha}{\what_{\alpha\beta}}{\varphi_\beta}\right\}. \end{align*}

Now let us denote ${\frac{1}{2}w_{\alpha\alpha}=\frac{1}{2}\expvaltwo{\what_{\alpha\alpha}}{\varphi_\alpha}}$ as $\vmf_\alpha$ and express it as

\begin{align*} \vmf_\alpha &= (N-1)\int \abs{\varphi_\alpha(1)}^2\hamtwo(1,2)\abs{\Theta_\alpha(2\cdots N)}^2\dmeasure{1\cdots N} \\ &= \int\int n_\alpha(x)\hamtwo(x,x')\densityn_\alpha(x')\dd{x}\dd{x'}. \tag{174} \end{align*}

This expression makes it clear that ${\vmf_\alpha}$ is the mean field interaction between an electron in orbital ${\varphi_\alpha}$ and ${N-1}$ electrons in $\varphi_\alpha$’s dual ${(N-1)}$-state, $\Theta_\alpha$.

Now $W$ can be expressed as

\begin{align*} W&=\sum_\alpha\left(\occ_\alpha \vmf_{\alpha} + \frac{1}{2}\sum_{\beta\neq\alpha} \sqrt{\occ_\alpha\occ_\beta}\,w_{\alpha\beta}\right) \\ &=\sum_\alpha\occ_\alpha\vmf_\alpha + \sum_{\alpha,\beta>\alpha}\sqrt{\occ_\alpha\occ_\beta}\,w_{\alpha\beta} \tag{175} \\ &=\sum_{\alpha}\sum_{\beta\geq\alpha}\sqrt{\occ_\alpha\occ_\beta} \,w_{\alpha\beta} =N\sum_{\alpha}\sum_{\beta\geq\alpha}c_\alpha c_\beta \,w_{\alpha\beta}. \end{align*}

If the coefficients ${\{c_\alpha\}}$ are real, expressing ${N c_\alpha c_\beta}$ as ${\sqrt{\occ_\alpha\occ_\beta}}$ implies that they are positive as well as real. However, none of these expressions, except the final one in which ${\sqrt{\occ_\alpha\occ_\beta}}$ has been replaced by ${ N c_\alpha c_\beta}$ rather than by ${N \bar{c}_\alpha c_\beta}$, would be different if the coefficients ${\{c_\alpha\}}$ were complex and ${w_{\alpha\beta}}$ had been defined as in Eq. (173).

Note that if the phase factors of the natural ${(N-1)}$-states were independent of particles’ positions and spins, they could be merged into the natural orbitals. Then ${\what_{\alpha\beta}}$ could be expressed as

\begin{align*} \what_{\alpha\beta}(1) &= (N-1)\int\sqrt{\pdfarg{\Theta}_\alpha(2\cdots N)\pdfarg{\Theta}_\beta(2\cdots N)} \hamtwo(1,2) \dmeasure{2\cdots N}, \tag{176} \end{align*}

where

\begin{align*} \pdfarg{\Theta}_\alpha(x_2\cdots x_N)\equiv\abs{\Theta_\alpha(x_2\cdots x_N)}^2 \end{align*}

is the probability density that a set of ${(N-1)}$ particles whose wavefunction is ${\Theta_\alpha}$ have configuration ${(x_2\cdots x_N)}$. The reason to express ${\what_{\alpha\beta}}$ in this form is to show that the integrand on the right hand side of Eq. (176) is the product of ${\hamtwo(x_1,x_2)}$ and the geometric mean of ${\pdenarg{\Theta}_\alpha(x_2\cdots x_N)}$ and ${\pdenarg{\Theta}_\beta(x_2\cdots x_N)}$.

I.4.4 Interaction energy - Expression 2

Another expression for $W$ can be found by inserting Eq. (170) into Eq. (169). Then the definition of ${\what_{\alpha\beta}}$ can be used to simplify the first term, and ${\braket{\varphi_\alpha}{\varphi_\beta}=\delta_{\alpha\beta}}$ can be used to simplify the second term, to give

\begin{align*} \Energy &= \sum_{\alpha\beta}\bar{c}_\alpha c_\beta\mel{\varphi_\alpha}{\what_{\alpha\beta}}{\varphi_\beta} \\ &+\sum_\alpha \lambda_\alpha \int\bar{\Theta}_\alpha(2\cdots N)\left(\sum_{i>1,j>i}\hamtwo(i,j)\right)\Theta_\alpha(2\cdots N)\dmeasure{2\cdots N} \\ &= \sum_{\alpha\beta}\bar{c}_\alpha c_\beta \mel{\varphi_\alpha}{\what_{\alpha\beta}}{\varphi_\beta} + \sum_\alpha \lambda_\alpha \underbrace{\expvaltwo{\Hamtwo}{\Theta_\alpha}}_{\displaystyle \Walpha} \end{align*}

Note that the first term on the right hand side only differs from right hand side of Eq. (172) by a factor of $N$. Therefore we can replace it with the right hand side of Eq. (175) divided by $N$, i.e.,

\begin{align*} W&= \sum_\alpha \lambda_\alpha\left(\vmf_\alpha + \Walpha\right)+ \frac{1}{2}\sum_{\alpha,\beta\neq\alpha}\sqrt{\lambda_\alpha\lambda_\beta} w_{\alpha\beta} \tag{177} \\ &= \sum_\alpha\left(\lambda_\alpha\Walpha + \sum_{\beta\geq\alpha}\sqrt{\lambda_\alpha\lambda_\beta}w_{\alpha\beta}\right). \tag{178} \end{align*}

I.4.5 Total energy

By combining Eq. (168), Eq. (174) and Eq. (175), the total energy can be expressed as

\begin{align*} E &=\sum_\alpha \occ_\alpha \left( \energy_{\alpha} +\vmf_\alpha\right) + \sum_\alpha \sum_{\beta>\alpha}\sqrt{\occ_\alpha\occ_\beta} \, w_{\alpha\beta} \tag{179} \\ &=\sum_\alpha \occ_\alpha \energy_{\alpha} + \sum_\alpha \sum_{\beta\geq\alpha}\sqrt{\occ_\alpha\occ_\beta} \, w_{\alpha\beta}. \tag{180} \end{align*}

Now let us define ${\hamsmall_\alpha\equiv \hamsmall + \vmfop}$, where

\begin{align*} \vmfop(x)\equiv \int \what(x,x') \densityn_\alpha(x')\dd{x'}. \end{align*}

This allows us to write

\begin{align*} \expvaltwo{\hamsmall_\alpha}{\varphi_\alpha} = \expvaltwo{\hamsmall}{\varphi_\alpha}+\expvaltwo{\vmfop}{\varphi_\alpha}= \energy_\alpha + \vmf_\alpha. \end{align*}

Therefore, the following exact expressions are equivalent:

\begin{align*} E &= \sum_\alpha\occ_\alpha\expvaltwo{\hamsmall_\alpha}{\varphi_\alpha} +\sum_{\alpha}\sum_{\beta>\alpha}\sqrt{\occ_\alpha\occ_\beta}\mel{\varphi_\alpha}{\what_{\alpha\beta}}{\varphi_\beta} \\ &= \sum_\alpha\occ_\alpha\expvaltwo{\hamsmall}{\varphi_\alpha} +\sum_{\alpha}\sum_{\beta\geq\alpha}\sqrt{\occ_\alpha\occ_\beta}\mel{\varphi_\alpha}{\what_{\alpha\beta}}{\varphi_\beta}. \end{align*}

If Eq. (177) and Eq. (178) are used instead of Eq. (175), the total energy can be expressed as

\begin{align*} E = \sum_\alpha \occ_\alpha\bigg[\energy_\alpha + \frac{1}{N}\big(\vmf_\alpha &+\Walpha \big)\bigg] \\ &+ \sum_{\alpha,\beta>\alpha}\sqrt{\lambda_\alpha\lambda_\beta} w_{\alpha\beta}, \tag{181} \end{align*}

where ${\Walpha}$ is the total interaction energy of ${N-1}$ particles whose state is ${\Theta_\alpha}$. The first part of this expression is an occupation-weighted sum of the natural orbital energy $\energy_\alpha$ plus one electron’s share of (i.e., ${\frac{1}{N}\times}$) the mean field interaction between an electron in orbital $\varphi_\alpha$ and the ${N-1}$ remaining electrons in state ${\Theta_\alpha}$ plus one electron’s share of the energy ${\Walpha}$ of interaction between the electrons in state ${\Theta_\alpha}$. The second part of the expression could also be expressed as

\begin{align*} \sum_{\alpha,\beta>\alpha}\sqrt{\lambda_\alpha\lambda_\beta} w_{\alpha\beta} = \frac{1}{N}\sum_{\alpha,\beta>\alpha}\sqrt{\occ_\alpha\occ_\beta} w_{\alpha\beta}. \end{align*}

It can be interpreted as a correlation term to correct the sum ${\sum_\alpha \lambda_\alpha \vmf_\alpha}$ of mean field interactions between the electron in orbital $\varphi_\alpha$ and the electrons in state ${\Theta_\alpha}$.

I.4.6 Hartree-Fock approximation

It can be shown [Coleman, 1963] that

\begin{align*} \lambda_\alpha\leq \frac{1}{N}\implies \occ_\alpha \leq 1, \end{align*}

with equality if and only if $\Psi$ has the form,

\begin{align*} \Psi(1\cdots {N}) &= \antisymmetrizer \left\{\varphi_\alpha(1) \Theta_\alpha(2\cdots {N})\right\} \\ &= \frac{1}{\sqrt{N}}\bigg[\varphi_\alpha(1)\Theta_\alpha(2\cdots i-1, i, i+1 \cdots N) \\ &\quad-\sum_{i=2}^N \varphi_\alpha(i)\Theta_\alpha(2\cdots i-1, 1, i+1 \cdots N)\bigg]. \end{align*}

The Hartree-Fock wavefunction is the simplest wavefunction with this form, as it can be expressed as

\begin{align*} \PsiHF(1\cdots N)=\antisymmetrizer\{\varphi_{\alpha_1}(1)\cdots\varphi_{\alpha_N}(N)\}. \end{align*}

Therefore, in the Hartree-Fock approximation, ${\varphi_\alpha}$’s dual ${(N-1)}$-state is

\begin{align*} \Theta_\alpha(1\cdots N-1) &= \antisymmetrizer\bigg\{\varphi_1(1)\cdots \breve{\varphi}_\alpha(\alpha)\cdots \varphi_{N}(N-1)\bigg\}, \end{align*}

where ${\breve{\varphi}_\alpha(\alpha)}$ denotes the absence of ${\varphi_\alpha(\alpha)}$ in the product, e.g., ${\varphi_1\breve{\varphi}_2\varphi_3=\varphi_1\varphi_3}$.

It follows that ${\what_{\alpha\beta}}$ is

\begin{align*} \what_{\alpha\beta}(x) &= \sum_{\eta\notin\{\alpha,\beta\}} \int \what(x,x')\abs{\varphi_\eta(x')}^2\dd{x'} \\ &= \int\what(x,x')\left(\sum_{\eta\notin\{\alpha,\beta\}}n_\eta(x')\right)\dd{x'} \\ &= \int\what(x,x')\left[n(x')-n_\alpha(x')-n_\beta(x')\right]\dd{x'}, \end{align*}

where

\begin{align*} n(x)\equiv\sum_{\alpha=1}^N\abs{\varphi_\alpha(x)}^2=\sum_{\alpha=1}^Nn_\alpha(x) \end{align*}

is the number density of $\Psi$.

This demonstrates that, within the Hartree-Fock approximation, ${\what_{\alpha\beta}(x)}$ is the mean field potential at $x$ from the density ${n-n_\alpha-n_\beta}$, which is the number density of $\Psi$ minus the contribution to it from orbitals $\varphi_\alpha$ and ${\varphi_\beta}$. In other words, the coupling between state ${\ket{\varphi_\alpha}}$ and ${\ket{\varphi_\beta}}$ is mediated by a mean field potential, which does not include a self interaction.

I.4.7 Summary

The theory presented in Appendix I.4 may be important in many contexts, but it has been developed and presented with chemical bonding in mind, where the term chemical is intended to mean that the attraction between the bonded atoms occurs due to a substantial redistribution of the atoms’ electron densities.

The next section is concerned with attractions between atoms, surfaces, nanoparticles, or other objects that can occur without substantial redistribution of the objects’ electron densities, because it is the attraction due to dynamical correlation of the objects’ constituent particles.

I.5 Non-overlapping bodies

This section presents one way to understand the forces and torques exerted by two unmagnetized charge-neutral bodies on one another when they do not overlap spatially and do not exchange particles. The bodies could be atoms, molecules, nanoparticles, or any other objects composed of more than one charged particle. Therefore they will be referred to as C-particles, where ‘C’ abbreviates composite, and they will be identified individually as CP1 and CP2. The isolated system comprised only of CP1 and CP2 will be referred to as CP1+CP2.

For simplicity it will be assumed that each C-particle is composed of only two species of more elementary particle; namely, nuclei of atomic number $Z$ and electrons.

The only approximation made in this section is the neglect of overlap of the CP’s wavefunctions.

**Figure 21.** A pair of charge-neutral composite particles polarize to lower their potential energy.

I.5.1 Macroscopic charge distributions and their ensembles

Since CP1 and CP2 are not charged, the forces and torques they exert on one another arise from non-uniformities of their charge distributions. For example, the C-particles depicted in Fig. 21 attract one another because they have polarized such that each one has a dipole moment that is directed from left to right. They would also attract one another if they both had opposite (left to right) linear polarizations, or for any number of other more complex charge distributions, ${\pden=\pdenup{1}+\pdenup{2}}$, of CP1+CP2.

In general, non-uniformities can be either static or dynamic. Static non-uniformities are non-uniformities of their time-averaged charge distributions, and dynamic non-uniformities are transient and arise from interaction-biased quasi-random fluctuations of the CP’s microstructures. The focus of this section is on dynamic uniformities, and I will refer to the energy and force of interaction between the CPs arising from the dynamic uniformities as the correlation energy and force, despite the fact that there also exist intra-CP correlations.

If the fluctuations of the microstructures of CP1 and CP2 were independent of one another, the probabilities of the net correlation force being repulsive and attractive at a given instant would be equal, and the time average of the net correlation force would vanish.

However, their charge distributions do not change independently of one another, and we will see that the energy of CP1+CP2 can be expressed as

\begin{align*} E= E_1+E_2+\Eint, \end{align*}

where ${\Eint}$ is the energy of interaction between them.

I.5.2 Notation

The variables $x_i$ and $y_j$ specify the cooordinates of the $i^\text{th}$ constituent particle of CP1 and the ${j^\text{th}}$ constituent particle of CP2, respectively, and $\interact{x_i}{y_j}$ denotes the Coulomb repulsion between particles with coordinates $x_i$ and $y_j$ if their charges are both either $e$ or $-e$.

The vector ${X\equiv (x_1\cdots x_p)}$ specifies the coordinates of all $p$ constituent particles of CP1 and ${Y\equiv (y_1\cdots y_q)}$ specifies the coordinates of all $q$ constituent particles of CP2.

Integrals will continue to incorporate sums over spin configurations and the abbreviations ${\dmeasureA{i_1\cdots i_m}\equiv\dd{x_{i_1}}\cdots\dd{x_{i_m}}}$ and ${\dmeasureB{i_1\cdots i_m}\equiv\dd{y_{j_1}}\cdots\dd{y_{j_n}}}$ will be used. For example,

\begin{align*} \int f(x_1,x_2)\dmeasureA{1,2} \equiv \int f(x_1,x_2)\dd{x_{1}}\dd{x_{2}}. \end{align*}

I.5.3 Wavefunction

The wavefunction of CP1+CP2 is

\begin{align*} \Psi(s_1\cdots s_N)=\sum_\alpha C_\alpha\wx_\alpha(s_1\cdots s_p)\wy_\alpha(s_{p+1}\cdots s_N), \end{align*}

where ${\{s_1\cdots s_N\}\equiv\{x_1,x_2\cdots x_p,y_1\cdots y_q\}}$; ${\{\wx_\alpha\}}$ and ${\{\wy_\alpha\}}$ are the sets of all natural $p$-states and $q$-states, respectively; and the dual of $p$-state $\wx_\alpha$ is the $q$-state $\wy_\alpha$.

Although $\Psi$ is antisymmetric, the C-particles do not overlap significantly. Furthermore, as the distance between them increases, the rates at which particles move between them decrease, while the characteristic time and length scales of fluctuations of their charge distributions that are capable of producing significant relative forces and torques increase. Therefore let us make the physical assumption that the particles are at a separation $r$ for which there exists a time scale $\tau$ such that the average frequency with which particles travel between them is much smaller than ${1/\tau}$, and the time scale of the charge redistribution processes responsible for their relative forces and torques is much smaller than $\tau$.

Under this assumption, and since the degree of overlap between CP1 and CP2 is negligible, it does not change the energy or the expectation value of any observable if $\Psi$ is chosen to not have the correct (anti-)symmetry with respect to exchange of coordinates between CP1 and CP2. Therefore $\Psi$ can be expressed as (see Appendix I.3 or [Coleman, 1963])

\begin{align*} \Psi(X,Y) & = \sum_{\alpha} C_\alpha \wx_\alpha(X)\wy_\alpha(Y), \tag{182} \end{align*}

where each $\wx_\alpha$ is an eigenfunction of the integral operator with kernel

\begin{align*} \dmatrixarg{\wx}_p(X;X') &\equiv \int \Psi(X,Y) \Psi^*(X',Y) \dmeasureB{1\cdots q} \end{align*}

and each $\wy_\alpha$ is an eigenfunction of the integral operator with kernel

\begin{align*} \dmatrixarg{\wy}_q(Y;Y') &\equiv \int \Psi(X,Y) \Psi^*(X,Y') \dmeasureA{1\cdots p}. \end{align*}

That is,

\begin{align*} \int \dmatrixarg{\wx}_p(X;X')\wx_\alpha(X')\dmeasureA{1\cdots p} &= \lambda_\alpha \wx_\alpha(X), \\ \int \dmatrixarg{\wy}_q(Y;Y')\wy_\alpha(Y')\dmeasureB{1\cdots q} &= \lambda_\alpha \wy_\alpha(Y), \end{align*}

where ${\lambda_\alpha =\abs{C_\alpha}^2}$.

The sets ${\{\wx_\alpha(X)\}}$ and ${\{\wy_\alpha(Y)\}}$ are orthonormal, meaning that ${\braket{\wx_\alpha}{\wx_\beta}=\delta_{\alpha\beta}}$ and ${\braket{\wy_\alpha}{\wy_\beta}=\delta_{\alpha\beta}}$, and their elements have the appropriate symmetry with respect to interchange of any two identical particles on the same C-particle. For example if $x_i$ and $x_j$ are the coordinates of electrons on CP1, then

\begin{align*} \wx_\alpha(x_1\cdots x_i \cdots x_j \cdots x_p)= -\wx_\alpha( x_1\cdots x_j \cdots x_i \cdots x_p). \end{align*}

I.5.4 Energy

The Hamiltonian of CP1+CP2 can be expressed as ${\Ham = \Ham_1 + \Ham_2 + \Hamint}$, where $\Ham_1$ and $\Ham_2$ are the Hamiltonians of CP1 and CP2, respectively, and $\Hamint$ is the interaction between them. Using the notation

\begin{align*} &\mel{\wx_\alpha\wy_\alpha}{\Ham}{\wx_\beta\wy_\beta} \\ &\qquad\quad\equiv \int\int \bar{\wx}_\alpha( X)\bar{\wy}_\alpha( Y)\,\Ham\, \wx_\beta(X)\wy_\beta(Y) \dmeasureA{1\cdots p}\dmeasureB{1\cdots q}, \end{align*}

the energy of CP1+CP2 can be expressed as

\begin{align*} E &= \expvaltwo{\Ham}{\Psi} = \sum_{\alpha\beta} \bar{C}_\alpha C_\beta \mel{\wx_\alpha\wy_\alpha}{\Ham}{\wx_\beta\wy_\beta} \\ &= \sum_\alpha\lambda_\alpha\left(\CPenergy{1}_\alpha+\CPenergy{2}_\alpha\right) + \expvaltwo{\Hamint}{\Psi}, \end{align*}

where ${\Eint\equiv\expvaltwo{\Hamint}{\Psi}}$ is the energy of interaction between CP1 and CP2; and ${\CPenergy{1}_\alpha\equiv\expvaltwo{\Ham_1}{\wx_\alpha}}$ and ${\CPenergy{2}_\alpha\equiv\expvaltwo{\Ham_2}{\wy_\alpha}}$ are the expectation values of $\Ham_1$ and ${\Ham_2}$, respectively, when the state $\ket{\wx_\alpha}$ of CP1 is the $\alpha^\text{th}$ natural $p$-state of $\ket{\Psi}$ and the state ${\ket{\wy_\alpha}}$ of CP2 is the natural ${q}$-state of ${\ket{\Psi}}$ that is ${\ket{\wx_\alpha}}$’s dual state, i.e.,

\begin{align*} \lcontractN{\wx_\alpha}{\Psi}&=\ket{\wy_\alpha}, & \lcontractN{\wy_\alpha}{\Psi}&=\ket{\wx_\alpha}. \end{align*}

The remainder of Appendix I.5 will focus on the interaction energy, $\Eint$.

I.5.5 Interaction energy

In what follows, the interaction energy will be expressed as

\begin{align*} \Eint\equiv \expvaltwo{\Hamint}{\Psi}\equiv \Eintcp{+-}+\Eintcp{-+}+\Eintcp{--}+\Eintcp{++}, \end{align*}

where ${\Eintcp{+-}}$ denotes the energy of interaction between the nuclei of CP1 and the electrons of CP2; ${\Eintcp{-+}}$ denotes the energy of interaction between the electrons of CP1 and the nuclei of CP2; and ${\Eintcp{--}}$ and ${\Eintcp{++}}$ denote the energies of interaction between CP1 and CP2 that only involve electrons and nuclei, respectively. An expression for ${\Eintcp{+-}}$ will now be derived, and expressions for ${\Eintcp{-+}}$, ${\Eintcp{--}}$, and ${\Eintcp{++}}$ that could be derived by a similar route will then be presented.

Let ${\sxn}$ and ${\sxe}$ denote the sets of all indices $i$ for which ${x_i}$ is the coordinate of one of CP1’s nuclei and electrons, respectively; and let ${\syn}$ and ${\sye}$ denote the sets of all indices $j$ for which $y_j$ is one of CP2’s nuclei and electrons, respectively. Then the interaction between CP1’s nuclei and CP2’s electrons can be expressed as

\begin{align*} \Eintcp{+-} &=-Z \int\dd{x}\int\dd{y} w(x,y) \left( \sum_{\substack{i\in \sxn \\ j\in\sye}} \int \int \abs{\Psi(x_1\cdots y_q)}^2 \delta(x-x_i)\delta(y-y_j) \dmeasureA{1\cdots p}\dmeasureB{1\cdots q} \right) \\ &=-Z \int\dd{x}\int\dd{y} w(x,y) \left( \numpx\nummy \int \int \abs{\Psi(x,x_2\cdots x_p,y,y_2\cdots y_q)}^2 \dmeasureA{2\cdots p}\dmeasureB{2\cdots q} \right), \tag{183} \end{align*}

where ${\numpx}$ denotes the number of nuclei in CP1, ${\nummy}$ denotes the number electrons in CP2; and the symmetry of ${\abs{\Psi}^2}$ with respect to exchange of identical particles belonging to the same CP has been used to reach the second expression from the first.

Eq. (183) will be simplified by expressing it in terms of position probability density functions (pdfs). First, the notation used to identify pdfs, joint pdfs, and condition pdfs will be introduced.

Notation for probability density functions (pdfs):

The joint probability density that one of CP1’s nuclei is at $x$ and one of CP2’s electrons is at $y$ is

\begin{align*} &\denpm(x,y)\\ &\quad\equiv \numpx\nummy\int\int \abs{\Psi(x,x_2\cdots y,y_2\cdots y_q)}^2\dmeasureA{2\cdots p}\dmeasureB{2\cdots q}. \end{align*}

More generally, ${\den_{\scriptscriptstyle ss'}(x,y)}$, where ${s,s'\in\{+,-\}}$, will denote the joint probability density that one of CP1’s particles of type ${s}$ is at $x$ and one of CP2’s particles of type ${s'}$ is at $y$, where particles of type ‘$+$’ are nuclei and those of type ‘$-$’ are electrons.

Let ${\denargp{i}}$ and ${\denargm{i}}$ denote the number densities of CPi’s nuclei and electrons, respectively, where ${\text{CPi}\in\{\text{CP1},\text{CP2}\}}$. For example, ${\denargm{2}(y)}$ is the probability density that one of CP2’s electrons is at $y$, which implies that ${\int\dd{y}\denargm{2}(y)=\nummy}$.

Let ${\den^{\scriptscriptstyle (i)}_{\scriptscriptstyle s|s'}(u|v)}$, where ${s, s'\in \{-,+\}}$, denote the conditional probability density that one of CPi’s particles of type $s$ is at $u$ given that one of the other CP’s particles of type $s'$ is at $v$. For example, ${\dencmp{2}(y|x)}$ is the conditional probability density that one of CP2’s electrons is at $y$, given that one of CP1’s nuclei is at $x$; and ${\dencmm{1}(x|y)}$ is the conditional probability density that one of CP1’s electrons is at $x$ given that one of CP2’s electrons is at $y$.

These definitions imply the following relations:

\begin{align*} \denpm(x,y) &= \dencpm{1}(x|y)\denargm{2}(y) = \denargp{1}(x)\dencmp{2}(y|x) \\ \denmp(x,y) &= \dencmp{1}(x|y)\denargp{2}(y) = \denargm{1}(x)\dencpm{2}(y|x) \\ \denpp(x,y) &= \dencpp{1}(x|y)\denargp{2}(y) = \denargp{1}(x)\dencpp{2}(y|x) \\ \denmm(x,y) &= \dencmm{1}(x|y)\denargm{2}(y) = \denargm{1}(x)\dencmm{2}(y|x). \end{align*}

Interaction energies in terms of pdfs

We can now express Eq. (183) as

\begin{align*} \Eintcp{+-} &= -Z\int\dd{x}\int\dd{y} w(x,y)\denpm(x,y). \\ &= -Z\int\dd{x}\int\dd{y} w(x,y)\dencpm{1}(x|y)\denargm{2}(y) \\ &\equiv -Z\bbraket{\dencpm{1}}{\denargm{2}} = -Z\bbraket{\denargp{1}}{\dencmp{2}}, \end{align*}

where I have introduced the shorthand notation,

\begin{align*} \bbraket{f}{g}\equiv \int \dd{x}\int \dd{y} w(x,y) f(x) g(y), \end{align*}

which I will now begin to use extensively. The function $f$ that occupies the first slot of ${\bbraket{\,\cdot\,}{\,\cdot\,}}$ will always be a position pdf for CP1’s nuclei or electrons, and the function $g$ in the second slot will always be a position pdf for CP2’s nuclei or electrons. Either $f$ or $g$ may or may not be a conditional pdf, i.e., ${f=f(x|y)}$ or ${f=f(x)}$ and ${g=g(y|x)}$ or ${g=g(y)}$.

In this notation, the energy of interaction between CP1’s electrons and CP2’s nuclei is

\begin{align*} \Eintcp{-+}\equiv -Z(\dencpm{1}|\denargm{2})= -Z(\denargp{1}|\dencmp{2}); \end{align*}

the energy of interaction between CP1’s electrons and CP2’s electrons is

\begin{align*} \Eintcp{--}\equiv (\dencmm{1}|\denargm{2})=(\denargm{1}|\dencmm{2}); \end{align*}

the energy of interaction between CP1’s nuclei and CP2’s nuclei is

\begin{align*} \Eintcp{++}\equiv Z^2(\dencpp{1}|\denargp{2})=Z^2(\denargp{1}|\dencpp{2}); \end{align*}

and the total interaction energy is

\begin{align*} \Eint &= \Eintcp{++} + \Eintcp{--} +\Eintcp{+-}+\Eintcp{-+} \\ &= Z^2(\denargp{1}|\dencpp{2}) + (\denargm{1}|\dencmm{2}) \\ &\qquad\qquad\qquad-Z\left[(\denargp{1}|\dencmp{2}) +(\denargm{1}|\dencpm{2})\right] \tag{184} \\ &= Z^2(\dencpp{1}|\denargp{2}) + (\dencmm{1}|\denargm{2}) \\ &\qquad\qquad\qquad-Z\left[(\dencpm{1}|\denargm{2}) +(\dencmp{1}|\denargp{2})\right] \tag{185} \end{align*}

Now let us define a set of probability or number density response functions, as follows:

\begin{align*} &\ddencmm{i}\equiv \dencmm{i} - \denargm{i},& &\ddencmp{i}\equiv \dencmp{i} - \denargm{i},& \\ &\ddencpp{i}\equiv \dencpp{i} - \denargp{i},& &\ddencpm{i}\equiv \dencpm{i} - \denargp{i}.& \end{align*}

Then $\Eint$ can be expressed as ${\Eint=\Estatic + \Edyn}$, where $\Estatic$ is the energy of the mean-field interaction between the CPs’ average charge densities,

\begin{align*} \rhoarg{1} &\equiv eZ\denargp{1}-e\denargm{1}, & \rhoarg{2} &\equiv eZ\denargp{2}-e\denargm{2}; \end{align*}

and $\Edyn$ is the contribution to $\Eint$ from dynamical correlation.

I.5.6 Mean field interaction energy

The mean-field interaction energy can be expressed as

\begin{align*} \Estatic&\equiv \overbrace{ Z^2\bbraket{\denargp{1}}{\denargp{2}} }^{\displaystyle \Estatarg{++}} + \overbrace{ \bbraket{\denargm{1}}{\denargm{2}} }^{\displaystyle \Estatarg{--}} \\ &\qquad\qquad\qquad \underbrace{ -Z\bbraket{\denargp{1}}{\denargm{2}} }_{\displaystyle \Estatarg{+-}} \underbrace{-Z \bbraket{\denargm{1}}{\denargp{2}} }_{\displaystyle \Estatarg{-+}} \\ & = Z\bbraket{\denargp{1}}{\pdenarg{2}} - \bbraket{\denargm{1}}{\pdenarg{2}}=\bbraket{\pdenarg{1}}{\pdenarg{2}} \tag{186} \end{align*}

where ${\pdenarg{1}}$, ${\pdenarg{2}}$ and ${\pden\equiv\pdenarg{1}+\pdenarg{2}}$ are the charge densities of CP1, CP2, and CP1+CP2, respectively.

If the separation $r$ between the CPs’ centers is large relative to their linear dimensions (e.g., their diameters, if they are spherical) it may be useful to express ${\pdenarg{1}}$ and ${\pdenarg{2}}$ as multipole expansions. Then $\Estatic$ can be expressed exactly as an infinite sum of multipole-multipole interactions, or approximated by a truncation of the infinite sum.

By assumption, each CP is charge-neutral. Therefore the term in the infinite sum that decays slowest as $r$ increases is the ${1/r^3}$ dipole-dipole term; the next slowest decaying terms are the ${1/r^4}$ dipole-quadrupole and quadrupole-dipole terms; and, in general and in principle, there are terms that decay as ${1/r^m}$ for all integers ${m\geq 3}$.

Note that ${\denargp{1}}$, ${\denargm{1}}$, ${\denargp{2}}$, and ${\denargm{2}}$ are all non-negative. Therefore ${\Estatarg{++}}$ and ${\Estatarg{--}}$ are positive contributions to $\Eint$, which always contribute repulsions to the inter-CP force; and ${\Estatarg{+-}}$ and ${\Estatarg{-+}}$ are negative contributions to $\Eint$, which always contribute attractions to the inter-CP force.

I.5.7 Correlation interaction energy

The correlation interaction energy can be expressed as

\begin{align*} \Edyn &\equiv Z^2\bbraket{\denargp{1}}{\ddencpp{2}} + \bbraket{\denargm{1}}{\ddencmm{2}} \\ &\quad\qquad\quad -Z\left[ \bbraket{\denargp{1}}{\ddencmp{2}} + \bbraket{\denargm{1}}{\ddencpm{2}} \right] \tag{187} \\ &= Z^2\bbraket{\ddencpp{1}}{\denargp{2}} + \bbraket{\ddencmm{1}}{\denargm{2}} \\ &\quad\qquad\quad-Z\left[\bbraket{\ddencpm{1}}{\denargm{2}} +\bbraket{\ddencmp{1}}{\denargp{2}}\right] \tag{188} \\ & = \Edynarg{++}+\Edynarg{--}+\Edynarg{+-}+\Edynarg{-+}, \end{align*}

where

\begin{align*} \Edynarg{++}\equiv Z^2\bbraket{\denargp{1}}{\ddencpp{2}}=Z^2\bbraket{\ddencpp{1}}{\denargp{2}} \end{align*}

is the energy of correlation between CP1’s nuclei and CP2’s nuclei;

\begin{align*} \Edynarg{--}\equiv \bbraket{\denargm{1}}{\ddencmm{2}}=\bbraket{\ddencmm{1}}{\denargm{2}} \end{align*}

is the energy of correlation between CP1’s electrons and CP2’s electrons;

\begin{align*} \Edynarg{+-}\equiv -Z\bbraket{\denargp{1}}{\ddencmp{2}}=-Z\bbraket{\ddencpm{1}}{\denargm{2}} \end{align*}

is the energy of correlation between CP1’s nuclei and CP2’s electrons; and

\begin{align*} \Edynarg{-+}\equiv -Z\bbraket{\denargm{1}}{\ddencpm{2}}=-Z\bbraket{\ddencmp{1}}{\denargp{2}} \end{align*}

is the energy of correlation between CP1’s electrons and CP2’s nuclei.

I.5.8 Density response functions

The response functions ${\ddencpp{i}}$, ${\ddencmm{i}}$, ${\ddencpm{i}}$, and ${\ddencmp{i}}$ are not non-negative everywhere because they are not pdfs. They are differences between pdfs. For example,

\begin{align*} \ddencpp{1}(x|y)\equiv \dencpp{1}(x|y)-\denargp{1}(x) \end{align*}

is the difference between the probability density that one of CP1’s nuclei is at $x$ when one of CP2’s nuclei is at $y$ and the probability density that one of CP1’s nuclei is at $x$ when nothing more specific than ${\denargp{2}}$ is known about the locations of CP2’s nuclei.

Conservation of probability and conservation of the number of nuclei imply that

\begin{align*} \int\dd{x}\denargp{1}(x)=\int\dd{x}\dencpp{1}(x|y)\implies \int\dd{x}\ddencpp{1}(x|y)=0. \end{align*}

Therefore they imply that if the discovery or revelation that there is a nucleus is at $y$ causes the number density of nuclei to decrease in one part of CP1, it must cause it to increase in another part of CP1.

Nuclei repel one another. Therefore, because the points $x_1$ and $y_1$ shown in Fig. 22 are relatively close to one another, it seems reasonable to expect that the probability density of there being a nucleus at $x_1$ would be reduced by Coulomb repulsion if there was a nucleus at $y_1$. If that was the case, ${\dencpp{1}(x_1|y_1)}$ would be less than ${\denargp{1}(x_1)}$, so ${\ddencpp{1}(x_1|y_1)}$ be negative, and the contribution of points $x_1$ and $y_1$ to ${\Edynarg{++}}$ would be negative.

On the other hand, ${\ddencpp{1}(x_2|y_2)}$ might be positive, despite the repulsion that would exist between nuclei at $x_2$ and $y_2$, because probability density is conserved and because the distance between $y_2$ and $x_2$ is larger than the distance between ${y_2}$ and most other points in CP1. Since the electric potential from a nucleus at $y_2$ would be lower at $x_2$ than at most other points in CP1, the presence of a nucleus at $y_2$ might increase the nucleus number density at $x_2$ in order to reduce it elsewhere.

Overall, however, if nuclei were sufficiently mobile, and if other contributions to the correlation energy (${\Edynarg{--}}$, ${\Edynarg{+-}}$, and ${\Edynarg{-+}}$) had negligible effects on the dynamics of nuclei, points in CP1 that are close to CP2 would be more likely to contribute negatively to $\Edynarg{++}$, and points in CP1 that are further away from CP2 would be more likely to contribute positively to ${\Edynarg{++}}$.

We can state this mathematically by defining

\begin{align*} \edynarg{++}(x) &\equiv Z^2\int\dd{y} w(x,y) \ddencpp{1}(x|y) \denargp{2}(y) \\ \implies \Edynarg{++}&= \int\dd{x}\edynarg{++}(x). \end{align*}

Then ${\edynarg{++}(x_1)}$ is likely to be negative and ${\edynarg{++}(x_2)}$ is likely to be positive. Furthermore, since $x_1$ is closer to CP2 than $x_2$ is, it is likely that

\begin{align*} \abs{\edynarg{++}(x_1)}>\abs{\edynarg{++}(x_2)}. \end{align*}

Therefore ${\Edynarg{++}}$ is more likely to be negative than positive. Similar reasoning could be used to argue that ${\Edynarg{--}}$, ${\Edynarg{+-}}$, and ${\Edynarg{-+}}$ are all likely to be negative; and it is known that the sum ${\Edyn}$ of all contributions to the correlation energy is negative.

However, while arguing that $\Edynarg{++}$ is likely to be negative, we have implicitly made some physical assumptions that are not necessarily justified or valid, or even likely to be justified or valid. For example, we have assumed that ${\Edynarg{++}}$ is not changed significantly by correlations between electrons and nuclei. However, we will not discuss this possibility because we will discuss another of our implicit assumptions, which but we will discuss another of our implicit assumptions. Its dubious validity that we have made, and which makes that assumption likely to be valid.

At first sight, and as it is presented above, the reasoning appears to contain a strong and unjustified implicit assumption. Namely, it appears that the fact that one of CP2’s nuclei is at $y$ at time $\tau$ changes the probability density at time $\tau$ that one of CP1’s nuclei is at $x$. If this assumption was being made, it would not be valid because the time taken by nuclei in a neighbourhood of $x$ to respond to the arrival of a nucleus at $y$ is finite. Their response is not instantaneous.

However, let us not forget that ${\Edynarg{++}}$ can be expressed as ${Z^2\bbraket{\dencpp{1}}{\denargp{2}}}$ or as ${Z^2\bbraket{\denargp{1}}{\dencpp{2}}}$, and that the correlation is not the nuclei of CP1 responding to fluctuations in the spatial distribution of CP2’s nuclei, or vice-versa, but the nuclei of each CP moving under the influence of the other CP’s nuclei. The motions of the nuclei and electrons of CP1 are correlated with one another and, to a lesser degree because they are further away, they are correlated with the motions of CP2’s nuclei and electrons.

Therefore it is not appropriate to interpret ${\ddencpp{1}(x|y)}$ as the response of CP1’s number density of nuclei at $x$, ${\denargp{1}(x)}$, to one of CP2’s nuclei suddenly appearing at $y$; and it is not even appropriate to interpret it as a response to the entire history of the nucleus whose position at time $t$ is $y$. It should be interpreted as an average of the responses to the trajectories in the set

\begin{align*} \big\{ \left\{(X(t),Y(t)): t\in (-\infty,\tau]\right\} :\exists j\in\syn \;s.t.\; y_j(\tau)=y\big\} \end{align*}

of all possible trajectories ${(X(t),Y(t))}$ of CP1+CP2 that are consistent with one of CP2’s nuclei being at $y$ at time $\tau$.

I.5.9 Decoupled non-overlapping bodies

It has been shown above that the energy of CP1+CP2 can be expressed as

\begin{align*} E=\sum_\alpha\lambda_\alpha\left(\CPenergy{1}_\alpha + \CPenergy{2}_\alpha\right) + \Emf + \Edyn. \end{align*}

I.5.10 Separation of time scales

We have found the following expression for the total energy of CP1+CP2, which is exact in the limit of zero overlap between CP1 and CP2 if CP1+CP2 is in a pure state $\Psi$.

\begin{align*} E[\Psi] & = \sum_\alpha\lambda_\alpha\left(\Epsilon_1^\alpha +\Epsilon_2^\alpha\right) + \Eintmf[\pdensuper{1},\pdensuper{2}] \\ & +\frac{1}{2}\left[ \expval{\pdensuperm{1},\dbarvm{2}} + \expval{\pdensuperm{2},\dbarvm{1}} \right] \\ &-\frac{Z}{2}\left[ \expval{\pdensuperp{1},\dbarvp{2}} + \expval{\pdensuperp{2},\dbarvp{1}} \right] \tag{189} \end{align*}

Let us assume that an isolated C-particle is approximately spherical but thermally disordered. When two C-particles approach one another the interaction between them can break their near-spherical symmetry. If they are observed on a time scale that is short relative to the time scale on which they rotate about an axis passing through their centers, and that is short relative to the time scale on which the internal structure of a C-particle can rearrange, it is reasonable to assume that they are observed in a pure state. This is because there are no relevant symmetries on such a time scale.

Let us now consider the different types of correlation described by the $\delta\bar{v}$ terms on the right hand side of Eq. (189). The terms $-Z\expval{\pdensuper{1}_\p,\delta\bar{v}_\p^{(2)}}$ and $-Z\expval{\pdensuper{2},\delta\bar{v}_\p^{(1)}}$ account for the energy associated with synchronicity between the motion of nuclei on one C-particle and the motion of nuclei and electrons on the other. If we assume that nuclei move much more slowly that electrons and that electrons are free to move so that, on the time scale of nuclear motion, they perfectly screen any fields from nuclei on the other C-particle, then $\delta\bar{v}_\p{(1)}=\delta\bar{v}_\p{(2)}=0$ and only the synchronous motion of electrons on different C-particles is relevant. Our assumption that electrons move freely also implies that $\Eintmf[\pdensuper{1},\pdensuper{2}] =0$, since both C-particles are globally charge-neutral and since on nuclear time scales electrons move rapidly to ensure local charge-neutrality. Therefore, it is expected that a very good approximation to the energy of CP1+CP2 is provided by

\begin{align*} E & \approx \sum_\alpha\lambda_\alpha\left(\Epsilon^\alpha_1+\Epsilon^\alpha_2\right) +\frac{1}{2} \expval{\pden_\subminus^{(1)},\delta\bar{v}_\subminus^{(2)}} +\frac{1}{2}\expval{\pden_\subminus^{(2)},\delta\bar{v}_\subminus^{(1)}}, \end{align*}

\begin{align*} E[\{\lambda_\alpha,&\wx_\alpha,\wy_\alpha\} ] \approx \sum_\alpha\lambda_\alpha\left(E_1[\wx_\alpha]+E_2[\wy_\alpha]\right) \\ & +\frac{1}{2} \expval{\pden_\subminus^{(1)}[\{\lambda_\alpha,\wx_\alpha\}],\delta\bar{v}_\subminus^{(2)}[\{\lambda_\alpha,\wx_\alpha,\wy_\alpha\}]} \\ &+\frac{1}{2}\expval{\pden_\subminus^{(2)}[\{\lambda_\alpha,\wy_\alpha\}],\delta\bar{v}_\subminus^{(1)}[\{\lambda_\alpha,\wx_\alpha,\wy_\alpha\}]} \tag{190} \end{align*}

I.5.11 Appendix to Appendix (I):Creation and annihilation operators

Note that ${\Wn_\beta}$, where ${\beta\neq\alpha}$, depends indirectly on ${\ket{\varphi_\alpha}}$, and this dependence could be made explicit, but I will not do this. However, I will draw out the dependence of ${\Delta\varepsilon_{\alpha\beta}}$ on orbitals other than ${\ket{\varphi_\alpha}}$ and ${\ket{\varphi_\beta}}$. These dependences enter ${\Delta\varepsilon_{\alpha\beta}}$ via $\hat{\mathcal{V}}_{\alpha\beta}$, because ${\ket{\Theta_\alpha}}$ and ${\ket{\Theta_\beta}}$ both contain finite overlaps with at least ${N-1}$ natural $1$-states.

The overlap of ${\ket{\Theta_\alpha}}$ with ${\ket{\varphi_\alpha}}$ vanishes by Eq. (164); however ${\braket{\varphi_\alpha}{\Theta_\beta}}$ does not vanish, in general, if ${\beta\neq\alpha}$. Therefore, let us express ${\ket{\Theta_\beta}}$ as the sum of a state with finite overlap with ${\ket{\varphi_\alpha}}$ and a state $\ket{\Theta_{\beta\perp\alpha}}$ whose projection onto ${\ket{\varphi_\alpha}}$ vanishes. To facilitate this decomposition, let us define the annihilation operator ${\hat{a}_\alpha}$ and the creation operator ${\hat{a}_\alpha^\dagger}$ by their actions on an $M$-particle state $\chi_M$ and an $(M-1)$-particle state $\chi_{M-1}$, respectively.

\begin{align*} &\left(\hat{a}_\alpha \chi_M\right)(1\cdots M-1) \equiv M^{\frac{1}{2}}\int \chi_M(1\cdots M)\bar{\varphi}_\alpha(M)\dd{x_M} \\ &\left(\hat{a}^\dagger_\alpha \chi_{M-1}\right)(1\cdots M) \equiv M^{-\frac{1}{2}}\hat{\mathcal{A}}\left\{\chi_{M-1}(1\cdots M-1)\varphi_\alpha(M)\right\} \end{align*}

where ${\hat{\mathcal{A}}}$ is the antisymmetrization operator. With a bit of algebra it can be shown that ${\hat{a}_\alpha\hat{a}_\alpha^\dagger + \hat{a}_\alpha^\dagger\hat{a}_\alpha=\identity}$, where $\identity$ is the identity. Note that this notation is a bit sloppy and, as a result, this expression for the identity is misleading. We should really express it as ${\hat{a}_{M+1,\alpha}\hat{a}_{M+1,\alpha}^\dagger + \hat{a}_{M,\alpha}^\dagger\hat{a}_{M,\alpha}=\identity_M}$, where ${\hat{a}_{M,\alpha}}$ acts on $M$-particle states to produce ${(M-1)}$-particle states, ${\hat{a}_{M,\alpha}^\dagger}$ acts on $(M-1)$-particle states to produce ${M}$-particle states, and ${\identity_M}$ is the identity in the $M$-particle Hilbert space. With this in mind, let us proceed with the simpler sloppy notation. We can write

\begin{align*} \Theta_\beta(2\cdots N) &= \hat{a}_\alpha^\dagger\hat{a}_\alpha\Theta_\beta(2\cdots N) + \hat{a}_\alpha\hat{a}^\dagger_\alpha\Theta_\beta(2\cdots N) \\ &= \hat{a}_\alpha^\dagger \Theta_{\beta-\alpha}(2\cdots N-1) +\Theta_{\beta\perp\alpha}(2\cdots N) \end{align*}

where ${\braket{\varphi_\alpha}{\Theta_{\beta-\alpha}}}$ and ${\braket{\varphi_\alpha}{\Theta_{\beta\perp\alpha}}}$ both vanish. Then,

\begin{align*} N\left(\frac{c_\alpha}{c_\beta}\right) &\Delta\varepsilon_{\alpha\beta} \equiv \mel{\Theta_\alpha}{\hat{\mathcal{U}}_{\alpha\beta}}{\hat{a}_\alpha^\dagger\Theta_{\beta-\alpha}} + \mel{\Theta_\alpha}{\hat{\mathcal{U}}_{\alpha\beta}}{\Theta_{\beta\perp\alpha}} \\ & = \int \bar{\varphi}_\alpha(1) \bar{\theta}_{\alpha\beta}(2) \hat{w}(1,2) \varphi_\alpha(2) \varphi_\beta(1) \dmeasure{1,2} \\ &+ \int \bar{\varphi}_\alpha(1) \bar{\Theta}_\alpha(2\cdots N) \hat{w}(1,2) \\ &\qquad\times\Theta_{\beta\perp\alpha}(2\cdots N) \varphi_\beta(1) \dmeasure{1\cdots N} \tag{191} \end{align*}

where ${\ket{\theta_{\alpha\beta}}\equiv \braket{\Theta_{\beta-\alpha}}{\Theta_\alpha}}$ is a $1$-particle state that is orthogonal to $\ket{\varphi_\alpha}$ and, to reach the second equation from the first, I have used the orthogonality of ${\Theta_\alpha}$ to ${\varphi_\alpha}$, as follows: in the expression for ${\hat{a}_\alpha^\dagger\Theta_{\beta-\alpha}}$, I expanded the antisymmetrized product of ${\Theta_{\beta-\alpha}}$ and ${\varphi_\alpha}$ as a sum; then I used the fact that each integral for which the argument of ${\varphi_\alpha}$ is not $2$ vanishes.

Appendix I. Energies of pure states in their natural bases

I.1 Pure states and mixed states

I.1.1 Pure states

I.1.2 Mixed states

I.1.3 Pure states define the set of all states

I.2 Notation, definitions, and assumptions

I.2.1 Basis bras and kets

I.2.2 Contractions

I.3 Natural states

Property 1:

Property 2:

I.4 Natural orbitals

I.4.1 Theoretical setup

I.4.2 Single particle energy

I.4.3 Interaction energy - Expression 1

I.4.4 Interaction energy - Expression 2

I.4.5 Total energy

I.4.6 Hartree-Fock approximation

I.4.7 Summary

I.5 Non-overlapping bodies

I.5.1 Macroscopic charge distributions and their ensembles

I.5.2 Notation

I.5.3 Wavefunction

I.5.4 Energy

I.5.5 Interaction energy

Notation for probability density functions (pdfs):

Interaction energies in terms of pdfs

I.5.6 Mean field interaction energy

I.5.7 Correlation interaction energy

I.5.8 Density response functions

I.5.9 Decoupled non-overlapping bodies

I.5.10 Separation of time scales

I.5.11 Appendix to Appendix (I):Creation and annihilation operators

Comments