Functional (mathematics) edit

f\mapsto I[f]=\int _{\Omega }H(f(x),f'(x),\ldots )\;\mu ({\mbox{d}}x)

Local vs non-local edit

If a functional's value can be computed for small segments of the input curve and then summed to find the total value, a function is called local. Otherwise it is called non-local. For example:

F(y)=\int _{x_{0}}^{x_{1}}y(x)\;\mathrm {d} x

is local while

F(y)={\frac {\int _{x_{0}}^{x_{1}}y(x)\;\mathrm {d} x}{\int _{x_{0}}^{x_{1}}(1+[y(x)]^{2})\;\mathrm {d} x}}

is non-local. This occurs commonly when integrals occur separately in the numerator and denominator of an equation such as in calculations of center of mass.

Linear functionals edit

Linear functionals first appeared in functional analysis, the study of vector spaces of functions. A typical example of a linear functional is integration: the linear transformation defined by the Riemann integral

I(f)=\int _{a}^{b}f(x)\,dx

is a linear functional from the vector space C[a,b] of continuous functions on the interval [a, b] to the real numbers. The linearity of I(f) follows from the standard facts about the integral:

I(f+g)=\int _{a}^{b}(f(x)+g(x))\,dx=\int _{a}^{b}f(x)\,dx+\int _{a}^{b}g(x)\,dx=I(f)+I(g)

I(\alpha f)=\int _{a}^{b}\alpha f(x)\,dx=\alpha \int _{a}^{b}f(x)\,dx=\alpha I(f).

Functional derivative edit

The functional derivative is defined first; Then the functional differential is defined in terms of the functional derivative.

Functional derivative edit

Given a manifold M representing (continuous/smooth/with certain boundary conditions/etc.) functions ρ and a functional F defined as

F\colon M\rightarrow \mathbb {R} \quad {\mbox{or}}\quad F\colon M\rightarrow \mathbb {C} \,,

the functional derivative of $F [$ ρ], denoted $δF/δ$ ρ, is defined by^[1]

{\begin{aligned}\int {\frac {\delta F}{\delta \rho (x)}}\phi (x)dx&=\lim _{\varepsilon \to 0}{\frac {F[\rho +\varepsilon \phi ]-F[\rho ]}{\varepsilon }}\\&=\left[{\frac {d}{d\epsilon }}F[\rho +\epsilon \phi ]\right]_{\epsilon =0},\end{aligned}}

where $\phi$ is an arbitrary function. $\epsilon \phi$ is called the variation of ρ.

Functional differential edit

The differential (or variation or first variation) of the functional $F$ [ρ] is,^[2] ^{[Note 1]}

\delta F=\int {\frac {\delta F}{\delta \rho (x)}}\ \delta \rho (x)\ dx\ ,

where $δ$ ρ $(x) = εϕ (x)$ is the variation of ρ $(x)$ .^{[clarification needed]} This is similar in form to the total differential of a function $F$ (ρ₁, ρ₂, ..., ρ_n),

dF=\sum _{i=1}^{n}{\frac {\partial F}{\partial \rho _{i}}}\ d\rho _{i}\ ,

where ρ₁, ρ₂, ... , ρ_n are independent variables. Comparing the last two equations, the functional derivative $δF / δ$ ρ $(x)$ has a role similar to that of the partial derivative $\partialF/\partial$ ρ_i , where the variable of integration $x$ is like a continuous version of the summation index $i$ .^[3]

Properties edit

Like the derivative of a function, the functional derivative satisfies the following properties, where $F$ [ρ] and $G$ [ρ] are functionals:

Linear:^[4]

{\frac {\delta (\lambda F+\mu G)}{\delta \rho (x)}}=\lambda {\frac {\delta F}{\delta \rho (x)}}+\mu {\frac {\delta G}{\delta \rho (x)}},\ \qquad \lambda ,\mu

constant,

Product rule:^[5]

{\frac {\delta (FG)}{\delta \rho (x)}}={\frac {\delta F}{\delta \rho (x)}}G+F{\frac {\delta G}{\delta \rho (x)}}\,,

Chain rules:

If

f

is a differentiable function, then

\displaystyle {\frac {\delta F[f(\rho )]}{\delta \rho (x)}}={\frac {\delta F[f(\rho )]}{\delta f(\rho (x))}}\ {\frac {df(\rho (x))}{d\rho (x)}}\ ,

^[6]

{\frac {\delta f(F[\rho ])}{\delta \rho (x)}}={\frac {df(F[\rho ])}{dF[\rho ]}}\ {\frac {\delta F[\rho ]}{\delta \rho (x)}}\,.

^[7]

Lemmas edit

${\begin{aligned}{\frac {\delta F}{\delta \rho ({\boldsymbol {r}})}}\,={\frac {\partial f}{\partial \rho }}-\nabla \cdot {\frac {\partial f}{\partial \nabla \rho }}\end{aligned}}$

where ρ = ρ( $r$ ) and $f = f (r$ , ρ, ∇ρ). This formula is for the case of the functional form given by $F$ [ρ] at the beginning of this section. For other functional forms, the definition of the functional derivative can be used as the starting point for its determination. (See the example Coulomb potential energy functional.)

Proof: given a functional

F[\rho ]=\int f({\boldsymbol {r}},\rho ({\boldsymbol {r}}),\nabla \rho ({\boldsymbol {r}}))\,d{\boldsymbol {r}},

and a function $ϕ$ ( $r$ ) that vanishes on the boundary of the region of integration, from a previous section Definition,

{\begin{aligned}\int {\frac {\delta F}{\delta \rho ({\boldsymbol {r}})}}\,\phi ({\boldsymbol {r}})\,d{\boldsymbol {r}}&=\left[{\frac {d}{d\varepsilon }}\int f({\boldsymbol {r}},\rho +\varepsilon \phi ,\nabla \rho +\varepsilon \nabla \phi )\,d{\boldsymbol {r}}\right]_{\varepsilon =0}\\&=\int \left({\frac {\partial f}{\partial \rho }}\,\phi +{\frac {\partial f}{\partial \nabla \rho }}\cdot \nabla \phi \right)d{\boldsymbol {r}}\\&=\int \left[{\frac {\partial f}{\partial \rho }}\,\phi +\nabla \cdot \left({\frac {\partial f}{\partial \nabla \rho }}\,\phi \right)-\left(\nabla \cdot {\frac {\partial f}{\partial \nabla \rho }}\right)\phi \right]d{\boldsymbol {r}}\\&=\int \left[{\frac {\partial f}{\partial \rho }}\,\phi -\left(\nabla \cdot {\frac {\partial f}{\partial \nabla \rho }}\right)\phi \right]d{\boldsymbol {r}}\\&=\int \left({\frac {\partial f}{\partial \rho }}-\nabla \cdot {\frac {\partial f}{\partial \nabla \rho }}\right)\phi ({\boldsymbol {r}})\ d{\boldsymbol {r}}\,.\end{aligned}}

The second line is obtained using the total derivative, where $\partialf / \partial\nabla$ ρ is a derivative of a scalar with respect to a vector.^{[Note 2]} The third line was obtained by use of a product rule for divergence. The fourth line was obtained using the divergence theorem and the condition that $ϕ =0$ on the boundary of the region of integration. Since $ϕ$ is also an arbitrary function, applying the fundamental lemma of calculus of variations to the last line, the functional derivative is obtained.

Examples edit

Thomas–Fermi kinetic energy functional edit

The Thomas–Fermi model of 1927 used a kinetic energy functional for a noninteracting uniform electron gas in a first attempt of density-functional theory of electronic structure:

T_{\mathrm {TF} }[\rho ]=C_{\mathrm {F} }\int \rho ^{5/3}(\mathbf {r} )\,d\mathbf {r} \,.

Since the integrand of $T TF$ [ρ] does not involve derivatives of ρ $(r)$ , the functional derivative of $T TF$ [ρ] is,^[8]

{\begin{aligned}{\frac {\delta T_{\mathrm {TF} }}{\delta \rho ({\boldsymbol {r}})}}&=C_{\mathrm {F} }{\frac {\partial \rho ^{5/3}(\mathbf {r} )}{\partial \rho (\mathbf {r} )}}={\frac {5}{3}}C_{\mathrm {F} }\rho ^{2/3}(\mathbf {r} )\,.\end{aligned}}

Coulomb potential energy functional edit

For the electron-nucleus potential, Thomas and Fermi employed the Coulomb potential energy functional

V[\rho ]=\int {\frac {\rho ({\boldsymbol {r}})}{|{\boldsymbol {r}}|}}\ d{\boldsymbol {r}}.

Applying the definition of functional derivative,

{\begin{aligned}\int {\frac {\delta V}{\delta \rho ({\boldsymbol {r}})}}\ \phi ({\boldsymbol {r}})\ d{\boldsymbol {r}}&{}=\left[{\frac {d}{d\varepsilon }}\int {\frac {\rho ({\boldsymbol {r}})+\varepsilon \phi ({\boldsymbol {r}})}{|{\boldsymbol {r}}|}}\ d{\boldsymbol {r}}\right]_{\varepsilon =0}\\&{}=\int {\frac {1}{|{\boldsymbol {r}}|}}\,\phi ({\boldsymbol {r}})\ d{\boldsymbol {r}}\,.\end{aligned}}

So,

{\frac {\delta V}{\delta \rho ({\boldsymbol {r}})}}={\frac {1}{|{\boldsymbol {r}}|}}\ .

For the classical part of the electron-electron interaction, Thomas and Fermi employed the Coulomb potential energy functional

J[\rho ]={\frac {1}{2}}\iint {\frac {\rho (\mathbf {r} )\rho (\mathbf {r} ')}{\vert \mathbf {r} -\mathbf {r} '\vert }}\,d\mathbf {r} d\mathbf {r} '\,.

From the definition of the functional derivative,

{\begin{aligned}\int {\frac {\delta J}{\delta \rho ({\boldsymbol {r}})}}\phi ({\boldsymbol {r}})d{\boldsymbol {r}}&{}=\left[{\frac {d\ }{d\epsilon }}\,J[\rho +\epsilon \phi ]\right]_{\epsilon =0}\\&{}=\left[{\frac {d\ }{d\epsilon }}\,\left({\frac {1}{2}}\iint {\frac {[\rho ({\boldsymbol {r}})+\epsilon \phi ({\boldsymbol {r}})]\,[\rho ({\boldsymbol {r}}')+\epsilon \phi ({\boldsymbol {r}}')]}{\vert {\boldsymbol {r}}-{\boldsymbol {r}}'\vert }}\,d{\boldsymbol {r}}d{\boldsymbol {r}}'\right)\right]_{\epsilon =0}\\&{}={\frac {1}{2}}\iint {\frac {\rho ({\boldsymbol {r}}')\phi ({\boldsymbol {r}})}{\vert {\boldsymbol {r}}-{\boldsymbol {r}}'\vert }}\,d{\boldsymbol {r}}d{\boldsymbol {r}}'+{\frac {1}{2}}\iint {\frac {\rho ({\boldsymbol {r}})\phi ({\boldsymbol {r}}')}{\vert {\boldsymbol {r}}-{\boldsymbol {r}}'\vert }}\,d{\boldsymbol {r}}d{\boldsymbol {r}}'\\\end{aligned}}

The first and second terms on the right hand side of the last equation are equal, since $r$ and $r'$ in the second term can be interchanged without changing the value of the integral. Therefore,

\int {\frac {\delta J}{\delta \rho ({\boldsymbol {r}})}}\phi ({\boldsymbol {r}})d{\boldsymbol {r}}=\int \left(\int {\frac {\rho ({\boldsymbol {r}}')}{\vert {\boldsymbol {r}}-{\boldsymbol {r}}'\vert }}d{\boldsymbol {r}}'\right)\phi ({\boldsymbol {r}})d{\boldsymbol {r}}

and the functional derivative of the electron-electron coulomb potential energy functional $J$ [ρ] is,^[9]

{\frac {\delta J}{\delta \rho ({\boldsymbol {r}})}}=\int {\frac {\rho ({\boldsymbol {r}}')}{\vert {\boldsymbol {r}}-{\boldsymbol {r}}'\vert }}d{\boldsymbol {r}}'\,.

The second functional derivative is

{\frac {\delta ^{2}J[\rho ]}{\delta \rho (\mathbf {r} ')\delta \rho (\mathbf {r} )}}={\frac {\partial }{\partial \rho (\mathbf {r} ')}}\left({\frac {\rho (\mathbf {r} ')}{\vert \mathbf {r} -\mathbf {r} '\vert }}\right)={\frac {1}{\vert \mathbf {r} -\mathbf {r} '\vert }}.

Weizsäcker kinetic energy functional edit

In 1935 von Weizsäcker proposed to add a gradient correction to the Thomas-Fermi kinetic energy functional to make it suit better a molecular electron cloud:

T_{\mathrm {W} }[\rho ]={\frac {1}{8}}\int {\frac {\nabla \rho (\mathbf {r} )\cdot \nabla \rho (\mathbf {r} )}{\rho (\mathbf {r} )}}d\mathbf {r} =\int t_{\mathrm {W} }\ d\mathbf {r} \,,

where

t_{\mathrm {W} }\equiv {\frac {1}{8}}{\frac {\nabla \rho \cdot \nabla \rho }{\rho }}\qquad {\text{and}}\ \ \rho =\rho ({\boldsymbol {r}})\ .

Using a previously derived formula for the functional derivative,

{\begin{aligned}{\frac {\delta T_{\mathrm {W} }}{\delta \rho ({\boldsymbol {r}})}}&={\frac {\partial t_{\mathrm {W} }}{\partial \rho }}-\nabla \cdot {\frac {\partial t_{\mathrm {W} }}{\partial \nabla \rho }}\\&=-{\frac {1}{8}}{\frac {\nabla \rho \cdot \nabla \rho }{\rho ^{2}}}-\left({\frac {1}{4}}{\frac {\nabla ^{2}\rho }{\rho }}-{\frac {1}{4}}{\frac {\nabla \rho \cdot \nabla \rho }{\rho ^{2}}}\right)\qquad {\text{where}}\ \ \nabla ^{2}=\nabla \cdot \nabla \ ,\end{aligned}}

and the result is,^[10]

{\frac {\delta T_{\mathrm {W} }}{\delta \rho ({\boldsymbol {r}})}}=\ \ \,{\frac {1}{8}}{\frac {\nabla \rho \cdot \nabla \rho }{\rho ^{2}}}-{\frac {1}{4}}{\frac {\nabla ^{2}\rho }{\rho }}\ .

Entropy edit

The entropy of a discrete random variable is a functional of the probability mass function.

{\begin{aligned}H[p(x)]=-\sum _{x}p(x)\log p(x)\end{aligned}}

Thus,

{\begin{aligned}\sum _{x}{\frac {\delta H}{\delta p(x)}}\,\phi (x)&{}=\left[{\frac {d}{d\epsilon }}H[p(x)+\epsilon \phi (x)]\right]_{\epsilon =0}\\&{}=\left[-\,{\frac {d}{d\varepsilon }}\sum _{x}\,[p(x)+\varepsilon \phi (x)]\ \log[p(x)+\varepsilon \phi (x)]\right]_{\varepsilon =0}\\&{}=\displaystyle -\sum _{x}\,[1+\log p(x)]\ \phi (x)\,.\end{aligned}}

Thus,

{\frac {\delta H}{\delta p(x)}}=-1-\log p(x).

Exponential edit

Let

F[\varphi (x)]=e^{\int \varphi (x)g(x)dx}.

Using the delta function as a test function,

{\begin{aligned}{\frac {\delta F[\varphi (x)]}{\delta \varphi (y)}}&{}=\lim _{\varepsilon \to 0}{\frac {F[\varphi (x)+\varepsilon \delta (x-y)]-F[\varphi (x)]}{\varepsilon }}\\&{}=\lim _{\varepsilon \to 0}{\frac {e^{\int (\varphi (x)+\varepsilon \delta (x-y))g(x)dx}-e^{\int \varphi (x)g(x)dx}}{\varepsilon }}\\&{}=e^{\int \varphi (x)g(x)dx}\lim _{\varepsilon \to 0}{\frac {e^{\varepsilon \int \delta (x-y)g(x)dx}-1}{\varepsilon }}\\&{}=e^{\int \varphi (x)g(x)dx}\lim _{\varepsilon \to 0}{\frac {e^{\varepsilon g(y)}-1}{\varepsilon }}\\&{}=e^{\int \varphi (x)g(x)dx}g(y).\end{aligned}}

Thus,

{\frac {\delta F[\varphi (x)]}{\delta \varphi (y)}}=g(y)F[\varphi (x)].

This is particularly useful in calculating the correlation functions from the partition function in quantum field theory.

Functional derivative of a function edit

A function can be written in the form of an integral like a functional. For example,

\rho ({\boldsymbol {r}})=F[\rho ]=\int \rho ({\boldsymbol {r}}')\delta ({\boldsymbol {r}}-{\boldsymbol {r}}')\,d{\boldsymbol {r}}'.

Since the integrand does not depend on derivatives of ρ, the functional derivative of ρ $(r)$ is,

{\begin{aligned}{\frac {\delta \rho ({\boldsymbol {r}})}{\delta \rho ({\boldsymbol {r}}')}}\equiv {\frac {\delta F}{\delta \rho ({\boldsymbol {r}}')}}&={\frac {\partial \ \ }{\partial \rho ({\boldsymbol {r}}')}}\,[\rho ({\boldsymbol {r}}')\delta ({\boldsymbol {r}}-{\boldsymbol {r}}')]=\delta ({\boldsymbol {r}}-{\boldsymbol {r}}').\end{aligned}}

Application in calculus of variations edit

Consider the functional

J[f]=\int \limits _{a}^{b}L[\,x,f(x),f\,'(x)\,]\,dx\ ,

where $f'(x) \equiv df/dx$ . If $f$ is varied by adding to it a function $δf$ , and the resulting integrand $L (x, f +δf, f '+δf')$ is expanded in powers of $δf$ , then the change in the value of $J$ to first order in $δf$ can be expressed as follows:^[11]^{[Note 3]}

\delta J=\int _{a}^{b}{\frac {\delta J}{\delta f(x)}}{\delta f(x)}dx\,.

The coefficient of $δf(x)$ , denoted as $δJ / δf(x)$ , is called the functional derivative of $J$ with respect to $f$ at the point $x$ .^[3] For this example functional, the functional derivative is the left hand side of the Euler-Lagrange equation,^[12]

{\frac {\delta J}{\delta f(x)}}={\frac {\partial L}{\partial f}}-{\frac {d}{dx}}{\frac {\partial L}{\partial f'}}\,.

Using the delta function as a test function edit

In physics, it's common to use the Dirac delta function $\delta (x-y)$ in place of a generic test function $\phi (x)$ , for yielding the functional derivative at the point $y$ (this is a point of the whole functional derivative as a partial derivative is a component of the gradient):

{\frac {\delta F[\rho (x)]}{\delta \rho (y)}}=\lim _{\varepsilon \to 0}{\frac {F[\rho (x)+\varepsilon \delta (x-y)]-F[\rho (x)]}{\varepsilon }}.

This works in cases when $F[\rho (x)+\varepsilon f(x)]$ formally can be expanded as a series (or at least up to first order) in $\varepsilon$ . The formula is however not mathematically rigorous, since $F[\rho (x)+\varepsilon \delta (x-y)]$ is usually not even defined.

The definition given in a previous section is based on a relationship that holds for all test functions $ϕ$ , so one might think that it should hold also when $ϕ$ is chosen to be a specific function such as the delta function.

Notes edit

^ Called differential in (Parr & Yang 1989, p. 246), variation or first variation in (Courant & Hilbert 1953, p. 186), and variation or differential in (Gelfand & Fomin 2000, p. 11, § 3.2).
^ For a three-dimensional cartesian coordinate system,
${\begin{aligned}{\frac {\partial f}{\partial \nabla \rho }}={\frac {\partial f}{\partial \rho _{x}}}\mathbf {\hat {i}} +{\frac {\partial f}{\partial \rho _{y}}}\mathbf {\hat {j}} +{\frac {\partial f}{\partial \rho _{z}}}\mathbf {\hat {k}} \,,\qquad &{\text{where}}\ \rho _{x}={\frac {\partial \rho }{\partial x}}\,,\ \rho _{y}={\frac {\partial \rho }{\partial y}}\,,\ \rho _{z}={\frac {\partial \rho }{\partial z}}\,\\&{\text{and}}\ \ \mathbf {\hat {i}} ,\ \mathbf {\hat {j}} ,\ \mathbf {\hat {k}} \ \ {\text{are unit vectors along the x, y, z axes.}}\end{aligned}}$
^ According to Giaquinta & Hildebrandt (1996, p. 18), this notation is customary in physical literature.

Category:Differential calculus

Category:Topological vector spaces Category:Differential operators Category:Calculus of variations Category:Variational analysis* Category:Types of functions

Thomas–Fermi model edit

The predecessor to density functional theory was the Thomas–Fermi model, developed independently by both Thomas and Fermi in 1927. They used a statistical model to approximate the distribution of electrons in an atom. The mathematical basis postulated that electrons are distributed uniformly in phase space with two electrons in every $h^{3}$ of volume.^[13] For each element of coordinate space volume $d^{3}r$ we can fill out a sphere of momentum space up to the Fermi momentum $p_{f}$ ^[14]

Kinetic energy edit

For a small volume element ΔV, and for the atom in its ground state, we can fill out a spherical momentum space volume V_f up to the Fermi momentum p_f , and thus,^[15]

V_{f}={\frac {4}{3}}\pi p_{f}^{3}({\vec {r}}).

where ${\vec {r}}$ is a point in ΔV.

The corresponding phase space volume is

\Delta V_{ph}=V_{f}\ \Delta V={\frac {4}{3}}\pi p_{f}^{3}({\vec {r}})\ \Delta V.

The electrons in ΔV_ph are distributed uniformly with two electrons per h³ of this phase space volume, where h is Planck's constant.^[16] Then the number of electrons in ΔV_ph is

\Delta N_{ph}={\frac {2}{h^{3}}}\ \Delta V_{ph}={\frac {8\pi }{3h^{3}}}p_{f}^{3}({\vec {r}})\ \Delta V.

The number of electrons in ΔV is

n({\vec {r}})={\frac {\Delta N_{ph}}{\Delta V}}={\frac {8\pi }{3h^{3}}}p_{f}^{3}({\vec {r}}).

where $n({\vec {r}})$ is the electron density.

The fraction of electrons at ${\vec {r}}$ that have momentum between p and p+dp is,

{\begin{aligned}F_{\vec {r}}(p)dp&={\frac {4\pi p^{2}dp}{{\frac {4}{3}}\pi p_{f}^{3}({\vec {r}})}}\qquad \qquad p\leq p_{f}({\vec {r}})\\&=0\qquad \qquad \qquad \quad {\text{otherwise}}\\\end{aligned}}

Using the classical expression for the kinetic energy of an electron with mass m_e, the kinetic energy per unit volume at ${\vec {r}}$ for the electrons of the atom is,

{\begin{aligned}t_{TF}[n]=\int {\frac {p^{2}}{2m_{e}}}\ n({\vec {r}})\ F_{\vec {r}}(p)\ dp=n({\vec {r}})\int _{0}^{p_{f}({\vec {r}})}{\frac {p^{2}}{2m_{e}}}\ \ {\frac {4\pi p^{2}}{{\frac {4}{3}}\pi p_{f}^{3}({\vec {r}})}}\ dp=C_{F}\ [n({\vec {r}})]^{5/3}\end{aligned}}

where a previous expression relating $n({\vec {r}})$ to $p_{f}({\vec {r}})$ has been used and,

C_{F}={\frac {3h^{2}}{10m_{e}}}\left({\frac {3}{8\pi }}\right)^{\frac {2}{3}}.

Integrating the kinetic energy per unit volume $t({\vec {r}})$ over all space, results in the total kinetic energy of the electrons,^[17]

T_{TF}[n]=C_{F}\int [n({\vec {r}})]^{5/3}\ d^{3}r\ .

This result shows that the total kinetic energy of the electrons can be expressed in terms of only the spatially varying electron density $n({\vec {r}}),$ according to the Thomas–Fermi model. As such, they were able to calculate the energy of an atom using this expression for the kinetic energy combined with the classical expressions for the nuclear-electron and electron-electron interactions (which can both also be represented in terms of the electron density).

Potential energies edit

The potential energy of an atom's electrons, due to the electric attraction of the positively charged nucleus is,

U_{eN}=\int n({\vec {r}})\ V_{N}({\vec {r}})\ d^{3}r\,

where $V_{N}({\vec {r}})\,$ is the potential energy of an electron at ${\vec {r}}\,$ that is due to the electric field of the nucleus. For the case of a nucleus centered at ${\vec {r}}=0$ with charge Ze, where Z is a positive integer and e is the elementary charge,

V_{N}({\vec {r}})={\frac {-Ze^{2}}{r}}.

The potential energy of the electrons due to their mutual electric repulsion is,

U_{ee}={\frac {1}{2}}\ e^{2}\int {\frac {n({\vec {r}})\ n({\vec {r}}\,')}{\left\vert {\vec {r}}-{\vec {r}}\,'\right\vert }}\ d^{3}r\ d^{3}r'.

Total energy edit

The total energy of the electrons is the sum of their kinetic and potential energies,^[18]

{\begin{aligned}E&=T\ +\ U_{eN}\ +\ U_{ee}\\&=C_{F}\int [n({\vec {r}})]^{5/3}\ d^{3}r\ +\int n({\vec {r}})\ V_{N}({\vec {r}})\ d^{3}r\ +\ {\frac {1}{2}}\ e^{2}\int {\frac {n({\vec {r}})\ n({\vec {r}}\,')}{\left\vert {\vec {r}}-{\vec {r}}\,'\right\vert }}\ d^{3}r\ d^{3}r'\\\end{aligned}}

Inaccuracies and improvements edit

Although this was an important first step, the Thomas–Fermi equation's accuracy is limited because the resulting expression for the kinetic energy is only approximate, and because the method does not attempt to represent the exchange energy of an atom as a conclusion of the Pauli principle. A term for the exchange energy was added by Dirac in 1928.

However, the Thomas–Fermi–Dirac theory remained rather inaccurate for most applications. The largest source of error was in the representation of the kinetic energy, followed by the errors in the exchange energy, and due to the complete neglect of electron correlation.

In 1962, Edward Teller showed that Thomas–Fermi theory cannot describe molecular bonding – the energy of any molecule calculated with TF theory is higher than the sum of the energies of the constituent atoms. More generally, the total energy of a molecule decreases when the bond lengths are uniformly increased.^[19]^[20]^[21]^[22] This can be overcome by improving the expression for the kinetic energy.^[23]

The Thomas–Fermi kinetic energy can be improved by adding to it the Weizsäcker (1935) correction:,^[24] which can then make a much improved Thomas–Fermi–Dirac–Weizsaecher density functional theory (TFDW-DFT), which would be equivalent to the Hartree and then Hartree–Fock mean field theories which do not treat static electron correlation (treated by the CASSCF theory developed by Bjorn Roos' group in Lund, Sweden), and dynamic correlation (treated by both Moeller–Plesset perturbation theory to second order (MP2) or CASPT2, the extension of MP2 theory to systems not well treated by simple single reference/configuration methods like Hartree–Fock theory and Kohn–Sham DFT. Note that KS-DFT has also been extended to treat systems for which the ground electronic state is not well represented by either a single Slater determinant of Hartree–Fock or "Kohn–Sham" orbitals, the so-called CAS-DFT method, also being developed in the group of Bjorn Roos in Lund.

T_{W}={\frac {1}{8}}{\frac {\hbar ^{2}}{m}}\int {\frac {|\nabla n({\vec {r}})|^{2}}{n({\vec {r}})}}dr.

Category:Atomic physics Category:Density functional theory

Pauli exclusion principle, Connection to quantum state symmetry edit

The Pauli exclusion principle with a single-valued many-particle wavefunction is equivalent to requiring the wavefunction to be antisymmetric. An antisymmetric two-particle state is represented as a sum of states in which one particle is in state $\scriptstyle |x\rangle$ and the other in state $\scriptstyle |y\rangle$ :

|\psi \rangle =\sum _{x,y}A(x,y)|x,y\rangle

and antisymmetry under exchange means that

$A(x,y)=-A(y,x)$

This implies A(x,y) = 0 when x=y, which is Pauli exclusion. It is true in any basis, since unitary changes of basis keep antisymmetric matrices antisymmetric, although strictly speaking, the quantity A(x,y) is not a matrix but an antisymmetric rank-two tensor.

Conversely, if the diagonal quantities A(x,x) are zero in every basis, then the wavefunction component:

A(x,y)=\langle \psi |x,y\rangle =\langle \psi |(|x\rangle \otimes |y\rangle )

is necessarily antisymmetric.

Quantum mechanical description of identical particles edit

Symmetrical and anti-symmetrical states edit

Antisymmetric wavefunction for a (fermionic) 2-particle state in an infinite square well potential.

Let us define a linear operator P, called the exchange operator. When it acts on a tensor product of two state vectors, it exchanges the values of the state vectors:

P{\bigg (}|\psi \rangle |\phi \rangle {\bigg )}\equiv |\phi \rangle |\psi \rangle

P is both Hermitian and unitary. Because it is unitary, we can regard it as a symmetry operator. We can describe this symmetry as the symmetry under the exchange of labels attached to the particles (i.e., to the single-particle Hilbert spaces).

Clearly, $P^{2}=1$ (the identity operator), so the eigenvalues of P are +1 and −1. The corresponding eigenvectors are the symmetric and antisymmetric states:

P|n_{1},n_{2};S\rangle =+|n_{1},n_{2};S\rangle

P|n_{1},n_{2};A\rangle =-|n_{1},n_{2};A\rangle

In other words, symmetric and antisymmetric states are essentially unchanged under the exchange of particle labels: they are only multiplied by a factor of +1 or −1, rather than being "rotated" somewhere else in the Hilbert space. This indicates that the particle labels have no physical meaning, in agreement with our earlier discussion on indistinguishability.

We have mentioned that P is Hermitian. As a result, it can be regarded as an observable of the system, which means that we can, in principle, perform a measurement to find out if a state is symmetric or antisymmetric. Furthermore, the equivalence of the particles indicates that the Hamiltonian can be written in a symmetrical form, such as

H={\frac {p_{1}^{2}}{2m}}+{\frac {p_{2}^{2}}{2m}}+U(|x_{1}-x_{2}|)+V(x_{1})+V(x_{2})

It is possible to show that such Hamiltonians satisfy the commutation relation

\left[P,H\right]=0

According to the Heisenberg equation, this means that the value of P is a constant of motion. If the quantum state is initially symmetric (antisymmetric), it will remain symmetric (antisymmetric) as the system evolves. Mathematically, this says that the state vector is confined to one of the two eigenspaces of P, and is not allowed to range over the entire Hilbert space. Thus, we might as well treat that eigenspace as the actual Hilbert space of the system. This is the idea behind the definition of Fock space.

Symmetric wavefunction for a (bosonic) 2-particle state in an infinite square well potential.

We will now make the above discussion concrete, using the formalism developed in the article on the mathematical formulation of quantum mechanics.

Let n denote a complete set of (discrete) quantum numbers for specifying single-particle states (for example, for the particle in a box problem we can take n to be the quantized wave vector of the wavefunction.) For simplicity, consider a system composed of two identical particles. Suppose that one particle is in the state n₁, and another is in the state n₂. What is the quantum state of the system? Intuitively, it should be

|n_{1}\rangle |n_{2}\rangle

which is simply the canonical way of constructing a basis for a tensor product space $H\otimes H$ of the combined system from the individual spaces. However, this expression implies the ability to identify the particle with n₁ as "particle 1" and the particle with n₂ as "particle 2". If the particles are indistinguishable, this is impossible by definition; either particle can be in either state. It turns out that we must have:^[25] ^{[clarification needed]}

|n_{1}\rangle |n_{2}\rangle \pm |n_{2}\rangle |n_{1}\rangle

to see this, imagine a two identical particle system. suppose we know that one of the particles is in state $n_{1}$ and the other is in state $n_{2}$ . prior to the measurement, there is no way to know if particle 1 is in state $n_{1}$ and particle 2 is in state $n_{2}$ , or the other way around because the particles are indistinguishable. and so, there are equal probabilities for each of the states to occur - meaning that the system is in superposition of both states prior to the measurement.

States where this is a sum are known as symmetric; states involving the difference are called antisymmetric. More completely, symmetric states have the form

|n_{1},n_{2};S\rangle \equiv {\mbox{constant}}\times {\bigg (}|n_{1}\rangle |n_{2}\rangle +|n_{2}\rangle |n_{1}\rangle {\bigg )}

while antisymmetric states have the form

|n_{1},n_{2};A\rangle \equiv {\mbox{constant}}\times {\bigg (}|n_{1}\rangle |n_{2}\rangle -|n_{2}\rangle |n_{1}\rangle {\bigg )}

Note that if n₁ and n₂ are the same, the antisymmetric expression gives zero, which cannot be a state vector as it cannot be normalized. In other words, in an antisymmetric state two identical particles cannot occupy the same single-particle states. This is known as the Pauli exclusion principle, and it is the fundamental reason behind the chemical properties of atoms and the stability of matter.

Exchange symmetry edit

The importance of symmetric and antisymmetric states is ultimately based on empirical evidence. It appears to be a fact of nature that identical particles do not occupy states of a mixed symmetry, such as

|n_{1},n_{2};?\rangle ={\mbox{constant}}\times {\bigg (}|n_{1}\rangle |n_{2}\rangle +i|n_{2}\rangle |n_{1}\rangle {\bigg )}

There is actually an exception to this rule, which we will discuss later. On the other hand, we can show that the symmetric and antisymmetric states are in a sense special, by examining a particular symmetry of the multiple-particle states known as exchange symmetry.

N particles edit

The above discussion generalizes readily to the case of N particles. Suppose we have N particles with quantum numbers n₁, n₂, ..., n_N. If the particles are bosons, they occupy a totally symmetric state, which is symmetric under the exchange of any two particle labels:

|n_{1}n_{2}\cdots n_{N};S\rangle ={\sqrt {\frac {\prod _{j}n_{j}!}{N!}}}\sum _{p}^{N!}|n_{(p,1)}\rangle |n_{(p,2)}\rangle \cdots |n_{(p,N)}\rangle

Here, the sum is taken over all different states under permutation p of the N elements. The square root left to the sum is a normalizing constant. The quantity n_j stands for the number of times each of the single-particle states appears in the N-particle state. In the following matrix each row represents one permutation of N elements.

(p,i)={\begin{pmatrix}1&2&3&4&\cdots &N\\2&1&3&4&\cdots &N\\3&2&1&4&\cdots &N\\\vdots &\vdots &\vdots &\vdots &\vdots &\vdots \\N&N-1&N-2&N-3&\cdots &1\end{pmatrix}}_{N!\times N}

If we choose the first row as a reference, the next ${\tbinom {N}{2}}\times 1$ rows, imply one permutation, the next ${\tbinom {N}{3}}\times 2$ rows imply two permutations, and so on. So the number of rows with k permutations with regard to the first row would be ${\tbinom {N}{k+1}}\times k$ .

In the same vein, fermions occupy totally antisymmetric states:

|n_{1}n_{2}\cdots n_{N};A\rangle ={\frac {1}{\sqrt {N!}}}\sum _{p}^{N!}\mathrm {sgn} (p)|n_{(p,1)}\rangle |n_{(p,2)}\rangle \cdots |n_{(p,N)}\rangle \

mathrm{sgn}(p)={\begin{cases}1,&\scriptstyle {\text{if }}p\scriptstyle {\text{ implies even  permutations}}\\-1,&\scriptstyle {\text{if }}p\scriptstyle {\text{ implies odd permutations}}\end{cases}}

Here, $sgn(p)$ is the signature of each permutation (i.e. $+1$ if $p$ is composed of an even number of transpositions, and $-1$ if odd.) Note that we have omitted the $\Pi _{j}n_{j}$ term, because each single-particle state can appear only once in a fermionic state. Otherwise the sum would again be zero due to the antisymmetry, thus representing a physically impossible state. This is the Pauli exclusion principle for many particles.

These states have been normalized so that

\langle n_{1}n_{2}\cdots n_{N};S|n_{1}n_{2}\cdots n_{N};S\rangle =1,\qquad \langle n_{1}n_{2}\cdots n_{N};A|n_{1}n_{2}\cdots n_{N};A\rangle =1.

Measurements of identical particles edit

Suppose we have a system of N bosons (fermions) in the symmetric (antisymmetric) state

|n_{1}n_{2}\cdots n_{N};S/A\rangle

and we perform a measurement of some other set of discrete observables, m. In general, this would yield some result m₁ for one particle, m₂ for another particle, and so forth. If the particles are bosons (fermions), the state after the measurement must remain symmetric (antisymmetric), i.e.

|m_{1}m_{2}\cdots m_{N};S/A\rangle

The probability of obtaining a particular result for the m measurement is

P_{S/A}(n_{1},\cdots n_{N}\rightarrow m_{1},\cdots m_{N})\equiv {\bigg |}\langle m_{1}\cdots m_{N};S/A\,|\,n_{1}\cdots n_{N};S/A\rangle {\bigg |}^{2}

We can show that

\sum _{m_{1}\leq m_{2}\leq \dots \leq m_{N}}P_{S/A}(n_{1},\cdots n_{N}\rightarrow m_{1},\cdots m_{N})=1

which verifies that the total probability is 1. Note that we have to restrict the sum to ordered values of m₁, ..., m_N to ensure that we do not count each multi-particle state more than once.

Wavefunction representation edit

So far, we have worked with discrete observables. We will now extend the discussion to continuous observables, such as the position x.

Recall that an eigenstate of a continuous observable represents an infinitesimal range of values of the observable, not a single value as with discrete observables. For instance, if a particle is in a state |ψ⟩, the probability of finding it in a region of volume d³x surrounding some position x is

|\langle x|\psi \rangle |^{2}\;d^{3}x

As a result, the continuous eigenstates |x⟩ are normalized to the delta function instead of unity:

\langle x|x'\rangle =\delta ^{3}(x-x')

We can construct symmetric and antisymmetric multi-particle states out of continuous eigenstates in the same way as before. However, it is customary to use a different normalizing constant:

|x_{1}x_{2}\cdots x_{N};S\rangle ={\frac {\prod _{j}n_{j}!}{N!}}\sum _{p}^{N!}|x_{(p,1)}\rangle |x_{(p,2)}\rangle \cdots |x_{(p,N)}\rangle

|x_{1}x_{2}\cdots x_{N};A\rangle ={\frac {1}{N!}}\sum _{p}^{N!}\mathrm {sgn} (p)|x_{(p,1)}\rangle |x_{(p,2)}\rangle \cdots |x_{(p,N)}\rangle

We can then write a many-body wavefunction,

{\begin{aligned}\Psi _{n_{1}n_{2}\cdots n_{N}}^{(S)}(x_{1},x_{2},\cdots x_{N})\equiv \langle x_{1}x_{2}\cdots x_{N}|n_{1}n_{2}\cdots n_{N};S\rangle \\[10pt]={\sqrt {\frac {\prod _{j}n_{j}!}{N!}}}\sum _{p}^{N!}\psi _{(p,1)}(x_{1})\psi _{(p,2)}(x_{2})\cdots \psi _{(p,N)}(x_{N})\end{aligned}}

{\begin{aligned}\Psi _{n_{1}n_{2}\cdots n_{N}}^{(A)}(x_{1},x_{2},\cdots x_{N})\equiv \langle x_{1}x_{2}\cdots x_{N};A|n_{1}n_{2}\cdots n_{N};A\rangle \\[10pt]={\frac {1}{\sqrt {N!}}}\sum _{p}^{N!}\mathrm {sgn} (p)\psi _{(p,1)}(x_{1})\psi _{(p,2)}(x_{2})\cdots \psi _{(p,N)}(x_{N})\end{aligned}}

where the single-particle wavefunctions are defined, as usual, by

\psi _{n}(x)\equiv \langle x|n\rangle

The most important property of these wavefunctions is that exchanging any two of the coordinate variables changes the wavefunction by only a plus or minus sign. This is the manifestation of symmetry and antisymmetry in the wavefunction representation:

\Psi _{n_{1}\cdots n_{N}}^{(S)}(\cdots x_{i}\cdots x_{j}\cdots )=\Psi _{n_{1}\cdots n_{N}}^{(S)}(\cdots x_{j}\cdots x_{i}\cdots )

\Psi _{n_{1}\cdots n_{N}}^{(A)}(\cdots x_{i}\cdots x_{j}\cdots )=-\Psi _{n_{1}\cdots n_{N}}^{(A)}(\cdots x_{j}\cdots x_{i}\cdots )

The many-body wavefunction has the following significance: if the system is initially in a state with quantum numbers n₁, ..., n_N, and we perform a position measurement, the probability of finding particles in infinitesimal volumes near x₁, x₂, ..., x_N is

N!\;\left|\Psi _{n_{1}n_{2}\cdots n_{N}}^{(S/A)}(x_{1},x_{2},\cdots x_{N})\right|^{2}\;d^{3N}\!x

The factor of N! comes from our normalizing constant, which has been chosen so that, by analogy with single-particle wavefunctions,

\int \!\int \!\cdots \!\int \;\left|\Psi _{n_{1}n_{2}\cdots n_{N}}^{(S/A)}(x_{1},x_{2},\cdots x_{N})\right|^{2}d^{3}\!x_{1}d^{3}\!x_{2}\cdots d^{3}\!x_{N}=1

Because each integral runs over all possible values of x, each multi-particle state appears N! times in the integral. In other words, the probability associated with each event is evenly distributed across N! equivalent points in the integral space. Because it is usually more convenient to work with unrestricted integrals than restricted ones, we have chosen our normalizing constant to reflect this.

Finally, it is interesting to note that antisymmetric wavefunction can be written as the determinant of a matrix, known as a Slater determinant:

\Psi _{n_{1}\cdots n_{N}}^{(A)}(x_{1},\cdots x_{N})={\frac {1}{\sqrt {N!}}}\left|{\begin{matrix}\psi _{n_{1}}(x_{1})&\psi _{n_{1}}(x_{2})&\cdots &\psi _{n_{1}}(x_{N})\\\psi _{n_{2}}(x_{1})&\psi _{n_{2}}(x_{2})&\cdots &\psi _{n_{2}}(x_{N})\\\cdots &\cdots &\cdots &\cdots \\\psi _{n_{N}}(x_{1})&\psi _{n_{N}}(x_{2})&\cdots &\psi _{n_{N}}(x_{N})\\\end{matrix}}\right|

Hartree-Fock (HF) edit

Hartree–Fock algorithm edit

The Hartree–Fock method is typically used to solve the time-independent Schrödinger equation for a multi-electron atom or molecule as described in the w:Born–Oppenheimer approximation. Since there are no known solutions for many-electron systems (hydrogenic atoms and the diatomic hydrogen cation being notable one-electron exceptions), the problem is solved numerically. Due to the nonlinearities introduced by the Hartree–Fock approximation, the equations are solved using a nonlinear method such as w:iteration, which gives rise to the name "self-consistent field method."

Greatly simplified algorithmic flowchart illustrating the Hartree–Fock method

Approximations edit

The Hartree–Fock method makes five major simplifications in order to deal with this task:

The w:Born–Oppenheimer approximation is inherently assumed. The full molecular wave function is actually a function of the coordinates of each of the nuclei, in addition to those of the electrons.
Typically, relativistic effects are completely neglected. The momentum operator is assumed to be completely non-relativistic.
The variational solution is assumed to be a w:linear combination of a finite number of basis functions, which are usually (but not always) chosen to be w:orthogonal. The finite basis set is assumed to be approximately complete.
Each w:energy eigenfunction is assumed to be describable by a single w:Slater determinant, an antisymmetrized product of one-electron wave functions (i.e., orbitals).
The mean field approximation is implied. Effects arising from deviations from this assumption, known as w:electron correlation, are completely neglected for the electrons of opposite spin, but are taken into account for electrons of parallel spin.^[26]^[27] (Electron correlation should not be confused with electron exchange, which is fully accounted for in the Hartree–Fock method.)^[27]

Relaxation of the last two approximations give rise to many so-called w:post-Hartree–Fock methods.

The Fock operator edit

Because the electron-electron repulsion term of the w:electronic molecular Hamiltonian involves the coordinates of two different electrons, it is necessary to reformulate it in an approximate way. Under this approximation, (outlined under Hartree–Fock algorithm), all of the terms of the exact Hamiltonian except the nuclear-nuclear repulsion term are re-expressed as the sum of one-electron operators outlined below, for closed-shell atoms or molecules (with two electrons in each spatial orbital).^[28] The "(1)" following each operator symbol simply indicates that the operator is 1-electron in nature.

{\hat {F}}[\{\phi _{j}\}](1)={\hat {H}}^{\text{core}}(1)+\sum _{j=1}^{N/2}[2{\hat {J}}_{j}(1)-{\hat {K}}_{j}(1)]

where ${\hat {F}}[\{\phi _{j}\}](1)$ is the one-electron Fock operator generated by the orbitals $\phi _{j}$ , and

{\hat {H}}^{\text{core}}(1)=-{\frac {1}{2}}\nabla _{1}^{2}-\sum _{\alpha }{\frac {Z_{\alpha }}{r_{1\alpha }}}

is the one-electron core Hamiltonian. Also ${\hat {J}}_{j}(1)$ is the w:Coulomb operator,

defining the electron-electron repulsion energy due to each of the two electrons in the jth orbital.^[28]

Finally ${\hat {K}}_{j}(1)$ is the w:exchange operator, defining the electron exchange energy due to the antisymmetry of the total n-electron wave function. ^[28]

This (so called) "exchange energy" operator, K, is simply an artifact of the Slater determinant.

Finding the Hartree–Fock one-electron wave functions is now equivalent to solving the eigenfunction equation:

{\hat {F}}(1)\phi _{i}(1)=\epsilon _{i}\phi _{i}(1)

where $\phi _{i}\;(1)$ are a set of one-electron wave functions, called the Hartree–Fock molecular orbitals.

Fock matrix edit

In the w:Hartree–Fock method of w:quantum mechanics, the Fock matrix is a matrix approximating the single-electron w:energy operator of a given quantum system in a given set of basis vectors.^[29]

It is most often formed in w:computational chemistry when attempting to solve the w:Roothaan equations for an atomic or molecular system. The Fock matrix is actually an approximation to the true Hamiltonian operator of the quantum system. It includes the effects of electron-electron repulsion only in an average way. Importantly, because the Fock operator is a one-electron operator, it does not include the w:electron correlation energy.

The Fock matrix is defined by the Fock operator. For the restricted case which assumes w:closed-shell orbitals and single-determinantal wavefunctions, the Fock operator for the i-th electron is given by:^[30]

{\hat {F}}(i)={\hat {h}}(i)+\sum _{j=1}^{n/2}[2{\hat {J}}_{j}(i)-{\hat {K}}_{j}(i)]

where:

{\hat {F}}(i)

is the Fock operator for the i-th electron in the system,

{\hat {h}}(i)

is the w:one-electron hamiltonian for the i-th electron,

n

is the number of electrons and

{\frac {n}{2}}

is the number of occupied orbitals in the closed-shell system,

{\hat {J}}_{j}(i)

is the w:Coulomb operator, defining the repulsive force between the j-th and i-th electrons in the system,

{\hat {K}}_{j}(i)

is the w:exchange operator, defining the quantum effect produced by exchanging two electrons.

The Coulomb operator is multiplied by two since there are two electrons in each occupied orbital. The exchange operator is not multiplied by two since it has a non-zero result only for electrons which have the same spin as the i-th electron.

For systems with unpaired electrons there are many choices of Fock matrices.

Linear combination of atomic orbitals edit

Typically, in modern Hartree–Fock calculations, the one-electron wave functions are approximated by a w:linear combination of atomic orbitals. These atomic orbitals are called w:Slater-type orbitals. Furthermore, it is very common for the "atomic orbitals" in use to actually be composed of a linear combination of one or more Gaussian-type orbitals, rather than Slater-type orbitals, in the interests of saving large amounts of computation time.

Various basis sets are used in practice, most of which are composed of Gaussian functions. In some applications, an orthogonalization method such as the w:Gram–Schmidt process is performed in order to produce a set of orthogonal basis functions. This can in principle save computational time when the computer is solving the Roothaan–Hall equations by converting the w:overlap matrix effectively to an w:identity matrix. However, in most modern computer programs for molecular Hartree–Fock calculations this procedure is not followed due to the high numerical cost of orthogonalization and the advent of more efficient, often sparse, algorithms for solving the w:generalized eigenvalue problem, of which the Roothaan–Hall equations are an example.

DFT Derivation and formalism edit

As usual in many-body electronic structure calculations, the nuclei of the treated molecules or clusters are seen as fixed (the Born–Oppenheimer approximation), generating a static external potential V in which the electrons are moving. A stationary electronic state is then described by a wavefunction $\Psi ({\vec {r}}_{1},\dots ,{\vec {r}}_{N})$ satisfying the many-electron time-independent Schrödinger equation

{\hat {H}}\Psi =\left[{\hat {T}}+{\hat {V}}+{\hat {U}}\right]\Psi =\left[\sum _{i}^{N}\left(-{\frac {\hbar ^{2}}{2m_{i}}}\nabla _{i}^{2}\right)+\sum _{i}^{N}V({\vec {r}}_{i})+\sum _{i<j}^{N}U({\vec {r}}_{i},{\vec {r}}_{j})\right]\Psi =E\Psi

where, for the $\ N$ -electron system, ${\hat {H}}$ is the Hamiltonian, $\ E$ is the total energy, ${\hat {T}}$ is the kinetic energy, ${\hat {V}}$ is the potential energy from the external field due to positively charged nuclei, and ${\hat {U}}$ is the electron-electron interaction energy. The operators ${\hat {T}}$ and ${\hat {U}}$ are called universal operators as they are the same for any $\ N$ -electron system, while ${\hat {V}}$ is system dependent. This complicated many-particle equation is not separable into simpler single-particle equations because of the interaction term ${\hat {U}}$ .

There are many sophisticated methods for solving the many-body Schrödinger equation based on the expansion of the wavefunction in Slater determinants. While the simplest one is the Hartree–Fock method, more sophisticated approaches are usually categorized as post-Hartree–Fock methods. However, the problem with these methods is the huge computational effort, which makes it virtually impossible to apply them efficiently to larger, more complex systems.

Here DFT provides an appealing alternative, being much more versatile as it provides a way to systematically map the many-body problem, with ${\hat {U}}$ , onto a single-body problem without ${\hat {U}}$ . In DFT the key variable is the particle density $n({\vec {r}}),$ which for a normalized $\,\!\Psi$ is given by

n({\vec {r}})=N\int {\rm {d}}^{3}r_{2}\int {\rm {d}}^{3}r_{3}\cdots \int {\rm {d}}^{3}r_{N}\Psi ^{*}({\vec {r}},{\vec {r}}_{2},\dots ,{\vec {r}}_{N})\Psi ({\vec {r}},{\vec {r}}_{2},\dots ,{\vec {r}}_{N}).

This relation can be reversed, i.e. for a given ground-state density $n_{0}({\vec {r}})$ it is possible, in principle, to calculate the corresponding ground-state wavefunction $\Psi _{0}({\vec {r}}_{1},\dots ,{\vec {r}}_{N})$ . In other words, $\,\!\Psi$ is a unique functional of $\,\!n_{0}$ ,^[31]

\,\!\Psi _{0}=\Psi [n_{0}]

and consequently the ground-state expectation value of an observable $\,{\hat {O}}$ is also a functional of $\,\!n_{0}$

O[n_{0}]=\left\langle \Psi [n_{0}]\left|{\hat {O}}\right|\Psi [n_{0}]\right\rangle .

In particular, the ground-state energy is a functional of $\,\!n_{0}$

E_{0}=E[n_{0}]=\left\langle \Psi [n_{0}]\left|{\hat {T}}+{\hat {V}}+{\hat {U}}\right|\Psi [n_{0}]\right\rangle

where the contribution of the external potential $\left\langle \Psi [n_{0}]\left|{\hat {V}}\right|\Psi [n_{0}]\right\rangle$ can be written explicitly in terms of the ground-state density $\,\!n_{0}$

V[n_{0}]=\int V({\vec {r}})n_{0}({\vec {r}}){\rm {d}}^{3}r.

More generally, the contribution of the external potential $\left\langle \Psi \left|{\hat {V}}\right|\Psi \right\rangle$ can be written explicitly in terms of the density $\,\!n$ ,

V[n]=\int V({\vec {r}})n({\vec {r}}){\rm {d}}^{3}r.

The functionals $\,\!T[n]$ and $\,\!U[n]$ are called universal functionals, while $\,\!V[n]$ is called a non-universal functional, as it depends on the system under study. Having specified a system, i.e., having specified ${\hat {V}}$ , one then has to minimize the functional

E[n]=T[n]+U[n]+\int V({\vec {r}})n({\vec {r}}){\rm {d}}^{3}r

with respect to $n({\vec {r}})$ , assuming one has got reliable expressions for $\,\!T[n]$ and $\,\!U[n]$ . A successful minimization of the energy functional will yield the ground-state density $\,\!n_{0}$ and thus all other ground-state observables.

The variational problems of minimizing the energy functional $\,\!E[n]$ can be solved by applying the Lagrangian method of undetermined multipliers.^[32] First, one considers an energy functional that doesn't explicitly have an electron-electron interaction energy term,

E_{s}[n]=\left\langle \Psi _{s}[n]\left|{\hat {T}}+{\hat {V}}_{s}\right|\Psi _{s}[n]\right\rangle

where ${\hat {T}}$ denotes the kinetic energy operator and ${\hat {V}}_{s}$ is an external effective potential in which the particles are moving, so that $n_{s}({\vec {r}})\ {\stackrel {\mathrm {def} }{=}}\ n({\vec {r}})$ .

Thus, one can solve the so-called Kohn–Sham equations of this auxiliary non-interacting system,

\left[-{\frac {\hbar ^{2}}{2m}}\nabla ^{2}+V_{s}({\vec {r}})\right]\phi _{i}({\vec {r}})=\epsilon _{i}\phi _{i}({\vec {r}})

which yields the orbitals $\,\!\phi _{i}$ that reproduce the density $n({\vec {r}})$ of the original many-body system

n({\vec {r}})\ {\stackrel {\mathrm {def} }{=}}\ n_{s}({\vec {r}})=\sum _{i}^{N}\left|\phi _{i}({\vec {r}})\right|^{2}.

The effective single-particle potential can be written in more detail as

V_{s}({\vec {r}})=V({\vec {r}})+\int {\frac {e^{2}n_{s}({\vec {r}}\,')}{|{\vec {r}}-{\vec {r}}\,'|}}{\rm {d}}^{3}r'+V_{\rm {XC}}[n_{s}({\vec {r}})]

where the second term denotes the so-called Hartree term describing the electron-electron Coulomb repulsion, while the last term $\,\!V_{\rm {XC}}$ is called the exchange-correlation potential. Here, $\,\!V_{\rm {XC}}$ includes all the many-particle interactions. Since the Hartree term and $\,\!V_{\rm {XC}}$ depend on $n({\vec {r}})$ , which depends on the $\,\!\phi _{i}$ , which in turn depend on $\,\!V_{s}$ , the problem of solving the Kohn–Sham equation has to be done in a self-consistent (i.e., iterative) way. Usually one starts with an initial guess for $n({\vec {r}})$ , then calculates the corresponding $\,\!V_{s}$ and solves the Kohn–Sham equations for the $\,\!\phi _{i}$ . From these one calculates a new density and starts again. This procedure is then repeated until convergence is reached. A non-iterative approximate formulation called Harris functional DFT is an alternative approach to this.

NOTE: The one-to-one correspondence between electron density and single-particle potential is not so smooth. It contains kinds of non-analytic structure. $E_{s}[n]$ contains kinds of singularities. This may indicate a limitation of our hope for representing exchange-correlation functional in a simple form.

Approximations (exchange-correlation functionals) edit

The major problem with DFT is that the exact functionals for exchange and correlation are not known except for the free electron gas. However, approximations exist which permit the calculation of certain physical quantities quite accurately. In physics the most widely used approximation is the local-density approximation (LDA), where the functional depends only on the density at the coordinate where the functional is evaluated:

E_{\rm {XC}}^{\rm {LDA}}[n]=\int \epsilon _{\rm {XC}}(n)n({\vec {r}}){\rm {d}}^{3}r.

The local spin-density approximation (LSDA) is a straightforward generalization of the LDA to include electron spin:

E_{\rm {XC}}^{\rm {LSDA}}[n_{\uparrow },n_{\downarrow }]=\int \epsilon _{\rm {XC}}(n_{\uparrow },n_{\downarrow })n({\vec {r}}){\rm {d}}^{3}r.

Highly accurate formulae for the exchange-correlation energy density $\epsilon _{\rm {XC}}(n_{\uparrow },n_{\downarrow })$ have been constructed from quantum Monte Carlo simulations of jellium.^[33]

Generalized gradient approximations (GGA) are still local but also take into account the gradient of the density at the same coordinate:

E_{XC}^{\rm {GGA}}[n_{\uparrow },n_{\downarrow }]=\int \epsilon _{XC}(n_{\uparrow },n_{\downarrow },{\vec {\nabla }}n_{\uparrow },{\vec {\nabla }}n_{\downarrow })n({\vec {r}}){\rm {d}}^{3}r.

Using the latter (GGA) very good results for molecular geometries and ground-state energies have been achieved.

Potentially more accurate than the GGA functionals are the meta-GGA functionals, a natural development after the GGA (generalized gradient approximation). Meta-GGA DFT functional in its original form includes the second derivative of the electron density (the Laplacian) whereas GGA includes only the density and its first derivative in the exchange-correlation potential.

Functionals of this type are, for example, TPSS and the Minnesota Functionals. These functionals include a further term in the expansion, depending on the density, the gradient of the density and the Laplacian (second derivative) of the density.

Difficulties in expressing the exchange part of the energy can be relieved by including a component of the exact exchange energy calculated from Hartree–Fock theory. Functionals of this type are known as hybrid functionals.

Hohenberg–Kohn theorems edit

1.If two systems of electrons, one trapped in a potential $v_{1}({\vec {r}})$ and the other in $v_{2}({\vec {r}})$ , have the same ground-state density $n({\vec {r}})$ then necessarily $v_{1}({\vec {r}})-v_{2}({\vec {r}})=const$ .

Corollary: the ground state density uniquely determines the potential and thus all properties of the system, including the many-body wave function. In particular, the "HK" functional, defined as $F[n]=T[n]+U[n]$ is a universal functional of the density (not depending explicitly on the external potential).

2. For any positive integer $N$ and potential $v({\vec {r}})$ it exists a density functional $F[n]$ such that $E_{(v,N)}[n]=F[n]+\int {v({\vec {r}})n({\vec {r}})d^{3}r}$ obtains its minimal value at the ground-state density of $N$ electrons in the potential $v({\vec {r}})$ . The minimal value of $E_{(v,N)}[n]$ is then the ground state energy of this system.

Pseudo-potentials edit

The many electron Schrödinger equation can be very much simplified if electrons are divided in two groups: valence electrons and inner core electrons. The electrons in the inner shells are strongly bound and do not play a significant role in the chemical binding of atoms; they also partially screen the nucleus, thus forming with the nucleus an almost inert core. Binding properties are almost completely due to the valence electrons, especially in metals and semiconductors. This separation suggests that inner electrons can be ignored in a large number of cases, thereby reducing the atom to an ionic core that interacts with the valence electrons. The use of an effective interaction, a pseudopotential, that approximates the potential felt by the valence electrons, was first proposed by Fermi in 1934 and Hellmann in 1935. In spite of the simplification pseudo-potentials introduce in calculations, they remained forgotten until the late 50's.

Ab initio Pseudo-potentials

A crucial step toward more realistic pseudo-potentials was given by Topp and Hopfield and more recently Cronin, who suggested that the pseudo-potential should be adjusted such that they describe the valence charge density accurately. Based on that idea, modern pseudo-potentials are obtained inverting the free atom Schrödinger equation for a given reference electronic configuration and forcing the pseudo wave-functions to coincide with the true valence wave functions beyond a certain distance $rl_{.}$ . The pseudo wave-functions are also forced to have the same norm as the true valence wave-functions and can be written as

R_{\rm {l}}^{\rm {pp}}(r)=R_{\rm {nl}}^{\rm {AE}}(r).

\int _{0}^{rl}dr|R_{\rm {l}}^{\rm {PP}}(r)|^{2}r^{2}=\int _{0}^{rl}dr|R_{\rm {nl}}^{\rm {AE}}(r)|^{2}r^{2}.

where $R_{\rm {l}}(r).$ is the radial part of the wavefunction with angular momentum $l_{.}$ , and $pp_{.}$ and $AE_{.}$ denote, respectively, the pseudo wave-function and the true (all-electron) wave-function. The index n in the true wave-functions denotes the valence level. The distance beyond which the true and the pseudo wave-functions are equal, $rl_{.}$ , is also $l_{.}$ -dependent.

^ (Parr & Yang 1989, p. 246, Eq. A.2).
^ (Parr & Yang 1989, p. 246, Eq. A.1).
^ ^a ^b (Parr & Yang 1989, p. 246).
^ (Parr & Yang 1989, p. 247, Eq. A.3).
^ (Parr & Yang 1989, p. 247, Eq. A.4).
^ (Greiner & Reinhardt 1996, p. 38, Eq. 7).
^ (Parr & Yang 1989, p. 251, Eq. A.34).
^ (Parr & Yang 1989, p. 247, Eq. A.6).
^ (Parr & Yang 1989, p. 248, Eq. A.11).
^ (Parr & Yang 1989, p. 247, Eq. A.9).
^ (Giaquinta & Hildebrandt 1996, p. 18)
^ (Gelfand & Fomin 2000, p. 28)
^ (Parr & Yang 1989, p. 47)
^ March, N. H. (1992). Electron Density Theory of Atoms and Molecules. Academic Press. p. 24. ISBN 0-12-470525-1.
^ March 1992, p.24
^ Parr and Yang 1989, p.47
^ March 1983, p. 5, Eq. 11
^ March 1983, p. 6, Eq. 15
^ Teller, E. (1962). "On the Stability of molecules in the Thomas–Fermi theory". Rev. Mod. Phys. 34 (4): 627–631. Bibcode:1962RvMP...34..627T. doi:10.1103/RevModPhys.34.627.
^ Balàzs, N. (1967). "Formation of stable molecules within the statistical theory of atoms". Phys. Rev. 156 (1): 42–47. Bibcode:1967PhRv..156...42B. doi:10.1103/PhysRev.156.42.
^ Lieb, Elliott H.; Simon, Barry (1977). "The Thomas–Fermi theory of atoms, molecules and solids". Adv. In Math. 23 (1): 22–116. doi:10.1016/0001-8708(77)90108-6.{{cite journal}}: CS1 maint: date and year (link)
^ Parr and Yang 1989, pp.114–115
^ Parr and Yang 1989, p.127
^ Weizsäcker, C. F. v. (1935). "Zur Theorie der Kernmassen". Zeitschrift für Physik. 96 (7–8): 431–58. Bibcode:1935ZPhy...96..431W. doi:10.1007/BF01337700.
^ http://www.tcm.phy.cam.ac.uk/~pdh1001/thesis/node14.html
^ Hinchliffe, Alan (2000). Modelling Molecular Structures (2nd ed.). Baffins Lane, Chichester, West Sussex PO19 1UD, England: John Wiley & Sons Ltd. p. 186. ISBN 0-471-48993-X.{{cite book}}: CS1 maint: location (link)
^ ^a ^b Szabo, A.; Ostlund, N. S. (1996). Modern Quantum Chemistry. Mineola, New York: Dover Publishing. ISBN 0-486-69186-1.
^ ^a ^b ^c Levine, Ira N. (1991). Quantum Chemistry (4th ed.). Englewood Cliffs, New Jersey: Prentice Hall. p. 403. ISBN 0-205-12770-3.
^ Callaway, J. (1974). Quantum Theory of the Solid State. New York: Academic Press. ISBN 9780121552039.
^ Levine, I.N. (1991) Quantum Chemistry (4th ed., Prentice-Hall), p.403
^ Cite error: The named reference Hohenberg1964 was invoked but never defined (see the help page).
^ Kohn, W.; Sham, L. J. (1965). "Self-consistent equations including exchange and correlation effects". Physical Review. 140 (4A): A1133–A1138. Bibcode:1965PhRv..140.1133K. doi:10.1103/PhysRev.140.A1133.
^ John P. Perdew, Adrienn Ruzsinszky, Jianmin Tao, Viktor N. Staroverov, Gustavo Scuseria and Gábor I. Csonka (2005). "Prescriptions for the design and selection of density functional approximations: More constraint satisfaction with fewer fits". Journal of Chemical Physics. 123 (6): 062201. Bibcode:2005JChPh.123f2201P. doi:10.1063/1.1904565. PMID 16122287.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[3] Called differential in (Parr & Yang 1989, p. 246), variation or first variation in (Courant & Hilbert 1953, p. 186), and variation or differential in (Gelfand & Fomin 2000, p. 11, § 3.2).

[9] For a three-dimensional cartesian coordinate system,
${\begin{aligned}{\frac {\partial f}{\partial \nabla \rho }}={\frac {\partial f}{\partial \rho _{x}}}\mathbf {\hat {i}} +{\frac {\partial f}{\partial \rho _{y}}}\mathbf {\hat {j}} +{\frac {\partial f}{\partial \rho _{z}}}\mathbf {\hat {k}} \,,\qquad &{\text{where}}\ \rho _{x}={\frac {\partial \rho }{\partial x}}\,,\ \rho _{y}={\frac {\partial \rho }{\partial y}}\,,\ \rho _{z}={\frac {\partial \rho }{\partial z}}\,\\&{\text{and}}\ \ \mathbf {\hat {i}} ,\ \mathbf {\hat {j}} ,\ \mathbf {\hat {k}} \ \ {\text{are unit vectors along the x, y, z axes.}}\end{aligned}}$

[14] According to Giaquinta & Hildebrandt (1996, p. 18), this notation is customary in physical literature.

[ParrYangP246A.2-1] (Parr & Yang 1989, p. 246, Eq. A.2).

[ParrYangP246A.1-2] (Parr & Yang 1989, p. 246, Eq. A.1).

[ParrYangP246-4] (Parr & Yang 1989, p. 246).

[ParrYangP247A.3-5] (Parr & Yang 1989, p. 247, Eq. A.3).

[ParrYangP247A.4-6] (Parr & Yang 1989, p. 247, Eq. A.4).

[ChainRule-7] (Greiner & Reinhardt 1996, p. 38, Eq. 7).

[ParrYangP251A.34-8] (Parr & Yang 1989, p. 251, Eq. A.34).

[ParrYangP247A.6-10] (Parr & Yang 1989, p. 247, Eq. A.6).

[ParrYangP248A.11-11] (Parr & Yang 1989, p. 248, Eq. A.11).

[ParrYangP247A.9-12] (Parr & Yang 1989, p. 247, Eq. A.9).

[GiaquintaHildebrandtP18-13] (Giaquinta & Hildebrandt 1996, p. 18)

[GelfandFominP28-15] (Gelfand & Fomin 2000, p. 28)

[ParrYang1989p47-16] (Parr & Yang 1989, p. 47)

[17] March, N. H. (1992). Electron Density Theory of Atoms and Molecules. Academic Press. p. 24. ISBN 0-12-470525-1.

[18] March 1992, p.24

[19] Parr and Yang 1989, p.47

[20] March 1983, p. 5, Eq. 11

[21] March 1983, p. 6, Eq. 15

[22] Teller, E. (1962). "On the Stability of molecules in the Thomas–Fermi theory". Rev. Mod. Phys. 34 (4): 627–631. Bibcode:1962RvMP...34..627T. doi:10.1103/RevModPhys.34.627.

[23] Balàzs, N. (1967). "Formation of stable molecules within the statistical theory of atoms". Phys. Rev. 156 (1): 42–47. Bibcode:1967PhRv..156...42B. doi:10.1103/PhysRev.156.42.

[24] Lieb, Elliott H.; Simon, Barry (1977). "The Thomas–Fermi theory of atoms, molecules and solids". Adv. In Math. 23 (1): 22–116. doi:10.1016/0001-8708(77)90108-6.{{cite journal}}: CS1 maint: date and year (link)

[25] Parr and Yang 1989, pp.114–115

[26] Parr and Yang 1989, p.127

[Weizsäcker1935-27] Weizsäcker, C. F. v. (1935). "Zur Theorie der Kernmassen". Zeitschrift für Physik. 96 (7–8): 431–58. Bibcode:1935ZPhy...96..431W. doi:10.1007/BF01337700.

[28] ttp://www.tcm.phy.cam.ac.uk/~pdh1001/thesis/node14.html

[29] Hinchliffe, Alan (2000). Modelling Molecular Structures (2nd ed.). Baffins Lane, Chichester, West Sussex PO19 1UD, England: John Wiley & Sons Ltd. p. 186. ISBN 0-471-48993-X.{{cite book}}: CS1 maint: location (link)

[Szabo-30] Szabo, A.; Ostlund, N. S. (1996). Modern Quantum Chemistry. Mineola, New York: Dover Publishing. ISBN 0-486-69186-1.

[Levine403-31] Levine, Ira N. (1991). Quantum Chemistry (4th ed.). Englewood Cliffs, New Jersey: Prentice Hall. p. 403. ISBN 0-205-12770-3.

[32] Callaway, J. (1974). Quantum Theory of the Solid State. New York: Academic Press. ISBN 9780121552039.

[33] Levine, I.N. (1991) Quantum Chemistry (4th ed., Prentice-Hall), p.403

[Hohenberg1964-34] Cite error: The named reference Hohenberg1964 was invoked but never defined (see the help page).

[Kohn1965-35] Kohn, W.; Sham, L. J. (1965). "Self-consistent equations including exchange and correlation effects". Physical Review. 140 (4A): A1133–A1138. Bibcode:1965PhRv..140.1133K. doi:10.1103/PhysRev.140.A1133.

[36] John P. Perdew, Adrienn Ruzsinszky, Jianmin Tao, Viktor N. Staroverov, Gustavo Scuseria and Gábor I. Csonka (2005). "Prescriptions for the design and selection of density functional approximations: More constraint satisfaction with fewer fits". Journal of Chemical Physics. 123 (6): 062201. Bibcode:2005JChPh.123f2201P. doi:10.1063/1.1904565. PMID 16122287.{{cite journal}}: CS1 maint: multiple names: authors list (link)

[1]

[2]

[Note 1]

[3]

[4]

[5]

[6]

[7]

[Note 2]

[8]

[9]

[10]

[11]

[Note 3]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

User:Homayoun mh/sandbox

Functional (mathematics) edit

Local vs non-local edit

Linear functionals edit

Functional derivative edit

Functional derivative edit

Functional differential edit

Properties edit

Lemmas edit

Examples edit

Thomas–Fermi kinetic energy functional edit

Coulomb potential energy functional edit

Weizsäcker kinetic energy functional edit

Entropy edit

Exponential edit

Functional derivative of a function edit

Application in calculus of variations edit

Using the delta function as a test function edit

Notes edit

Thomas–Fermi model edit

Kinetic energy edit

Potential energies edit

Total energy edit

Inaccuracies and improvements edit

Pauli exclusion principle, Connection to quantum state symmetry edit

Quantum mechanical description of identical particles edit

Symmetrical and anti-symmetrical states edit

Exchange symmetry edit

N particles edit

Measurements of identical particles edit

Wavefunction representation edit

Hartree-Fock (HF) edit

Hartree–Fock algorithm edit

Approximations edit

The Fock operator edit

Fock matrix edit

Linear combination of atomic orbitals edit

DFT Derivation and formalism edit

Approximations (exchange-correlation functionals) edit

Hohenberg–Kohn theorems edit

Pseudo-potentials edit