Efficient projector expansion in LCAO electronic structure calculations: Ver. 1.0
The original has been published in Phys. Rev. B 72, 045121 (2005).

Taisuke Ozaki (AIST) and Hiori Kino (NIMS)

Abstract

A projector expansion method is presented for an efficient and accurate implementation of the first-principles electronic structure calculations using pseudo potentials and atomic basis functions. By expressing the rapidly varying local potential in the vicinity of nuclei by a separable projector expansion, the difficulty involved in the grid integration using the regular real-space grid is remarkably reduced without increasing the computational effort. To illustrate the capability, it is shown that the proposed method significantly suppresses not only a spurious oscillation in the energy curve for the atomic displacement involved in a weak interaction such as hydrogen bonding, but also the dependence of optimized structure on relative position to the real-space grid in the geometry optimization within a modest grid fineness.

I. INTRODUCTION

The electronic structure method based on a linear combination of numerical atomic local basis orbitals (LCAO)[1, 2, 3, 4, 5] and pseudo potentials [6, 7] is a possible way of extending the applicability of a density functional theory (DFT) [8] to large-scale systems, since the generalized eigenvalue problem with the resultant sparse matrices can be solved by O( $N$ ) methods.[9, 10, 11, 12, 13, 14] The pioneering work by Sankey et al.[1] and its generalization to a fully self-consistent field (SCF) calculation by the SIESTA group [2] enable us to deal with large-scale systems within a fully self-consistent DFT. However, in general, one of difficult problems in the implementation of these LCAO methods is how the matrix elements for the effective potential are accurately evaluated within a modest computational effort.[2, 3, 15, 16, 17, 18] Although this numerical integration for the construction of matrix elements is a highly technical issue, serious problems enough to lose the validity of calculations are often elicited when we calculate weakly interacting systems such as hydrogen bonding molecules and include deep semicore states in pseudo potentials.[2] Due to the non-integrable form of the effective potential in the conventional DFT, the numerical integration is indispensable to construct the matrix elements, even if analytic basis functions such as Gaussian functions are used. While a scheme to avoid the numerical integration is to fit the potential into integrable analytic functions,[17, 20] however, it can be regarded as just a replacement of the numerical integration with the numerical fitting.

In the LCAO methods, usually, two kinds of numerical grids are used for the numerical integration for the construction of matrix elements.[2, 3, 15, 16, 17, 18, 19] One of them is a grid decomposed into radial and angular parts such as Gauss-Legendre and Lebedev grids which are combined with a partitioning scheme to decompose a multicenter integral into one-center integrals.[3, 15, 16, 17, 18, 19] The other is a regular real-space grid similar to that used in conventional plane-wave DFT methods.[2] Although the former possesses a benefit that the total energy is independent of the rotation of system unlike the latter, the latter is superior to the former in terms of computational efficiency. Since all the basis functions use the same regular real-space grid, once the value of basis function on grid is calculated and stored in the computational memory before the SCF calculation, there is no need of the recalculation of the value of basis function on grid during the SCF calculation. Therefore, in the latter, the matrix elements are quite efficiently constructed in an element-by-element fashion for non-zero values of basis functions on grid stored in the computational memory. On the other hand, in the former, it is difficult to store the value of basis function on grid because of the requirement of the extensive memory size, since the quadrature grid is determined for each set of two atoms.

Although the regular real-space grid possesses an advantage in terms of computational efficiency, however, serious problems remain in the grid integration using the regular real-space grid, since the total energy depends on the relative position between the system and the grid. [2, 21, 22] A typical illustration is that the optimized structure depends on the initial arrangement relative to the grid position. This dependence causes a severe problem when weakly interacting systems such as hydrogen bonding molecules are calculated. In such a system a highly fine grid is needed to obtain a convergent result, which demands great computational resources. The same problem involved in the real-space grid integration is also reported in other real-space methods,[21, 22] which implies that the difficulty involved in the grid integration is a major concern in the efficient and accurate implementation of real-space methods. Thus, our aim is to establish a method which addresses the regular real-space grid for the simplicity and overcomes the difficulty involved in the grid integration within a modest computational effort. In this paper, we present a projector expansion method in which a rapidly varying local potential in the vicinity of nuclei is expressed by a separable projector expansion, and thereby the contribution of the rapidly varying local potential to matrix elements is accurately calculated in the momentum space instead of the real space. Since the real-space integration is applied to only slowly varying parts in the effective potential, as a result, the difficulty involved in the grid integration is remarkably reduced. This paper is organized as follows. In Sec. II, a projector expansion method is presented in order to overcome the difficulty involved in the grid integration in the first-principles electronic structure calculations using atomic basis functions. In Sec. III, as illustrations of the accuracy of the proposed method, it is shown that the projector expansion method significantly suppresses not only a spurious oscillation in the energy curve for the atomic displacement involved in a weak interaction such as hydrogen bonding, but also the dependence of optimized structure on relative position to the real-space grid in the geometry optimization within a modest grid fineness. In Sec. IV, we summarize the projector expansion method and the capability. Finally, in the Appendix, the analytic evaluation of force on atom in the LCAO method using the projector expansion method is briefly described.

II. PROJECTOR EXPANSION

Let us start our formulation from the total energy expression of the DFT [8] to elucidate the difficulty involved in the grid integration. In the framework of atomic basis functions $\{\chi_{i\alpha}\}$ ,[1, 2, 4, 5] norm-conserving pseudo potentials,[6, 7] and a local density approximation (LDA) [23] to the exchange-correlation term, the total energy $E_{\rm tot}$ in the DFT is given by the sum of the kinetic energy $E_{\rm kin}$ , the electron-core Coulomb energy $E_{\rm ec}$ , the electron-electron Coulomb energy $E_{\rm ee}$ , the exchange-correlation energy $E_{\rm xc}$ , and the core-core Coulomb energy $E_{\rm cc}$ between pseudo core charges $Z_{i}$ and $Z_{j}$ as follows:

\displaystyle E_{\rm tot}=E_{\rm kin}+E_{\rm ec}+E_{\rm ee}+E_{\rm xc}+E_{\rm cc}

(1)

with

\displaystyle E_{\rm kin}=2\sum_{i\alpha,j\beta}\rho_{i\alpha,j\beta}\int d({% \bf r})\chi_{i\alpha}\hat{T}\chi_{j\beta},

(2)

	$\displaystyle E_{\rm ec}$	$\displaystyle=$	$\displaystyle E_{\rm ec}^{\rm(L)}+E_{\rm ec}^{\rm(NL)}$		(3)
		$\displaystyle=$	$\displaystyle\int d({\bf r})n({\bf r})\sum_{k}V_{{\rm L},k}({\bf r}-{\bf R}_{k% })+\int d({\bf r})n({\bf r})\sum_{k}V_{{\rm NL},k}({\bf r}-{\bf R}_{k}),$		(3)

\displaystyle E_{\rm ee}

\displaystyle=

\displaystyle\frac{1}{2}\int\int d({\bf r})d({\bf r^{\prime}})\frac{n({\bf r})% n({\bf r^{\prime}})}{|{\bf r}-{\bf r^{\prime}}|},

(4)

\displaystyle E_{\rm xc}=\int d({\bf r})n\epsilon_{\rm xc}(n),

(5)

\displaystyle E_{\rm cc}=\frac{1}{2}\sum_{i,j}\frac{Z_{i}Z_{j}}{|{\bf R}_{i}-{% \bf R}_{j}|},

(6)

where $i$ and $\alpha$ are the site and basis function indices, respectively. $\rho_{i\alpha,j\beta}$ is a density matrix associated with two basis functions $\chi_{i\alpha}$ and $\chi_{j\beta}$ , and is defined by $\sum_{\nu}\Theta(\varepsilon_{\nu}-\mu)c_{i\alpha,\nu}c_{j\beta,\nu}$ with the LCAO coefficient $c_{i\alpha,\nu}$ , the one-particle eigenenergy $\varepsilon_{\nu}$ , the chemical potential $\mu$ , and a step function $\Theta(x)$ . $n$ is the electron density defined by $2\sum_{i\alpha,j\beta}\rho_{i\alpha,j\beta}\chi_{i\alpha}\chi_{j\beta}$ . $V_{{\rm L},k}$ and $V_{{\rm NL},k}$ are the local part and non-local part in the norm-conserving pseudo potential of atom $k$ , respectively. The factor 2 in $E_{\rm kin}$ and $n$ is for the spin multiplicity. In this formulation, we consider only non spin-polarized case, non-Bloch expression of one-particle wave functions, and the LDA for simplicity, but the extensions of the below description to the spin polarized case, the Bloch expression, and a generalized gradient approximation (GGA) [24] to the exchange-correlation term are straightforward. Although the expression Eq. (1) is commonly used, it can be transformed into a more tractable form without any approximation.[1, 2] Reorganizing the sum of $E_{\rm ec}^{\rm(L)}$ , $E_{\rm ee}$ and $E_{\rm cc}$ by introducing an atomic electron density $n_{i}^{(a)}$ , we can rewrite the sum $E_{\rm ec0}+E_{\rm ee}+E_{\rm cc}$ by that of two majority contributions $E_{\rm na}$ and $E_{\rm ecc}$ consisting of short-range terms and a minority contribution $E_{\rm\delta ee}$ consisting of a long-range term as follows:

\displaystyle E_{\rm ec}^{\rm(L)}+E_{\rm ee}+E_{\rm cc}=E_{\rm na}+E_{\rm scc}% +E_{\rm\delta ee}

(7)

with

\displaystyle E_{\rm na}=\int d({\bf r})n({\bf r})\sum_{i}V_{{\rm na},i}({\bf r% }-{\bf R}_{i}),

(8)

\displaystyle E_{\rm scc}=\frac{1}{2}\sum_{i,j}\left[\frac{Z_{i}Z_{j}}{|{\bf R% }_{i}-{\bf R}_{j}|}-\int d{\bf r}n^{(a)}_{i}({\bf r}-{\bf R}_{i})V^{(a)}_{{\rm H% },j}({\bf r}-{\bf R}_{j})\right],

(9)

\displaystyle E_{\rm\delta ee}=\frac{1}{2}\int d{\bf r}\delta n({\bf r})\delta V% _{\rm H}({\bf r}),

(10)

where a difference electron density $\delta n({\bf r})$ is defined by the difference between the electron density $n$ and the sum, $n^{(a)}$ , of atomic electron densities $n_{i}^{(a)}$ as:

	$\displaystyle\delta n({\bf r})$	$\displaystyle=$	$\displaystyle n({\bf r})-n^{(a)}({\bf r}),$		(11)
		$\displaystyle=$	$\displaystyle n({\bf r})-\sum_{i}n_{i}^{(a)}({\bf r}).$		(11)

Through this paper, when the site index is dropped in the designation for a quantity specifiable with the site index, it means the inclusion of the summation over the site index except for the symbolic use. The atomic electron density $n_{i}^{(a)}$ is evaluated from pseudo wave functions under the confinement potential in this study.[4, 5] So, $\delta n$ is considerably smaller than $n^{(a)}$ . Also, it should be noted that $n_{i}^{(a)}$ is finite only within the confinement radius $r_{c,i}$ . From the sum of the local part of pseudo potential $V_{{\rm L},i}$ and a Hartree potential $V^{\rm(a)}_{{\rm H},i}$ associated with the atomic electron density, a neutral atom potential $V_{{\rm na},i}$ is defined by $V_{{\rm na},i}=V_{{\rm L},i}+V^{(a)}_{{\rm H},i}$ . The neutral atom potential is spherical because of the spherical atomic electron density, and becomes zero beyond the confinement radius $r_{c,i}$ due to Gauss’s law, indicating that it is a short-range spherical potential. Since the electron density $n$ can be evaluated the short-range quantities $\chi$ and $\rho$ , we see that the neutral atom potential energy $E_{\rm na}$ consists of short-range terms. In addition to this, the screened core-core Coulomb energy $E_{\rm scc}$ is also evaluated by taking account of only the neighboring atoms, since the second term in Eq. (9) becomes exactly equivalent to the first term when $r_{c,i}+r_{c,j}\leq|{\bf R}_{i}-{\bf R}_{j}|$ , and thereby the long-range terms in Eq. (9) vanish. Therefore, the first and second terms being the majority contributions in the right side of Eq. (7) are calculated by making use of information of adjacent atoms in an O( $N$ ) operation. Since the second term in the right side of Eq. (9) is a function of only the distance between two atoms, it is accurately evaluated using a finer real-space grid, and can be tabulated as a function of the distance between two atoms at the first stage of the calculation. So, the evaluation of $E_{\rm scc}$ is far from the difficulty involved in the real-space grid integration. The difference electron-electron Coulomb energy $E_{\rm\delta ee}$ is a minority contribution to the total energy, but is constructed by a long-range term $\delta V_{\rm H}$ , since the difference Hartree potential $\delta V_{\rm H}$ is associated with the every difference electron density $\delta n({\bf r})$ in the real space. It is worth mentioning that the Ewald summation over the core-core Coulomb energy is not required by the reorganization for parts of the total energy, which is one of advantages in the reorganized expression for the total energy. Considering the variation of the total energy with respect to the LCAO coefficient $c_{i\alpha,\nu}$ , we have a well-known generalized eigenvalue equation $Hc_{\nu}=\varepsilon_{\nu}Sc_{\nu}$ , where $H$ and $S$ are a Hamiltonian matrix defined by $\langle\chi_{i\alpha}|\hat{H}|\chi_{j\beta}\rangle$ and an overlap matrix defined by $\langle\chi_{i\alpha}|\chi_{j\beta}\rangle$ , respectively. In the Kohn-Sham (KS) Hamiltonian $\hat{H}\equiv\hat{T}+V_{\rm eff}$ , the effective potential $V_{\rm eff}$ is given by

\displaystyle V_{\rm eff}=\sum_{k}V_{{\rm NL},k}+\sum_{k}V_{{\rm na},k}+\delta V% _{\rm H}+V_{\rm xc},

(12)

where $V_{\rm xc}$ is the exchange-correlation potential. In the matrix generalized eigenvalue equation, the matrix elements for the overlap matrix, the kinetic operator, and the non-local part of pseudo potential $V_{{\rm NL},k}$ can be evaluated by two-center integrals. Since $V_{{\rm NL},k}$ is expressed by a separable projector expansion,[25, 26] the matrix elements are reduced to a product of two-center integrals. The resultant two-center integrals in the evaluation of these matrix elements can be very accurately evaluated using a finer grid in the momentum space within almost the same computational time because they only have to be evaluated once before the SCF loop.[1, 2]

Rather than the matrix elements expressible by two-center integrals, the difficulty involved in the grid integration comes from the remaining contributions in the effective potential. Although the matrix elements for the remaining potentials $V_{{\rm na},k}$ , $\delta V_{\rm H}$ , and $V_{\rm xc}$ has been evaluated by the grid integration using the regular real-space grid,[2] it can be pointed out that the matrix element for the neutral atom potential $V_{{\rm na},k}$ is difficult to be accurately calculated within a modest fineness of the real-space grid. In Fig. 1 shows potentials $V_{\rm na}$ , $\delta V_{\rm H}$ , and $V_{\rm xc}$ in Eq. (12) along the bond axis in a carbon monoxide (CO). We can see that the neutral atom potential $V_{\rm na}$ rapidly varies in the vicinity of nuclei, while the other potentials $\delta V_{\rm H}$ and $V_{\rm xc}$ smoothly vary. The comparison suggests that a finer grid is required to accurately evaluate the matrix elements for the neutral atom potential $V_{\rm na}$ , and that the calculations are not converged when a modest fineness of the real-space grid is used as shown later on.

Figure 1: Potentials $V_{\rm na}$ , $\delta V_{\rm H}$ , and $V_{\rm xc}$ in Eq. (12) along the bond axis in a carbon monoxide (CO).

Thus, in order to suppress the difficulty involved in the real-space grid integration, we propose a projector expansion method to accurately evaluate the matrix elements for the neutral atom potential $V_{{\rm na}}$ . The neutral atom potential $V_{{\rm na},i}$ is spherical and is defined within the finite range determined by the cutoff radius $r_{c,i}$ of the confinement potential. Therefore, the potential $V_{{\rm na},i}$ can be expressed by a projector expansion as follows:

\displaystyle\hat{V}_{{\rm na},k}=\sum_{lm}^{L_{\rm max}}\sum_{\zeta}^{N_{\rm rad% }}|V_{{\rm na},k}\bar{R}_{l\zeta}Y_{lm}\rangle\frac{1}{c_{l\zeta}}\langle Y_{% lm}\bar{R}_{l\zeta}V_{{\rm na},k}|,

(13)

where a set of radial functions $\{\bar{R}_{l\zeta}\}$ is an orthonormal set defined by a norm $\int r^{2}drRV_{{\rm na},k}R^{\prime}$ for radial functions $R$ and $R^{\prime}$ , and is calculated by the following Gram-Schmidt orthogonalization:[26]

	$\displaystyle\bar{R}_{l\zeta}$	$\displaystyle=$	$\displaystyle R_{l\zeta}-\sum_{\eta}^{\zeta-1}\bar{R}_{l\eta}\frac{1}{c_{l\eta% }}\int r^{2}dr\bar{R}_{l\eta}V_{{\rm na},k}R_{l\zeta},$		(14)
	$\displaystyle c_{l\zeta}$	$\displaystyle=$	$\displaystyle\int r^{2}dr\bar{R}_{l\zeta}V_{{\rm na},k}\bar{R}_{l\zeta}.$		(15)

The radial function $R_{l\zeta}$ used in this study is pseudo wave functions for both the ground and excited states under the same confinement potential as used in the calculation of the atomic electron density $n_{i}^{(a)}$ .[4, 5] It is expected that the projector expansion rapidly converges with respect to the summation over $R$ , since radial functions $R_{l\zeta}$ forms an orthonormal set within the finite range of $V_{{\rm na},k}$ . The projector expansion for the neutral atom potential becomes exact when the summation over the angular and the radial parts goes to infinity. The most important feature in the projector expansion defined by Eq. (13) is that the neutral atom potential is expressed by a separable form, and thereby we only have to evaluate the two-center integrals to construct the matrix elements for the neutral atom potential once before the SCF loop as well as the other two-center integrals. As discussed above, the two-center integrals can be accurately evaluated in the momentum space instead of the real space. Therefore, we can avoid the grid integration using the real-space grid for the rapidly varying neutral atom potential in the vicinity of nuclei, indicating that the difficulty involved in the grid integration is almost suppressed. It should be noted that the computational time by the introduction of the projector expansion is not increased within the SCF loop, since the basis functions and the projectors in Eq. (13) are Fourier-transformed once at the first stage of calculations, and the two-center integrals between them are evaluated before the SCF loop. Furthermore, the computational demand before the SCF loop is not large. In fact, the comparison of the total computational time shows that the computational time by the projector expansion method is only 13 % longer than that by the conventional method with the same real-space grid fineness as used in the projector expansion method in a molecular dynamics (MD) simulation (100 MD steps) of a C ${}_{60}$ molecule using 4 processors of Pentium 4. In addition to this, these calculations can be easily parallelized on a parallel computer due to no data-communication in their parallel computation.

There exists another advantage of the projector expansion method. The choice of the local part in the separable pseudo potential is an important factor to enhance the transferability of the pseudo potential. However, choosing a deep local part has tended to be avoided because finer real-space mesh is necessary to calculate converged results in the real-space grid integration. In this projector expansion method, there is no difficulty in the choice of the deep local part. Thus, it is possible not only to choose the deep local part, but also to include deep semicore states in the pseudo potential without increasing computational effort.

If the same radial functions $R_{l\zeta}$ introduced in Eq. (14) are used for the basis functions $\chi_{j\beta}$ ( $\equiv Y_{l^{\prime}m^{\prime}}R_{l^{\prime}\zeta^{\prime}}$ ) in the LCAO method, the matrix elements $\langle\chi_{i\alpha}|\hat{V}_{{\rm na},k}|\chi_{j\beta}\rangle$ for the neutral atom potential are evaluated in case of $i=k$ and/or $j=k$ without depending on the convergence of the summation over the angular and radial parts. Since $R_{l^{\prime}\zeta^{\prime}}=\sum_{\zeta}a_{\zeta}^{(l^{\prime}\zeta^{\prime})% }\bar{R}_{l^{\prime}\zeta}$ , e.g., in case of $j=k$ , we have

	$\displaystyle\hat{V}_{{\rm na},j}\|\chi_{j\beta}\rangle$	$\displaystyle=$	$\displaystyle\sum_{\zeta}a_{\zeta}^{(l^{\prime}\zeta^{\prime})}\|V_{{\rm na},j}% \bar{R}_{l^{\prime}\zeta}Y_{l^{\prime}m^{\prime}}\rangle$		(16)
		$\displaystyle=$	$\displaystyle V_{{\rm na},j}\|\chi_{j\beta}\rangle.$		(16)

Therefore, the one-center integral ( $i=j=k$ ) and the two-center integral ( $i=k$ or $j=k$ ) are highly accurately evaluated in the evaluation of $\langle\chi_{i\alpha}|\hat{V}_{{\rm na},k}|\chi_{j\beta}\rangle$ , where $\langle\chi_{i\alpha}|\hat{V}_{{\rm na},k}|\chi_{j\beta}\rangle$ is referred to as integral to distinguish it from one- and two-center integrals in the projector expansion defined by Eq. (13).

Although we are able to complete our formulation for the projector expansion here, a different scheme is furthermore introduced to accurately calculate the two-center integral ( $i=j\neq k$ ). In this case, the neutral atom potential $V_{{\rm na},k}$ and the product of $R_{l\zeta}$ and $R_{l^{\prime}\eta}$ are Fourier-transformed at the first stage of the calculation, and therefore the two-center integral between the neutral atom potential and the product of two basis functions is accurately evaluated in the momentum space apart from the projector expansion by Eq. (13). As a result, all the one- and two-center integrals, which are large components in the matrix elements, are calculated with a high degree of precision without depending on the convergence of the summation over the angular and radial parts. The projector expansion by Eq. (13) is applied only for the three-center integrals which are small components in the matrix elements. Thus, the error associated with the convergence of the summation in the projector expansion is significantly suppressed.

There is another origin of the difficulty involved in the real-space grid integration. The partial core correction (PCC) charge density $n_{\rm pcc}$ is often used in Eq. (5) to take into account the non-linearity in the exchange-correlation terms. In this case, if a highly localized and large PCC charge density in the vicinity of nuclei is employed, the difficulty associated with Eq. (5) appears in the real-space grid integration. However, the difficulty can be avoidable by using a modest PCC charge density. In addition, we can avoid the inclusion of the large PCC charge density by using the pseudo potential including the semicore states, since it is relatively easy to include the semicore states in the projector expansion method as discussed above.

Here, we would like to comment on the evaluation of force on atom in the LCAO method using the projector expansion scheme. The majority contributions to the total energy are expressed by the two-center integrals in the projector expansion method. So, the contribution to the force associated with these parts can be analytically calculated by differentiating the two-center integrals evaluated in the momentum space without any difficulty. On the other hand, the remaining $E_{\rm xc}$ and $E_{\rm\delta ee}$ are evaluated using the real-space grid integration. Consequently, it is not trivial whether the force associated with $E_{\rm xc}$ and $E_{\rm\delta ee}$ is analytically appraisable or not. However, in fact, it is easy to analytically evaluate the force on atom in the regular real-space scheme coupled with the projector expansion method. The details are given in the Appendix.

III. NUMERICAL RESULTS

In this section, we show numerical results on the convergence of the summation over the angular and radial parts in Eq. (13) and demonstrate the capability of the projector expansion method for the suppression of the difficulty involved in the real-space grid integration.

Figure 2: Convergence properties of of the total energy in a water molecule as a function of (a) the max L, $L_{\rm max}$ , and (b) the number of radial projectors, $N_{\rm rad}$ , in the summation of Eq. (13). In the calculations (a) and (b), $N_{\rm rad}$ and $L_{\rm max}$ were fixed at 4 and 6. The cutoff energy of 201 (Ryd) was used for the grid integration using the real-space grid. Non-Projector means that the real-space grid integration is employed for construction of the matrix elements for the neutral atom potential.

In Fig. 2 shows the convergence properties of the total energy in a water molecule as a function of the max $L$ , $L_{\rm max}$ , and the number of radial projectors, $N_{\rm rad}$ , in the summation in Eq. (13). In this study, an optimized double valence plus polarization function (DVP) was used as a basis set.[4, 5] To replace the deep core potential into a shallow potential, a norm-conserving pseudo potential [7] was used in a separable form with multiple projectors.[26] The cutoff radii of pseudo potentials are listed in Table I of Ref.[5]. A generalized gradient approximation (GGA) [24] is used for the exchange-correlation without the nonlinear partial core correction. The cutoff energy of 3600 (Ryd) and the Gauss-Legendre grid [27] of 128 were used for the evaluation of the two-center integrals (integrals) in the momentum space. The cutoff energy for the real-space grid integration is given in the caption of the Figures and Table. All calculations were performed by our DFT code, OpenMX.[28] From the convergence properties, we see that the fully convergent result is achieved when $L_{\rm max}$ and $N_{\rm rad}$ approach 6 and 4, respectively. In this case, we have to use at least 2 for $L_{\rm max}$ and $N_{\rm rad}$ to accurately evaluate the one- and two-center integrals, since the double valence plus polarization function is used as a basis set. If the values are less than 2, the accuracy of the one- and two-center integrals is not assured. Based on the convergence properties, we generally use the following values: $L_{\rm max}=L_{\rm max}^{\rm{basis}}+4$ and $N_{\rm rad}=4$ , where $L_{\rm max}^{\rm{basis}}$ is the maximum angular momentum quantum number of basis functions. From several test calculations, we confirm that the values for $L_{\rm max}$ and $N_{\rm rad}$ are enough to achieve the fully convergent results for systems including other elements, while the results are not shown.

To illustrate the accuracy of the projector expansion method, we show a comparison of the total energies of a water dimer molecule, which is a typical hydrogen bonding system with a weak interaction, calculated by the projector expansion method (Projector) and the grid integration using the regular real-space grid (Non-Projector) in Fig. 3. The total energies are calculated as a function of the distance between two oxygen atoms in water molecules, each of which possesses the experimental geometry. The dihedral angle between the water molecules is fixed at 180 degrees as depicted in Fig. 3. We see that the total energy calculated by the grid integration using the regular real-space grid with a lower cutoff energy considerably oscillates, while the oscillation can be suppressed using the high cutoff energy of 1000 Ryd. This spurious oscillation is due to the inaccurate calculation of the matrix elements for the neutral atom potential around the tail of basis functions in the real-space grid integration. On the other hand, the projector expansion method entirely suppresses the spurious oscillation of the total energy curve even in the lowest cutoff energy, thus indicating that the contribution around the tail of basis functions to the matrix elements associated with the neutral atom potential are accurately calculated. The calculated energy curve by the projector method is completely consistent with that by the non-projector method using 1000 Ryd. It should be noted that the error in the calculation of the total charge is only 1.0e-5 even in the non-projector method using 200 Ryd. Nevertheless, the spurious oscillation appears in amplitude of 0.001 Hartree in the non-projector method. Our analysis clearly shows that the spurious oscillation comes from the deep neutral atom potential $V_{{\rm na}}$ , and is removed by replacing it by the projector expansion.

Figure 3: Total energy (Hartree) of a water dimer as a function of the distance between two oxygen atoms calculated by the projector expansion method (Projector) and the grid integration using the regular real-space grid (Non-Projector) for the neutral atom potential. The value in parenthesis means the cutoff energy for the grid integration using the regular real-space grid. Each energy curve was shifted by adding a constant for ease of comparison.

As shown above, the spurious oscillation appears in the energy curve calculated by the real-space grid integration. Therefore, it would be considered that the optimized structure is trapped in a local minimum near the initial geometry if the geometry optimization is performed using the real-space grid integration, which could be a serious problem in the calculations of weakly interacting systems. In contrast, the difficulty is overcome in the projector expansion method. To study this dependence of the optimized structure on the initial geometry, we optimize the structure of a water molecule with various initial positions relative to the regular real-space grid. The optimized structure by the real-space grid integration strongly depends on the initial geometry relative to the regular real-space grid as shown in Table I, which indicates that the structure is trapped in a local minimum near the initial geometry. On the other hand, the optimized structure by the projector expansion method is almost independent of the initial geometry relative to the regular real-space grid, while the difference in the bond angle is 0.4 degrees at a maximum. These two clearly illustrate that the difficulty involved in the grid integration is successfully suppressed by introducing the projector expansion for the rapidly varying neutral atom potential in the vicinity of nuclei.

TABLE. I. Dependence of optimized structure (Å and degrees) and dipole moment (Debye) of a water molecule on the initial position relative to the regular real-space grid. The initial Cartesian coordinates (Å) are given by (a) O=(0.0,0.0,0.0), H ${}_{1}$ =(0.76,0.59,0.0), H ${}_{2}$ =(-0.76,0.59,0.0). The other initial Cartesian coordinates are generated by rotating the coordinates (a) by (b) $R_{x}(30)$ , (c) $R_{y}(40)R_{x}(30)$ , and (d) $R_{z}(50)R_{y}(40)R_{x}(30)$ , where $R_{x}(30)$ means a rotational matrix which rotates the coordinate by 30 degrees on the x-axis. The cutoff energy of 177 (Ryd) was used for the grid integration using the regular real-space grid. The experimental values are taken from Ref. [29, 30].

Projector
Initial Geo.	$r_{\rm OH}$	$\angle({\rm HOH})$	$\mu_{\rm B}$
a	0.990, 0.990	104.8	1.95
b	0.990, 0.990	104.5	1.95
c	0.990, 0.990	104.8	1.94
d	0.990, 0.990	104.9	1.95
Non-Projector
Initial Geo.	$r_{\rm OH}$	$\angle({\rm HOH})$	$\mu_{\rm B}$
a	0.989, 0.989	101.2	2.00
b	0.982, 0.982	102.0	1.98
c	0.993, 0.993	101.4	1.99
d	1.004, 0.993	107.0	1.92
Expt.	0.957	104.5	1.86

IV. CONCLUSIONS

In the real-space implementation of the DFT using atomic basis functions, one of difficult problems is how the matrix elements for the effective potential are accurately evaluated in an efficient numerical scheme within a modest computation effort. Especially, this problem becomes obvious when weakly interacting systems such as hydrogen bonding molecules are calculated. In this context, to suppress the difficulty involved in the regular real-space grid integration, we have developed a projector expansion method which expresses the rapidly varying neutral atom potential in the vicinity of nuclei by a separable projector. The neutral atom potential being the origin of the difficulty involved in the real-space grid integration is factorized by the separable projectors, and the resultant two-center integrals are accurately evaluated in the momentum space instead of the real space. To illustrate the accuracy of the projector expansion method, we have shown that the spurious oscillation in the energy curve of a water dimer molecule is entirely suppressed and that the optimized structure is almost independent of the initial geometry relative to the regular real-space grid within a modest grid fineness. Thus, we conclude that the projector expansion method is a highly useful technique for the accurate and efficient implementation of the DFT based on the LCAO method.

ACKNOWLEDGMENT

This work was partly supported by NEDO under the Nanotechnology Materials Program, Research and Development for Applying Advanced Computational Science and Technology of Japan Science and Technology Corporation (ACT-JST), and NAREGI Nanoscience Project and Special Coordination Funds of the Ministry of Education, Culture, Sports, Science and Technology, Japan.

Appendix A

In this Appendix, we show an explicit expression for the analytic evaluation of the force on atom in the projector expansion method. On the parts of the total energy expressed by the two-center integrals (integrals), its contributions to the force are analytically calculated by differentiating the two-center integrals (integrals) evaluated in the momentum space. Then, each term in the summation over the discretized radial grid in the momentum space are analytically differentiating using cubic splines without any difficulty. Therefore, we focus on the analytic differentiation of the remaining $E_{\rm\delta ee}$ and $E_{\rm xc}$ with respect to atomic coordinate. So, this derivation is a general formulation for the analytic force evaluation in the LCAO method based on the grid integration using the regular real-space grid. The difference electron-electron Coulomb energy $E_{\rm\delta ee}$ and the exchange-correlation energy $E_{\rm xc}$ can be discretized using the regular real-space grid as follows:

\displaystyle E_{\rm\delta ee}

\displaystyle=

\displaystyle\frac{1}{2}\Delta V\sum_{\bf p}\delta n({\bf r}_{\bf p})\delta V_% {\rm H}({\bf r}_{\bf p}),

(17)

\displaystyle E_{\rm xc}=\Delta V\sum_{\bf p}n({\bf r}_{\bf p})\epsilon_{\rm xc% }(n({\bf r_{p}})),

(18)

where ${\bf p}$ is the index in a vector form for specifying the position in the regular real-space grid, and $\Delta V$ is the volume per grid. Then, the derivative of $E_{\rm\delta ee}$ with respect to the atomic coordinate ${\bf R}_{k}$ is given by

\displaystyle\frac{\partial E_{\rm\delta ee}}{\partial{\bf R}_{k}}=\sum_{\bf p% }\frac{\partial n({\bf r}_{\bf p})}{\partial{\bf R}_{k}}\frac{\partial E_{\rm% \delta ee}}{\partial n({\bf r}_{\bf p})}+\sum_{\bf p}\frac{\partial n^{\rm a}(% {\bf r}_{\bf p})}{\partial{\bf R}_{k}}\frac{\partial E_{\rm\delta ee}}{% \partial n^{\rm a}({\bf r}_{\bf p})}.

(19)

In this study, $\delta V_{\rm H}({\bf r}_{\bf p})$ is defined through the fast Fourier transform (FFT) of $\delta n({\bf r})$ by

	$\displaystyle\delta V_{\rm H}({\bf r}_{\bf p})$	$\displaystyle=$	$\displaystyle\sum_{\bf G}\delta\tilde{V}_{\rm H}({\bf G}){\rm e}^{i{\bf G}% \cdot{\bf r}},$		(20)
		$\displaystyle=$	$\displaystyle\frac{4\pi}{N_{\rm rsg}}\sum_{\bf G}\frac{1}{\|{\bf G}\|^{2}}\sum_{% \bf p}\delta n({\bf r}_{\bf p}){\rm e}^{i{\bf G}\cdot({\bf r}-{\bf r}_{\bf p})},$		(20)

where $N_{\rm rsg}$ is the number of grid points in the regular real-space grid. Considering this expression of $\delta V_{\rm H}({\bf r})$ , we obtain an explicit compact expression for the derivative of $E_{\rm\delta ee}$ with respect to $n({\bf r}_{\bf p})$ as follows:

$\displaystyle\frac{\partial E_{\rm\delta ee}}{\partial n({\bf r}_{\bf p})}$	$\displaystyle=$	$\displaystyle\frac{1}{2}\Delta V\{\delta V_{\rm H}({\bf r}_{\bf p})+\sum_{\bf q% }\delta n({\bf r}_{\bf q})\frac{\partial\delta V_{\rm H}({\bf r}_{\bf q})}{% \partial n({\bf r}_{\bf p})}\},$	(21)
	$\displaystyle=$	$\displaystyle\frac{1}{2}\Delta V\{\delta V_{\rm H}({\bf r}_{\bf p})+\frac{4\pi% }{N_{\rm rsg}}\sum_{\bf G}\frac{1}{\|{\bf G}\|^{2}}\sum_{\bf q}\delta n({\bf r}_% {\bf q}){\rm e}^{i{\bf G}\cdot({\bf r_{q}}-{\bf r_{p}})}\},$
	$\displaystyle=$	$\displaystyle\Delta V\delta V_{\rm H}({\bf r}_{\bf p}).$

Similarly, the derivative of $E_{\rm\delta ee}$ with respect to $n^{\rm a}({\bf r}_{\bf p})$ is given by

$\displaystyle\frac{\partial E_{\rm\delta ee}}{\partial n^{\rm a}({\bf r}_{\bf p% })}$	$\displaystyle=$	$\displaystyle-\frac{1}{2}\Delta V\{\delta V_{\rm H}({\bf r}_{\bf p})-\sum_{\bf q% }\delta n({\bf r}_{\bf q})\frac{\partial\delta V_{\rm H}({\bf r}_{\bf q})}{% \partial n^{\rm a}({\bf r}_{\bf p})}\},$	(22)
	$\displaystyle=$	$\displaystyle-\frac{1}{2}\Delta V\{\delta V_{\rm H}({\bf r}_{\bf p})+\frac{4% \pi}{N_{\rm rsg}}\sum_{\bf G}\frac{1}{\|{\bf G}\|^{2}}\sum_{\bf q}\delta n({\bf r% }_{\bf q}){\rm e}^{i{\bf G}\cdot({\bf r_{q}}-{\bf r_{p}})}\},$
	$\displaystyle=$	$\displaystyle-\Delta V\delta V_{\rm H}({\bf r}_{\bf p}).$

The derivative of $n({\bf r}_{\bf p})$ with respect to ${\bf R}_{k}$ is found by considering the definition of $n({\bf r})$ as follows:

	$\displaystyle\frac{\partial n({\bf r_{p}})}{\partial{\bf R}_{k}}$	$\displaystyle=$	$\displaystyle\sum_{i\alpha,j\beta}\sum_{\nu}\{\frac{\partial c^{}_{i\alpha,% \nu}}{\partial{\bf R}_{k}}c_{j\beta,\nu}\chi_{i\alpha}({\bf r})\chi_{j\beta}({% \bf r})+c^{}_{i\alpha,\nu}\frac{\partial c_{j\beta,\nu}}{\partial{\bf R}_{k}}% \chi_{i\alpha}({\bf r_{p}})\chi_{j\beta}({\bf r_{p}})\}$		(23)
			$\displaystyle+2\sum_{\alpha,j\beta}\rho_{k\alpha,j\beta}\frac{\partial\chi_{k% \alpha}({\bf r_{p}})}{\partial{\bf R}_{k}}\chi_{j\beta}({\bf r_{p}}).$		(23)

The derivative of the LCAO coefficient $c$ with respect to ${\bf R}_{k}$ can be transformed to the derivative of the overlap matrix by taking account of the orthonormality relation in one-particle wave functions as usually made in the LCAO method with the non-orthogonal basis set.[1] Since the overlap matrix element is a two-center integral, the matrix element and its derivatives are evaluated in the momentum space as well as the other two-center integrals. Also, the derivative of the basis function $\chi$ with respect to ${\bf R}_{k}$ is analytically evaluated using cubic splines because of the use of numerical basis function. The derivative of $n^{\rm a}({\bf r}_{\bf p})$ with respect to ${\bf R}_{k}$ is formed by only the contribution from the same atomic site $k$ due to the independent grid position from the atomic positions as follows:

\displaystyle\frac{\partial n^{\rm a}({\bf r_{p}})}{\partial{\bf R}_{k}}

\displaystyle=

\displaystyle\frac{\partial n^{\rm a}_{k}({\bf r_{p}})}{\partial{\bf R}_{k}}.

(24)

The derivative of $E_{\rm xc}$ with respect to the atomic coordinate ${\bf R}_{k}$ is easily evaluated by

	$\displaystyle\frac{\partial E_{\rm xc}}{\partial{\bf R}_{k}}$	$\displaystyle=$	$\displaystyle\sum_{\bf p}\frac{\partial n({\bf r}_{\bf p})}{\partial{\bf R}_{k% }}\frac{\partial E_{\rm xc}}{\partial n({\bf r}_{\bf p})},$		(25)
		$\displaystyle=$	$\displaystyle\Delta V\sum_{\bf p}\frac{\partial n({\bf r}_{\bf p})}{\partial{% \bf R}_{k}}v_{{\rm xc}}(n({\bf r_{p}})).$		(25)

Even for the GGA, the derivative of $E_{\rm xc}$ with respect to the atomic coordinate ${\bf R}_{k}$ is explicitly expressed, while the GGA is implemented by a finite difference scheme in our implementation. As a result, the derivative of the discretized $E_{\rm\delta ee}$ and $E_{\rm xc}$ are analytically evaluated. In addition, considering the localized basis functions $\chi$ , the evaluation of Eqs. (19) and (25) can be quite efficiently performed in an O( $N$ ) operation as well as the other contributions to the force.

The force on atom in the LCAO methods based on the numerical grid integration has been often evaluated by appealing the Hellman-Feymann theorem with the Pulay correction [3] rather than strictly differentiating the discretized energy expression. However, our derivation shows that it is possible to evaluate the analytic force consistent with the total energy in an efficient way, even though both the numerical basis functions and the numerical grid integration are introduced. A defect of the analytic forces derived here is that they do not fulfill the law of action and reaction. The defect comes from the independent determination of the regular real-space grid from the atomic positions. Since the grid position is fixed in the absolute Cartesian coordinate, the calculated force suffers from ’a pinning effect’. Nevertheless, the effect is relatively small since only the minor contributions $E_{\rm\delta ee}$ and $E_{\rm xc}$ to the total energy are evaluated using the real-space grid integration. A way of recovering the law of action and reaction is to introduce an adaptive coordinate [31] which is dependent on the atomic positions, while the simplicity in the regular real-space grid is lost. A study toward this direction will be a future work.

References

[1] O. F. Sankey and D. J. Niklewski, Phys. Rev. B 40, 3979 (1989).
[2] P. Ordejon, E. Artacho, J. M. Soler, Phys. Rev. B 53, R10441 (1996); J. M. Soler, E. Artacho, J. D. Gale, A. Garcia, J. Junquera, P. Ordejon, and D. Sanchez-Portal, J. Phys.:Condens. Matter 14, 2745 (2002) and references therein.
[3] B. Delly, J. Chem. Phys. 92, 508 (1990).
[4] T. Ozaki, Phys. Rev. B 67, 155108 (2003).
[5] T. Ozaki and H. Kino, Phys. Rev. B 69, 195113 (2004).
[6] G. B. Bachelet, D. R. Hamann, and M. Schluter, Phys. Rev. B 26, 4199 (1982).
[7] N. Troullier and J. L. Martins, Phys. Rev. B 43, 1993 (1991).
[8] P. Hohenberg and W. Kohn, Phys. Rev. 136, B864 (1964); W. Kohn and L. J. Sham, Phys. Rev. 140, A1133(1965)
[9] W. Yang, Phys. Rev. Lett. 66, 1438 (1991).
[10] S. Goedecker and L. Colombo, Phys. Rev. Lett. 73, 122 (1994).
[11] G. Galli and M. Parrinello, Phys. Rev. Lett. 69, 3547 (1992).
[12] X.-P. Li, R. W. Nunes, and D. Vanderbilt, Phys. Rev. B 47, 10891 (1993); M. S. Daw, Phys. Rev. B 47, 10895 (1993).
[13] T. Ozaki and K. Terakura, Phys. Rev. B 64, 195126 (2001).
[14] S. Goedecker, Rev. of Mod. Phys. 71, 1085 (1999) and references therein.
[15] J. M. Perez-Jorda, J. Chem. Phys. 101, 1738 (1994).
[16] P. M. W. Gill, B. G. Johnson, and J. A. Pople, Chem. Phys. Lett. 209, 506 (1993).
[17] C. Satoko, Phys. Rev. B 30, 1754 (1984).
[18] A. D. Becke, J. Chem. Phys. 88, 2547 (1988).
[19] M. R. Pederson and K. A. Jackson, Phys. Rev. B 41, 7453 (1990); K. Jackson and M. R. Pederson, Phys. Rev. B 42, 3276 (1990); A. Briley, M. R. Pederson, K. A. Jackson, D. C. Patton, and D. V. Porezag, Phys. Rev. B 58, 1786 (1998); D. Porezag, M. R. Pederson, and A. Y. Liu, Phys. Rev. B 60, 14132 (1999).
[20] B. I. Dunlap and N. Rosch, J. de chimie physique 86, 671 (1989).
[21] T. Ono and K. Hirose, Phys. Rev. Lett. 82, 5016 (1999).
[22] J. R. Chelikowsky, N. Troullier, and Y. Saad, Phys. Rev. Lett. 72, 1240 (1994); J. R. Chelikowsky, X. Jing, K. Wu, and Y. Saad, Phys. Rev. B 53, 12071 (1996) and and references therein.
[23] D. M. Ceperley and B. J. Alder, Phys. Rev. Lett. 45, 566 (1980); J. P. Perdew and A. Zunger, Phys. Rev. B 23, 5048 (1981).
[24] J. P. Perdew, K. Burke, and M. Ernzerhof, Phys. Rev. Lett. 77, 3865 (1996).
[25] L. Kleinman and D. M. Bylander, Phys. Rev. Lett. 48, 1425 (1982).
[26] P. E. Blochl, Phys. Rev. B 41, R5414 (1990).
[27] W. H. Press, B. P. Flannery, S. A. Teukolsky, and W. T. Vetterling, Numerical Recipes (Cambridge University Press, Cambridge, 2002).
[28] Our DFT code, OpenMX, the basis orbitals, and pseudo potentials used in this study are available on a web site (http://www.openmx-square.org/) in the constitution of the GNU General Public Licence.
[29] S. A. Clough, Y. Beers, G. P. Klein, and L. S. Rothman, J. Chem. Phys. 59, 2254 (1973).
[30] W. S. Benedict, N. Gailar, and E. K. Plyler, J. Chem. Phys. 24, 1139 (1956).
[31] F. Gygi, Europhys. Lett. 19, 617 (1992); F. Gygi, Phys. Rev. B 48, 11692 (1993).

Efficient projector expansion in LCAO electronic structure calculations: Ver. 1.0 The original has been published in Phys. Rev. B 72, 045121 (2005).

Abstract

Appendix A

References

Efficient projector expansion in LCAO electronic structure calculations: Ver. 1.0
The original has been published in Phys. Rev. B 72, 045121 (2005).