Numerical sign problem

In applied mathematics, the numerical sign problem is the problem of numerically evaluating the integral of a highly oscillatory function of a large number of variables. Numerical methods fail because of the near-cancellation of the positive and negative contributions to the integral. Each has to be integrated to very high precision in order for their difference to be obtained with useful accuracy.

The sign problem is one of the major unsolved problems in the physics of many-particle systems. It often arises in calculations of the properties of a quantum mechanical system with large number of strongly interacting fermions, or in field theories involving a non-zero density of strongly interacting fermions.

Overview

In physics the sign problem is typically (but not exclusively) encountered in calculations of the properties of a quantum mechanical system with large number of strongly interacting fermions, or in field theories involving a non-zero density of strongly interacting fermions. Because the particles are strongly interacting, perturbation theory is inapplicable, and one is forced to use brute-force numerical methods. Because the particles are fermions, their wavefunction changes sign when any two fermions are interchanged (due to the anti-symmetry of the wave function, see Pauli principle). So unless there are cancellations arising from some symmetry of the system, the quantum-mechanical sum over all multi-particle states involves an integral over a function that is highly oscillatory, hence hard to evaluate numerically, particularly in high dimension. Since the dimension of the integral is given by the number of particles, the sign problem becomes severe in the thermodynamic limit. The field-theoretic manifestation of the sign problem is discussed below.

The sign problem is one of the major unsolved problems in the physics of many-particle systems, impeding progress in many areas:

The sign problem in field theory

Sources:

In a field theory approach to multi-particle systems, the fermion density is controlled by the value of the fermion chemical potential $\mu$ . One evaluates the partition function $Z$ by summing over all classical field configurations, weighted by $\exp(-S)$ where $S$ is the action of the configuration. The sum over fermion fields can be performed analytically, and one is left with a sum over the bosonic fields $\sigma$ (which may have been originally part of the theory, or have been produced by a Hubbard–Stratonovich transformation to make the fermion action quadratic)

$Z=\int D\sigma \;\rho [\sigma ]$ where $D\sigma$ represents the measure for the sum over all configurations $\sigma (x)$ of the bosonic fields, weighted by

$\rho [\sigma ]=\det(M(\mu ,\sigma ))\exp(-S[\sigma ])$ where $S$ is now the action of the bosonic fields, and $M(\mu ,\sigma )$ is a matrix that encodes how the fermions were coupled to the bosons. The expectation value of an observable $A[\sigma ]$ is therefore an average over all configurations weighted by $\rho [\sigma ]$ $\langle A\rangle _{\rho }={\frac {\int D\sigma \;A[\sigma ]\;\rho [\sigma ]}{\int D\sigma \;\rho [\sigma ]}}.$ If $\rho [\sigma ]$ is positive, then it can be interpreted as a probability measure, and $\langle A\rangle _{\rho }$ can be calculated by performing the sum over field configurations numerically, using standard techniques such as Monte Carlo importance sampling.

The sign problem arises when $\rho [\sigma ]$ is non-positive. This typically occurs in theories of fermions when the fermion chemical potential $\mu$ is nonzero, i.e. when there is a nonzero background density of fermions. If $\mu \neq 0$ there is no particle-antiparticle symmetry, and $\det(M(\mu ,\sigma ))$ , and hence the weight $\rho (\sigma )$ , is in general a complex number, so Monte Carlo importance sampling cannot be used to evaluate the integral.

Reweighting procedure

A field theory with a non-positive weight can be transformed to one with a positive weight, by incorporating the non-positive part (sign or complex phase) of the weight into the observable. For example, one could decompose the weighting function into its modulus and phase,

$\rho [\sigma ]=p[\sigma ]\,\exp(i\theta [\sigma ])$ where $p[\sigma ]$ is real and positive, so

$\langle A\rangle _{\rho }={\frac {\int D\sigma A[\sigma ]\exp(i\theta [\sigma ])\;p[\sigma ]}{\int D\sigma \exp(i\theta [\sigma ])\;p[\sigma ]}}={\frac {\langle A[\sigma ]\exp(i\theta [\sigma ])\rangle _{p}}{\langle \exp(i\theta [\sigma ])\rangle _{p}}}$ Note that the desired expectation value is now a ratio where the numerator and denominator are expectation values that both use a positive weighting function, $p[\sigma ]$ . However, the phase $\exp(i\theta [\sigma ])$ is a highly oscillatory function in the configuration space, so if one uses Monte Carlo methods to evaluate the numerator and denominator, each of them will evaluate to a very small number, whose exact value is swamped by the noise inherent in the Monte Carlo sampling process. The "badness" of the sign problem is measured by the smallness of the denominator $\langle \exp(i\theta [\sigma ])\rangle _{p}$ : if it is much less than 1 then the sign problem is severe. It can be shown (e.g.) that

$\langle \exp(i\theta [\sigma ])\rangle _{p}\propto \exp(-fV/T)$ where $V$ is the volume of the system, $T$ is the temperature, and $f$ is an energy density. The number of Monte Carlo sampling points needed to obtain an accurate result therefore rises exponentially as the volume of the system becomes large, and as the temperature goes to zero.

The decomposition of the weighting function into modulus and phase is just one example (although it has been advocated as the optimal choice since it minimizes the variance of the denominator ). In general one could write

$\rho [\sigma ]=p[\sigma ]{\frac {\rho [\sigma ]}{p[\sigma ]}}$ where $p[\sigma ]$ can be any positive weighting function (for example, the weighting function of the $\mu =0$ theory.) The badness of the sign problem is then measured by

$\left\langle {\frac {\rho [\sigma ]}{p[\sigma ]}}\right\rangle _{p}\propto \exp(-fV/T)$ which again goes to zero exponentially in the large-volume limit.

Methods for reducing the sign problem

The sign problem is NP-hard, implying that a full and generic solution of the sign problem would also solve all problems in the complexity class NP in polynomial time. If (as is generally suspected) there are no polynomial-time solutions to NP problems (see P versus NP problem), then there is no generic solution to the sign problem. This leaves open the possibility that there may be solutions that work in specific cases, where the oscillations of the integrand have a structure that can be exploited to reduce the numerical errors.

In systems with a moderate sign problem, such as field theories at a sufficiently high temperature or in a sufficiently small volume, the sign problem is not too severe and useful results can be obtained by various methods, such as more carefully tuned reweighting, analytic continuation from imaginary $\mu$ to real $\mu$ , or Taylor expansion in powers of $\mu$ .

There are various proposals for solving systems with a severe sign problem:

• Meron-cluster algorithms. These achieve an exponential speed-up by decomposing the fermion world lines into clusters that contribute independently. Cluster algorithms have been developed for certain theories, but not for the Hubbard model of electrons, nor for QCD, the theory of quarks.
• Stochastic quantization. The sum over configurations is obtained as the equilibrium distribution of states explored by a complex Langevin equation. So far, the algorithm has been found to evade the sign problem in test models that have a sign problem but do not involve fermions.
• Fixed-node method. One fixes the location of nodes (zeros) of the multiparticle wavefunction, and uses Monte Carlo methods to obtain an estimate of the energy of the ground state, subject to that constraint.
• Majorana algorithms. Using Majorana fermion representation to perform Hubbard-Stratonovich transformations can help to solve the fermion sign problem of a class of fermionic many-body models.