Goal:
So far we have discussed the equilibrium properties of disordered systems, that are encoded in their partition function/free energy. When a system (following Langevin, Monte Carlo dynamics) equilibrates at sufficiently large times, its long-time properties are captured by these equilibrium calculations. In glassy systems the equilibration timescales are extremely large: for very large timescales the system does not visit equilibrium configurations, but rather metastable states. In this set of problems, we characterize the energy landscape of the spherical
-spin by studying its metastable states (local minima).
Techniques: conditional probabilities, saddle point, random matrix theory.
Dynamics, optimization, trapping local minima
Convex and rugged energy landscapes.
- Rugged landscapes. Consider the spherical
-spin model:
is an energy landscape . It is a random function on configuration space (the surface
of the sphere). This landscape has its global minimum(a) at the ground state configuration(s): the energy density of the ground state(s) can be obtained studying the partition function
in the limit
. Besides the ground state(s), the energy landscape can have other local minima; fully-connected models of glasses are characterized by the fact that there are plenty of these local minima: the energy landscape is rugged, see the sketch.
- Optimization by gradient descent. Suppose that we are interested in finding the configurations of minimal energy, starting from an arbitrary configuration
: we can implement a dynamics in which we progressively update the configuration moving towards lower and lower values of the energy, hoping to eventually converge to the ground state(s). The simplest dynamics of this sort is gradient descent,
where
is the gradient of the landscape restricted to the sphere. The dynamics stops when it reaches a stationary point , a configuration where
. If the landscape has a convex structure, this will be the ground state; if the energy landscape is very non-convex like in glasses, the end point of this algorithm will be a local minimum at energies much higher than the ground state (see sketch).
- Stationary points and complexity. To guess where gradient descent dynamics (or Langevin dynamics ) are expected to converge, it is useful to understand the distribution of the stationary points, i.e. the number
of such configuration having a given energy density
. In fully-connected models, this quantity has an exponential scaling,
, where
is the landscape’s complexity. [*] . Stationary points can be stable (local minima), or unstable (saddles or local maxima): their stability is encoded in the spectrum of the Hessian matrix
: when all the eigenvalues of the Hessian are positive, the point is a local minimum (and a saddle otherwise).
- [*] - This quantity looks similar to the entropy
we computed for the REM in Problem 1. However, while the entropy counts all configurations at a given energy density, the complexity
accounts only for the stationary points.
Problems
In these problems, we discuss the computation of the annealed complexity of the spherical
-spin model, which is defined by
Problem 5.1: the Kac-Rice formula and the complexity
- The Kac-Rice formula. Consider first a random function of one variable
defined on an interval
, and let
be the number of points
such that
. Justify why
where
is the probability density that
is a zero of the function.
In particular, why is the derivative of the function appearing in this formula? Consider now the number of stationary points
of the
-spin energy landscape, which satisfy
. Justify why the generalization of the formula above gives
where
is the probability density that
is a stationary point of energy density
, and
is the Hessian matrix of the function
restricted to the sphere.
- Statistical rotational invariance. Recall the expression of the correlations of the energy landscape of the
-spin computed in Problem 3.1: in which sense the correlation function is rotationally invariant? Justify why rotational invariance implies that
where
is one fixed vector belonging to the surface of the sphere. Where does the prefactor arise from?
- Gaussianity and correlations.
- Determine the distribution of the quantity
.
- The entries of
are Gaussian variables. One can show that the
components of
are uncorrelated to
; they have zero mean and covariances
Compute the probability density that
.
- The
matrix
conditioned to the fact that
can be written as
where the matrix
has random entries with zero average and correlations
Combining this with the results above, show that
Problem 5.2: the Hessian and random matrix theory
To get the complexity, it remains to compute the expectation value of the determinant of the Hessian matrix: this is the goal of this problem. We will do this exploiting results from random matrix theory.
- Gaussian Random matrices. Show that the matrix
is a GOE matrix, i.e. a matrix taken from the Gaussian Orthogonal Ensemble, meaning that it is a symmetric matrix with distribution
What is the value of
?
- Eigenvalue density and concentration. Let
be the eigenvalues of the matrix
. Show that the following identity holds:
where
is the empirical eigenvalue density. It can be shown that if
is a GOE matrix, the distribution of the empirical density has a large deviation form (recall TD1) with speed
, meaning that
where now
is a functional (a function of a function). Using a saddle point argument, show that this implies
where
is the typical value of the eigenvalue density, which satisfies
.
- The semicircle and the complexity. The eigenvalue density of GOE matrices is self-averaging, and it equals to
- Check this numerically: generate matrices for various values of
, plot their empirical eigenvalue density and compare with the asymptotic curve. Is the convergence faster in the bulk, or in the edges of the eigenvalue density, where it vanishes?
- Combining all the results, show that the annealed complexity is
The integral
can be computed explicitly, and one finds:
Plot the annealed complexity, and determine numerically where it vanishes: why is this a lower bound or the ground state energy density?
- The threshold and the stability.
Sketch
for different values of
; recalling that the Hessian encodes for the stability of the stationary points, show that there is a transition in the stability of the stationary points at the critical value of the energy density
When are the critical points stable local minima? When are they saddles? Why the stationary points at
are called marginally stable ?
Check out: key concepts
Gradient descent, rugged landscapes, metastable states, Hessian matrices, random matrix theory, landscape’s complexity.