Problem 6: the Hessian at the stationary points, and random matrix theory
This is a continuation of problem 5. To get the complexity of the spherical -spin, it remains to compute the expectation value of the determinant of the Hessian matrix: this is the goal of this problem. We will do this exploiting results from random matrix theory discussion in the Tutorial and Exercise 4 .
Gaussian Random matrices. Show that the matrix , defined in Problem 5, is a GOE matrix, i.e. a matrix taken from the Gaussian Orthogonal Ensemble, meaning that it is a symmetric matrix with distribution
where is a normalization. What is the value of ?
Eigenvalue density and concentration. Let be the eigenvalues of the matrix . Show that the following identity holds:
where is the empirical eigenvalue distribution. It can be shown that if is a GOE matrix, the distribution of the empirical distribution has a large deviation form with speed , meaning that where now is a functional. Using a saddle point argument, show that this implies
where is the typical value of the eigenvalue density, which satisfies .
The semicircle and the complexity. The eigenvalue density of GOE matrices is self-averaging, and it equals to
Combining all the results, show that the annealed complexity is
The integral can be computed explicitly, and one finds:
Plot the annealed complexity, and determine numerically where it vanishes: why is this a lower bound or the ground state energy density?
The threshold and the stability.
Sketch for different values of ; recalling that the Hessian encodes for the stability of the stationary points, show that there is a transition in the stability of the stationary points at the critical value of the energy density
When are the critical points stable local minima? When are they saddles? Why the stationary points at are called marginally stable ?
Back to dynamics: quenches, and dynamical transitions
Through Problems 5 and 6, we have shown that the energy landscape of the spherical -spin model has exponentially many stationary points , and that there is a transition at the energy density : for the stationary points are saddles, for they are local minima. Let us try to deduce something on the systems's dynamics out of this.
Gradient descent dynamics. The local minima are dynamically stable: if we do gradient descent, we get stuck in a local minimum and we exert a small perturbation to the configuration, gradient descent brings us back to the local minimum. These configurations are trapping. If we try to optimize the landscape, i.e. to reach the ground state, with gradient descent dynamics, we expect that we will not be able to reach it easily, as we will be trapped by local minima. In fact, for the spherical -spin model it can be shown that starting from random initial conditions and evolving the configuration with gradient descent (possibly with infinitesimal noise, to be sent to zero with a protocol),
The system gets stuck at the energy density level where local minima start to appear, and does not reach the deeper local minima.
Quenches in temperature and equilibration. We can generalize this protocol to higher : we extract randomly the initial condition of the dynamics, and then we evolve the configuration with Langevin dynamics (gradient descent + noise):
In Langevin dynamics, a Gaussian vector at each time , uncorrelated from the vectors at other times , with zero average and constant variance proportional to temperature. It represents the action of a thermal bath on the system.
This dynamical protocol is called a quench . The question we can ask is: does the system equilibrate with the bath under this dynamics? If yes, we should see that
where is the equilibrium energy density at the temperature , the same one controlling the strength of the noise. Equilibrating with the bath would indeed imply that at large time the system visits uniformly the equilibrium energy shell.
Dynamical transition. Now, in the spherical -spin we know that if , the energy shell has many stationary points, but they are all unstable saddles and do not trap the dynamics. We expect that this energy shell is relatively easy to explore dynamically, and that equilibration takes place. On the other hand, if , in the equilibrium energy shell and at higher energy, there are exponentially many local minima that trap the dynamics, and we expect that reaching equilibrium configurations will be difficult. This tells us that there exists a critical , defined by
such that for
The statement above for gradient descent corresponds to the special case . is called the dynamical transition temperature.
Fig. 6 - Activated jump across an energy barrier.
Equilibration timescales. Does it mean that when , the system never equilibrates? This is true only in the limit . When is finite, there is a timescale beyond which the system equilibrates. However, this equilibration timescale
in the spherical -spin scales as
This is again due to the presence of many local minima/ metastable states, that are separated by extensive energy barriers. So, when we take before taking the large time limit, we are unable to see equilibration and we have a sharp transition, which becomes a crossover for finite .
Activation and Arrhenius law. Why exponential timescales? When the noise in the Langevin dynamics is weak (temperature is small), the dynamics gets stuck in local minima for very large time. This time depends crucially on the energy barrier which separate the minimum from the other configurations (see Fig 6.1). The Arrhenius law states that the typical timescale required to escape from a local minimum through a barrier of height with thermal dynamics with inverse temperature scales as . Since in the spherical -spin we have , then . A dynamics made of jumps from minimum to minimum through the crossing of energy barriers is called activated dynamics .
Check out: key concepts
Metastable states, Hessian matrices, random matrix theory, landscape’s complexity.