T-5: Difference between revisions
| Line 10: | Line 10: | ||
| [[File:Landscapes-GDD.png|thumb|right| | [[File:Landscapes-GDD.png|thumb|right|x200px|Entropy of the Random Energy Model]] | ||
| <br> | <br> | ||
Revision as of 14:14, 5 January 2024
Goal:  
So far we have discussed the equilibrium properties of disordered systems, that are encoded in their partition function and free energy. In this set of problems, we characterize the energy landscape of the spherical -spin, by determining the number of its stationary points.
Key concepts:   gradient descent, out-of-equilibrium dynamics, metastable states, Hessian matrices, random matrix theory, Langevin dynamics,?
Dynamics, optimization, trapping local minima
- Consider the spherical -spin model discussed in the Problems 2 and 3; The function is an energy landscape : it is a random function defined on configuration space, which is the space all configurations belong to. This landscape has its global minimum(a) at the ground state configuration(s): the energy density of the ground state(s) can be obtained studying the partition function in the limit . Besides the ground state(s), the energy landscape can have other local minima; the fully-connected models of glasses are characterized by the fact that there are plenty of these local minima, see SKETCH.
-  Suppose that we are interested in finding the configurations of minimal energy of some model with energy landscape , starting from an arbitrary initial configuration : we can think about a dynamics in which we progressively update the configuration of the system moving towards lower and lower values of the energy, hoping to eventually converge to the ground state(s). The simplest dynamics of this sort is gradient descent,
where the configuration changes in time moving in the direction of the gradient of the energy landscape restricted to the sphere, . The dynamics stops when it reaches a stationary point , i.e. a configuration where . If the landscape has a simple, convex structure, this will be the ground state one is seeking for; if the energy landscape is very non-convex like in glasses, the end point of this algorithm will be a local minimum at energies much higher than the ground state. SKETCH 
- To understand the structure of the energy landscape and to guess where gradient descent dynamics (or its variation) are expected to converge, it is useful to characterize the distribution of the stationary points, i.e. the number of such configuration having a given energy density . In fully-connected models of glasses, this quantity has an exponential scaling, , where is the complexity of the landscape. [1]
ADD HESSIAN
Problem 5.1: the Kac-Rice method and the complexity
In this Problem, we set up the computation of the annealed complexity of the spherical -spin model, which is defined by
-   The Kac-Rice formula. Consider first a random function of one variable  defined on an interval , and let  be the number of points  such that . Justify why
where is the probability density that is a zero of the function. In particular, why is the derivative of the function appearing in this formula? Consider now the number of stationary points of the -spin energy landscape, which satisfy . Justify why the generalization of the formula above gives where is the probability density that is a stationary point of energy density , and is the Hessian matrix of the function restricted to the sphere.[2] 
-  Statistical rotational invariance. Recall the expression of the correlations of the energy landscape of the -spin computed in Problem 2.1: in which sense the correlation function is rotationally invariant? Justify why rotational invariance implies that 
where is one fixed vector belonging to the surface of the sphere. Where does the prefactor arise from? 
-  Gaussianity and correlations. Determine the distribution of the quantity . Show that the components of the vector  are Gaussian random variables with zero mean and covariances
The quantity can be shown to be uncorrelated to . The entries of the matrix are also Gaussian variables. Computing their correlation, one finds that the matrix conditioned to the fact that can be written as where the matrix has random entries with zero average and correlations Combining everything, show that this implies 
Problem 5.2: the Hessian and random matrix theory
To get the complexity, it remains to compute the expectation value of the determinant of the Hessian matrix: this is the goal of this problem. We will do this exploiting results from random matrix theory.
- Gaussian Random matrices. Show that the matrix is a GOE matrix, i.e. a matrix taken from the Gaussian Orthogonal Ensemble, meaning that it is a symmetric matrix with distribution What is the value of ?
-  Eigenvalue density and concentration.  Let Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle  \lambda_\alpha }
 be the eigenvalues of the matrix Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle  G }
. Show that the following identity holds:
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \overline{|\text{det} \left(G - p \epsilon \mathbb{I} \right)|}= \overline{\text{exp} \left[(N-1) \left( \int d \lambda \, \rho_N(\lambda) \, \log |\lambda - p \epsilon|\right) \right]}, \quad \quad \rho_{N}(\lambda)= \frac{1}{N-1} \sum_{\alpha=1}^{N-1} \delta (\lambda- \lambda_\alpha) } where Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \rho_{N}(\lambda)} is the empirical eigenvalue density. It can be shown that if Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle G } is a GOE matrix, the distribution of the empirical density has a large deviation form (recall TD1) with speed Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle N^2 } , meaning that Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle P_N[\rho] = e^{-N^2 \, g[\rho]} } where now Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle g[\cdot] } is a functional (a function of a function). Using a saddle point argument, show that this implies Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \overline{\text{exp} \left[(N-1) \left( \int d \lambda \, \rho_N(\lambda) \, \log |\lambda - p \epsilon|\right) \right]}=\text{exp} \left[N \left( \int d \lambda \, \rho_{\text{ty}}(\lambda+p \epsilon) \, \log |\lambda|\right)+ o(N) \right] } where Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \rho_{\text{ty}}(\lambda) } is the typical value of the eigenvalue density, which satisfies Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle g[\rho_{\text{ty}}]=0 } . 
-  The semicircle, the threshold and the ground state. The eigenvalue density of GOE matrices is self-averaging, and it equals to 
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \lim_{N \to \infty}\rho_N (\lambda)=\lim_{N \to \infty} \overline{\rho_N}(\lambda)= \rho_{\text{ty}}(\lambda)= \frac{1}{2 \pi \sigma^2}\sqrt{4 \sigma^2-\lambda^2 } } - Check this numerically: generate matrices for various values of Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle N } , plot their empirical eigenvalue density and compare with the asymptotic curve. Is the convergence faster in the bulk, or in the edges of the eigenvalue density, where it vanishes?
-  Sketch Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle  \rho_{\text{ty}}(\lambda+p \epsilon) }
 for different values of Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle  \epsilon }
; recalling that the Hessian encodes for the stability of the stationary points, show that there is a transition in the stability of the stationary points at a critical value of the energy density 
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \epsilon_{\text{th}}= -\sqrt{\frac{2(p-1)}{p}} } When are the critical point stable local minima? When are they saddles? Why the stationary points at Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \epsilon= \epsilon_{\text{th}}} are called marginally stable ? 
-  Combining all the results, show that the annealed complexity is
Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \Sigma_{\text{a}}(\epsilon)= \frac{1}{2}\log [4 e (p-1)]- \epsilon^2+ I_p(\epsilon), \quad \quad I_p(\epsilon)= \frac{2}{\pi}\int d x \sqrt{1-\left(x- \frac{\epsilon}{ \epsilon_{\text{th}}}\right)^2}\, \log |x| , \quad \quad \epsilon_{\text{th}}= -\sqrt{\frac{2(p-1)}{p}}. } The integral Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle I_p(\epsilon)} can be computed explicitly, and one finds: Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle I_p(\epsilon)= \begin{cases} &\frac{\epsilon^2}{\epsilon_{\text{th}}^2}-\frac{1}{2} - \frac{\epsilon}{\epsilon_{\text{th}}}\sqrt{\frac{\epsilon^2}{\epsilon_{\text{th}}^2}-1}+ \log \left( \frac{\epsilon}{\epsilon_{\text{th}}}+ \sqrt{\frac{\epsilon^2}{\epsilon_{\text{th}}^2}-1} \right)- \log 2 \quad \text{if} \quad \epsilon \leq \epsilon_{\text{th}}\\ &\frac{\epsilon^2}{\epsilon_{\text{th}}^2}-\frac{1}{2}-\log 2 \quad \text{if} \quad \epsilon > \epsilon_{\text{th}} \end{cases} } Plot the annealed complexity, and determine numerically where it vanishes: why is this a lower bound or the ground state energy density? 
 
Notes
- [1] - This quantity looks similar to the entropy we computed for the REM in Problem 1.1. However, while the entropy counts all configurations at a given energy density, the complexity Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \Sigma(\epsilon) } accounts only for the stationary points.
- [2] - We define with Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \hat \Pi(\vec{\sigma}) } the projector on the tangent plane to the sphere at Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \vec{\sigma}} : this is the plane orthogonal to the vector Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \vec{\sigma}} . The gradient Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \nabla_\perp E(\vec{\sigma}) } is a Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle (N-1)} -dimensional vector that is obtained projecting the gradient Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle [\nabla E(\vec{\sigma})]_i=\partial E/\partial \sigma_i } on the tangent plane, Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \nabla_\perp E(\vec{\sigma})=\hat \Pi \nabla E(\vec{\sigma})} . The Hessian Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \nabla^2_\perp E(\vec{\sigma}) } is a Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle (N-1) \times (N-1)} -dimensional matrix that is obtained from the Hessian Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle [\nabla^2 E(\vec{\sigma})]_{ij}=\partial^2 E/\partial \sigma_i \partial \sigma_j } as Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \nabla^2_\perp E(\vec{\sigma})= \hat \Pi(\vec{\sigma}) \, \nabla^2 E(\vec{\sigma}) \, \hat\Pi(\vec{\sigma}) - N^{-1}\nabla E(\vec{\sigma}) \cdot \vec{\sigma} \mathbb{I}} where Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle \mathbb{I}} is the identity matrix.
