T-I: Difference between revisions

From Disordered Systems Wiki
Jump to navigation Jump to search
 
(224 intermediate revisions by the same user not shown)
Line 1: Line 1:
<strong>Goal: </strong> derive the equilibrium phase diagram of the simplest spin-glass model, the Random Energy Model (REM). The REM is defined assigning to each configuration <math>\alpha</math> of the system a random energy <math>E_\alpha</math>. The random energies are independent, taken from a Gaussian distribution
<strong>Goal: </strong> understanding the energy landscape of the simplest spin-glass model, the Random Energy Model (REM). <br>
<math> p(E) =(2 \pi N)^{-1/2}e^{-\frac{E^2}{2 N}}</math>.
<strong> Techniques: </strong> probability theory, saddle point approximation.
<br>
<br>




<strong>Key concepts: </strong> average value vs typical value, large deviations, rare events, saddle point approximation, self-averaging quantities, freezing transition, .
=== Some probability notions relevant at large N ===
<br>
<br>
== A dictionary for large-N disordered systems ==
<br>
<ul>


<ul>
<li> We will discuss disordered systems with <math> N </math> degrees of freedom (for instance, for a spin system on a lattice of size <math>L</math> in dimension <math> d</math>, <math> N = L^d </math>). Since the systems are random, the quantities that describe their properties (the free energy, the number of configurations of the system that satisfy a certain property, the magnetization etc) are also random variables, with a distribution. In this discussion we denote these random variables generically with <math> X_N, Y_N </math> (where the subscript denotes the number of degrees of freedom) and with <math> P_{X_N}(x), P_{Y_N}(y) </math> their distribution. Statistical physics goal is to characterize the behavior of these quantities in the limit <math> N \to \infty</math>.
<li> We will consider random variables  <math> X_N </math> which depend on a parameter <math> N \gg 1 </math> (can be the size of the system) and which have the scaling <math> X_N \sim O(e^{N}) </math>; the scaling means that the rescaled variable <math> Y_N=N^{-1}\log |X_N| </math> has a well defined distribution that remains of <math> O(1) </math> when <math> N \to \infty </math>. A standard example are partition functions of disordered systems with <math> N </math> degrees of freedom, <math> Z \sim e^{-\beta N f} </math>: here <math> X_N \to Z </math> and <math> Y_N \to -\beta f </math>, where <math> f </math> is the free energy.  
</li>
</li>
<br>
<br>


<li>A random variable depending on a parameter <math> N </math> is <ins>self-averaging </ins> when its distribution converges to a delta function at a single value as <math> N \to \infty </math>. When the random variable is not self-averaging, it remains distributed in the limit <math> N \to \infty </math>. If <math> Y_N </math> is self-averaging, then
<li> <ins>'''Self-averagingness.'''</ins> The physics of disordered systems is described by quantities that are distributed when <math> N </math> is finite (they take different values from sample to sample of the system), but for which sample to sample fluctuations are suppressed when <math> N \to \infty</math>. These quantities are said to be <ins>self-averaging </ins>.
A random variable <math> Y_N </math> is self-averaging when, in the limit <math> N \to \infty </math>, its distribution concentrates around the average, collapsing to a deterministic value:
<center>  
<center>  
<math>
<math>
\lim_{N \to \infty} Y_N =\lim_{N \to \infty}  \overline{Y_N}
\lim_{N \to \infty} Y_N =\lim_{N \to \infty}  \overline{Y_N}:= {Y_\infty}, \quad \quad  \overline{Y_N}=\int \, dy\, P_{Y_N}(y)\, y
</math>
</math>
</center>
</center>
This happens when its fluctuations are small compared to the average, meaning that <sup>[[#Notes|[*] ]]</sup>
<center>
<math>
\lim_{N \to \infty} \frac{\overline{Y_N^2}}{\overline{Y_N}^2}=1.
</math>
</center>
When the random variable is not self-averaging, it remains distributed in the limit <math> N \to \infty </math>. When it is self-averaging, <ins><em> sample-to-sample </em> fluctuations are suppressed</ins> when <math>N </math> is large.
<br>
'''Example 1.''' Consider the partition function of a disordered system at inverse temperature <math> \beta</math>,  <math> Z_N(\beta) </math>. When <math> N </math> is large this random variable has an exponential scaling, <math> Z_N(\beta) \sim e^{-\beta N f_N(\beta)} </math>, where the variable <math> f_N(\beta) </math> is the free energy density. This scaling means that the random variable  <math> \beta f_N=-N^{-1}\log Z_N </math> has a well defined distribution that remains of <math> O(1) </math> when <math> N \to \infty </math>. In all the disordered systems models we will consider in these lectures, the free-energy not only has a well defined distribution in the limit, but it is also self-averaging. This is very important property: it implies that the free energy (and therefore all the thermodynamics observables, that can be obtained taking derivatives of the free energy) does not fluctuate from sample to sample when <math> N </math> is large, and so the physics of the system does not depend on the particular sample. While intensive quantities like <math> f_N </math> are self-averaging, quantities scaling exponentially like the partition function <math> Z_N </math> are not necessarily so: in particular, we will see that they are not when the system is in a glassy phase.
<br>
'''Example 2.''' The partition function is an example of exponentially-scaling variable <math> X_N \sim e^{N Y_N} </math>, where the rescaled variable  <math> Y_N </math> is self-averaging while  <math> X_N </math> may not be. Another example is given in Problem 1 below, where <math> X_N \to \mathcal{N}_N(E) </math> and <math> Y_N \to S(E) </math>.
</li>
</li>
<br>
<br>


<li> Let <math> P_N(x), P_N(y) </math> be the distributions of <math> X_N </math> and <math> Y_N </math>. In general, quantities like <math> Y_N </math> have a distribution that for large <math> N </math> takes the form <math> P_N(y) \sim e^{-N^\alpha g(y)+o(N)} </math> where <math> g(y) </math> is some positive function and <math> \alpha>0 </math>. This is called a <ins> large deviation form </ins> for the probability distribution, with <ins> speed </ins> <math> N^\alpha </math>. This distribution is of <math> O(1) </math> for the value <math> y_{\text{ty}} </math> such that <math> g(y_{\text{ty}})=0 </math>: this value is the <ins> typical value </ins> of <math> Y_N </math>; all the other values of <math> y </math> are associated to a probability that is exponentially small in <math> N</math>: they are <ins>exponentially rare</ins>.  
 
</li>
 
 
 
<li> <ins>'''Typical and rare.'''</ins> The typical value of a random variable is the value at which its distribution peaks (it is the most probable value). Values at the tails of the distribution, where the probability density does not peak but it is small (for instance, vanishing with <math> N \to \infty </math>) are said to be rare. For self-averaging quantities, in the limit <math> N \to \infty </math> the distribution collapses to a single value, that is both the average and typical value. In general, average and typical value of a random variable may not coincide: this happens when the average is dominated by values that are rare, associated to a small probability of occurrence and thus to the tails of the distribution. Let’s see this with an example.  
<br>
<br>


<li>Averages over distributions having a large deviation form can usually be computed with the <ins> saddle point approximation </ins> for large <math> N </math>. Let’s fix <math> \alpha=1 </math>. If <math> f(Y_N) </math> is a function of <math> Y_N </math> which scales slower than exponential of <math> N </math>, then
<ul>'''Example: typical vs average.''' Often, quantities like <math> Y_N </math> have a distribution that for large <math> N </math> takes the form <math> P_{Y_N}(y) \sim e^{-N^\alpha g(y)+ \text{subleading}} </math> where <math> g(y) </math> is some positive function and <math> \alpha>0 </math>. This is called a <ins> large deviation form </ins> for the probability distribution, with <ins> speed </ins> <math> N^\alpha </math>. This distribution is of <math> O(1) </math> for the value <math> y^{\text{typ}} </math> such that <math> g(y^{\text{typ}})=0 =g'(y^{\text{typ}})</math>: this value is the <ins> typical value </ins> of <math> Y_N </math> (asymptotically at large <math> N </math>); all the other values of <math> y </math> are associated to a probability that is exponentially small in <math> N^\alpha</math>: they are <ins>exponentially rare</ins>.
Consider now an exponentially scaling quantity like <math> X_N = e^{N Y_N} </math>, and let’s fix <math> \alpha=1 </math>. The asymptotic typical values <math> x^{\text{typ}} </math> and  <math> y^{\text{typ}} </math> are related by:
<center>
<center>
  <math>  \overline{f(Y_N)} =\int dy P_N(y) f(y)= \int dy\, e^{-N g(y)+o(N)} f(y) \sim f(y_{\text{ty}}
  <math>  y^{\text{typ}=\lim_{N \to \infty} \frac{\log x^{\text{typ}} }{N},
</math></center>
</math></center>
because the integral is dominated by the region where <math> g(y)=0 </math>, since all the other contributions are exponentially suppressed. This also implies
so the scaling of <math> x^{\text{typ}} </math> is <math> x^{\text{typ}}\sim e^{N y^{\text{typ}}} </math>. Let us now look at the scaling of the average.
<center>  
The average of <math> X_N </math> can be computed with the <ins> saddle point approximation </ins> for large <math> N </math>:
<math>
<center>
\lim_{N \to \infty} Y_N =\lim_{N \to \infty} \overline{Y_N}= y_{\text{ty}}
<math> \overline{X_N} =\int dy\, P_{Y_N}(y)\, e^{N y}= \int dy\, e^{N[y- g(y)]+o(N)} =e^{N [y^*-g(y*)]+ o(N) },
</math>
</math></center>
</center>
where <math> y^* </math> is the point maximising the shifted function <math> \tilde{g}(y)= y-g(y)</math>. In this example, <math> y^* \neq y^{\text{ty}} </math>: the asymptotic of the average value of <math> X_N </math> is different from the asymptotic of the typical value. In particular, the average is dominated by rare events, i.e. realisations in which <math> Y_N </math> takes the value <math> y^*</math>, whose probability of occurrence is exponentially small.
</ul>


</li>
</li>
<br>
<br>


<li> We define the typical value of <math> X_N </math> as 
 
 
<li> <ins>'''Quenched averages.'''</ins> Let us go back to <math> X_N </math>: how to get <math> y^{\text{typ}} </math> from it? When <math> Y_N </math> is self-averaging,
<center>
<center>
  <math>   
  <math>   
  x_{\text{ty}} =\text{exp}\left({N y_{\text{ty}}}\right)=\text{exp}\left( \overline{\log |X_N|}\right)
  y^{\text{typ}} =\lim_{N \to \infty} \overline{Y_N}= \lim_{N \to \infty} \frac{\overline{\log X_N}}{N} \equiv \lim_{N \to \infty} \frac{{\log x^{\text{typ}}}}{N}
</math></center>
</math></center>
which are equivalent definitions since
where in the last line we have used that <math> y^{\text{typ}}= \lim_{N \to \infty} N^{-1} \log x^{\text{typ}}_N </math>.
In the language of disordered systems, computing the typical value of <math> X_N </math> through the average of its logarithm  corresponds to performing a <ins> quenched average</ins>: from this average, one extracts the correct asymptotic value of the self-averaging quantity <math> Y_N </math>.</li><br>
 
<li> <ins>'''Annealed averages.'''</ins> The quenched average does not necessarily coincide with the <ins> annealed average</ins>, defined as:
<center>
<math>   
<math>   
\overline{\log |X_N|}=\int dx P_N(x) \log |x|  = N \int dy P_N(y) \, y= N \int dy\, e^{-N g(y)+o(N)} \, y \sim N  y_{\text{ty}}.
y_{\text{a}} = \lim_{N \to \infty} \frac{\log \overline{X_N}}{N}.
</math>
</math>
Notice that while the average value of <math> Y_N </math> coincides with the typical value (choose <math> f(y)=y</math> in the formula above), this is in general not the case for quantities growing exponentially fast like <math> X_N </math>: the average value of these exponentially scaling quantities is in general much larger than the typical value, meaning that the general inequality
</center>
<center>
In fact, it always holds <math>  \overline{\log X_N} \leq \log \overline{X_N}</math> because of the concavity of the logarithm.
  <math>   
When the inequality is strict and quenched and annealed averages are not the same, it means that <math> X_N </math> is not self-averaging, and its average value is exponentially larger than the typical value (because the average is dominated by rare events). In this case, to get the correct limit of the self-averaging quantity <math> Y_N </math> one has to perform the quenched average.<sup>[[#Notes|[**] ]]</sup> This is what happens in glassy phases.
\frac{\overline{\log |X_N|}}{N} \leq \frac{\log |\overline{X_N}|}{N}
</math></center>
is strict. When <math> N \to \infty </math>, the quantity on the left-hand-side is <math>  y_{\text{ty}} </math>, which we will also call <ins> quenched </ins> in the following; the quantity on the right-hand-side is different: we call it  <ins> annealed</ins> and define it with <math> y_{\text{a}} </math>.
</li>
</li>
<br>
<br>
</ul>


</ul>
<div style="font-size:89%">
: <small>[*]</small> - See  [[Media:SelfAvMathNote.pdf| here]] for a note on the equivalence of these two criteria.
</div>
 
<div style="font-size:89%">
: <small>[**]</small> - Notice that the opposite is not true: one can have situations in which the partition function is not self-averaging, but still the quenched free energy coincides with the annealed one.
</div>
 
 
 
<br>
 
== Problems ==
This problem and the one of next week deal with the Random Energy Model (REM). The REM has been introduced in <sup>[[#Notes|[1] ]]</sup>. In the REM the system can take <math> M=2^N </math> configurations <math> \vec \sigma^\alpha=(\sigma^\alpha_1, \cdots, \sigma^\alpha_N)</math> with <math> \sigma^\alpha_i = \pm 1 </math>. To each  configuration <math>\alpha=1, \cdots, 2^N</math> is assigned a random energy <math>E_\alpha</math>. The random energies are independent, taken from a Gaussian distribution
<math> p(E) =( 2 \pi N)^{-1/2}e^{-\frac{E^2}{2 N}}.</math>


=== Problem 1.1: the energy landscape of the REM ===
=== Problem 1: the energy landscape of the REM ===


[[File:Entropy REM.png|thumb|left|x140px|Entropy of the Random Energy Model]]
[[File:Entropy REM.png|thumb|left|x140px|Entropy of the Random Energy Model]]


In this problem we study the random variable <math> \mathcal{N}(E)dE </math>, that is the number of configurations having energy  <math> E_\alpha \in [E, E+dE] </math>. We show that for large <math> N </math> it scales as <math>\mathcal{N}(E) = e^{N S\left( E/N\right) + o(N)}</math>
In this problem we study the random variable <math> \mathcal{N}_N(E)dE </math>, that is the number of configurations having energy  <math> E_\alpha \in [E, E+dE] </math>. For large <math> N </math> this variable scales exponentially <math>\mathcal{N}_N(E) \sim  e^{N S_N\left( E/N\right)}</math>. Let <math> \epsilon=E/N </math>. Through this exercise we show that the asymptotic value of the entropy <math> S_N(\epsilon) </math>, that is self-averaging, is given by:
. We show that the typical value of <math> S(\epsilon) </math>, the quenched entropy of the model (see sketch), is given by:
<center><math>
<center><math>
S(\epsilon)=\begin{cases}
\lim_{N \to \infty} S_N(\epsilon)=S_\infty(\epsilon)=\begin{cases}
  \log 2- \epsilon^2 \quad &\text{ if } |\epsilon| \leq \sqrt{\log 2} \\
  \log 2- \frac{\epsilon^2}{2} \quad &\text{ if } |\epsilon| \leq \sqrt{2 \, \log 2} \\
0 \quad &\text{ if } |\epsilon| >\sqrt{\log 2}
- \infty \quad &\text{ if } |\epsilon| >\sqrt{2 \, \log 2}
\end{cases}
\end{cases}
</math></center>
</math></center>
The point where the entropy vanishes, <math> \epsilon=- \sqrt{\log 2} </math>, is the energy density of the ground state. The entropy is maximal at  <math> \epsilon=0 </math>: the highest number of configurations have vanishing energy density.  
The point where the entropy vanishes, <math> \epsilon=- \sqrt{2 \, \log 2} </math>, is the energy density of the ground state. The entropy is maximal at  <math> \epsilon=0 </math>: the highest number of configurations have vanishing energy density. We set  <math> S(\epsilon):=S_\infty(\epsilon) </math>.
 




<ol>
<ol>
<li> <em> Averages: the annealed entropy.</em> We begin by computing the annealed entropy <math> S_{\text{a}} </math>, which is defined by the average <math> \overline{\mathcal{N}(E)}= \text{exp}\left(N S_{\text{a}}\left( E/N \right)+ o(N)\right) </math>. Compute this function using the representation <math> \mathcal{N}(E)dE= \sum_{\alpha=1}^{2^N} \chi_\alpha(E) dE \;</math>  [with <math> \chi_\alpha(E)=1</math> if  <math> E_\alpha \in [E, E+dE]</math> and  <math> \chi_\alpha(E)=0</math> otherwise]. When does <math> S_{\text{a}}  </math> coincide with <math> S </math>?</li>
<li> <em> Averages: the annealed entropy.</em> We begin by computing the annealed entropy <math> S_{\text{a}} </math>, which is defined by the average <math> \overline{\mathcal{N}_N(E)}= \text{exp}\left(N S_{\text{a}}\left( E/N \right)+ o(N)\right) </math>. Compute this function using the representation <math> \mathcal{N}_N(E)dE= \sum_{\alpha=1}^{2^N} \chi_\alpha(E) dE \;</math>  [with <math> \chi_\alpha(E)=1</math> if  <math> E_\alpha \in [E, E+dE]</math> and  <math> \chi_\alpha(E)=0</math> otherwise]. </li>
</ol>
</ol>
<br>
<br>


<ol start="2">
<ol start="2">
<li><em> Self-averaging.</em> For  <math> |\epsilon| \leq  \sqrt{\log 2} </math> the quantity <math> \mathcal{N} </math> is self-averaging: its distribution concentrates around the average value <math> \overline{\mathcal{N}} </math> when  <math> N \to \infty </math>. Show that <math>\sigma^2= \overline{\mathcal{N}^2}- \overline{\mathcal{N}}^2 \sim \overline{\mathcal{N}}</math>; by the central limit theorem, show that <math> \mathcal{N}  </math> is self-averaging when <math> |\epsilon|< \sqrt{\log 2} </math>. This is no longer true in the region where the annealed entropy is negative: why does one expect fluctuations to be relevant in this region?</li>
<li><em> Self-averaging.</em> For  <math> |\epsilon| \leq  \sqrt{2 \, \log 2} </math> the quantity <math> \mathcal{N}_N </math> is self-averaging: its distribution concentrates around the average value <math> \overline{\mathcal{N}_N} </math> when  <math> N \to \infty </math>. Show this by computing the second moment <math>\overline{\mathcal{N}^2_N}</math>. Deduce that <math> S(\epsilon)= S_a(\epsilon)</math> when <math> |\epsilon| \leq  \sqrt{2 \, \log 2} </math>. This property of being self-averaging is no longer true in the region where the annealed entropy is negative: why does one expect fluctuations to be relevant in this region?</li>
</ol>
</ol>
<br>
<br>


<ol start="3">
<ol start="3">
<li> <em> Rare events vs typical values.</em> For  <math> |\epsilon| > \sqrt{\log 2} </math> the annealed entropy is negative: the average number of configurations with those energy densities is exponentially small in <math> N </math>. This implies that the probability to get configurations with those energy is exponentially small in <math> N </math>: these configurations are rare. Do you have an idea of how to show this, using the expression for  <math> \overline{\mathcal{N}}</math>? What is the typical value of <math> \mathcal{N} </math> in this region? Justify why the point where the entropy vanishes coincides with the ground state energy of the model.</li>
<li> <em> Rare events.</em> For  <math> |\epsilon| > \sqrt{2 \, \log 2} </math> the annealed entropy is negative: the average number of configurations with those energy densities is exponentially small in <math> N </math>. This implies that the probability to get configurations with those energy is exponentially small in <math> N </math>: these configurations are rare. Do you have an idea of how to show this, using the expression for  <math> \overline{\mathcal{N}_N}?</math> What is the typical value of <math> \mathcal{N}_N </math> in this region? Putting everything together, derive the form of the typical value of the entropy density. Why the point where the entropy vanishes coincides with the ground state energy of the model?</li>
</ol>
</ol>


Line 94: Line 131:
<!--'''Comment:''' this analysis of the landscape suggests that in the large  <math> N </math> limit, the fluctuations due to the randomness become relevant when one looks at the bottom of their energy landscape, close to the ground state energy. We show below that this intuition is correct, and corresponds to the fact that the partition function <math> Z </math> has an interesting behaviour at low temperature.-->
<!--'''Comment:''' this analysis of the landscape suggests that in the large  <math> N </math> limit, the fluctuations due to the randomness become relevant when one looks at the bottom of their energy landscape, close to the ground state energy. We show below that this intuition is correct, and corresponds to the fact that the partition function <math> Z </math> has an interesting behaviour at low temperature.-->


=== Problem 1.2: the free energy and the freezing transition ===
== Check out: key concepts ==


We now compute the equilibrium phase diagram of the model, and in particular the quenched free energy density <math>f </math> which controls the scaling of the typical value of the partition function, <math>Z \sim e^{-N \beta \, f +o(N) } </math>. We show that the free energy equals to
Self-averaging, average value vs typical value, large deviations, rare events, saddle point approximation.
<center><math>
f =
\begin{cases}
&- \left( T \log 2 + \frac{1}{4 T}\right) \quad \text{if} \quad T \geq T_c\\
& - \sqrt{\log 2} \quad \text{if} \quad T <T_c
\end{cases} \quad \quad T_c= \frac{1}{2 \sqrt{\log 2}}.
</math></center>
At <math> T_c </math> a transition occurs, often called freezing transition: in the whole low-temperature phase, the free-energy is “frozen” at the value that it has at the critical temperature  <math>T= T_c </math>.
 
<ol>
<li><em> The thermodynamical transition and the freezing.</em>
The partition function the REM reads
<math>
Z = \sum_{\alpha=1}^{2^N} e^{-\beta E_\alpha}= \int dE \, \mathcal{N}(E) e^{-\beta E}.
</math>
Using the behaviour of the typical value of <math> \mathcal{N} </math> determined in Problem 1.1, derive the free energy of the model (hint: perform a saddle point calculation). What is the order of this thermodynamic transition?
</ol>
</li>
<br>
 
<ol start="2">
<li> <em> Entropy.</em> What happens to the entropy of the model when the critical temperature is reached, and in the low temperature phase? What does this imply for the partition function <math> Z</math>?</li>
</ol>
<br>
 
<ol start="3">
<li><em> Fluctuations, and back to average vs typical.</em> Similarly to what we did for the entropy, one can define an annealed free energy <math> f_{\text{a}} </math> from <math> \overline{Z}=e^{- N \beta f_{\text{a}} + o(N)} </math>: show that in the whole low-temperature phase this is smaller than the quenched free energy obtained above. Putting all the results together, justify why the average of the partition function in the low-T phase is "dominated by rare events".  
</ol></li>


== To know more ==
* Derrida. Random-energy model: limit of a family of disordered models [https://hal.science/hal-03285940v1/document]


'''Comment:''' the low-T phase of the REM is a frozen phase, characterized by the fact that the free energy is temperature independent, and that the typical value of the partition function is very different from the average value. In fact, the low-T phase is also <em> a glass phase </em>: it is a phase where a peculiar symmetry, the so called replica symmetry, is broken. We go back to this concepts in the next sets of problems.
* A note on terminology:  
<div style="font-size:89%">
The terms “quenched” and “annealed” come from metallurgy and refer to the procedure in which you cool a very hot piece of metal: a system is quenched if it is cooled very rapidly (istantaneously changing its environment by putting it into cold water, for instance) and has to adjusts to this new fixed environment; annealed if it is cooled slowly, kept in (quasi)equilibrium with its changing environment at all times. Think now at how you compute the free energy of a disordered system, and at disorder as the environment. In the quenched protocol, you first compute the average over the configurations of the system (with the Boltzmann weight) keeping the disorder (environment) fixed, so the configurations have to adjust to the given disorder. Then you take the log and only afterwards average over the randomness (not even needed, at large <math>N</math>, if the free-energy is self-averaging). In the annealed protocol instead, the disorder (environment) and the configurations are treated on the same footing and adjust to each others, you average over both simultaneously. The quenched case corresponds to keeping the environment fixed and looking at how configurations adjust to it, the annealedone to changing the environment fast.
</div>

Latest revision as of 12:47, 11 February 2025

Goal: understanding the energy landscape of the simplest spin-glass model, the Random Energy Model (REM).
Techniques: probability theory, saddle point approximation.



A dictionary for large-N disordered systems


  • We will discuss disordered systems with degrees of freedom (for instance, for a spin system on a lattice of size in dimension , ). Since the systems are random, the quantities that describe their properties (the free energy, the number of configurations of the system that satisfy a certain property, the magnetization etc) are also random variables, with a distribution. In this discussion we denote these random variables generically with (where the subscript denotes the number of degrees of freedom) and with their distribution. Statistical physics goal is to characterize the behavior of these quantities in the limit .

  • Self-averagingness. The physics of disordered systems is described by quantities that are distributed when is finite (they take different values from sample to sample of the system), but for which sample to sample fluctuations are suppressed when . These quantities are said to be self-averaging . A random variable is self-averaging when, in the limit , its distribution concentrates around the average, collapsing to a deterministic value:

    This happens when its fluctuations are small compared to the average, meaning that [*]

    When the random variable is not self-averaging, it remains distributed in the limit . When it is self-averaging, sample-to-sample fluctuations are suppressed when is large.

    Example 1. Consider the partition function of a disordered system at inverse temperature , . When is large this random variable has an exponential scaling, , where the variable is the free energy density. This scaling means that the random variable has a well defined distribution that remains of when . In all the disordered systems models we will consider in these lectures, the free-energy not only has a well defined distribution in the limit, but it is also self-averaging. This is very important property: it implies that the free energy (and therefore all the thermodynamics observables, that can be obtained taking derivatives of the free energy) does not fluctuate from sample to sample when is large, and so the physics of the system does not depend on the particular sample. While intensive quantities like are self-averaging, quantities scaling exponentially like the partition function are not necessarily so: in particular, we will see that they are not when the system is in a glassy phase.

    Example 2. The partition function is an example of exponentially-scaling variable , where the rescaled variable is self-averaging while may not be. Another example is given in Problem 1 below, where and .


  • Typical and rare. The typical value of a random variable is the value at which its distribution peaks (it is the most probable value). Values at the tails of the distribution, where the probability density does not peak but it is small (for instance, vanishing with ) are said to be rare. For self-averaging quantities, in the limit the distribution collapses to a single value, that is both the average and typical value. In general, average and typical value of a random variable may not coincide: this happens when the average is dominated by values that are rare, associated to a small probability of occurrence and thus to the tails of the distribution. Let’s see this with an example.
      Example: typical vs average. Often, quantities like have a distribution that for large takes the form where is some positive function and . This is called a large deviation form for the probability distribution, with speed . This distribution is of for the value such that : this value is the typical value of (asymptotically at large ); all the other values of Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle y } are associated to a probability that is exponentially small in Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle N^\alpha} : they are exponentially rare. Consider now an exponentially scaling quantity like Failed to parse (SVG (MathML can be enabled via browser plugin): Invalid response ("Math extension cannot connect to Restbase.") from server "https://wikimedia.org/api/rest_v1/":): {\displaystyle X_N = e^{N Y_N} } , and let’s fix . The asymptotic typical values and are related by:

      so the scaling of is . Let us now look at the scaling of the average. The average of can be computed with the saddle point approximation for large :

      where is the point maximising the shifted function . In this example, : the asymptotic of the average value of is different from the asymptotic of the typical value. In particular, the average is dominated by rare events, i.e. realisations in which takes the value , whose probability of occurrence is exponentially small.


  • Quenched averages. Let us go back to : how to get from it? When is self-averaging,

    where in the last line we have used that .

    In the language of disordered systems, computing the typical value of through the average of its logarithm corresponds to performing a quenched average: from this average, one extracts the correct asymptotic value of the self-averaging quantity .

  • Annealed averages. The quenched average does not necessarily coincide with the annealed average, defined as:

    In fact, it always holds because of the concavity of the logarithm. When the inequality is strict and quenched and annealed averages are not the same, it means that is not self-averaging, and its average value is exponentially larger than the typical value (because the average is dominated by rare events). In this case, to get the correct limit of the self-averaging quantity one has to perform the quenched average.[**] This is what happens in glassy phases.


[*] - See here for a note on the equivalence of these two criteria.
[**] - Notice that the opposite is not true: one can have situations in which the partition function is not self-averaging, but still the quenched free energy coincides with the annealed one.



Problems

This problem and the one of next week deal with the Random Energy Model (REM). The REM has been introduced in [1] . In the REM the system can take configurations with . To each configuration is assigned a random energy . The random energies are independent, taken from a Gaussian distribution

Problem 1: the energy landscape of the REM

Entropy of the Random Energy Model

In this problem we study the random variable , that is the number of configurations having energy . For large this variable scales exponentially . Let . Through this exercise we show that the asymptotic value of the entropy , that is self-averaging, is given by:

The point where the entropy vanishes, , is the energy density of the ground state. The entropy is maximal at : the highest number of configurations have vanishing energy density. We set .


  1. Averages: the annealed entropy. We begin by computing the annealed entropy , which is defined by the average . Compute this function using the representation [with if and otherwise].


  1. Self-averaging. For the quantity is self-averaging: its distribution concentrates around the average value when . Show this by computing the second moment . Deduce that when . This property of being self-averaging is no longer true in the region where the annealed entropy is negative: why does one expect fluctuations to be relevant in this region?


  1. Rare events. For the annealed entropy is negative: the average number of configurations with those energy densities is exponentially small in . This implies that the probability to get configurations with those energy is exponentially small in : these configurations are rare. Do you have an idea of how to show this, using the expression for What is the typical value of in this region? Putting everything together, derive the form of the typical value of the entropy density. Why the point where the entropy vanishes coincides with the ground state energy of the model?


Check out: key concepts

Self-averaging, average value vs typical value, large deviations, rare events, saddle point approximation.

To know more

  • Derrida. Random-energy model: limit of a family of disordered models [1]
  • A note on terminology:

The terms “quenched” and “annealed” come from metallurgy and refer to the procedure in which you cool a very hot piece of metal: a system is quenched if it is cooled very rapidly (istantaneously changing its environment by putting it into cold water, for instance) and has to adjusts to this new fixed environment; annealed if it is cooled slowly, kept in (quasi)equilibrium with its changing environment at all times. Think now at how you compute the free energy of a disordered system, and at disorder as the environment. In the quenched protocol, you first compute the average over the configurations of the system (with the Boltzmann weight) keeping the disorder (environment) fixed, so the configurations have to adjust to the given disorder. Then you take the log and only afterwards average over the randomness (not even needed, at large , if the free-energy is self-averaging). In the annealed protocol instead, the disorder (environment) and the configurations are treated on the same footing and adjust to each others, you average over both simultaneously. The quenched case corresponds to keeping the environment fixed and looking at how configurations adjust to it, the annealedone to changing the environment fast.