T-I
Goal:  derive the equilibrium phase diagram of the simplest spin-glass model, the Random Energy Model (REM). 
 Techniques:  saddle point approximation, Legendre transform, probability theory.
A dictionary for large-N disordered systems
- Exponentially scaling variables. We will consider positive random variables which depend on a parameter (the number of degrees of freedom: for a system of size in dimension , ) and which have the scaling : this means that the rescaled variable has a well defined distribution that remains of when . The standard example we have in mind are the partition functions of disordered systems with degrees of freedom, : here and , where is the free energy density. Let be the distributions of and .
-  Self-averaging. A random variable  is self-averaging  when, in the limit , its distribution concentrates around the average, collapsing to a deterministic value:
This happens when its fluctuations are small compared to the average, meaning that [*] When the random variable is not self-averaging, it remains distributed in the limit . When it is self-averaging, sample-to-sample fluctuations are suppressed when is large. This property holds for the free energy of all the disordered systems we will consider. This is very important property: it implies that the free energy (and therefore all the thermodynamics observables, that can be obtained taking derivatives of the free energy) does not fluctuate from sample to sample when is large, and so the physics of the system does not depend on the particular sample. Notice that while intensive quantities like (like the free energy density) are self-averaging, quantities scaling exponentially like (like the partition function) are not necessarily so, see below. 
- Average and typical. The typical value of a random variable is the value at which its distribution peaks (it is the most probable value). For self-averaging quantities, in the limit average and typical value coincide. In general, it might not be so!
-  Quenched averages. Let us go back to : how to get  from it? When  is self-averaging, 
where in the last line we have used that . In the language of disordered systems, computing the typical value of through the average of its logarithm corresponds to performing a quenched average: from this average, one extracts the correct asymptotic value of the self-averaging quantity .
-  Annealed averages. The quenched average does not necessarily coincide with the  annealed average, defined as:
In fact, it always holds because of the concavity of the logarithm. When the inequality is strict and quenched and annealed averages are not the same, it means that is not self-averaging, and its average value is exponentially larger than the typical value (because the average is dominated by rare events). In this case, to get the correct limit of the self-averaging quantity one has to perform the quenched average.[**] This is what happens in the glassy phases discussed in these TDs. 
- We discuss an example to fix the ideas. Often, quantities like  have a distribution that for large  takes the form  where  is some positive function and . This is called a  large deviation form  for the probability distribution, with  speed  . This distribution is of  for the value  such that : this value is the  typical value  of  (asymptotically at large ); all the other values of  are associated to a probability that is exponentially small in : they are exponentially rare. 
Consider now an exponentially scaling quantity like , and let’s fix . The asymptotic typical values  and   are related by:
so the scaling of is . Let us now look at the scaling of the average. The average of can be computed with the saddle point approximation for large :
where is the point maximising the shifted function . In this example, : the asymptotic of the average value of is different from the asymptotic of the typical value. In particular, the average is dominated by rare events, i.e. realisations in which takes the value , whose probability of occurrence is exponentially small.
- [*] - See here for a note on the equivalence of these two criteria.
- [**] - Notice that the opposite is not true: one can have situations in which the partition function is not self-averaging, but still the quenched free energy coincides with the annealed one.
Problems
Problem 1.1: the energy landscape of the REM
The REM has been introduced in [1] . In the REM the system can take configurations with . To each configuration is assigned a random energy . The random energies are independent, taken from a Gaussian distribution In this problem we study the random variable , that is the number of configurations having energy . We show that for large it scales as . We show that the typical value of , the quenched entropy density of the model (see sketch), is given by:
The point where the entropy vanishes, , is the energy density of the ground state. The entropy is maximal at : the highest number of configurations have vanishing energy density.
- Averages: the annealed entropy. We begin by computing the annealed entropy , which is defined by the average . Compute this function using the representation [with if and otherwise]. When does coincide with ?
- Self-averaging. For the quantity is self-averaging: its distribution concentrates around the average value when . Show this by computing the second moment . This property of being self-averaging is no longer true in the region where the annealed entropy is negative: why does one expect fluctuations to be relevant in this region?
- Rare events. For the annealed entropy is negative: the average number of configurations with those energy densities is exponentially small in . This implies that the probability to get configurations with those energy is exponentially small in : these configurations are rare. Do you have an idea of how to show this, using the expression for ? What is the typical value of in this region? Putting everything together, derive the form of the typical value of the entropy density. Why the point where the entropy vanishes coincides with the ground state energy of the model?
Problem 1.2: freezing transition & glassiness
We now compute the equilibrium phase diagram of the model, and in particular the quenched free energy density which controls the scaling of the typical value of the partition function, . We show that the free energy equals to
At a transition occurs, often called freezing transition: in the whole low-temperature phase, the free-energy is “frozen” at the value that it has at the critical temperature .
- The equilibrium transition and the freezing. The partition function the REM reads Using the behaviour of the typical value of determined in Problem 1.1, derive the free energy of the model (hint: perform a saddle point calculation). What is the order of this thermodynamic transition?
- Entropy. What happens to the entropy of the model when the critical temperature is reached, and in the low temperature phase? What does this imply for the partition function ?
- Fluctuations, and back to average vs typical. Similarly to what we did for the entropy, one can define an annealed free energy from : show that in the whole low-temperature phase this is smaller than the quenched free energy obtained above. Putting all the results together, justify why the average of the partition function in the low-T phase is "dominated by rare events".
-  Overlap distribution and glassiness. The overlap between two configurations  is 
. We define the overlap distribution as the thermal average
 
 Comment: the low-T phase of the REM is a frozen phase, characterized by the fact that the free energy is temperature independent, and that the typical value of the partition function is very different from the average value. In fact, the low-T phase is also a glass phase where a peculiar symmetry, the so called replica symmetry, is broken. We go back to this concepts in the next sets of problems. 
 Check out: key conceptsSelf-averaging, average value vs typical value, large deviations, rare events, saddle point approximation, freezing transition, overlap distribution. To know more- Derrida. Random-energy model: limit of a family of disordered models [1]
 - A note on terminology:
 The terms “quenched” and “annealed” come from metallurgy and refer to the procedure in which you cool a very hot piece of metal: a system is quenched if it is cooled very rapidly (istantaneously changing its environment by putting it into cold water, for instance) and has to adjusts to this new fixed environment; annealed if it is cooled slowly, kept in (quasi)equilibrium with its changing environment at all times. Think now at how you compute the free energy, and at disorder as the environment. In the quenched protocol, you compute the average over configurations of the system keeping the disorder (environment) fixed, so the configurations have to adjust to the given disorder. Then you take the log and only afterwards average over the randomness (not even needed, at large , if the free-energy is self-averaging). In the annealed protocol instead, the disorder (environment) and the configurations are treated on the same footing and adjust to each others, you average over both simultaneously. 
