Lecture 8. Games on random graphs (2)

The last part of this lec­ture will take place on Mon­day, May 6 at 3 PM in Bend­heim Class­room 103. The full scribe notes for this lec­ture will be posted some­time after that date.

03. May 2013 by Ramon van Handel
Categories: Random graphs | Leave a comment

Lecture 7. Games on random graphs (1)

The next two lec­tures by Rene Car­mona will be on games on ran­dom graphs.
Many thanks to Patrick Rebes­chini for scrib­ing these lectures!

In the fol­low­ing we dis­cuss some results from the paper “Con­nec­tiv­ity and equi­lib­rium in ran­dom games” by Daskalakis, Dimakis, and Mos­sel. We define a ran­dom game on a ran­dom graph and we char­ac­ter­ize the graphs that are likely to exhibit Nash equi­lib­ria for this game. We show that if the ran­dom graph is drawn from the Erdös-Rényi dis­tri­b­u­tion, then in the high con­nec­tiv­ity regime the law of the num­ber of pure Nash equi­lib­ria con­verges toward a Pois­son dis­tri­b­u­tion, asymp­tot­i­cally, as the size of the graph is increased.

Let G=(V,E) be a sim­ple (that is, undi­rected and with no self-edges) graph, and for each v\in V denote by N(v) the set of neigh­bors of v, that is, N(v):=\{v'\in V : (v,v')\in E\}. We think of each ver­tex in V as a player in the game that we are about to intro­duce. At the same time, we think of each edge (v,v')\in E as a strate­gic inter­ac­tion between play­ers v and v'.

Def­i­n­i­tion (Game on a graph). For each v\in V let S_v rep­re­sent the set of strate­gies for player v, assumed to be a finite set. We nat­u­rally extend this def­i­n­i­tion to include fam­i­lies of play­ers: for each A\subseteq V, let S_A= \times_{v\in A} S_v be the set of strate­gies for each player in A. For each v\in V, denote by u_v:  S_v\times S_{N(v)}\ni(\sigma_v,\sigma_{N(v)})\rightarrow u_v(\sigma_v,\sigma_{N(v)}) \in \mathbb{R} the reward func­tion for player v. A game is a col­lec­tion (S_v,u_v)_{v\in V}.

The above def­i­n­i­tion describes a game that is sta­tic, in the sense that the game is played only once, and local, in the sense that the reward func­tion of each player depends only on its own strat­egy and on the strat­egy of the play­ers in its neigh­bors. We now intro­duce the notion of pure Nash equilibrium.

Def­i­n­i­tion (Pure Nash equi­lib­rium). We say that \sigma \in S_V is a pure Nash equi­lib­rium (PNE) if for each v\in V we have

    \[u_v(\sigma_v,\sigma_{N(v)}) \ge u_v(\tau,\sigma_{N(v)}) 	\qquad\text{for each $\tau\in S_v$}.\]

A pure Nash equi­lib­rium rep­re­sents a state where no player can be bet­ter off by chang­ing his own strat­egy if he is the only one who is allowed to do so. In order to inves­ti­gate the exis­tence of a pure Nash equi­lib­rium it suf­fices to study the best response func­tion defined below.

Def­i­n­i­tion (Best response func­tion). Given a reward func­tion u_v for player v\in V, we define the best response func­tion \operatorname{BR}_v for v as

    \[\operatorname{BR}_v(\sigma_v,\sigma_{N(v)}) := 	\begin{cases}  		1	& \text{if } \sigma_v\in\arg\sup_{\tau\in S_v} u_v(\tau,\sigma_{N(v)}),\\ 		0	& \text{otherwise}. 	\end{cases}\]

Clearly, \sigma is a pure Nash equi­lib­rium if and only if \operatorname{BR}_v(\sigma_v,\sigma_{N(v)})=1 for each v\in V. We now define the type of ran­dom games that we will be inter­ested in; in order to do so, we need to spec­ify the set of strate­gies and the reward func­tion for each player.

Def­i­n­i­tion (Ran­dom game on a fixed graph). For a graph G=(V,E) and an atom­less prob­a­bil­ity mea­sure \mu on \mathbb{R}, let \mathcal{D}_{G,\mu} be the asso­ci­ated ran­dom game defined as follows:

  1. S_v=\{0,1\} for each v\in V;
  2. \{ u_v(\sigma_v,\sigma_{N(v)}) \}_{v\in V, \sigma_v \in S_v, \sigma_{N(v)} \in S_{N(v)}} is a col­lec­tion of inde­pen­dent iden­ti­cally dis­trib­uted ran­dom vari­ables with dis­tri­b­u­tion \mu.

Remark. For each game \mathcal{D}_{G,\mu} the fam­ily \{\operatorname{BR}_v(0,\sigma_{N(v)})\}_{v\in V, \sigma_{N(v)}\in S_{N(v)}} is a col­lec­tion of inde­pen­dent ran­dom vari­ables that are uni­formly dis­trib­uted in \{0,1\}, and for each v\in V, \sigma_{N(v)}\in S_{N(v)} we have \operatorname{BR}_v(1,\sigma_{N(v)}) = 1-\operatorname{BR}_v(0,\sigma_{N(v)}) almost surely. In fact, note that \operatorname{BR}_v(0,\sigma_{N(v)})=1 if and only if u_v(0,\sigma_{N(v)}) \ge u_v(1,\sigma_{N(v)}) and this event has prob­a­bil­ity 1/2 since the two ran­dom vari­ables appear­ing on both sides of the inequal­ity sign are inde­pen­dent with the same law \mu and \mu is atom­less. As far as the analy­sis of the exis­tence of pure Nash equi­lib­ria is con­cerned, we could take the present notion of best response func­tions as the def­i­n­i­tion of our ran­dom game on a fix graph. In fact, note that the choice of \mu in \mathcal{D}_{G,\mu} does not play a role in our analy­sis, and we would obtain the same results by choos­ing dif­fer­ent (atom­less) dis­tri­b­u­tions for sam­pling (inde­pen­dently) the reward func­tion of each player.

Denote by G(n,p) the dis­tri­b­u­tion of a Erdös-Rényi ran­dom graph with n ver­tices where each edge is present inde­pen­dently with prob­a­bil­ity p. We now intro­duce the notion of a ran­dom game on a ran­dom graph.

Def­i­n­i­tion (Ran­dom game on a ramon graph). For each n\in\mathbb{N}, p\in(0,1) and each prob­a­bil­ity mea­sure \mu on \mathbb{R}, do the following:

  1. choose a graph G from G(n,p);
  2. choose a ran­dom game from \mathcal{D}_{G,\mu} for the graph G.

Hence­forth, given a ran­dom vari­able X let \mathcal{L}(X) rep­re­sent its dis­tri­b­u­tion. Given two mea­sures \mu and \nu on a mea­sur­able space, define the total vari­a­tion dis­tance between \mu and \nu as

    \[\| \mu - \nu \|_{TV} := \sup_{f: \| f\| \le 1} | \mu f - \nu f |,\]

where the supre­mum is taken over mea­sur­able func­tions such that the supre­mum norm is less than or equal to 1.

We are now ready to state the main the­o­rem that we will prove in the fol­low­ing (The­o­rem 1.9 in Daskalakis, Dimakis, and Mos­sel).

The­o­rem 1 (High con­nec­tiv­ity regime). Let Z^{n,p} be the num­ber of pure Nash equi­lib­ria in the ran­dom game on ran­dom graph defined above. Let define the high-connectivity regime as

    \[p(n)= \frac{(2+\varepsilon(n)) \log n}{n},\]

where \varepsilon:\mathbb{N}\rightarrow\mathbb{R} sat­is­fies the fol­low­ing two properties:

    \begin{align*} 	\varepsilon (n) &> c &\text{for each $n\in\mathbb{N}$, for some $c>0$,}\\ 	\varepsilon (n) &\le \frac{n}{\log n} - 2 &\text{for each $n\in\mathbb{N}$.} \end{align*}

Then, we have

    \[\mathbf{P}_{G(n,p)} \{ \| \mathcal{L}(Z^{n,p(n)}) - \mathcal{L}(N_1) \|_{TV}  	\le O(n^{-\varepsilon/8}) + e^{-\Omega(n)} \}  	\ge 1- \frac{2}{n^{\varepsilon/8}},\]

where \mathbf{P}_{G(n,p)} denotes the con­di­tional prob­a­bil­ity given the graph G(n,p) and N_1 is a Pois­son ran­dom vari­able with mean 1. In particular,

    \[\lim_{n\rightarrow\infty} \mathcal{L}(Z^{n,p(n)}) = \mathcal{L}(N_1),\]

which shows that in this regime a pure Nash equi­lib­rium exists with prob­a­bil­ity con­verg­ing to 1-\frac{1}{e} as the size of the net­work increases.

Remark. Using the ter­mi­nol­ogy of sta­tis­ti­cal mechan­ics, the first result in The­o­rem 1 rep­re­sents a quenched–type result since it involves the con­di­tional dis­tri­b­u­tion of a sys­tem (i.e., the game) given its envi­ron­ment (i.e., the graph). On the other hand, the sec­ond result rep­re­sents an annealed–type result, where the uncon­di­tional prob­a­bil­ity is considered.

In order to prove The­o­rem 1 we need the fol­low­ing lemma on Pois­son approx­i­ma­tions. The lemma is adapted from the results of R. Arra­tia, L. Gold­stein, and L. Gor­don (“Two moments suf­fice for Pois­son approx­i­ma­tions: the Chen-Stein method”, Ann. Probab. 17, 9–25, 1989) and it shows how the total vari­a­tion dis­tance between the law of a sum of Bernoulli ran­dom vari­ables and a Pois­son dis­tri­b­u­tion can be bounded by the first and sec­ond moments of the Bernoulli ran­dom vari­ables. This result is a par­tic­u­lar instance of the Stein’s method in prob­a­bil­ity theory.

Lemma (Arra­tia, Gold­stein, and Gor­don, 1989). Let \{X_i\}_{i=0,1,\ldots,N} be a col­lec­tion of Bernoulli ran­dom vari­ables with p_i:=\mathbf{P}\{X_i=1\}. For each i\in\{0,1,\ldots,N\} let B_i\subseteq \{0,1,\ldots,N\} be such that \{X_j\}_{j\in B^c_i} is inde­pen­dent of X_i. Define

    \begin{align*} 	b_1 := \sum_{i=0}^N \sum_{j\in B_i} p_ip_j,\qquad\qquad 	b_2 := \sum_{i=0}^N \sum_{j\in B_i\setminus \{i\}} p_{ij}, \end{align*}

where p_{ij} := \mathbf{P}\{X_i=1,X_j=1\}. Define Z:=\sum_{i=0}^N X_i and \lambda := \mathbf{E} Z = \sum_{i=0}^N p_i. If N_{\lambda} is a Pois­son ran­dom vari­able with mean \lambda, then

    \[\| \mathcal{L} (Z) - \mathcal{L} (N_\lambda) \|_{TV} \le 2 (b_1+b_2).\]

Proof. We define the fol­low­ing oper­a­tors that act on each func­tion f : \mathbb{N}\rightarrow\mathbb{R}:

    \begin{align*} 	[Df](n) &:= f(n+1)-f(n)&\text{for each $n\in\mathbb{N}$},\\ 	[Tf](n) &:= nf(n)-\lambda f(n+1)&\text{for each $n\in\mathbb{N}$},\\ 	[Sf](n+1) &:= - \frac{\mathbf{E}[f(N_\lambda) \mathbf{1}_{N_\lambda \le n}]} 	{\lambda \mathbf{P}\{N_\lambda = n\}} &\text{for each $n\in\mathbb{N}$}, 	& &[Sf](0):=0. \end{align*}

We point out that T char­ac­ter­izes N_\lambda in the sense that \mathbf{E} [[Tf](M)] = 0 if and only if M is a Pois­son ran­dom vari­able with mean \lambda; T is an exam­ple of Stein’s oper­a­tor. First of all, we show that for each f : \mathbb{N}\rightarrow\mathbb{R} we have TSf = f. In fact, if n=0 we have

    \[[TSf](0) = T[Sf](0) = - \lambda [Sf](1)  	= \frac{\mathbf{E}[f(N_\lambda) \mathbf{1}_{N_\lambda \le 0}]} 	{\mathbf{P}\{N_\lambda = 0\}} = f(0)\]

and if n\ge 1 we have

    \begin{align*} 	[TSf](n) &= T[Sf](n) =  n[Sf](n) - \lambda [Sf](n+1)\\ 	&= -n \frac{\mathbf{E}[f(N_\lambda) \mathbf{1}_{N_\lambda \le n-1}]} 	{\lambda \mathbf{P}\{N_\lambda = n-1\}}  	+ \frac{\mathbf{E}[f(N_\lambda) \mathbf{1}_{N_\lambda \le n}]} 	{\mathbf{P}\{N_\lambda = n\}}\\ 	&= \frac{\mathbf{E}[f(N_\lambda) \mathbf{1}_{N_\lambda \le n}] 	- \mathbf{E}[f(N_\lambda) \mathbf{1}_{N_\lambda \le n-1}]} 	{\mathbf{P}\{N_\lambda = n\}}\\ 	&= f(n). \end{align*}

For each i\in\{0,1,\ldots,N\} define \tilde Z_i := \sum_{j\in B^c_i}X_j and Y_i := Z-X_i = \sum_{j\in\{0,1,\ldots,N\}\setminus \{i\}} X_j. The fol­low­ing prop­er­ties hold:

  1. \tilde Z_i \le Y_i \le Z,
  2. X_i f(Z) = X_i f(Y_i+1),
  3. f(Y_i+1)-f(Z+1) = X_i[f(Y_i+1)-f(Y_i+2)].

In what fol­lows, con­sider any given func­tion h:\mathbb{N}\rightarrow \mathbb{R} such that \| h\| := \sup_{n\in\mathbb{N}} |h(n)| \le 1. Define the func­tion \tilde h as \tilde h(n):=h(n)-\mathbf{E}h(N_\lambda) for each n\in\mathbb{N}, and let f:= S \tilde h. From what was seen above we have Tf=\tilde h and we get

    \begin{align*} 	\mathbf{E} [h(Z)-h(N_\lambda)] =& \ \mathbf{E} \tilde h (Z) = \mathbf{E} [Tf] (Z) 	= \mathbf{E} [Zf(Z)-\lambda f (Z+1)] \\ 	=& \ \sum_{i=0}^N \mathbf{E} [X_i f(Z) - p_i f(Z+1)] 	\stackrel{\text{(ii)}}{=} \sum_{i=0}^N \mathbf{E} [X_i f(Y_i+1) - p_i f(Z+1)]\\ 	=& \ \sum_{i=0}^N \mathbf{E} [p_i f(Y_i+1) - p_i f(Z+1)]  	+ \sum_{i=0}^N \mathbf{E} [(X_i - p_i)f(Y_i+1)]\\ 	\stackrel{\text{(iii)}}{=}& \ \sum_{i=0}^N p_i \mathbf{E} [X_i (f(Y_i+1) - f(Y_i+2))] \\ 	& \ + \sum_{i=0}^N \mathbf{E} [(X_i - p_i)(f(Y_i+1)-f(\tilde Z_i + 1))]  	+ \sum_{i=0}^N \mathbf{E} [(X_i - p_i)f(\tilde Z_i + 1)]. \end{align*}

The first term is bounded above by \| Df \|\sum_{i=0}^N p^2_i while the third term is equal to 0 since \tilde Z_i is inde­pen­dent of X_i. In order to bound the sec­ond term we want to rewrite each term f(Y_i+1)-f(\tilde Z_i + 1) as a tele­scop­ing sum. In what fol­low fix i\in\{0,1,\ldots,N\}, label the ele­ments of B_i\setminus\{i\} as \{j_1,\ldots,j_K\} and define

    \begin{align*} 	U_0 &:= \tilde Z_i = \sum_{j\in B^c_i} X_{j},\\ 	U_{k} &:= U_{k-1} + X_{j_k} \qquad \text{for $k\in\{1,\ldots,K\}$}. \end{align*}

Notic­ing that U_K=Y_i, we have

    \[f(Y_i+1)-f(\tilde Z_i + 1) = \sum_{k=1}^K [ f(U_{k-1}+X_{j_k}+1)  	- f(U_{k-1}+1)]\]

and we get

    \begin{align*} 	\lefteqn{\mathbf{E} [(X_i - p_i)(f(Y_i+1)-f(\tilde Z_i + 1))]=}\\ 	&\qquad\qquad= 	\sum_{k=1}^K \mathbf{E} [(X_i - p_i)(f(U_{k-1}+X_{j_k}+1) - f(U_{k-1}+1))]\\ 	&\qquad\qquad=  	\sum_{k=1}^K \mathbf{E} [(X_i - p_i) X_{j_k} (f(U_{k-1}+2) - f(U_{k-1}+1))]\\ 	&\qquad\qquad\le \| Df \| \sum_{k=1}^K \mathbf{E} [(X_i + p_i)X_{j_k}] 	= \| Df \| \sum_{k=1}^K (p_{ij_k} + p_ip_{j_k})\\ 	&\qquad\qquad= \| Df \| \sum_{j\in B_i\setminus \{i\}} ( p_{ij} + p_i p_j). \end{align*}

There­fore, com­bin­ing all together we get

    \begin{align*} 	\mathbf{E} [h(Z)-h(N_\lambda)] &\le \| Df \|\sum_{i=0}^N p^2_i  	+ \| Df \| \sum_{i=0}^N \sum_{j\in B_i\setminus \{i\}} ( p_{ij} + p_i p_j)\\ 	&= \| Df \| \bigg( \sum_{i=0}^N \sum_{j\in B_i \setminus \{i\}} p_{ij}  	+ \sum_{i=0}^N\sum_{j\in B_i} p_i p_j \bigg)\\ 	&= \| Df \| (b_1+b_2). \end{align*}

To be continued…

01. May 2013 by Ramon van Handel
Categories: Random graphs | Leave a comment

Giant component: final remarks

The past three lec­tures were devoted to the giant com­po­nent theorem:

The­o­rem Let C_v be the con­nected com­po­nent of G(n, \frac{c}{n}) that con­tains v\in [n].

  1. If c<1, then \max_v|C_v|= O(\log n) in probability.
  2. If c>1, then \max_v|C_v|\sim (1-\rho) n in prob­a­bil­ity, for some 0<\rho<1.
  3. If c=1, then \max_v|C_v|\sim n^{2/3} in distribution.

We proved only the first (sub­crit­i­cal) and sec­ond (super­crit­i­cal) cases: our pre­sen­ta­tion was largely inspired by the treat­ment of Jan­son, Luczak, and Rucin­ski and Dur­rett. We have omit­ted the crit­i­cal case, how­ever, as the last two lec­tures of the semes­ter will be on another topic. The goal of this post is to pro­vide some final remarks and ref­er­ences on the giant com­po­nent theorem.

Ret­ro­spec­tive

At first sight, the “dou­ble jump” in the giant com­po­nent the­o­rem looks quite shock­ing. In hind­sight, how­ever, this does not seem quite so mirac­u­lous, as it mir­rors an ele­men­tary phe­nom­e­non that is cov­ered in many intro­duc­tory prob­a­bil­ity courses: given a (nice) ran­dom walk S_t with ini­tial con­di­tion S_0=a, define the hit­ting time \tau:=\inf\{t:S_t=b\} for some b<a. Then there are three cases:

  1. If S_t has neg­a­tive drift, then \mathbb{E}[\tau]<\infty. In fact, the ran­dom vari­able \tau has a light (expo­nen­tial) tail.
  2. If S_t has pos­i­tive drift, then \mathbb{P}[\tau=\infty]>0.
  3. If S_t has zero drift, then \tau<\infty a.s. but \mathbb{E}[\tau]=\infty. That is, the ran­dom vari­able \tau has a heavy tail.

This “dou­ble jump” in the behav­ior of the hit­ting prob­a­bil­i­ties of a ran­dom walk is directly anal­o­gous to the behav­ior of the con­nected com­po­nents of an Erdös-Rényi graph, and this was indeed the basic idea behind the proofs given in the pre­vi­ous lec­tures. Of course, it remains a bit of a mir­a­cle that the ran­dom walk approx­i­ma­tion of the explo­ration process, which only holds for small times, is suf­fi­ciently pow­er­ful that it describes so com­pletely the behav­ior of the ran­dom graph.

The crit­i­cal case

In the sub­crit­i­cal case, the size of the largest com­po­nent is of order \log n because the hit­ting time of a ran­dom walk with neg­a­tive drift has an expo­nen­tial tail: that is, we proved

    \[\mathbb{P}[\tau\ge k] \le e^{-\alpha k}    \quad\Longrightarrow\quad    \mathbb{P}\Big[\max_v|C_v| \ge \gamma\log n\Big] \le    n\,\mathbb{P}[\tau\ge\gamma\log n] \le n^{1-\alpha\gamma}\]

which goes to zero as n\to\infty for \gamma>1/\alpha.

Sim­i­larly, we would expect that we can obtain the size of the largest com­po­nent in the crit­i­cal case if we under­stand the heavy tail behav­ior of the hit­ting time of a ran­dom walk with zero drift. This is in fact the case. Indeed, when there is zero drift, one can show that

    \[P[\tau\ge k] \lesssim \frac{1}{\sqrt{k}}.\]

The crude union bound argu­ment used above now does not give the cor­rect answer, but an only slightly bet­ter argu­ment is needed. Indeed, note that

    \begin{eqnarray*}     \mathbb{P}\Big[\max_v|C_v| \ge \gamma n^{2/3}\Big] &\le&     \mathbb{P}[|\{v:|C_v|\ge \gamma n^{2/3}\}|\ge \gamma n^{2/3}] \\     &\le&     \frac{\mathbb{E}[|\{v:|C_v|\ge \gamma n^{2/3}\}|]}{\gamma n^{2/3}} \\     &=&     \frac{n\,\mathbb{P}[\tau\ge \gamma n^{2/3}]}{\gamma n^{2/3}}     \lesssim \frac{1}{\gamma^{3/2}}. \end{eqnarray*}

There­fore, n^{-2/3}\max_v|C_v|=O(1). With some fur­ther work, a cor­re­spond­ing lower bound can also be proved. See the paper by Nach­mias and Peres or the notes by van der Hof­s­tad for the details.

It turns out that in the crit­i­cal case n^{-2/3}\max_v|C_v| is not only bounded in prob­a­bil­ity, but in fact con­verges weakly to some lim­it­ing dis­tri­b­u­tion. This dis­tri­b­u­tion, and much more, is beau­ti­fully described by Aldous in terms of Brown­ian excur­sions. This is an inter­est­ing exam­ple of the appli­ca­tion of sto­chas­tic analy­sis to dis­crete prob­a­bil­ity; unfor­tu­nately, we do not have the time to cover it.

In a dif­fer­ent direc­tion, it turns out that var­i­ous addi­tional phase tran­si­tions appear when we con­sider a finer scal­ing, for exam­ple, in the “crit­i­cal win­dow” c=1\pm c' n^{-1/3}. For an overview of the var­i­ous tran­si­tions, see, for exam­ple, sec­tion 11.1 in Alon and Spencer.

Con­nec­tiv­ity threshold

Rather than con­sid­er­ing the size of the largest com­po­nent, one could ask when the entire Erdös-Rényi graph is con­nected. Note that when 1<c<\infty, the con­stant \rho in the size (1-\rho)n of the giant com­po­nent is always strictly pos­i­tive, so the graph is not con­nected. There­fore, in order for the entire graph to be con­nected, we must let c\to\infty (that is, the edge prob­a­bil­ity p must be super­lin­ear). It turns out that the appro­pri­ate scal­ing for this ques­tion is c\sim \log n, and another phase tran­si­tion arises here.

The­o­rem. Let p=\frac{a\log n}{n}. If a>1, then the Erdös-Rényi graph G(n,p) is con­nected with prob­a­bil­ity tend­ing to 1 as n\to\infty. If a<1, the graph is con­nected with prob­a­bil­ity tend­ing to 0 as n\to\infty.

To get some intu­ition, con­sider the prob­a­bil­ity that a ver­tex v is iso­lated (that is, dis­con­nected from every other vertex):

    \[\mathbb{P}[v\text{ is isolated}] =    \mathbb{P}[\eta_{vw}=0\text{ for all }w\ne v] =    \bigg(1-\frac{c}{n}\bigg)^{n-1}\sim e^{-c}.\]

Thus for c=a\log n, we have

    \[\mathbb{E}[|\{v:v\text{ is isolated}\}|] \sim    n \,e^{-a\log n} = n^{1-a}.\]

In par­tic­u­lar, if a<1, then there must exist an iso­lated ver­tex with pos­i­tive prob­a­bil­ity as n\to\infty, in which case the graph is not con­nected (in fact, it is not hard to show that the vari­ance of the num­ber of iso­lated com­po­nents is of the same order as its mean, so that the prob­a­bil­ity that the graph is con­nected tends to zero). Some­what mirac­u­lously, it turns out that when there are no iso­lated ver­tices, then the graph must already be con­nected, so that we do indeed obtain the sharp tran­si­tion described in the The­o­rem above. For a proof of this fact (by a clever com­bi­na­to­r­ial argu­ment) see, for exam­ple, the lec­ture notes by van der Hof­s­tad. Alter­na­tively, one can use a ran­dom walk argu­ment entirely in the spirit of the proofs in the pre­vi­ous lec­tures to prove that the ran­dom graph is con­nected for a>1: by run­ning simul­ta­ne­ous explo­ration processes from dif­fer­ent ver­tices as we did in the proof of the super­crit­i­cal case, one can show that all con­nected com­po­nents must inter­sect when a>1 and thus the entire graph must be con­nected. See sec­tion 2.8 in Dur­rett for such an argument.

28. April 2013 by Ramon van Handel
Categories: Random graphs | Leave a comment

Lecture 6. Giant component (3)

Let us begin with a brief recap from the pre­vi­ous lec­ture. We con­sider the Erdös-Rényi ran­dom graph G(n,\frac{c}{n}) in the super­crit­i­cal case c>1. Recall that C_v denotes the con­nected com­po­nent of the graph that con­tains the ver­tex v. Our goal is to prove the exis­tence of the giant com­po­nent with size \max_v|C_v|\sim (1-\rho)n, while the remain­ing com­po­nents have size \lesssim\log n.

Fix \beta>0 suf­fi­ciently large (to be cho­sen in the proof), and define the set

    \[K=\{v :  |C_v| > \beta \log n\}\]

of ver­tices con­tained in “large” com­po­nents. The proof con­sists of two parts:

  • Part 1: \mathbb{P}[C_v = C_{v'}~\forall v,v' \in K] \to 1.
  • Part 2: \frac{|K|}{n} \to 1-\rho in probability.

Part 1 states that all the suf­fi­ciently large com­po­nents must inter­sect, form­ing the giant com­po­nent. Part 2 counts the num­ber of ver­tices in the giant com­po­nent. Part 2 was proved in the pre­vi­ous lec­ture. The goal of this lec­ture is to prove Part 1, which com­pletes the proof of the giant component.

Overview

As in the pre­vi­ous lec­tures, the cen­tral idea in the study of the giant com­po­nent is the explo­ration process (U_t,A_t,R_t), where

    \[|C_v| = \tau := \inf\{ t  :  |A_t|=0\}.\]

We have seen that |A_t| \approx S_t, where S_t is a ran­dom walk with increments

    \[X_t = S_t -S_{t-1} \sim \text{Binomial}(n,\tfrac{c}{n})-1, 	\qquad |A_0| = S_0 =1.\]

When c>1, we have \mathbb{E}[X_t] = c-1 > 0. Thus |A_t| is approx­i­mately a ran­dom walk with pos­i­tive drift. The intu­itive idea behind the proof of Part 1 is as fol­lows. Ini­tially, the ran­dom walk can hit 0 rapidly, in which case the com­po­nent is small. How­ever, if the ran­dom walk drifts away from zero, then with high prob­a­bil­ity it will never hit zero, in which case the com­po­nent must keep grow­ing until the ran­dom walk approx­i­ma­tion is no longer accu­rate. Thus there do not exist any com­po­nents of inter­me­di­ate size: each com­po­nent is either very small (|C_v|\le \beta\log n) or very large (we will show |C_v|\ge n^{2/3}, but the pre­cise expo­nent is not important).

We now want to argue that any pair of large com­po­nents must nec­es­sar­ily inter­sect. Con­sider two dis­joint sets I and J of ver­tices of size |I|,|J|\ge n^{2/3}. As each edge is present in the graph with prob­a­bil­ity c/n, the prob­a­bil­ity that there is no edge between I and J is

    \[\mathbb{P}[\eta_{ij}=0\mbox{ for all }i\in I,~j\in J]= 	\bigg(1-\frac{c}{n}\bigg)^{|I|\,|J|} \le 	\bigg(1-\frac{c}{n}\bigg)^{n^{4/3}} \le 	e^{-cn^{1/3}}.\]

We there­fore expect that any pair of large com­po­nents must inter­sect with high prob­a­bil­ity. The prob­lem with this argu­ment is that we assumed that the sets I and J are non­ran­dom, while the ran­dom sets C_v them­selves depend on the edge struc­ture of the ran­dom graph (so the events \{C_v=I,C_{v'}=J\} and \{\mbox{no edges between }I,J\} are highly cor­re­lated). To actu­ally imple­ment this idea, we there­fore need a lit­tle bit more sophis­ti­cated approach.

To make the proof work, we revisit more care­fully our ear­lier ran­dom walk argu­ment. The process |A_t|\approx S_t has pos­i­tive drift as \mathbb{E}[S_t] = S_0 + (c-1)t. Thus the process |A_t|-(c-1)t/2 is still approx­i­mately a ran­dom walk with pos­i­tive drift! Apply­ing the above intu­ition, either |A_t| dies rapidly (the com­po­nent is small), or |A_t| grows lin­early in t as is illus­trated in the fol­low­ing figure:

Rendered by QuickLaTeX.com

This means that the explo­ration process for a com­po­nent of size >\beta\log n will not only grow large (|A_{n^{2/3}}|>0) with high prob­a­bil­ity, but that the explo­ration process will also pos­sess a large num­ber of active ver­tices (|A_{n^{2/3}}|\gtrsim n^{2/3}). To prove that all large com­po­nents inter­sect, we will run dif­fer­ent explo­ration processes simul­ta­ne­ously start­ing from dif­fer­ent ver­tices. We will show that if two of these processes reach a large num­ber of active ver­tices then there must be an edge between them with high prob­a­bil­ity, and thus the cor­re­spond­ing com­po­nents must coin­cide. This resolves the depen­dence prob­lem in our naive argu­ment, as the edges between the sets of active ver­tices have not yet been explored and are there­fore inde­pen­dent of the his­tory of the explo­ration process.

The com­po­nent size dichotomy

We now begin the proof in earnest. We will first show the dichotomy between large and small com­po­nents: either the com­po­nent size is \le\beta\log n, or the num­ber of active ver­tices |A_t| grows lin­early up to time n^{2/3}. To be pre­cise, we con­sider the fol­low­ing event:

    \[B := \Big\{\text{either} ~  	|C_v| \le \beta \log n , 	~ \text{or} ~  	|A_t| \ge \Big(\frac{c-1}{2}\Big) t  	~ \text{for all} ~ \beta \log n \le t  	\le n^{2/3}\Big\}.\]

Our goal is to show that \mathbb{P}[B] is large.

Define the stop­ping time

    \[\sigma := \inf\Big\{t \ge \beta \log n  : 	|A_t| < \Big(\frac{c-1}{2}\Big)t \Big\}.\]

We can write

    \[B = \{ \tau \le \beta \log n ~ \text{or} ~ \sigma > n^{2/3} \}.\]

Now sup­pose \tau >\beta \log n and \sigma =t. Then \tau \ge t, as explo­ration process is alive at time \beta \log n and stays alive until time t. We can there­fore write

    \[\mathbb{P}[B^c] = \sum_{s=\beta \log n}^{n^{2/3}}  	\mathbb{P}[\tau>\beta \log n ~ \text{and} ~ \sigma = s] \le 	\sum_{s=\beta \log n}^{n^{2/3}}  	\mathbb{P}\Big[|A_s|  	< \Big(\frac{c-1}{2}\Big)s 	~ \text{and} ~ 	s\le\tau\Big].\]

To bound the prob­a­bil­i­ties inside the sum, we com­pare |A_s| to a suit­able ran­dom walk.

The ran­dom walk argument

To bound the prob­a­bil­ity that |A_s| < (c-1)s/2, we must intro­duce a com­par­i­son ran­dom walk that lies beneath |A_t|. We use the same con­struc­tion as was used in the pre­vi­ous lec­ture. Let

    \begin{equation*} 	\bar S_{t+1}=\left\{ 	\begin{array}{ll} 	\bar S_t -1 +  	\sum_{w \in U_t} \eta_{wi_t} +  	\sum_{w=1}^{n-(\frac{c+1}{2})n^{2/3} - |U_t|}  	\tilde \eta _w^t &\quad\text{if }  	|U_t|<n-(\frac{c+1}{2}) n^{2/3},\\ 	\bar S_t -1 +  	\sum_{w \in \bar U_t} \eta_{wi_t} & \quad   	\text{if } 	|U_t|\ge n-(\frac{c+1}{2}) n^{2/3} . 	\end{array}  	\right. \end{equation*}

where \bar S_0=1, \tilde \eta_w^t are i.i.d. \text{Bernoulli}(\frac{c}{n}) ran­dom vari­ables inde­pen­dent of \eta_{ij}, i_t=\min A_t (the same i_t used in the explo­ration process), and \bar U_t is the set of the first n-(\frac{c+1}{2})n^{2/3} com­po­nents of U_t (if t \ge \tau, then A_t=\varnothing and thus i_t is unde­fined; then we sim­ply add n-(\frac{c+1}{2})n^{2/3} vari­ables \tilde\eta_w^t).

As in the pre­vi­ous lec­ture, we have:

  • \bar S_t is a ran­dom walk with \text{Binomial}(n-(\frac{c+1}{2})n^{2/3},\frac{c}{n})-1 increments.
  • \bar S_t \le |A_t| when­ever t \le \tau and |U_t| \ge n-(\frac{c+1}{2})n^{2/3}.

Now sup­pose that s\le n^{2/3} and |A_s|<(\frac{c-1}{2})s. Then

    \[|U_s| = n - |A_s| - |R_s| \ge 	n - (\tfrac{c+1}{2})s \ge n - (\tfrac{c+1}{2})n^{2/3}.\]

We there­fore obtain for s\le n^{2/3}

    \[\mathbb{P}\Big[|A_s|  	< \Big(\frac{c-1}{2}\Big)s 	~ \text{and} ~ 	s\le\tau\Big] \le 	\mathbb{P}\Big[\bar S_s < \Big(\frac{c-1}{2}\Big)s\Big].\]

Thus com­put­ing \mathbb{P}[B^c] reduces to com­pute the tail prob­a­bil­ity of a ran­dom walk (or, in less fancy terms, a sum of i.i.d. ran­dom vari­ables). That is some­thing we know how to do.

Lemma (Cher­noff bound). Let X\sim \text{Binomial}(n,p). Then

    \[\mathbb{P}[X\le np-t] \le e^{-t^2/2np}.\]

Proof. Let \gamma>0. Then

    \begin{eqnarray*} 	\mathbb{P}[X\le np-t] &=& 	\mathbb{P}[e^{-\gamma X} \ge e^{-\gamma np + \gamma t}] \\ 	&\le& e^{\gamma np - \gamma t} \, \mathbb{E}[e^{-\gamma X}]\\ 	&=& e^{\gamma np - \gamma t} (1- (1-e^{-\gamma})p)^n \\ 	&\le& e^{\{\gamma-(1-e^{-\gamma})\}np - \gamma t}\\ 	&\le& e^{\gamma^2 np/2 - \gamma t}. \end{eqnarray*}

The result fol­lows by opti­miz­ing over \gamma>0. \quad\square

Note that \bar S_s \sim 1 - s + \text{Binomial}(\{n-(\frac{c+1}{2})n^{2/3}\}s,\frac{c}{n}). We there­fore have by the Cher­noff bound

    \[\mathbb{P}\Big[\bar S_s < \Big(\frac{c-1}{2}\Big)s\Big] \le 	\mathbb{P}\Big[ 	\text{Binomial}(\{n-(\tfrac{c+1}{2})n^{2/3}\}s,\tfrac{c}{n}) 	\le \Big(\frac{c+1}{2}\Big)s\Big] \le 	e^{-(c-1-o(1))^2 s/8c}\]

for all s (here o(1) depends only on n and c). In par­tic­u­lar, we have

    \[\mathbb{P}\Big[\bar S_s < \Big(\frac{c-1}{2}\Big)s\Big] \le 	n^{-\beta(c-1)^2/9c} 	\quad\text{for all }s\ge\beta\log n\]

pro­vided n is suf­fi­ciently large. Thus we can estimate

    \[\mathbb{P}[B^c] \le 	\sum_{s=\beta \log n}^{n^{2/3}}  	\mathbb{P}\Big[\bar S_s < \Big(\frac{c-1}{2}\Big)s\Big]  	\le 	n^{2/3-\beta(c-1)^2/9c},\]

which goes to zero as n\to\infty pro­vided that \beta is cho­sen suf­fi­ciently large. In par­tic­u­lar, the com­po­nent size dichotomy fol­lows: choos­ing any \beta>15c/(c-1)^2, we obtain

    \[\mathbb{P}[|C_v|\le\beta\log n 	\text{ or }|C_v|\ge n^{2/3}\text{ for all }v] \ge 	1-n\mathbb{P}[B^c] \ge  	1-n^{5/3-\beta(c-1)^2/9c}\xrightarrow{n\to\infty}0.\]

Remark: Unlike in the proof of Part 2 in the pre­vi­ous lec­ture, here we do need to choose \beta suf­fi­ciently large for the proof to work. If \beta is too small, then the ran­dom walk \bar S_t can­not move suf­fi­ciently far away from zero to ensure that it will never return. In par­tic­u­lar, even in the super­crit­i­cal case, the sec­ond largest com­po­nent has size of order \log n.

Large com­po­nents must intersect

To com­plete the proof, it remains to show that all large com­po­nents must inter­sect. To do this, we will run sev­eral explo­ration processes at once start­ing from dif­fer­ent ver­tices. If the sets of active ver­tices of two of these processes grow large, then there must be an edge between them with high prob­a­bil­ity, and thus the cor­re­spond­ing com­po­nents inter­sect. Let us make this argu­ment precise.

In the fol­low­ing, we denote by (U_t^v,A_t^v,R_t^v) the explo­ration process started at A_0=\{v\}. For each such process, we define the cor­re­spond­ing set B^v that we have inves­ti­gated above:

    \[B_v := \Big\{\text{either} ~  	|C_v| \le \beta \log n , 	~ \text{or} ~  	|A_t^v| \ge \Big(\frac{c-1}{2}\Big) t  	~ \text{for all} ~ \beta \log n \le t  	\le n^{2/3}\Big\}.\]

We have shown above that, pro­vided \beta>15c/(c-1)^2, we have

    \[\mathbb{P}\bigg[\bigcap_v B_v\bigg] 	\ge 1 - \sum_v \mathbb{P}[B_v^c] \ge 	1-o(1).\]

We can there­fore estimate

    \begin{eqnarray*} 	&& 	\mathbb{P}[\exists\, v,v' \in K \text{ such that } C_v \neq C_{v'}] \\ 	&& \mbox{}= 	\mathbb{P}\big[\exists\, v,v' \in K \text{ such that } C_v \neq C_{v'}, 	\text{ and } 	|A^v_{n^{2/3}}| \ge (\tfrac{c-1}{2})n^{2/3} 	~\forall\, v \in K\big] + o(1) \phantom{\sum}\\ 	&& \mbox{}\le 	\sum_{v,v'} 	\mathbb{P}\big[C_v \neq C_{v'} \text{ and } 	 |A^v_{n^{2/3}}| \wedge |A_{n^{2/3}}^{v'}| \ge  	(\tfrac{c-1}{2})n^{2/3}\big]+o(1). \end{eqnarray*}

Now note that by time t, the explo­ration process (U_t^v,A_t^v,R_t^v) has only explored edges \eta_{ij} where i\in R_t^v (or j\in R_t^v), and sim­i­larly for (U_t^{v'},A_t^{v'},R_t^{v'}). It fol­lows that

    \[\text{The conditional law of } 	\{\eta_{ij} : i,j \not\in R_t^v\cup R_t^{v'}\} 	\text{ given } 	(A_t^v,R_t^v,A_t^{v'},R_t^{v'}) 	\text{ is i.i.d.\ Binomial}(\tfrac{c}{n}).\]

In par­tic­u­lar, if I^v,J^v,I^{v'},J^{v'} are dis­joint sub­sets of ver­tices, then

    \[\mathbb{P}[\text{no edge between } I^v, I^{v'}  	| A_t^v =I^v, R_t^v=J^v,A_t^{v'}=I^{v'},R_t^{v'}=J^{v'}]  	= \bigg(1-\frac cn \bigg)^{|I^v|\, |I^{v'}|}.\]

On the other hand, C_v \ne C_{v'} implies that R_t^v,A_t^v,R_t^{v'},A_t^{v'} must be dis­joint at every time t. Thus if C_v \ne C_{v'}, there can be no edges between ver­tices in A_t^v and A_t^{v'} at any time t (if such an edge exists, then the ver­tices con­nected by this edge will even­tu­ally be explored by both explo­ration processes, and then the sets of removed ver­tices will no longer be dis­joint). Therefore,

    \begin{eqnarray*} 	&&  	\mathbb{P}\big[C_v \neq C_{v'} \text{ and } 	 |A^v_{n^{2/3}}| \wedge |A_{n^{2/3}}^{v'}| \ge  	(\tfrac{c-1}{2})n^{2/3}\big] \\ 	&& \mbox{} \le 	\mathbb{P}\big[ 	\text{no edge between }A^v_{n^{2/3}},A^{v'}_{n^{2/3}},\\ 	&& \phantom{\mbox{} \le \mathbb{P}\big[} 	A^v_{n^{2/3}},R^v_{n^{2/3}}, 	A^{v'}_{n^{2/3}},R^{v'}_{n^{2/3}}\text{ are disjoint}, ~ 	|A^v_{n^{2/3}}| \wedge |A_{n^{2/3}}^{v'}| \ge  	(\tfrac{c-1}{2})n^{2/3}\big] \\ 	&& \mbox{}\le 	\bigg(1-\frac{c}{n} \bigg)^{(c-1)^2n^{4/3}/4} 	\le e^{-c(c-1)^2n^{1/3}/4}. \end{eqnarray*}

Thus we finally obtain

    \[\mathbb{P}[C_v = C_{v'}~\forall v,v' \in K] \ge 	1- n^2e^{-c(c-1)^2n^{1/3}/4} - o(1) 	\xrightarrow{n\to\infty}1,\]

and the proof of the giant com­po­nent the­o­rem is complete.

Many thanks to Quentin Berthet for scrib­ing this lecture!

27. April 2013 by Ramon van Handel
Categories: Random graphs | Leave a comment

Lecture 5. Giant component (2)

Con­sider the Erdös-Rényi graph model G(n,\frac{c}{n}), and denote as usual by C_v the con­nected com­po­nent of the graph that con­tains the ver­tex v. In the last lec­ture, we focused mostly on the sub­crit­i­cal case c < 1, where we showed that \max_v|C_v|\lesssim\log n. Today we will begin devel­op­ing the super­crit­i­cal case c > 1, where \max_v|C_v| \sim (1-\rho)n for a suit­able con­stant 0<\rho<1. In par­tic­u­lar, our aim for this and next lec­ture is to prove the fol­low­ing theorem.

The­o­rem. Let c>1. Then

    \[\frac{\max_v|C_v|}{n}\xrightarrow{n\to\infty} 1 - \rho 	\quad\mbox{in probability},\]

where \rho is the small­est pos­i­tive solu­tion of the equa­tion \rho = e^{c(\rho-1)}. More­over, there is a \beta>0 such that all but one of the com­po­nents have size \leq \beta\log n with prob­a­bil­ity tend­ing to 1 as n\to\infty.

Begin­ning of the proof. Define the set

    \[K = \{ v: |C_v| > \beta \log n\}.\]

The proof of the The­o­rem con­sists of two main parts:

  • Part 1: \mathbb{P}[C_v = C_{v'}~ \forall v,v' \in K] \xrightarrow{n \to \infty} 1.
  • Part 2: \frac{|K|}{n} \xrightarrow{n \to \infty} 1-\rho in probability.

Part 1 states that all “large” com­po­nents of the graph must inter­sect, form­ing one giant com­po­nent. Some intu­ition for why this is the case was given at the end of the pre­vi­ous lec­ture. Part 2 com­putes the size of this giant com­po­nent. In this lec­ture, we will con­cen­trate on prov­ing Part 2, and we will find out where the mys­te­ri­ous con­stant \rho comes from. In the next lec­ture, we will prove Part 1, and we will develop a detailed under­stand­ing of why all large com­po­nents must intersect.

Before we pro­ceed, let us com­plete the proof of the The­o­rem assum­ing Parts 1 and 2 have been proved. First, note that with prob­a­bil­ity tend­ing to one, the set K is itself a con­nected com­po­nent. Indeed, if v\in K and v'\not\in K then v,v' must lie in dis­joint con­nected com­po­nents by the def­i­n­i­tion of K. On the other hand, with prob­a­bil­ity tend­ing to one, all v,v'\in K must lie in the same con­nected com­po­nent by Part 1. There­fore, with prob­a­bil­ity tend­ing to one, the set K forms a sin­gle con­nected com­po­nent of the graph. By Part 2, the size of this com­po­nent is \sim (1-\rho)n, while by the def­i­n­i­tion of K, all other com­po­nents have size \le \beta\log n. This com­pletes the proof. \quad\square

The remain­der of this lec­ture is devoted to prov­ing Part 2 above. We will first prove that the claim holds on aver­age, and then prove con­cen­tra­tion around the aver­age. More pre­cisely, we will show:

  1. \mathbb{E}\big[\frac{|K|}{n}\big] \xrightarrow{n\to\infty} 1-\rho.
  2. \mathrm{Var}\big[\frac{|K|}{n}\big] \xrightarrow{n\to\infty}0.

Together, these two claims evi­dently prove Part 2.

Mean size of the giant component

We begin by writ­ing out the mean size of the giant component:

    \[\mathbb{E}\bigg[\frac{|K|}{n}\bigg] =  	\mathbb{E}\Bigg[ 	\frac{1}{n} \sum_{v \in [n]}  	\mathbf{1}_{|C_v|>\beta\log n} 	\Bigg] =  	\frac{1}{n} \sum_{v \in [n]}  	\mathbb{P}[\left|C_v\right| > \beta \log n ] = 	\mathbb{P}[\left|C_v\right| > \beta \log n ],\]

where we note that \mathbb{P}[\left|C_v\right| > \beta \log n ] does not depend on the ver­tex v by the sym­me­try of the Erdös-Rényi model. There­fore, to prove con­ver­gence of the mean size of the giant com­po­nent, it suf­fices to prove that

    \[\mathbb{P}[\left|C_v\right| \le \beta \log n ] 	\xrightarrow{n\to\infty}\rho.\]

This is what we will now set out to accomplish.

In the pre­vi­ous lec­ture we defined explo­ration process (U_t, A_t, R_t). We showed that

    \[|C_v| = \tau := \inf \{t: |A_t| = 0\}\]

and that for t<\tau

    \[|A_{t+1}| = |A_t| - 1 + \sum_{w \in U_t} \eta_{wi_t}, 	\qquad |A_0|=1,\]

where i_t \in A_t is an arbi­trary nonan­tic­i­pat­ing choice, say, i_t = \min A_t (recall that (\eta_{ij})_{i,j\in[n]} denotes the adja­cency matrix of the ran­dom graph). As \eta_{ij} are i.i.d. \mathrm{Bernoulli}(\frac{c}{n}) and as edges ema­nat­ing from the set of unex­plored ver­tices U_t have not yet appeared in pre­vi­ous steps, the process |A_t| is “almost” a ran­dom walk: it fails to be a ran­dom walk as we only add |U_t| Bernoulli vari­ables in each iter­a­tion, rather than a con­stant num­ber. In the last lec­ture, we noted that we can esti­mate |A_t| from above by a gen­uine ran­dom walk S_t by adding some fic­ti­tious ver­tices. To be pre­cise, we define

    \[S_{t+1}=S_t - 1 + \sum_{w\in U_t} \eta_{wi_t} + 	\sum_{w=1}^{n-|U_t|}\tilde\eta_w^t,\qquad S_0=1,\]

where \tilde\eta_w^t are i.i.d. \mathrm{Bernoulli}(\frac{c}{n}) inde­pen­dent of the \eta_{ij} (if t\ge\tau, then A_t=\varnothing and thus i_t is unde­fined; in this case, we sim­ply add all n vari­ables \tilde\eta_w^t). In the present lec­ture, we also need to bound |A_t| from below. To this end, we intro­duce another process \bar S_t as follows:

    \[\bar S_{t+1}=\left\{ 	\begin{array}{ll} 	\bar S_t - 1 + \sum_{w\in U_t} \eta_{wi_t} + 		\sum_{w=1}^{n-\beta\log n-|U_t|}\tilde\eta_w^t 	& \mbox{if }|U_t|< n-\beta\log n,\\ 	\bar S_t - 1 + \sum_{w\in \bar U_t} \eta_{wi_t} 	& \mbox{if }|U_t|\ge n-\beta\log n, 	\end{array}\right. 	\qquad \bar S_0=1,\]

where \bar U_t is the set con­sist­ing of the first n-\beta\log n ele­ments of U_t in increas­ing order of the ver­tices (if t\ge\tau, we add n-\beta\log n vari­ables \tilde\eta_w^t). The idea behind these processes is that S_t is engi­neered, by includ­ing “fic­ti­tious” ver­tices, to always add n i.i.d. Bernoulli vari­ables in every iter­a­tion, while \bar S_t is engi­neered, by includ­ing “fic­ti­tious” ver­tices when |U_t| is small and omit­ting ver­tices when |U_t| is large, to always add n-\beta\log n i.i.d. Bernoulli vari­ables in every iter­a­tion. The fol­low­ing facts are immediate:

  • S_t is a ran­dom walk with i.i.d. \mathrm{Binomial}(n,\frac{c}{n})-1 increments.
  • \bar S_t is a ran­dom walk with i.i.d. \mathrm{Binomial}(n-\beta\log n,\frac{c}{n})-1 increments.
  • S_t\ge |A_t| for all t\le\tau.
  • \bar S_t\le |A_t| for all t\le\tau on the event \{|C_v|\le\beta\log n\}.

To see the last prop­erty, note that the explo­ration process can only explore as many ver­tices as are present in the con­nected com­po­nent C_v, so that |U_t|\ge n-|C_v| for all t; there­fore, in this sit­u­a­tion only the sec­ond pos­si­bil­ity in the def­i­n­i­tion of \bar S_t occurs, and it is obvi­ous by con­struc­tion that then \bar S_t\le |A_t| (nonethe­less, the first pos­si­bil­ity in the def­i­n­i­tion must be included to ensure that \bar S_t is a ran­dom walk).

We now define the hit­ting times

    \[T = \inf\{t:S_t=0\},\qquad 	\bar T = \inf\{t:\bar S_t=0\}.\]

Then we evi­dently have

    \[\mathbb{P}[T \le \beta \log n] \le  	\mathbb{P}[|C_v| \le \beta \log n] \le  	\mathbb{P}[\bar T \le \beta \log n].\]

(Note how we clev­erly chose the ran­dom walk \bar S_t pre­cisely so that \bar T\le |C_v| when­ever |C_v|\le \beta\log n). We have there­fore reduced the prob­lem of com­put­ing \mathbb{P}[|C_v| \le \beta \log n] to com­put­ing the hit­ting prob­a­bil­i­ties of ran­dom walks. Now we are in busi­ness, as this is some­thing we know how to do using martingales!

The hit­ting time computation

Let us take a moment to gather some intu­ition. The ran­dom walks S_t and \bar S_t have incre­ments dis­trib­uted as \mathrm{Binomial}(n,\frac{c}{n})-1 and \mathrm{Binomial}(n-\beta\log n,\frac{c}{n})-1, respec­tively. As n\to\infty, both incre­ment dis­tri­b­u­tions con­verge to a \mathrm{Poisson}(c)-1 dis­tri­b­u­tion, so we expect that \mathbb{P}[|C_v|\le\beta\log n]\sim\mathbb{P}[T_0\le\beta\log n] where T_0 is the first hit­ting time of the Pois­son ran­dom walk. On the other hand, as \mathbb{P}[T_0\le\beta\log n] \to \mathbb{P}[T_0<\infty], we expect that \mathbb{P}[|C_v|\le\beta\log n]\to \mathbb{P}[T_0<\infty]. The prob­lem then reduces to com­put­ing the prob­a­bil­ity that a Pois­son ran­dom walk ever hits the ori­gin. This com­pu­ta­tion can be done explic­itly, and this is pre­cisely where the mys­te­ri­ous con­stant \rho=\mathbb{P}[T_0<\infty] comes from!

We now pro­ceed to make this intu­ition pre­cise. First, we show that the prob­a­bil­ity \mathbb{P}[T<\beta\log n] can indeed be replaced by \mathbb{P}[T<\infty], as one might expect.

Lemma. \mathbb{P}[T \le \beta \log n] = \mathbb{P}[T < \infty] - o(1) as n\to\infty.

Proof. We need to show that

    \[\mathbb{P} [\beta \log n < T < \infty] \xrightarrow{n \to \infty} 0.\]

Note that as S_T=0 when T<\infty,

    \[\mathbb{P}[k \le T < \infty] = \sum_{t=k}^{\infty}  	\mathbb{P}[T=t] \le \sum_{t=k}^{\infty} \mathbb{P}[S_t = 0].\]

We can evi­dently write

    \[\mathbb{P}[S_t = 0] =  	\mathbb{E}[\mathbf{1}_{S_t=0} e^{\gamma S_t} ]  	\le  	\mathbb{E} [e^{\gamma S_t}] =  	e^{\gamma} \mathbb{E}[e^{\gamma X_1}]^t\]

where S_t - S_{t-1} =: X_t \sim \mathrm{Binomial}(n,\frac{c}{n})-1 and

    \[\mathbb{E} [e^{\gamma X_1}] = e^{-\gamma}  	(1+\tfrac{c}{n}(e^{\gamma}-1))^n  	\le e^{c(e^{\gamma}-1)-\gamma}.\]

Choos­ing \gamma = -\log c, we obtain \mathbb{E} [e^{\gamma X_1}] \le e^{1-c + \log c} < 1 for c \ne 1. There­fore,

    \[\mathbb{P} [\beta\log n < T < \infty] \le  	\frac{1}{c} \sum_{t=\beta\log n}^{\infty}  	e^{(1-c + \log c)t} 	\xrightarrow{n\to\infty}0.\]

This com­pletes the proof. \qquad\square

By the above Lemma, and a triv­ial upper bound, we obtain

    \[\mathbb{P} [T< \infty] - o(1) \le  	\mathbb{P}[|C_v| \le \beta \log n] \le \mathbb{P}[\bar T < \infty].\]

To com­plete com­pu­ta­tion of the mean size of the giant com­po­nent, it there­fore remains to show that \mathbb{P} [T<\infty] and \mathbb{P}[\bar T<\infty] con­verge to \rho. In fact, we can com­pute these quan­ti­ties exactly.

Lemma. Let c>1, \frac{c}{n} < 1. Then

    \[\mathbb{P} [T<\infty] = \rho_n\]

where \rho_n is the small­est pos­i­tive solu­tion of \rho_n = (1+\frac{c}{n}(\rho_n-1))^n.

Proof. Recall the mar­tin­gale M_t used in last lecture:

    \[M_t = e^{\gamma S_t - \phi(\gamma)t}, 	\qquad 	\phi(\gamma)=\log \mathbb{E}[e^{\gamma X_1}] = 	\log\big[e^{-\gamma}(1+\tfrac{c}{n}(e^{\gamma}-1))^n\big].\]

Sup­pose that \gamma < 0 and \phi(\gamma) > 0. Then

    \[\mathbb{E} [e^{-\phi(\gamma)T}] =  	\mathbb{E} \Big[\lim_{k \to \infty} M_{k \wedge T}\Big]  	= \lim_{k \to \infty} \mathbb{E} [M_{k \wedge T}] = M_0 = e^{\gamma}.\]

The first equal­ity holds since if T < \infty then S_T = 0 and M_{k \wedge T} \to M_T = e^{-\phi(\gamma)T}, while if T = \infty then S_{k} \ge 0 and M_k \to 0 = e^{-\phi(\gamma)T}. The sec­ond equal­ity holds by dom­i­nated con­ver­gence since 0 \le M_{k \wedge T} \le 1, and the third equal­ity is by the optional stop­ping theorem.

Now sup­pose we can find \gamma_n < 0 such that \phi(\gamma) \downarrow 0 as \gamma \uparrow \gamma_n. Then we have

    \[\rho_n := 	e^{\gamma_n} = \lim_{\gamma \uparrow \gamma_n}  	\mathbb{E}[e^{-\phi(\gamma)T}] = 	\mathbb{E} \Big[\lim_{\gamma \uparrow \gamma_n} e^{-\phi(\gamma)T}\Big]  	= \mathbb{P}[T < \infty]\]

by dom­i­nated con­ver­gence. Thus, evi­dently, it suf­fices to find \gamma_n with the req­ui­site prop­er­ties. Now note that as \phi(\gamma_n)=0, \gamma_n<0, and \phi(\gamma)>0 for \gamma<\gamma_n, we evi­dently must have

    \[\rho_n = (1+\tfrac{c}{n}(\rho_n-1))^n,\qquad 	\rho_n < 1,\qquad 	\rho < (1+\tfrac{c}{n}(\rho-1))^n \mbox{ for }\rho<\rho_n.\]

We can find such \rho_n by inspect­ing the fol­low­ing illustration:

Rendered by QuickLaTeX.com

Evi­dently the req­ui­site assump­tions are sat­is­fied when \rho_n is the small­est root of the equa­tion \rho_n=(1+\tfrac{c}{n}(\rho_n-1))^n (but not for the larger root at 1!) \quad\square

Remark. Note that the super­crit­i­cal case c>1 is essen­tial here. If c\le 1 then the equa­tion for \rho_n has no solu­tions <1, and the argu­ment in the proof does not work. In fact, when c\le 1, we have \mathbb{P}[T<\infty]=1.

By an imme­di­ate adap­ta­tion of the proof of the pre­vi­ous lemma, we obtain

    \[\mathbb{P}[\bar T < \infty] = \bar \rho_n\]

where \bar\rho_n is the small­est pos­i­tive solu­tion of \bar \rho_n = (1+\frac{c}{n}(\bar \rho_n-1))^{n-\beta \log n}. Let­ting n\to\infty, we see that

    \[\lim_{n \to \infty} \mathbb{P}[|C_v|\le\beta\log n] =  	\lim_{n \to \infty} \mathbb{P}[T<\infty] =  	\lim_{n \to \infty} \mathbb{P}[\bar T<\infty] = \rho,\]

where \rho is the small­est solu­tion of the equa­tion \rho=e^{c(\rho-1)} (which is pre­cisely the prob­a­bil­ity that the Pois­son ran­dom walk hits zero, by the iden­ti­cal proof to the lemma above). We have there­fore proved

    \[\mathbb{E}\bigg[\frac{|K|}{n}\bigg] \xrightarrow{n\to\infty} 1-\rho.\]

Vari­ance of the giant com­po­nent size

To com­plete the proof of Part 2 of the giant com­po­nent the­o­rem, it remains to show that

    \[\mathrm{Var}\bigg[\frac{|K|}{n}\bigg] = 	\mathrm{Var}\bigg[1-\frac{|K|}{n}\bigg]  	\xrightarrow{n\to\infty} 0.\]

To this end, let us consider

    \[\mathbb{E}\bigg[\bigg(1-\frac{|K|}{n}\bigg)^2\bigg] = 	\mathbb{E}\bigg[\bigg(\frac{1}{n} \sum_{v \in [n]}  		\mathbf{1}_{|C_v| \le \beta \log n}\bigg)^2\bigg] =  	\frac{1}{n^2} \sum_{v,w \in [n]}  	\mathbb{P}[|C_v| \le \beta \log n, |C_w| \le \beta \log n].\]

To esti­mate the terms in this sum, we con­di­tion on one of the components:

    \[\begin{array}{lcl} 	\mathbb{P}[|C_v| \le \beta \log n, |C_w| \le \beta \log n] 	&=& 	\mathbb{E}[ 	\,\mathbb{P}[|C_v| \le \beta \log n|C_w]\, 	\mathbf{1}_{|C_w| \le \beta \log n}\,] \\ 	&=& 	\sum_{I \subseteq [n], |I| \le \beta\log(n)}  	\mathbb{P}[C_w = I]\, \mathbb{P}[|C_v| \le \beta\log n|C_w = I]. 	\end{array}\]

To pro­ceed, note that the event \{C_w=I\} can be writ­ten as

    \[\{C_w=I\} = \{(\eta_{ij})_{i,j\in I}\mbox{ defines a connected  	subgraph and }\eta_{ij}=0\mbox{ when }i\in I,~j\not\in I\}.\]

In par­tic­u­lar, the event \{C_w=I\} is inde­pen­dent of the edges \eta_{ij} for i,j\not\in I. There­fore, for v\not\in I, the con­di­tional law of C_v given C_w=I coin­cides with the (uncon­di­tional) law of C_v^{[n]\backslash I}, the con­ncted com­po­nent con­tain­ing v in the induced sub­graph on the ver­tices [n]\backslash I:

    \[\mathbb{P}[|C_v| \le \beta\log n|C_w = I] = 	\mathbb{P}[|C_v^{[n]\backslash I}|\le \beta\log n]\qquad 	\mbox{for }v\not\in I.\]

As this quan­tity only depends on |I| by the sym­me­try of the Erdös-Rényi model, we can evi­dently write

    \[\mathbb{P}[|C_v| \le \beta\log n|C_w = I] = 	\mathbb{P}[|C_1^{[n-|I|]}|\le \beta\log n] \le 	\mathbb{P}[|C_1^{[n-\beta\log n]}|\le \beta\log n]\]

for v\not\in I, |I|\le\beta\log n. In par­tic­u­lar, we obtain

    \[\sum_{v,w\in[n]} 	\mathbb{P}[|C_v| \le \beta \log n, |C_w| \le \beta \log n] 	\le 	\mathbb{E}[n-|K|]\,\{ 	\beta\log n + n\, \mathbb{P}[|C_1^{[n-\beta\log n]}|\le \beta\log n]\}.\]

Now note that, by its def­i­n­i­tion, C_1^{[k]} is dis­trib­uted pre­cisely as the com­po­nent con­tain­ing ver­tex 1 in the G(k,\frac{c}{n}) ran­dom graph model. We can there­fore show, repeat­ing exactly the proof of the mean size of the giant com­po­nent above, that

    \[\mathbb{P}[|C_1^{[n-\beta\log n]}|\le \beta\log n] = 	\rho + o(1) =  	\mathbf{E}\bigg[1-\frac{|K|}{n}\bigg] + o(1).\]

We have there­fore shown that

    \[\mathbb{E}\bigg[\bigg(1-\frac{|K|}{n}\bigg)^2\bigg] = 	\frac{1}{n^2}\sum_{v,w\in[n]}  	\mathbb{P}[|C_v| \le \beta \log n, |C_w| \le \beta \log n] 	\le 	\mathbf{E}\bigg[1-\frac{|K|}{n}\bigg]^2+o(1),\]

which evi­dently implies

    \[\mathrm{Var}\bigg[1-\frac{|K|}{n}\bigg] \le o(1).\]

This is what we set out to prove.

Remark. It should be noted that the proof of Part 2 did not depend on the value of \beta, or even on the \beta\log n rate, in the def­i­n­i­tion of the set K: any sequence that grows sub­lin­early to infin­ity would have given the same result. This sug­gests that all but a van­ish­ing frac­tion of ver­tices are con­tained in con­nected com­po­nents of order \sim 1 or \sim n. We find out only in the next lec­ture why the rate \beta\log n (for \beta suf­fi­ciently large!) is impor­tant: only suf­fi­ciently large con­nected com­po­nents are guar­an­teed to inter­sect, while there might (and do) exist com­po­nents of order \sim\log n that are dis­joint from the giant com­po­nent. If we do not exclude the lat­ter, we will not be able to prove Part 1.

Many thanks to Weichen Wang for scrib­ing this lecture!

17. April 2013 by Ramon van Handel
Categories: Random graphs | Leave a comment

Lecture 4. Giant component (1)

Con­sider the Erdös-Rényi graph model G(n,p). In pre­vi­ous lec­tures, we focused on the “high com­plex­ity regime”, i.e., as n goes to infin­ity, p is fixed. We dis­cussed top­ics such as clique num­bers and chro­matic num­bers. From now on, we shall con­sider the “low com­plex­ity regime”, where as n goes to infin­ity, p=\frac{c}{n} for a fixed con­stant c>0. As before, let \eta=(\eta_{i,j})_{i,j\in[n]} be the adja­cency matrix of G(n, \frac{c}{n}). Then \{\eta_{ij}: i,j\in[n], i<j\} are i.i.d. Bernoulli ran­dom vari­ables with suc­cess prob­a­bil­ity p=\frac{c}{n}.

The­o­rem 1 Let C_v be the con­nected com­po­nent of G(n, \frac{c}{n}) that con­tains v\in [n].

  1. If c<1, then \max_v|C_v|= O(\log n) in probability.
  2. If c>1, then \max_v|C_v|\sim (1-\rho) n in prob­a­bil­ity, for some 0<\rho<1.
  3. If c=1, then \max_v|C_v|\sim n^{2/3} in distribution.

In the fol­low­ing lec­tures, we will aim to prove at least parts 1 and 2.

The explo­ration process

How to study |C_v|? We will explore C_v by start­ing an “explo­ration process” at v that moves around C_v until all its sites have been vis­ited. This walk will be con­structed so that it hits each site once. So, the time it takes to explore all of C_v is exactly |C_v|. As a con­se­quence, study­ing |C_v| reduces to study­ing a hit­ting time of a cer­tain ran­dom walk, and to the lat­ter we can apply mar­tin­gale theory.

At each time t=0,1,2,\ldots, we main­tain three sets of vertices:

    \begin{eqnarray*} 	R_t &=& \{\text{removed sites}\},\\ 	A_t &=& \{\text{active sites}\},\\  	U_t &=& \{\text{unexplored sites}\}. \end{eqnarray*}

Below is an illus­tra­tion of how these sets are updated on a sim­ple example.

  • At t=0, ini­tial­ize A_0=\{v\}, U_0=[n]\backslash \{v\} and R_0=\varnothing. Namely, only v is active, all the ver­tices other than v are unex­plored, and no ver­tices have been removed.

    Rendered by QuickLaTeX.com

  • At t=1, update A_1=A_0\backslash \{v\}\cup \{\text{neighbors of } v\}, U_1=U_0\backslash A_1 and R_1=\{v\}. Namely, all neigh­bors of v are moved from the unex­plored set to the active set, and v itself is removed.

    Rendered by QuickLaTeX.com

  • At t=2, pick some x\in A_1 and update A_2=A_1\backslash \{x\}\cup \{\text{unexplored neighbors of } x\}, U_2=U_1\backslash A_2 and R_2=R_1\cup \{x\}. Namely, all unex­plored neigh­bors of x are moved into the active set, and x itself is removed.

    Rendered by QuickLaTeX.com

  • At time t, we pick some ver­tex x from the cur­rent active set A_{t-1}, acti­vate all unex­plored neigh­bors of x and remove x itself.

This is a sort of local search along the con­nected com­po­nent C_v: much like play­ing a game of Minesweeper! At each t, the choice of x\in A_{t-1} can be made arbi­trar­ily (e.g., select­ing the ver­tex with the small­est index or ran­domly select­ing a ver­tex in A_{t-1}). The only require­ment is that it is nonan­tic­i­pat­ing (only depend­ing on the edges vis­ited up to time t). For exam­ple, we can­not pick the ver­tex in A_{t-1} which has the largest num­ber of unex­plored neigh­bors, as this choice relies on unex­plored edges.

A for­mal descrip­tion of the “explo­ration process”:

  • Ini­tial­ize A_0=\{v\}, U_0=[n]\backslash \{v\} and R_0=\varnothing.
  • For t\geq 0, we set

        \begin{eqnarray*} 	R_{t+1} &=& R_t\cup \{i_t\},\\ 	A_{t+1} &=& A_t\backslash \{i_t\}\cup \{w\in U_t: \eta_{wi_t}=1\},\\ 	U_{t+1} &=& U_t\backslash \{w\in U_t: \eta_{wi_t}=1\}, \end{eqnarray*}

    where i_t\in A_t is a nonan­tic­i­pat­ing but oth­er­wise arbi­trary choice.

This process stops when there are no more active ver­tices. It hits each ver­tex in C_v once and only once. At each time t, we remove one ver­tex in C_v. So the stop­ping time is exactly equal to the size of C_v:

    \[ \tau\equiv \inf\{t: |A_t|=0 \} = |C_v|. \]

So, we only need to study the stop­ping time \tau.

Recall that \eta_{ij} indi­cates whether there is an edge between ver­tices i and j, and \eta_{ij}\overset{\mathrm{i.i.d.}}{\sim} \text{Bernoulli}(\frac{c}{n}). By construction,

    \[ |A_{t+1}| = |A_t| - 1 + \sum_{w\in U_t} \eta_{wi_t}. \]

Now, let’s do a thought exper­i­ment (wrong, but intu­itive). Let’s for­get for the moment that some sites were pre­vi­ously vis­ited, and assume that in each step all neigh­bors of i_t are unvis­ited still (note that when n is really large and t is rel­a­tively small, this assump­tion makes sense). Then |A_{t+1}|-|A_t|+1 is the sum of n inde­pen­dent \text{Bernoulli}(\frac{c}{n}) vari­ables, which has a \text{Binomial}(n, \frac{c}{n}) dis­tri­b­u­tion. This bino­mial vari­able is inde­pen­dent of the past because it only depends on unex­plored edges; in addi­tion, its dis­tri­b­u­tion does not depend on |A_t|. There­fore, |A_t| would be a ran­dom walk with incre­ment dis­tri­b­u­tion \text{Binomial}(n, \frac{c}{n})-1\approx\text{Poisson}(c)-1. Then, study­ing |C_v| boils down to study­ing first time a Pois­son ran­dom walk hits zero! Of course, we can­not really ignore pre­vi­ously vis­ited sites, but this rough intu­ition nonethe­less cap­tures the right idea as n\to\infty and will serve as a guid­ing prin­ci­ple for the proof.

A com­par­i­son ran­dom walk

The rea­son that |A_t| is not a ran­dom walk is that there are only |U_t| edges (not n) to explore at time t. We can arti­fi­cially cre­ate a ran­dom walk by adding n-|U_t| “fic­ti­tious” points at each time t as follows.

Let \tilde{\eta}_{v}^t be i.i.d. \text{Bernoulli}(\frac{c}{n}) for t\geq 0, v\in[n], which are inde­pen­dent of (\eta_{ij}). Define

    \[ 	S_0 = 1, \qquad S_{t+1} = S_t - 1 +  	\sum_{w\in U_t}  	\eta_{wi_t} + \sum_{w\in [n]\backslash  	U_t}\tilde{\eta}^t_{w}. \]

(When t\ge\tau, then A_t=\varnothing and thus i_t is unde­fined; in this case, we sim­ply add all n vari­ables \tilde\eta^t_w.)
Note that \sum_{w\in U_t} \eta_{wi_t} is the sum of edges from A_t to U_t. Since we have not explored U_t yet, those edges are inde­pen­dent of all edges explored up to time t (here we use that i_t is nonan­tic­i­pat­ing). We there­fore see that (S_t) is indeed a ran­dom walk with increment

    \[ S_{t}- S_{t-1} \sim \text{Binomial}(n, \tfrac{c}{n})-1. \]

More­over, since all \tilde{\eta}_{w}^t are nonnegative,

    \[ S_{t+1} - S_t \geq |A_{t+1}| - |A_t| \]

as long as t<\tau. It fol­lows that |A_t| is dom­i­nated by the gen­uine ran­dom walk S_t, that is,

    \[|A_t| \le S_t \quad\mbox{for all }t\le \tau.\]

We can now obtain bounds on |C_v| by ana­lyz­ing hit­ting times of the ran­dom walk S_t.

The sub­crit­i­cal case c<1

Define the first time the com­par­i­son walk hits zero as

    \[T\equiv\inf\{t: S_t=0\}.\]

Since |A_t|\leq S_t for t\le\tau, it is obvi­ous that

    \[|C_v|=\tau\leq T.\]

Now we study T. The intu­ition is that as \mathbb{E}[S_{t+1}-S_t]=c-1, (S_t) is a ran­dom walk with neg­a­tive drift in the sub­crit­i­cal case c<1. Thus \mathbb{P}[T<\infty]=1, and in fact the hit­ting time T has nice tails!

Lemma 2 Let c<1 and \alpha=c-1-\log c>0. Then for any pos­i­tive inte­ger k,

    \[ \mathbb{P}[T\geq k]\leq \frac{1}{c} e^{-\alpha k}. \]

We will prove this lemma below. Using the lemma, we imme­di­ately obtain:

Corol­lary 3 If c<1, then for any a>\frac{1}{\alpha}=\frac{1}{c-1-\log c}

    \[ \mathbb{P}\big[\max_v|C_v|\geq a\log n\big] \xrightarrow{n\to\infty} 0.  \]

Proof. Apply­ing the Lemma 2 and the union bound,

    \begin{eqnarray*} \mathbb{P}\big[\max_v|C_v|\geq a\log n\big] &\leq & \sum_{v\in[n]} \mathbb{P}[|C_v|\geq a\log n]\\ &\leq & \sum_{v\in[n]} \mathbb{P}[T\geq a\log n] \\ &\leq & \frac{n}{c} n^{-a\alpha} \to 0. \qquad\square \end{eqnarray*}

This corol­lary proves part 1 of The­o­rem 1. In fact, it turns out that the con­stant \frac{1}{\alpha} is tight: by using the sec­ond moment method, one can prove a match­ing lower bound on \max_v|C_v| (see, for exam­ple, the lec­ture notes by van der Hof­s­tad), which implies that in fact \max_v|C_v|\sim \frac{1}{\alpha}\log n in prob­a­bil­ity. The proof is not much more dif­fi­cult, but we pre­fer to move on to the super­crit­i­cal case.

Remark. It might seem some­what sur­pris­ing that the result we obtain is so sharp, con­sid­er­ing that we have blindly replaced |A_t| by the larger quan­tity S_t. How­ever, in going from |A_t| to S_t we do not lose as much as one might think at first sight. When n is large and t is rel­a­tively small, the excess term \sum_{w\in[n]\backslash U_t}\tilde{\eta}^t_w in the def­i­n­i­tion of S_t is zero with high prob­a­bil­ity, as most ver­tices are unex­plored and the Bernoulli suc­cess prob­a­bil­ity \frac{c}{n} of the \tilde{\eta}^t_w is very small. With a bit of work, one can show that S_t and |A_t| will actu­ally stick together for times t\lesssim\log n with prob­a­bil­ity going to one as n\to\infty. Thus, in the sub­crit­i­cal case where the ran­dom walk only lives for \sim\log n time steps, noth­ing is lost in going from |A_t| to S_t, and our rough intu­ition that |A_t| should behave like a ran­dom walk as n\to\infty is vindicated.

To wrap up the sub­crit­i­cal case, it remains to prove the lemma.

Proof of Lemma 2. By the Markov inequality,

    \[ \mathbb{P}[T\geq k] = \mathbb{P}[e^{\alpha T}\geq e^{\alpha k}] \leq e^{-\alpha k} \mathbb{E}[e^{\alpha T}].  \]

It remains to bound \mathbb{E}[e^{\alpha T}]\le\frac{1}{c}, which is a stan­dard exer­cise in mar­tin­gale theory.

Recall that

    \[S_t = 1+\sum_{k=1}^t X_k,\]

where X_k are i.i.d. Define the moment gen­er­at­ing func­tion \phi(\beta)= \log\mathbb{E}[e^{\beta X_k}], and let

    \[ M_t \equiv e^{\beta S_t-\phi(\beta)t}, \qquad \text{for }t\geq 0.  \]

Since e^{\beta S_t}= e^{\beta}\prod_{k=1}^t e^{\beta X_k} and X_t is inde­pen­dent of M_0,M_1,\cdots,M_{t-1},

    \[ 	\mathbb{E}[M_t|M_0,\cdots, M_{t-1}] = M_{t-1}\, 	\mathbb{E}[\tfrac{M_t}{M_{t-1}}|M_0,\cdots, M_{t-1}]  	= M_{t-1}\, \mathbb{E}[e^{\beta X_t-\phi(\beta)}] = M_{t-1}, \]

where we have used \mathbb{E}[e^{\beta X_k - \phi(\beta)}]=1. So (M_t) is a martingale.

In the case \beta>0 and \phi(\beta)<0,

    \[\mathbb{E}[e^{-\phi(\beta)T}] = \mathbb{E}\big[\lim_{n\to\infty} M_{T\wedge n}\big] \leq\liminf_{n\to\infty} \mathbb{E}[M_{T\wedge n}]= M_0 = e^{\beta}.\]

The inequal­ity is by Fatou’s lemma and the sec­ond equal­ity is by the optional stop­ping the­o­rem. To see the first equal­ity, note that if T<\infty, then S_{T\wedge n}\to S_T=0 and T\wedge n\to T as n\to\infty, while if T=\infty, then S_{T\wedge n}>0 for all n and T\wedge n\to\infty. Thus e^{-\phi(\beta)T}=\lim_{n\to\infty}e^{\beta S_{T\wedge n}-\phi(\beta)(T\wedge n)}=\lim_{n\to\infty}M_{T\wedge n}.

Next, we com­pute \phi(\beta). Recall that (X_k+1)\sim \text{Binomial}(n, \frac{c}{n}). It has the same dis­tri­b­u­tion as the sum of n i.i.d. \text{Bernoulli}(\frac{c}{n}) vari­ables. For Y\sim\text{Bernoulli}(p), we have \mathbb{E}[e^{\beta Y}]= 1+(e^{\beta}-1)p. There­fore,

    \begin{eqnarray*} -\phi(\beta) &=& - \log \mathbb{E}[e^{\beta X_t}]\\ &=& -\log\big( e^{-\beta} (1+(e^{\beta}-1)\tfrac{c}{n})^n \big)\\ &=& \beta - n\log\big(1+(e^{\beta}-1)\tfrac{c}{n}\big)\\ &\geq& \beta - c(e^{\beta}-1), \end{eqnarray*}

where the last inequal­ity is because -\log(1+x)\geq -x for any x. Now, by set­ting \beta=-\log c, we obtain that -\phi(\beta)\geq c-1-\log c=\alpha. Thus we have shown \mathbb{E}[e^{\alpha T}]\le\mathbb{E}[e^{-\phi(\beta)T}]\le e^\beta=\frac{1}{c}. \quad\square

The super­crit­i­cal case c>1

The goal of the fol­low­ing lec­tures is to prove part 2 of The­o­rem 1. More pre­cisely, we will prove:

The­o­rem 4 Let c>1. Then

    \[ \frac{\max_v|C_v|}{n}\xrightarrow{n\to\infty} 1 - \rho \]

in prob­a­bil­ity, where \rho is the small­est pos­i­tive solu­tion of the equa­tion \rho = e^{c(\rho-1)}. More­over, there is \beta>0 such that all but one of the com­po­nents have size \leq \beta\log n, with prob­a­bil­ity tend­ing to 1 as n\to\infty.

This the­o­rem says that with prob­a­bil­ity tend­ing to 1, there is a unique giant com­po­nent whose size is (1-\rho)n, and all other com­po­nents are small with size \leq \beta\log n.

Here we pro­vide some vague intu­ition for this the­o­rem. When c>1, the ran­dom walk (S_t) sat­is­fies \mathbb{E}[S_t-S_{t-1}]=c-1>0, i.e., (S_t) has pos­i­tive drift. Then \mathbb{P}[T<\infty]<1! In fact, the fur­ther away it starts from 0, the smaller the prob­a­bil­ity it will ever hit 0. Con­sider the two situations:

  • S_t dies quickly: this implies that the com­po­nent is small.
  • S_t lives long: then it must live very long, as once it gets far away from 0, the prob­a­bil­ity of return­ing is very small. This implies that the com­po­nent must be very large (if we pre­tend that S_t=|A_t|).

Of course, S_t is not |A_t| (obvi­ously |A_t| even­tu­ally hits 0). But the intu­ition explains that there can­not be com­po­nents of inter­me­di­ate size: given any ver­tex v, either |C_v| is small (\lesssim \log n), or it must get very large (\gtrsim n^{2/3}, say). In fact, we will find that all com­po­nents of size \geq \beta\log n must grow all the way to \gtrsim n^{2/3}. How­ever, any pair of large com­po­nents must inter­sect with high prob­a­bil­ity, as there are many poten­tial edges between them! There­fore, all ver­tices v with |C_v|\geq \beta\log n should be in the same giant com­po­nent. We then show that the num­ber of such ver­tices is (1-\rho)n with high probability.

Many thanks to Tracy Ke for scrib­ing this lecture!

06. April 2013 by Ramon van Handel
Categories: Random graphs | Leave a comment

Next lecture: April 4

Just a reminder that next week (March 21) is Spring Break, while the week after (March 28) there will be no lec­ture due to the ORFE/PACM col­lo­quium of Elchanan Mos­sel.

The next Sto­chas­tic Analy­sis Sem­i­nar lec­ture will be on April 4. We will start fresh with a new topic: the study of the giant com­po­nent of an Erdös-Rényi graph.

15. March 2013 by Ramon van Handel
Categories: Stochastic analysis seminar | Leave a comment

Lecture 3. Chromatic number

Let {C} be set of inte­gers that rep­re­sent col­ors. A ver­tex col­or­ing of a graph {G=([n], E)} is an assign­ment {c\,:\,\{1, \ldots, n\} \rightarrow C} of a color {c(i) \in C} to each ver­tex {i \in [n]}. Fur­ther­more a ver­tex col­or­ing {c} is said to be proper if {c(i)\neq c(j)} for all {(i,j) \in E}. The chro­matic num­ber {\chi(G)} of a graph {G} is the small­est num­ber {|C|} of col­ors needed for a proper ver­tex col­or­ing to exist. In this lec­ture, any col­or­ing is a proper ver­tex col­or­ing. All logs are in base {2}.

The­o­rem [Bol­lobas, 1988]
The chro­matic num­ber {\chi(G)} of a ran­dom graph {G\sim G(n,\frac12)} satisfies

\displaystyle \chi(G) =\frac{n}{2\log n}(1+o_P(1))

1. Basic properties

In this sec­tion, we begin by review­ing some basic prop­er­ties of the chro­matic num­ber, and in par­tic­u­lar how it relates to inde­pen­dent sets. Recall that an inde­pen­dent set {S \subset [n]} of a graph {G=([n], E)} is a set of ver­tices such that {i,j \in S \Rightarrow (i,j) \notin E}. Inde­pen­dent sets are some­times called sta­ble sets. Denote by {\bar G=([n], \bar E)} the com­ple­ment graph of {G} where {(i,j) \in \bar E} iff {(i,j) \notin E}. If {S} is an inde­pen­dent set of {G}, then it is a clique in {\bar G}. The largest inde­pen­dent set in {G} is called inde­pen­dence num­ber of {G} and denoted by {\alpha(G)}.

Propo­si­tion For any graph {G=([n],E)} with inde­pen­dence num­ber {\alpha(G)}, clique num­ber {\omega(G)} and chro­matic num­ber {\chi(G)} the fol­low­ing holds

  1. {1\le \chi(G) \le n}
  2. {\chi(G) \ge \omega(G)}
  3. {n \le \chi(G)\alpha(G)}

Proof: Trivial. \Box

2. Chro­matic, clique and inde­pen­dence numbers

Our goal is to study the chro­matic num­ber of a ran­dom graph {G \sim G(n,\frac12)}. It actu­ally builds upon the clique num­ber that we stud­ied dur­ing last lec­ture. Indeed, we will use the fol­low­ing obser­va­tion: if {G \sim G(n,\frac12)} then so does the com­ple­ment graph {\bar G} of {G}. It implies that the inde­pen­dence num­ber {\alpha(G)} has the same dis­tri­b­u­tion as the clique num­ber {\omega(G)}. We know from last lec­ture that {\omega(G)=(2\log n)(1+o(1))}. There­fore, it fol­lows from the above propo­si­tion that

\displaystyle \chi(G) \ge \frac{n}{\alpha(G)}= \frac{n}{2\log n}(1+o(1))\,.

Our main the­o­rem sug­gests that this sim­ple bound is, in fact, tight. Prov­ing this requires a stronger lower bound on clique of a ran­dom graph.

Define {m=\lfloor n/\ln^2 n\rfloor} and let {V \subset [n]} be a sub­set of {|V|=m} ver­tices and observe that induced sub­graph {G_V=(V, E \cap [m]^2)} restricted to ver­tices in {V} has dis­tri­b­u­tion {G(m,\frac12)}.

Define

\displaystyle \bar \alpha_m=\min_{\substack{V\subset [n]\\ |V|=m}} \alpha(G_{V})

That is: every {m} ver­tex sub­graph con­tains an inde­pen­dent set of size {\bar \alpha_m}.

Con­sider now the fol­low­ing col­or­ing pro­ce­dure to pro­duce a proper col­or­ing. Pull out inde­pen­dent sets of size {\bar \alpha_m} one by one and give each a dis­tinct color. Accord­ing to the def­i­n­i­tion of {\bar \alpha_m}, as long as there are at least {m} ver­tices, this is pos­si­ble. If there are less than {m} ver­tices left, give each a dis­tinct color.

The above algo­rithm gives one pos­si­ble way of col­or­ing graph {G} and thus the out­put num­ber is an upper bound on the chro­matic num­ber {\chi(G)}. Let us now com­pute this upper bound. By def­i­n­i­tion of {\bar \alpha_m}, we have

\displaystyle \chi(G)\le \Big\lceil \frac{n-m}{\bar \alpha_m}\Big\rceil + m \le \frac{n}{\bar \alpha_m} +m\,. \ \ \ \ \ (1)

There­fore, to find an upper bound on {\chi(G)}, we need to find a lower bound on {\bar \alpha_m} for an appro­pri­ately cho­sen {m}. To that end, observe that

\displaystyle \begin{array}{rcl} \mathbb{P}(\bar \alpha_m<k)&\le& \sum_{\substack{V\subset [n]\\ |V|=m}} \mathbb{P}(\alpha(G_{V})<k)\\ &\le& {n \choose m} \max_{\substack{V\subset [n]\\ |V|=m}}\mathbb{P}(\omega(G_{V})<k)\,. \end{array}

There­fore, we need to prove a bound on {\mathbb{P}(\omega(G_{V})<k)}, uni­formly in {V}. This can be done using con­cen­tra­tion prop­er­ties of a suit­able martingale.

3. Azuma-Hoeffding inequality

Mar­tin­gales have use­ful con­cen­tra­tion prop­er­ties. Specif­i­cally the fol­low­ing lemma holds.

Lemma [Azuma-Hoeffding inequal­ity.]
Let {(\Omega, \mathcal{F}, \{\mathcal{F}_i\}_{i \ge 0}, \mathbb{P})} be a fil­tered space and let {X} be a ran­dom vari­able on {(\Omega, \mathcal{F}, \mathbb{P})}. Assume that the mar­tin­gale {X_i=\mathbb{E}[X|\mathcal{F}_i], i\ge 0} is such that for all {i \ge 1},

\displaystyle |X_{i}-X_{i-1}|\le 1\,, \qquad \text{a.s.}

Then for any pos­i­tive inte­ger {N} and any {t>0}, it holds,

\displaystyle \mathbb{P}(X_N-X_0\ge t) \le \exp\Big(-\frac{t^2}{2N}\Big)

and

\displaystyle \mathbb{P}(X_N-X_0\le -t) \le \exp\Big(-\frac{t^2}{2N}\Big)

Proof: We assume the fol­low­ing inequal­ity due to Hoeffd­ing. For any ran­dom vari­able {Z} such that {|Z| \le 1} a.s. and any {s>0}, it holds

\displaystyle \mathbb{E}[e^{sZ}]\le e^{\frac{s^2}{2}}\,.

Now, using a Cher­noff bound, we get for any {s>0},

\displaystyle \begin{array}{rcl} \mathbb{P}(X_N-X_0>t)&=&\mathbb{P}\Big(\sum_{i=1}^N\{X_i-X_{i-1}\}>t\Big)\\ &\le& e^{-st}\mathbb{E}\exp\Big(s\sum_{i=1}^N\{X_i-X_{i-1}\}\Big)\\ &=&e^{-st}\mathbb{E}\prod_{i=1}^N\exp\Big(s\{X_i-X_{i-1}\}\Big)\\ &=&e^{-st}\mathbb{E}\prod_{i=1}^N\mathbb{E}\Big[\exp\Big(s\{X_i-X_{i-1}\}\Big)\Big|\mathcal{F}_{i-1}\Big]\\ \end{array}

where {\mathcal{F}_{i}=\sigma(X_1, \ldots, X_i)}. The above two dis­plays yield that

\displaystyle \mathbb{P}(X_N-X_0>t)\le \exp\Big(\inf_{s>0}\Big\{\frac{Ns^2}{2} -st\Big\}\Big)=\exp\Big(-\frac{t^2}{2N}\Big)\,.

The sec­ond part of the proof fol­lows by apply­ing the same argu­ment to the mar­tin­gale {(-X_i)_{i\ge 0}}. \Box

Con­sider the lex­i­co­graphic order­ing of {[n]^2} and let {E_1, \ldots, E_N \in \{0,1\}} be {N={n \choose 2}} iid Bernoulli ran­dom vari­ables with para­me­ter {1/2} indi­cat­ing the pres­ence of an edge in a graph {G\sim G(n,\frac12)}. Define the {\sigma}–alge­bra {\mathcal{F}_i=\sigma(E_1, \ldots E_i), i=1, \ldots, N}. We are going to apply the Azuma-Hoeffding inequal­ity to the (Doob) mar­tin­gale {Y_i=\mathbb{E}[Y|\mathcal{F}_i]\,, \quad i=1, \ldots, N} where {Y} is max­i­mal num­ber of edge-disjoint {k}–cliques in {G}.

Note first that {|Y_i-Y_{i-1}|\le 1} a.s., since adding/removing one edge can add/remove at most one edge dis­joint {k}–clique. More­over, this inequal­ity would not hold if we had cho­sen {Y} to be the num­ber cliques of size {k} (not nec­es­sar­ily dis­joint). Indeed, adding or remov­ing an edge can cre­ate or destroy many over­lap­ping cliques.

It fol­lows that,

\displaystyle \begin{array}{rcl} \mathbb{P}[\omega(G)<k]&=&\mathbb{P}[Y=0]\\ &=&\mathbb{P}[Y-\mathbb{E}[Y]=-\mathbb{E}[Y]]\\ &=&\mathbb{P}[Y_N-Y_0=-\mathbb{E}[Y]]\\ \end{array}

There­fore, apply­ing the Azuma-Hoeffding inequal­ity, we get

\displaystyle \mathbb{P}[\omega(G)<k]\le e^{-\frac{(\mathbb{E}[Y])^2}{2N}}\,. \ \ \ \ \ (2)

It remains to show that {\mathbb{E}[Y]} is suf­fi­ciently large. Note first that {Y \ge X}, where is {Y} is the num­ber of {k}–cliques that do not share an edge with any other {k}–clique of the graph.

4. A strong lower bound on the clique number

Fix pos­i­tive inte­gers {m} and

\displaystyle k=(2-\varepsilon+o(1))\log m

and let {X} denote the num­ber of {k} cliques in {G \sim G(m,\frac12)} that do not share an edge with any other {k}–clique of the graph. Then

\displaystyle \mathbb{E}[X]\ge m^{(2-\varepsilon)a+o(1)}

for all {a\in (0,1)}. Here the asymp­totic nota­tion nota­tions {o(1)} are as {m \rightarrow \infty}.

Proof: Define {M=\left\lceil 2^ak2^{\frac{k-1}{2}}\right\rceil\ge 4k} (for {m} large enough) for some {a \in (1,2)} to be cho­sen later.

More­over, note that for {m} large enough,

\displaystyle M\le 8m^{1-\frac{\varepsilon}2 +o(1)}\le m

so that {X \ge X'} where {X'} is the num­ber of {k} cliques in {G \sim G(M,\frac12)} that do not share an edge with any other {k}–clique of the graph {G}. Let {G \sim G(M, \frac12)}.

For any {S \in [M]} define the indicator:

\displaystyle I_S=1\{S \text{ is a clique of size} \ k \ \text{in} \ G\}.

More­over for any set {S}, let {Z(S)} denote the num­ber of {k}–cliques in {G}, other than {S} itself, that share at least two ver­tices with {S}. Observe that

\displaystyle \begin{array}{rcl} \mathbb{E}[X']&=&\mathbb{E}\sum_{\substack{S \subset [M] \\ |S| =k}}I_S1\{Z(S)=0\}\\ &=&{M\choose k}\mathbb{P}(I=1, Z=0)\qquad \text{where } I=I_{[k]}\,, \ Z=Z([k])\\ &=&{M \choose k}\Big(\frac12\Big)^{k \choose 2}\mathbb{P}(Z=0|I=1)\,, \end{array}

We bound {\mathbb{P}(Z=0|I=1)} as follows

\displaystyle \begin{array}{rcl} \mathbb{P}(Z=0|I=1)&=&1-\mathbb{P}(Z\ge 1|I=1)\\ &\ge& 1-\mathbb{E}(Z|I=1)\\ &=&1-\sum_{\substack{S \subset [M], |S|=k\\ |S\cap [k]| \ge 2}}\mathbb{P}(I_S=1)\\ &=&1-\sum_{s=2}^{k-1}\underbrace{{k\choose s}{M-k \choose k-s}\Big(\frac12\Big)^{{k\choose 2}-{s\choose 2}}}_{F_s}\\ \end{array}

The fol­low­ing bound holds for all {2\le s \le k/2},

\displaystyle \begin{array}{rcl} \frac{F_s}{F_2}&=&\frac{(M-2k+2)!}{(M-2k+s)!}\Big[ \frac{(k-2)!}{(k-s)!} \Big]^2\frac2{s!}2^{\frac{(s+1)(s-2)}{2}}\\ &\le& \frac2{s!}\Big(\frac{(k-2)^22^{\frac{s+1}2}}{M-2k+2} \Big)^{s-2}\\ &\le&\frac1{(s-2)!}\Big(\frac{2k^22^{\frac{k/2+1}{2}}}{M} \Big)^{s-2}\qquad \text{for} \ M\ge 4k\\ &\le&\frac1{(s-2)!}(2^{2+a}k2^{-\frac{k}4})^{s-2} \end{array}

It yields

\displaystyle \begin{array}{rcl} \sum_{2\le s \le k/2}\frac{F_s}{F_2}&\le & \exp\Big(2^{2+a}k2^{-\frac{k}4}\Big)=1+o(1)\\ \end{array}

More­over, for {s> k/2}, {M \ge k}, it holds

\displaystyle \begin{array}{rcl} {M-k \choose k-s}&\le& {M-k \choose s}\frac{s!(M-k-s)!}{(k-s)!(M-2k+s)!}\\ &\le& {M-k \choose s}\Big(\frac{s}{M-k+s}\Big)^k\\ &\le & {M-k \choose s}\Big(\frac{k}{M-k/2}\Big)^{k}\\ \end{array}

Hence,

\displaystyle \begin{array}{rcl} F_s&=& {k\choose k-s}{M-k \choose k-s}\Big(\frac12\Big)^{{k\choose 2}-{s\choose 2}}\\ &\le& F_{k-s}\Big(\frac{k}{M-k/2}\Big)^{k}\Big(\frac12\Big)^{{k-s\choose 2}-{s\choose 2}}\\ &\le & F_{k-s}\Big(\frac{k2^{\frac{k-1}2}}{M-k/2}\Big)^{k}\\ &\le & F_{k-s}\Big(\frac{2^{-a}M}{M-k/2}\Big)^{k}\\ &\le &F_{k-s}o(1)\,. \end{array}

There­fore,

\displaystyle \sum_{k/2\le s \le k-1}\frac{F_s}{F_2}\le \sum_{2\le s \le k/2}\frac{F_s}{F_2}o(1)=o(1)\\

Thus,

\displaystyle \mathbb{P}(Z=0|I=1)\ge 1-F_2(1+o(1))\,.

More­over,

\displaystyle \begin{array}{rcl} F_2&=&2{k \choose 2}{M-k \choose k-2} \frac{1}{2^{k \choose 2}}\\ &=& k^2\Big(\frac{M-k}{k-2}\Big)^{k-2}2^{-\frac{k(k-1)}{2}}(1+o(1))\\ &=& \frac{k^2}{M^2}\Big(\frac{M}{k2^{\frac{k-1}{2}}}\Big)^{k}(1+o(1))\\ &=& \frac{k^22^{ak}}{M^2}(1+o(1))\\ &\le&\frac{2}{2^{2a}}\Big(\frac{2^a}{2}\Big)^k(1+o(1))=o(1)\,, \end{array}

since {2^a<2}.

To con­clude the proof, observe that by Stir­ling, since {k=o(n)},

\displaystyle \begin{array}{rcl} {M \choose k}\Big(\frac12\Big)^{k \choose 2}&\ge&\Big(\frac{M}{k2^{\frac{k-1}{2}}}\Big)^k(1+o(1))\\ &=&2^{ak}(1+o(1))=m^{(2-\varepsilon) a+o(1)}\,. \end{array}

\Box

Let {G \sim G(m, \frac12)} and observe that

\displaystyle (2-\varepsilon+o(1))\log m=(2-\varepsilon+o(1))\log n

We can there­fore apply the pre­vi­ous propo­si­tion together with~(2), to get

\displaystyle \mathbb{P}[\omega(G)<(2-\varepsilon+o(1))\log n]\le \exp\Big(-\frac{m^{2(2-\varepsilon)a+o(1)}}{n(n-1)}\Big)  

5. Proof of the main theorem

Since

\displaystyle {n \choose m}\le 2^n=2^{m^{1+o(1)}}\,,

we get for {k=(2-\varepsilon+o(1))\log n},

\displaystyle \begin{array}{rcl} \mathbb{P}(\bar \alpha_m<k)&\le &{n \choose m} \exp\Big(-\frac{m^{2(2-\varepsilon)a+o(1)}}{n(n-1)}\Big)\\ &\le& \exp\Big(m^{1+o(1)}(\ln 2)-\frac{m^{2(2-\varepsilon)a+o(1)}}{n(n-1)}\Big)=o(1)\,. \end{array}

Thus {\bar \alpha_m \ge (2\log n)(1+o_P(1))}. Together with~(1), it yields

\displaystyle \begin{array}{rcl} \chi(G)&\le&\frac{n}{k}(1+o_P(1))+m \\ &=&\frac{n}{2\log n}(1+o_P(1))+o\Big(\frac{n}{\log n}\Big)\\ &=&\frac{n}{2\log n}(1+o_P(1)) \end{array}

Which com­pletes the proof of our theorem.

Lec­ture and scrib­ing by Philippe Rigol­let

14. March 2013 by Philippe
Categories: Random graphs | Leave a comment

Lecture 2. Clique number

In this lec­ture all logs are in base {2}. We will prove the following.

The­o­rem For the cen­tered and nor­mal­ized clique number

\displaystyle \frac{\omega\left(G(n, \frac12)\right) - 2 \log n}{\log \log n}\xrightarrow{n\to\infty}-2 \quad\text{in probability.}

That is, as n\to\infty, the clique num­ber of the Erdös-Rényi graph G(n,\frac12) is \omega\left(G(n, \frac12)\right)=2\log n-(2+o(1))\log\log n.

Proof: The proof is divided into two parts. First, we show that the clique num­ber can­not be too big using a union bound. Then we show that the clique num­ber can­not be too small using the sec­ond moment method. In the fol­low­ing, G_n denotes an Erdös-Rényi graph G(n,\frac12).

For the first part one has

\displaystyle \begin{array}{rcl} {\mathbb P}(\omega(G_n) \geq k) & = & {\mathbb P}( \exists S \subset [n], \; \text{s.t.} \; |S|=k \; \text{and} \; S \; \text{is a clique in} \; G_n) \\ & \leq & \sum_{S \subset [n], |S|=k} {\mathbb P}(S \; \text{is a clique in} \; G_n) \notag \\ & = & {n \choose k} \left(\frac12\right)^{{k \choose 2}} \\ & \leq & \left(\frac{e n}{k} \right)^k 2^{- \frac{k(k-1)}{2}} \\ & = & 2^{k \left( \log n - \log k + \log e- \frac{k-1}{2}\right)}. \end{array}

Thus, choos­ing k=2\log n - (2 - \epsilon) \log \log n, we obtain

{\mathbb P}(\omega(G_n) \geq 2\log n - (2 - \epsilon) \log \log n) \le n^{ - \frac{\epsilon}{2} \log \log n + \frac{1}{2} + \log e} \xrightarrow{n \rightarrow \infty} 0

for every {\epsilon >0}, where we used that k\ge\log n for large n.

For the sec­ond part it is use­ful to intro­duce some nota­tion. Define

\displaystyle {X_S = 1\{S \; \text{is a clique in} \; G_n\}},\qquad {Y_k = \sum_{S \subset [n], |S|=k} X_S}.

In par­tic­u­lar, {\omega(G_n) < k} iff {Y_k=0}. Thus we want to show that for {k=2\log n - (2 + \epsilon) \log \log n} one has {{\mathbb P}(Y_k = 0) \to 0}. Using the triv­ial obser­va­tion that x=0 implies (x-y)^2=y^2, we get

\displaystyle {\mathbb P}(Y_k=0) \leq {\mathbb P}((Y_k - {\mathbb E} Y_k)^2 = ({\mathbb E} Y_k)^2) \leq \frac{\text{Var}(Y_k)}{({\mathbb E} Y_k)^2}

where we have used Markov’s inequal­ity. Note that by lin­ear­ity of the expec­ta­tion, {\mathbb E} Y_k = {n \choose k} \left(\frac12\right)^{k\choose 2}. Fur­ther­more, we can write

\displaystyle \text{Var}(Y_k) = \sum_{S \subset [n], |S|=k} \text{Var}(X_S) + \sum_{S, T \subset [n], |S|=|T|=k, S \neq T} \{ {\mathbb E} X_S X_T - ({\mathbb E} X_S) ({\mathbb E} X_T) \}.

As {X_S} are boolean ran­dom vari­ables we have {\text{Var}(X_S) \leq {\mathbb E} X_S} and thus

\displaystyle \frac{\sum_{S \subset [n], |S|=k} \text{Var}(X_S)}{\left( {\mathbb E} Y_k \right)^2} \leq \frac{\sum_{S \subset [n], |S|=k} {\mathbb E} X_S}{\left( {\mathbb E} Y_k \right)^2} = \frac{1}{{n \choose k} \left(\frac12\right)^{k\choose 2}},

which tends to {0} for {k=2\log n - (2 + \epsilon) \log \log n} (see the first part of the proof and use the inequal­ity {{n \choose k} \geq \left(\frac{n}{k}\right)^k}). Thus it remains to show that the fol­low­ing quan­tity tends to {0}:

\displaystyle \frac{\sum_{S, T \subset [n], |S|=|T|=k, S \neq T} \{{\mathbb E} X_S X_T - ({\mathbb E} X_S) ({\mathbb E} X_T)\}}{\left({\mathbb E} Y_k\right)^2} .

First note that, by the inde­pen­dence of the edges, for {S, T} with {|S \cap T| \leq 1} we have that {X_S} and {X_T} are inde­pen­dent, so that in the numer­a­tor of the above quan­tity one can restrict to {S, T} with {|S \cap T| \geq 2}. Now by an ele­men­tary rea­son­ing we have (with {S_0} being an arbi­trary sub­set of {k} vertices)

\displaystyle \begin{array}{rcl} && \sum_{S, T \subset [n], |S|=|T|=k, S \neq T, |S \cap T| \geq 2} {\mathbb E} X_S X_T \\ && = \sum_{S, T \subset [n], |S|=|T|=k, S \neq T, |S \cap T| \geq 2} {\mathbb P}(X_S = 1 \; \text{and} \; X_T = 1) \\ && = \sum_{S, T \subset [n], |S|=|T|=k, S \neq T, |S \cap T| \geq 2} {\mathbb P}(X_S = 1) {\mathbb P}(X_T =1 | X_S = 1) \\ && = \left( \sum_{T \subset [n], |T|=k, T \neq S_0, |S_0 \cap T| \geq 2} {\mathbb P}(X_T = 1 | X_{S_0} = 1) \right) \left(\sum_{S \subset [n], |S|=k} {\mathbb P}(X_S=1) \right) \\ && = \left( \sum_{T \subset [n], |T|=k, T \neq S_0, |S_0 \cap T| \geq 2} {\mathbb P}(X_T = 1 | X_{S_0} = 1) \right) \left({n \choose k} {\mathbb E} X_S \right) . \end{array}

Thus we are now left with prov­ing that the fol­low­ing quan­tity goes to {0}:

\displaystyle \frac{\sum_{T \subset [n], |T|=k, T \neq S_0, |S_0 \cap T| \geq 2} {\mathbb P}(X_T = 1 | X_{S_0} = 1)}{{n \choose k} \left(\frac12 \right)^{- {k \choose 2}}} . \ \ \ \ \ (1)

Clearly one has

\displaystyle {\mathbb P}(X_T = 1 | X_{S_0} = 1) = \left( \frac12 \right)^{{k \choose 2} - {|T \cap S_0| \choose 2}} ,

which shows that (1) can be rewrit­ten as

\displaystyle \sum_{s=2}^{k-1} \frac{{k \choose s} {n-k \choose k-s}}{{n \choose k}} 2^{{s \choose 2}} . \ \ \ \ \ (2)

Now note that since

\displaystyle {n-1 \choose k-1} = \frac{k}{n} {n \choose k} ,

one has

\displaystyle {n-k \choose k-s} \leq {n-s \choose k-s} = \left( \prod_{\alpha=1}^s \frac{k-\alpha}{n-\alpha} \right) {n \choose k} \leq \left(\frac{k}{n}\right)^s {n \choose k} .

Using { {k \choose s} \leq \left(\frac{e k}{s}\right)^s} one obtains that (2) is bounded from above by

\displaystyle \sum_{s=2}^{k-1} \left(\frac{k}{s}\right)^s \left(\frac{k}{n}\right)^s 2^{{s \choose 2}} = \sum_{s=2}^{k-1} 2^{s \left(2 \log k - \log n + \frac{s}{2} - \log s + \log e - \frac12\right)} .

As \frac{s}{2}-\log s is a con­vex func­tion and as {2 \leq s \leq k}, one has {\frac{s}{2} - \log s \leq \max(0, \frac{k}{2} - \log k)}. Thus for {k} large enough the expo­nent in the above dis­play is bounded from above by {\log k - \log n + \frac{k}{2} + c}, and for {k=2\log n - (2 + \epsilon) \log \log n} this lat­ter is bounded by {- \frac{\epsilon}{2} \log \log n + c}. Thus we proved that (2) is bounded from above by

\displaystyle \sum_{s=2}^{k-1} 2^{ - s (\frac{\epsilon}{2} \log \log n - c) } ,

which tends to {0} as {n} tends to infin­ity, con­clud­ing the proof. \Box

07. March 2013 by Sebastien Bubeck
Categories: Random graphs | Leave a comment

Time change for Spring 2013

Wel­come to the sto­chas­tic analy­sis sem­i­nar blog! We are exper­i­ment­ing with this blog for the first time in the Spring 2013 semes­ter. The inten­tion is to post notes from the lec­tures here: we are grate­ful to sev­eral grad­u­ate stu­dents who have gra­ciously vol­un­teered to serve as scribes. In addi­tion, any announce­ments or orga­ni­za­tional mat­ters will be posted to this blog. Infor­ma­tion on pre­vi­ous semes­ters can be found here.

Time change: In order to avoid con­flicts with a num­ber of sem­i­nars, the sto­chas­tic analy­sis sem­i­nar will be held on Thurs­days, 4:50–6:20, Bend­heim Cen­ter 103 start­ing this week. That is, we start and end 20 min­utes later than orig­i­nally announced for the remain­der of this semester.

05. March 2013 by Ramon van Handel
Categories: Stochastic analysis seminar | Leave a comment

← Older posts