See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/305483738
Joint large deviation result for empirical measures of the coloured random
geometric graphs
Article  in  SpringerPlus · December 2016
DOI: 10.1186/s40064-016-2718-z
CITATIONS READS
2 57
1 author:
Kwabena Doku-Amponsah
University of Ghana
41 PUBLICATIONS   59 CITATIONS   
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Joint Large deviation Principles for d-regular random graphs View project
All content following this page was uploaded by Kwabena Doku-Amponsah on 23 July 2016.
The user has requested enhancement of the downloaded file.
Doku‑Amponsah  SpringerPlus  (2016) 5:1140 
DOI 10.1186/s40064‑016‑2718‑z
RESEARCH Open Access
Joint large deviation result for empirical 
measures of the coloured random  
geometric graphs
Kwabena Doku‑Amponsah* 
*Correspondence:  
kdoku‑amponsah@ug.edu.gh Abstract 
Statistics Department, We prove joint large deviation principle for the  empirical pair measure and empirical 
University of Ghana,  
Box LG 115, Legon, Ghana locality measure of the near intermediate coloured random geometric graph models on 
n points picked uniformly in a d‑dimensional torus of a unit circumference. From this 
result we obtain large deviation principles for the number of edges per vertex, the degree 
distribution and the proportion of isolated vertices  for the near intermediate random 
geometric graph models.
Keywords: Random geometric graph, Erdős–Rényi graph, Coloured random 
geometric graph, Typed graph, Joint large deviation principle, Empirical pair measure, 
Empirical measure, Degree distribution, Entropy, Relative entropy, Isolated vertices
Mathematics Subject Classification: 60F10, 05C80, 68P30
Background
In this article we study the coloured geometric random graph CGRG, where n points or 
vertices or nodes are picked uniformly at random in [0, 1]d , colours or spins are assigned 
independently from a finite alphabet  and any two points with colours a1, a2 ∈  dis-
tance at most rn(a1, a2) apart are connected. This random graph models, which has the 
geometric random graph (see Penrose 2003) as special case, has been suggested by Can-
nings and Penman (2003) as a possible extension to the coloured random graph studied 
in Biggins and Penman (2009), Doku-Amponsah and Mörters (2010), Doku-Amponsah 
(2006), Bordenave and Caputo (2015), Mukherjee (2014) and Doku-Amponsah (2014).
The connectivity radius rn plays similar role as the connection probability pn in the 
coloured random graph model. Several large deviation results about the coloured ran-
dom graphs and hence Erdős–Rényi graph have been established recently. See O’Connell 
(1998), Biggins and Penman (2009), Doku-Amponsah and Mörters (2010), Doku-
Amponsah (2006, 2014), Bordenave and Caputo (2015), Mukherjee (2014).
Until recently few or no large deviation result about the CGRG have been found. 
Doku-Amponsah (2015) proved joint large deviation principle for empirical pair meas-
ure and the empirical locality measure of the CGRG, where n points are uniformly cho-
sen in [0, 1]d, colours or spins are assigned by drawing without replacement from the 
© 2016 The Author(s). This article is distributed under the terms of the Creative Commons Attribution 4.0 International License 
(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, 
provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and 
indicate if changes were made.
Doku‑Amponsah S pringerPlus ( 2016)5 :1140 Page 2 of 16
pool of, say, nνn(a1) colours, and nωn(a1, a2) edges, a1, a2 ∈ , are randomly inserted 
among the points for some colour law νn: � → [0, 1] and edge law ωn: � ×� → [0,∞).
This article presents a full joint large deviation principle (LDP) for the empirical pair 
measure and the empirical locality measure of the CGRG. Refer to (Doku-Amponsah 
and Mörters) for similar result for the coloured random graphs. From this large devia-
tion results we obtain LDPs for graph quantities such as number of edges per vertex, the 
degree distribution and the proportion of isolated vertices of geometric random graphs in 
the intermediate case. Our results are similar to those in O’Connell (1998), Biggins and 
Penman (2009), Doku-Amponsah and Mörters (2010), Doku-Amponsah (2006, (2014), 
Bordenave and Caputo (2015) and Mukherjee (2014) for the Erdö–Renyi graph except 
that the rate functions of the LDPs in our current setting is bigger as a result of the effect 
of the geometric in the model.
As a first step in the proof of our main result, we obtain a joint LDP for the empiri-
cal colour measure and empirical pair measures for the CGRG, see Theorem  4, by 
the exponential change-of-measure techniques and coupling argument. See example 
Doku-Amponsah and Mörters (2010) or Doku-Amponsah (2006). In the next step, we 
use Biggins (2004,  Theorem  5(b)) to mix Theorem  4 and the result (Doku-Amponsah 
2015,  Theorem  2.1) to obtain the full joint LDP for empirical pair measure and the 
empirical locality measure of CGRG model. Refer to Doku-Amponsah and Mörters 
(2010) or Doku-Amponsah (2006) for further illustration of this method.
Our main motivation for studying this model are in two folds.
Independence testing  Consider CGRG which is a model for Wireless Sensor Network 
as a very big dataset comprising the typed sites and the bonds between sites. One inter-
esting question to ask is how many bits will be required to code the n sites and the bonds 
between sites with high probability? Then, an asymptotic equipartition property (AEP) 
for the WSN will answer this question and our LDP for the empirical measures of the 
CGRG will play a crucial in the prove of the AEP. Further, we can test whether a received 
codeword yn of WSN is jointly typical with a candidate sent codeword xn of WSN. The 
probability that two independent sequences (xn, yn) (xn being a codeword other than 
what was sent when yn was received) actually appear as dependent is bounded asymptot-
ically as 2−nI , where the AEP is used to obtain the bound. See Doku-Amponsah (2016) 
for more on this application.
Hypothesis testing  One of the standard problems in statistics is to decide between two 
alternative explanations for the data are observed. For example, a transmitter will send 
an information on the WSN bits by bits in communication systems. There are two pos-
sible cases for each transmission: one is that bit 0 of WSN data is sent (noted as event 
H0 ) and the other is that bit 1 of WSN data is sent (noted as event H1). In the receiver 
side, the bit y is to be received as either 0 or 1. Based on the y bit of WSN data received, 
we can make a hypothesis whether the event H0 happens (bit 0 was sent at the transmit-
ter) or the event H1 happens (i.e. bit 1 was sent at the transmitter). Of course, we may 
make mis-judgement, such as we decode that bit 0 was sent but actually bit 1 was sent. 
We need to make the probability of error in hypothesis testing as low as possible and the 
LDPs for CGRG models can help us specify the probability of error.
Doku‑Amponsah  SpringerPlus  (2016) 5:1140 Page 3 of 16
In the remainder of the paper we state and prove our LDP results. In “Statement of 
the results” section we state our LDPs, Theorem 1, Corollary 2, Corollary 3, Theorem 4, 
and Corollary 5. In “Proof of Theorem 1” section we present the proof of Theorem 4. 
In “Proof of Theorem  1” section we combine Theorem  2.1 and Doku-Amponsah 
(2014,  Theorem  2.1) to obtain the Theorem  1, using the setup and result of Biggins 
(2004) to ‘mix’ the LDPs. The paper concludes with the proofs of Corollaries 2, 3 and 5 
which are given in “Proof of Corollaries 2, 3, and 5” section.
Statement of the results
The joint LDP for empirical pair measure and empirical locality measure of CGRG
In this subsection we shall look at a more general model of random geometric graphs, 
the CGCG in which the connectivity radius depends on the type or colour or symbol or 
spin of the nodes. The empirical pair measure and the empirical locality measure are our 
main object of study.
Given a probability measure ν on  and a function rn: � ×� → (0, 1] we may define 
the randomly coloured random geometric graph or simply coloured random geometric 
graph G with n vertices as follows: Pick vertices x1, . . . , xn at random independently 
according to the uniform distribution on [0, 1]d , d ∈ N. Assign to each vertex xj colour 
σ(xj) independently according to the colour law ν. Given the colours, we join any two 
vertices xi, xj,(i �= j) by an edge independently of everything else, if
[ ]
�xi − xj� ≤ rn σ(xi), σ(xj) .
In this article we shall refer to rn(a, b), for a, b ∈  as a connection radius, and always 
consider
G = (((σ (xi), σ(xj)): i, j = 1, 2, 3, . . . , n),E)
under the joint law of graph and colour. We interpret G as coloured GRG with verti-
ces x1, . . . , xn chosen at random uniformly and independently from the vertices space 
[0, 1]2. For the purposes of this study we restrict ourselves to the near intermediate cases. 
i.e. the connection radius rn satisfies the condition nrdn (a, b) → Cd(a, b) for all a, b ∈ , 
where C 2d : � → [0,∞) is a symmetric function, which is not identically equal to zero.
For any finite or countable set  we denote by P(�) the space of probability measures, 
and by P̃(�) the space of finite measures on , both endowed with the weak topology. 
By convention we write N = {0, 1, 2, . . .}.
We associate with any coloured graph G a probability measure, the empirical colour 
measure L1 ∈ P(�), by
n
1 ∑
L1G(a) := δσ(xj)(a), for a1 ∈ �,n
j=1
and a symmetric finite measure, the empirical pair measure L2X ∈ P̃∗(�2), by
∑
L2
1
G(a, b) := [δ
2
(σ (xi),σ(xj)) + δ((σ (xj),σ(xi))](a, b), for (a, b) ∈ � .n
(i,j)∈E
Doku‑Amponsah S pringerPlus  (2016)5 :1140 Page 4 of 16
Note that the total mass the empirical pair measure is 2|E| / n. Finally we define a further 
probability measure, the empirical neighbourhood measure MG ∈ P(� × N), by
n
1 ∑
MG(a, ℓ) := δ(σ(xi),L(xj))(a, ℓ), for (a, ℓ) ∈ � × N,n
j=1
while L(x ) = (lxjj (b), b ∈ �) and lxj (b) is the number of vertices of colour b connected 
to vertex xj.
For any η ∈ P(� × �N )we denote by η1 the -marginal of η and for every 
(b, a) ∈ � ×�, let η2 be the law of the pair (a,  l(b)) under the measure η. Define the 
measure (finite), �η(·, ℓ), l(·)� ∈ P̃(� ×�) by
∑
H2(η)(b, a) := η2(a, l(b))l(b), for a, b ∈ �
l(b)∈N
and write H1(η) = η1. We define the function H: P(� × �N ) → P(�)× P̃(� ×�) by 
H(η) = (H1(η),H2(η)) and note that H(MG) = (L1G ,L2G). Observe that H1 is a continu-
ous function but H2 is discontinuous in the weak topology. In particular, in the summa-
∑
tion l(b)∈ η2(a, l(b))l(b) the function l(b) may be unbounded and so the functional N
η → H2(η) would not be continuous in the weak topology. We call a pair of measures 
(ω, η) ∈ P̃(� ×�)× P(� × �N ) sub-consistent if
H2(η)(b, a) ≤ ω(b, a), for all a, b ∈ �, (1)
and consistent if equality holds in (1). For a measure ω ∈ P̃ (�2∗ ) and a measure 
ρ ∈ P(�), we recall from (Doku-Amponsah and Mörters 2010) the rate function
H1(ω � ρ) := H(ω �Cdρ ⊗ ρ)+ �Cdρ ⊗ ρ� − �ω�,
where the measure Cdρ ⊗ ρ ∈ P̃(� ×�) is defined by Cdρ ⊗ ρ(a, b) = Cd(a, b)ρ(a)ρ(b) 
for a, b ∈ . It is not hard to see that H1(ω � ρ) ≥ 0 and equality holds if and only if 
ω = Cdρ ⊗ ρ.
For every (ω, η) ∈ P̃∗(� ×�)× P(� × N) define a probability measure 
(ω,η)
Qpoi  on 
 × N by
( )
∏
(ω,η) −
ω(a,b) 1 ω(a, b ℓ(b))
Qpoi (a, ℓ) := η1(a) e
η1(a) , for a ∈ �, ℓ ∈ N.
ℓ(b)! η1(a)
b∈�
We assume d ∈ N and write

πd/2

� � if d ≥ 2
(d+2)
�(d) = Ŵ 2

1 if d = 1,
where Ŵ is the gamma function. We now state the principal theorem in this section the 
LDP for the empirical pair measure and the empirical locality measure.
Theorem  1 Suppose that G is a CRGG with colour law ν and connection radii 
rn:  × → [0, 1] satisfying nrdn (a, b) → Cd(a, b) for some symmetric function 
Doku‑Amponsah  SpringerPlus  (2016)5 :1140 Page 5 of 16
C: � ×� → [0,∞) not identical to zero. Then, as n → ∞, the pair (L2G , MG) satisfies an 
LDP in P̃∗(� ×�)× P(� × N) with good rate function
{
(ω,η)
H(η �Q )+H(η � ν)+ 1H (ω�η ) if (ω, η) consistent and η = ω ,
J (ω, η) = poi 1 2 2 1 1 2
∞ otherwise.
H2(ω�η1) = H1(ω � η1)− �ω� log�(d)+ (�(d)− 1)�Cdη1 ⊗ η1�.
Remark 1 Note that the first three terms of the rate function is the same as the rate 
function of Doku-Amponsah and Mörters (2010, Theorem 2.1). Additionally, the extra 
term 12 (−�ω� log�(d)+ (�(d)− 1)�Cdη1 ⊗ η1�) is positive and is as a result of the 
geometric [0, 1]d we have incorporated in the model. Moreover, on typical CGRG we 
have, η1 = ν, ω = �(d)C η1 ⊗ η1 and
ℓ(b)
∏
η(a, ℓ) = ν(a) e−�(d)C (a,b)ν(b)
(�(d)Cd(a, b)ν(b))d , for all (a, ℓ) ∈ � × N.
ℓ(b)!
b∈�
Hence, for some ε we P{|MG − η� ≥ ε} → 0 as n → ∞.
We write
c 1
1(δ) := (�(d)− 1) − �δ� log�(d)
2 2
Corollary 2 Suppose D is the degree distribution of the random graph G(n, rn), where 
the connectivity radius rn ∈ (0, 1] satisfies nrdn → c ∈ (0,∞). Then, as n → ∞, D satisfies 
an LDP in the space P(N ∪ {0}) with good rate function
{
[ ( ) ]
H d � q + 1 � � log �δ�( ) 1 c
2(δ) =
�δ� 2 δ c − 2 �δ� + 2 + 1(δ), if �δ� < ∞,
∞ if �δ� = ∞, (2)
∑
where qk is a poisson distribution with parameter k,  and �δ� := ∞m=0mδ(m).
This rate function 2 compares very well with the rate function of Doku-Amponsah 
and Mörters (2010, Corollary 2.2) with the extra term 1 accounting for the geometric 
effect on the CGRG model.
Next we give a similar result as in O’Connell (1998), the LDP for the proportion of iso-
lated vertices of the RGG.
[ ( ) ]
1 (�(d)− 1)c(1− y))
ξ1(y) = (�(d)− 1)cy(1− y/2)+ (1− y) log −
�(d) 2
Corollary 3 Suppose D is the degree distribution of the random graph G(n, rn), where 
the connectivity radius rn ∈ (0, 1] satisfies nrdn → c ∈ (0,∞). Then, as n → ∞, the pro-
portion of isolated vertices, D(0) satisfies an LDP in [0, 1] with good rate function
[ ]
( c ) (a− c(1− y))2
ξ2(y) = y log y+ cy(1− y/2)− (1− y) log − + ξ1(y),
a 2c(1− y)
where a = a(y, d) is the unique positive solution of 1− e−a = �(d)ca (1− y).
Doku‑Amponsah S pringerPlus ( 2016) 5:1140 Page 6 of 16
From Corollary 3 we deduce that on a typical random geometric graphs the number of 
isolated vertices will grow like ne−�(d)c. Thus, as n → ∞, the number of isolated verti-
ces in the geometric random graphs converges to ne−�(d)c in probability. Again, the rate 
function ξ2 above compares very well with the result of O’Connell (1998) with the extra 
term ξ1 accounting for the influence of the geometric plane [0, 1]d on the model.
The joint LDP for the empirical colour measure and empirical pair measure of CGRG
Theorem  4 Suppose that G is a CGRG with colour law ν and connection radii 
r 2n:  → [0, 1] satisfying nrdn (a, b) → Cd(a, b) for some symmetric function 
Cd : �
2 → [0,∞) not identical to zero. Then, as n → ∞, the pair (L1X ,L
2
X ) satisfies an 
LDP in P(�)× P̃∗(�2) with good rate function
1
I(η1,ω) = H(η1 � ν)+ H2(ω � η1), (3)
2
where the measure Cη1 ⊗ η1 ∈ P̃∗(� ×�) is defined by Cη1 ⊗ η1(a, b) =
Cd(a, b)η1(a)η1(b) for a, b ∈ .
Further, we state a corollary of Theorem 4 below.
Corollary 5 Suppose that Gis a CGRG graph with colour law ν and connection 
radii rn: 2 → [0, 1] satisfying nrdn (a, b) → Cd(a, b) for some symmetric function 
Cd : �
2 → [0,∞) not identical to zero. Then, as n → ∞, the number of edges per vertex 
|E| / n of Gsatisfies an LDP in [0,∞) with good rate function
{ }
ζ(x) = x log x − x + inf ψ(y)− x log(y)+ y ,
y>0
where ψ(y) = inf H(η1 � ν) over all probability vectors η1 with 1 T2�(d)η1 Cη1 = y.
Remark 2 By taking Cd(a, b) = c one will obtain ψ(y) = 0 for y = �(d)2 c, and ψ(y) = ∞ 
otherwise, which establishes that |E| / n obeys an LDP in [0,∞) with good rate function
{ ( ) }
1 1
ζ(x) = x log x − x + inf ψ(y)− x log y + y ,
y>0 2 2
where �(d)c = y.
Proof of Theorem 4
Change‑of‑measure
For any two points U1 and U2 uniformly and independently chosen from the space [0, 1]d 
write
F(t) := P{�U1 − U2� ≤ t}.
Further, given a function f̃ :  → R and a symmetric function g̃ : 2 → R, we define the 
constant U  by
f̃
∑
U = log ef̃ (a)ν(a),
f̃
a∈�
Doku‑Amponsah  SpringerPlus ( 2016)5 :1140 Page 7 of 16
and the function h̃ : 2n → R by
[ ]
( )−n
h̃n(a, b) = log 1− F(rn(a, b))+ F(rn(a, b))e
g̃(a,b) , (4)
for a, b ∈ . We use f̃  and g̃ to define (for sufficiently large n) a new coloured random 
graph as follows:
•  To the n points x1, x2, . . . , xn picked independently and uniformly in [0, 1]d we assign 
colours from  independently and identically according to the colour law ν̃ defined 
by 
f̃ (a)−B
ν̃(a) = e f̃ ν(a).
•  Given any two points xu, xv , with xu carrying colour a and xv carrying colour b, we 
connect vertex xu to vertex xv with probability 
F(r (a, b))eg̃(a,b)n
F(r̃n(a, b)) = .
1− F(rn(a, b))+ F(r g̃(a,b)n(a, b))e
We denote the transformed law by P̃. We observe that ν̃ is a probability meas-
ure and that P̃ is absolutely continuous with respect to P as, for any coloured graph 
G = ((σ (xj): j = 1, 2, 3, . . . , n),E),
dP̃ ∏ ν̃(σ (x )) ∏u F(r̃n(σ (xu), σ(xv))) ∏ 1− F(r̃n(σ (xu), σ(xv)))
(G) =
dP ν(σ (xu)) F(rn(σ (xu), σ(xv))) 1− F(rn(σ (xu), σ(xv)))u∈V (u,v)∈E (u,v)�∈E
∏ ν̃(σ (xu)) ∏ F(r̃n(σ (xu), σ(xv))) n− nF(rn(σ (xu), σ(xv)))
= ×
ν(σ (x )) F(r (σ (x ), σ(x ))) n− nF(r̃ (σ (x ), σ(x )))
u∈V u n u v n u v(u,v)∈E
∏ n− nF(r̃n(σ (xu), σ(xv)))
×
n− nF(rn(σ (xu), σ(xv)))
(u,v)∈E
∏ ∏ ∏
f̃ (σ (x ))−U
= e u f̃ eg̃(σ (xu),σ(xv))
1
e n h̃n(σ (xu),σ(xv))
u∈V (u,v)∈E (u,v)∈E
( 〈 〉 〈 〉 〈 〉)
〈 〉
1 1 1 1= exp n LG , f̃ − U + n L
2 , g̃ + n L1 1 1
f̃ G G
⊗ LG , h̃n − L�, h̃n ,2 2 2
(5)
where
1 1
∑
L� = δ(σ(xu),σ(x )).n u
u∈V
∑ ∑
We write �g ,ω� := a,b∈� g(a, b)ω(a, b) for ω ∈ P̃(�2), and �f , ρ� := a∈� f (a)ρ(a) for 
ρ ∈ P(�), and note that
F(rn(a, b)) = �(d)r
d
n (a, b), for all a, b ∈ �
2.
Doku‑Amponsah  SpringerPlus ( 2016)5 :1140 Page 8 of 16
i.e. the volume of a d-dimensional (hyper)sphere with radius r(a,  b) satisfying 
nrdn (a, b) → Cd(a, b).
The following lemmas will be useful in the proofs of main Lemmas.
Lemma 1 (Euler’s lemma) If nrdn (a, b) → Cd(a, b) for every a, b ∈ , then
lim [1+ αF(rn(a, b))]
n = eα�(d)Cd(a,b), for all a, b ∈ � and α ∈ R. (6)
n→∞
Proof Observe that, for any ε > 0 and for large n we have
[ ] [ ]
α�(d n n)Cd(a, b)− ε α�(d)Cd(a, b)+ ε
1+ ≤ [1+ αF(rn(a, b))]
n ≤ 1+ ,
n n
by the point-wise convergence. Hence by the sandwich theorem and Euler’s formula we 
get (6).  
We write
{ }
P(n)(ω) := 1P LG = ω .
Lemma 2 The family of measures (Pn: n ∈ N) is exponentially tight on P(�).
Proof We use coupling argument, see the proof of Doku-Amponsah and Mörters 
(2010, Lemma 5.1) to show that, for every θ > 0, there exists N ∈ N such that
1
lim sup P{|E| > nN } ≤ −θ .
n→∞ n
To begin, let c(d) > maxa,b∈� Cd(a, b) > 0 and nrdn (c) → c(d). Using similar coupling 
arguments as in see the proof of Doku-Amponsah and Mörters (2010, Lemma 5.1), we 
can define, for all sufficiently large n,  a coloured random graph X̃ with vertices x1, . . . , xn 
chosen uniformly from the vertices space [0, 1]d , colour law η and connectivity probabil-
{ }
ity pn = P �xi − xj� ≤ rn(c) = �(d)rdn , for all i �= j such that any edge present in G is 
also present in X̃ . Let |Ẽ| be the number of edges of X̃ . Using the binomial formula and 
Euler’s formula, we have that
n(n−1)
2 ( )
[ ]
∑
−nl |Ẽ| −nl k n(n− 1)/2 k n(n−1)/2−k
P{|Ẽ| ≥ nl} ≤ e E e = e e (pn) (1− pn)
k
k=0
= e−nl(1− p + ep )n(n−1)/2 ≤ e−nlenc�(d)(e−1+o(1))n n ,
where we used npn = �(d)nrdn → �(d)c in the last step. Now given θ > 0 choose 
N ∈ N such that N > θ +�(d)c(e − 1) and observe that, for sufficiently large n, 
P{|E| ≥ nN } ≤ P{|Ẽ| ≥ nN } ≤ e−nθ ,
which implies the statement.  
Proof of the upper bound in Theorem 4
We denote by C1 the space of functions on  and by C2 the space of symmetric functions 
on 2, and define
Doku‑Amponsah S pringerPlus ( 2016) 5:1140 Page 9 of 16
�
�
� �
Î(η1,ω) = sup f (a)− Uf η1(a)
f ∈C1 a∈�
g∈C2

1 � �(d ) �
+ g(a, b)ω(a, b)+ (1− eg(a,b))Cd(a, b)η1(a)η1(b)
2 2 
a,b∈� a,b∈�
for (η1,ω) ∈ P(�)× P 2∗(� )
Lemma 3 For each closed set G ⊂ P(�)× P̃∗(�2), we have
1 {( ) }
lim sup log L1 ,L2P G G ∈ F ≤ − inf Î(η1,ω).
n→∞ n (η1,ω)∈F
Proof First let f̃ ∈ C1 and g̃ ∈ C2 be arbitrary. Define β̃: �2 → R by
β̃(a, b) = �(d)(1− eg̃(a,b))Cd(a, b).
Observe that, by Lemma 1, β̃(a, b) = limn→∞ h̃n(a, b) for all a, b ∈ , recalling the defi-
nition of h̃n from (4). Hence, by (5), for sufficiently large n,
{ }
∫
1 1 2 1 1 1
emaxa∈� |β̃(a,a)| �
1L1 , h̃n� n�L ,f̃−U �+n� L ,g̃�+n� L ⊗L ,h̃ �≥ e 2 � dP̃ = E e G f̃ 2 G 2 G G n ,
∑
where L1� =
1
n u∈V δ(σ(xu),σ(xu)) and therefore,
〈 〉 〈 〉 〈 〉
{ }
1 n 1 ,f̃−U +n 1 1L L2 ,g̃ +n 1 1
lim sup logE e G f̃ 2 G 2
L ⊗L ,h̃
G G n ≤ 0. (7)
n→∞ n
Given ε > 0 let Îε(η1,ω) = min{Î(η1,ω), ε−1} − ε. Suppose that (η1,ω) ∈ G and observe 
that Î(η1,ω) > Îε(η1,ω). We now fix f̃ ∈ C1 and g̃ ∈ C2 such that
1 1
�f̃ −U , η1� + �g̃ ,ω� + �β̃ , η1 ⊗ η1� ≥ Îε(η1,ω).f̃ 2 2
As  is finite, there exist open neighbourhoods B1η  and B2ω of η1 1,ω such that
{ }
1 1
inf �f̃ − U , η1� + �g̃ , ω̃� + �β̃ , ηf̃ 1 ⊗ η1� ≥ Îε(η1,ω)− ε.
η̃ 11∈Bη 2 21
ω̃∈B2ω
Using Chebyshev’s inequality and (7) we have that
1 {( ) }
lim sup log L1P G ,L
2
G ∈ B
1
η × B
2
n 1 ωn→∞
〈 〉 〈 〉 〈 〉
{ }
1 n L1 ,f̃−U +n 1L2 ,g̃ +n 1L1 ⊗L1 ,h̃n
≤ lim sup logE e G f̃ 2 G 2 G G − Î (8)ε(η1,ω)+ ε
n→∞ n
≤ −Îε(η1,ω)+ ε.
Now we use Lemma 2 with θ = ε−1, to choose N (ε) ∈ N such that
1
lim sup log P{|E| > nN (ε)} ≤ −ε−1. (9)
n→∞ n
Doku‑Amponsah  SpringerPlus  (2016) 5:1140 Page 10 of 16
For this N (ε), define the set KN (ε) by
{ }
K 2N (ε) = (η1,ω) ∈ P(�)× P̃∗(� ): �ω� ≤ 2N (ε) ,
and recall that �L2G� = 2|E|/n. The set KN (ε) ∩ F  is compact and therefore may be cov-
ered by finitely many sets B1 2η × B , r = 1, . . . ,m with (η ,ω ) ∈ F  for r = 1, . . . ,m. 1,r ωr 1,r r
Consequently,
m
{( ) } {( ) } {( ) }
∑
L1 ,L2 ∈ F ≤ L1 ,L2 1P G G P G G ∈ Bη × B
2
ω + L
1
P G ,L
2
G �∈ K .1,r r N (ε)
r=1
We may now use (8) and (9) to obtain, for all sufficiently small ε > 0,
( )
1 {( ) } 1 {( ) }m
lim sup log 1 2P LG ,LG ∈ F ≤ max lim sup log
1
P LG ,
2 1 2 −1
LG ∈ Bη × B1,r ω ∨ (−ε)r
n→∞ n r=1 n→∞ n
( )
≤ − inf −1Îε(η1,ω)+ ε ∨ (−ε) .
(η1,ω)∈G
Taking ε ↓ 0 we get the desired statement.  
Next, we express the rate function in term of relative entropies, see for example 
Dembo and Zeitouni (1998, 2.15), and consequently show that it is a good rate function. 
Recall the definition of the function I from Theorem 4.
Lemma 4 (i) Î(η1,ω) = I(η1,ω), for any (η1,ω) ∈ P(�)× P̃∗(�2),
(ii) I is a good rate function and
(iii) H2(ω � η1) ≥ 0 with equality if and only if ω = �(d)Cdη1 ⊗ η1.
Proof (i) Suppose that ω �≪ �(d)Cdη1 ⊗ η1. Then, there exists a0, b0 ∈  with 
Cη1 ⊗ η1(a0, b0) = 0 and ω(a 20, b0) > 0. Define ĝ :  → R by
[ ]
ĝ(a, b) = log K (1(a ,b )(a, b)+ 1 (a, b))+ 1 , for a, b ∈ � and K > 0.0 0 (b0,a0)
For this choice of ĝ and f = 0 we have
∑ ∑ ∑
( ) 1 �(d)
f (a)−Uf η1(a)+ ĝ(a, b)ω(a, b)+ (1− e
ĝ(a,b))Cd(a, b)η1(a)η1(b)
2 2
a∈� a,b∈� a,b∈�
�(d)
≥ log(K + 1)ω(a0, b0) → ∞, for K ↑ ∞.
2
Now suppose that ω ≪ Cη1 ⊗ η1. We have
� � � �
� �
Î(η1,ω) = sup f (a)− log e
f (a)ν(a) η1(a)
f ∈C1 a∈� a∈�
�(d) �
+ Cd(a, b)η1(a)η1(b)
2
a,b∈�
 
1  � �
+ sup g(a, b)ω(a, b)−�(d) eg(a,b)Cd(a, b)η1(a)η1(b) .
2 g∈C  2 a,b∈� a,b∈�
By the variational characterization of relative entropy, the first term equals H(η1 ‖ ν). By 
the substitution g Cdη1⊗ηh = �(d)e 1 the last term equals
ω
Doku‑Amponsah S pringerPlus ( 2016) 5:1140 Page 11 of 16
[ ( ) ]
∑ ω(a, b)
sup log h(a, b) − h(a, b) ω(a, b)
h∈C �(d)Cd(a, b)η1(a)η1(b)2
h≥0 a,b∈�
( )
∑ ∑
( ) ω(a, b)
= sup log h(a, b)− h(a, b) ω(a, b)+ log ω(a, b)
h∈C �(d)Cd(a, b)η1(a)η1(b)2
≥0 a,b∈� a,b∈�h
= −�ω� +H(ω ��(d)Cdη1 ⊗ η1),
where we have used supx>0 log x − x = −1 in the last step. This yields that 
Î(η1,ω) = I(η1,ω).
(ii) Recall from (3) and the definition of H2 that I(η1,ω) = H(ω � ν)+ 
1
H(ω ��(d)Cdη1 ⊗ η +
�(d)
1) �Cdη1 ⊗ η1� −
1 �ω� . All summands are continuous in 
2 2 2
η1,ω and thus I is a rate function. Moreover, for all α < ∞, the level sets {I ≤ α} are con-
tained in the bounded set {(η1,ω) ∈ P(�)× P̃ (�2∗ ): H2(ω � η1) ≤ α} and are therefore 
compact. Consequently, I is a good rate function.
(iii) Consider the nonnegative function ξ(x) = x log x − x + 1, for x > 0, ξ(0) = 1, 
which has its only root in x = 1. Note that
{ ∫
ξ ◦ g d(�(d)Cdω ⊗ ω) if g :=
dω ≥ 0 exists,
H (ω � η ) = d(�(d)C2 1 dη1⊗η1) (10)
∞ otherwise.
( )
Hence H2(ω � η1) ≥ 0, and if ω = �(d)Cdη1 ⊗ η1, then ξ dω = ξ(1) = 0d  (�(d)Cdη1⊗η1)
and so H2(�(d)Cdη1 ⊗ η1 �ω) = 0. Conversely, if H2(ω �ω) = 0, then ω(a, b) > 0 
implies Cdη1 ⊗ η1(a, b) > 0, which then implies ξ ◦ g(a, b) = 0 and further g(a, b) = 1. 
Hence ω = �(d)Cdη1 ⊗ η1, which completes the proof of (iii).  
Proof of the lower bound in Theorem 4
We obtain the lower bound of Theorem 4 from the upper bound as follows:
Lemma 5 For every open set O ⊂ P(�)× P̃ (�2∗ ), we have
1 {( ) }
lim inf log P L1 ,L2 ∈ O ≥ − inf I(η1,ω).
n→∞ n G G (η1,ω)∈O
Proof Suppose (η1,ω) ∈ O, with ω ≪ �(d)Cdη1 ⊗ η1. Define f̃ω: � → R by
{
log η1(a) , if η (a) > 0,
f̃ω(a) = ν(a)
1
0, otherwise.
and g̃ω: �2 → R by
{
log ω(a,b), , if ω(a, b) > 0,g̃ �(d)C (a b)η (a)η (b)ω(a, b) = d 1 1
0, otherwise.
In addition, we let β̃ω(a, b) = �(d)Cd(a, b)(1− eg̃ω(a,b)) and note that 
β̃ω(a, b) = limn→∞ h̃ω,n(a, b), for all a, b ∈  where
[ ]
( )−n
h̃ω,n(a, b) = log 1− F(rn(a, b))+ F(rn(a, b))e
g̃ω(a,b) .
Doku‑Amponsah  SpringerPlus ( 2016)5 :1140 Page 12 of 16
Choose B1η ,B2ω open neighbourhoods of η1,ω, such that B1 ,×B2η ω ⊂ O and for all 1 1
(ω̃, ω̃) ∈ B1 2η × B1 ω
1 1 1 1
�f̃ω, η1� + �g̃ω,ω� + �β̃ω, η1 ⊗ η1� − ε ≤ �f̃ω, η̃1� + �g̃ω, ω̃� + �β̃ω, η̃1 ⊗ η̃1�.
2 2 2 2
We now use P̃, the probability measure obtained by transforming P using the functions 
f̃ω, g̃ω. Note that the colour law in the transformed measure is now η1, and the connec-
tivity radii r̃n(a, b) satisfy
n r̃dn (a, b) → ω(a, b)/(η1(a)η1(b)) =: C̃d(a, b), as n → ∞.
Using (5), we obtain
� �
�� � �
1 , 2
dP
P LG LG ∈ O ≥ Ẽ (G)
� �
1 � �
L1 ,L2 ∈B1 ×B2dP̃ G G η1 ω
 
 
� � �
= e−f̃ω(σ(xu)) e−g̃ω(σ(xu),σ(xv))
1
e− n h̃ω,n(σ (xu),σ(xv))Ẽ � �1 � �
L1 ,L2 ∈B1η ×B
2
 G G 1 ω u∈V (u,v)∈E (u,v)∈E
� � � �
� �
� � � �
−n L1 ,f̃ −n 1 L2 ,g̃ −n 1 L1 ⊗L1 ,g̃ + 1 L1
G ω 2 G ω 2 G G ω 2 � ,h̃ω,n= Ẽ e × � �1 � �
L1 ,L2 ∈B1 2
G G η ×B1 ω
� �
1 1 �� � �
≥ exp −n�f̃ 1 2 1 2ω ,ω� − n �g̃ω ,ω� − n �β̃ω , η1 ⊗ η1� +m− nε × P̃ L
2 2 G
,LG ∈ Bη × B1 ω ,
where m := 0 ∧mina∈� β̃(a, a). Therefore, by (6), we have
1 {( ) }
lim inf log 1 2P LG ,L ∈ On→∞ n G
1 1 1 {( ) }
≥ −�f̃ω ,ω� − �g̃ω ,ω� − �β̃ω , η1 ⊗ η1� − ε + lim inf log
1 , 2P̃ LG LG ∈ B
1 2
2 2 n→∞ n η
× B
1 ω
.
The result follows once we prove that
1 {( ) }
lim inf log L1 ,L2 ∈ B1P̃ × B2 = 0. (11)
n→∞ n G G η1 ω
We use the upper bound (but now with the law P replaced by P̃) to prove (11). Then we 
obtain
1 {( ) ( )c}
lim sup log 1 2 1 2P̃ LG ,LG ∈ Bη × Bω ≤ − inf Ĩ(ρ̃, ω̃),
n→∞ n (ρ̃,ω̃)∈F̃
where F̃ = (B1 × B2 )cη ω  and Ĩ(ρ̃, ω̃) := H(ω̃ �ω)+
1
2H2(ω̃ � ρ̃). It therefore suffices to 1
show that the infimum is positive. Suppose for contradiction that there exists a sequence 
(ρ̃n, ω̃n) ∈ F̃  with Ĩ(ρ̃n, ω̃n) ↓ 0. Then, because Ĩ is a good rate function and its level sets 
are compact, and by lower semi-continuity of the mapping (ρ̃, ω̃) �→ Ĩ(ρ̃, ω̃) , we can con-
struct a limit point (ρ̃, ω̃) ∈ F̃  with Ĩ(ρ̃, ω̃) = 0. By Lemma 4 this implies H(ρ̃ � η1) = 0 
and H2(ω̃ � η1) = 0, hence ρ̃ = η1, and ω̃ = C̃dη1 ⊗ η1 = ω contradicting (ρ̃, ω̃) ∈ F̃ .  
Proof of Theorem 1
For any n ∈ N we define
Doku‑Amponsah S pringerPlus ( 2016) 5:1140 Page 13 of 16
{ }
Pn(�) := ρ ∈ P(�): nρ(a) ∈ N for all a ∈ � ,
{ }
n
P̃n(� ×�) := ω ∈ P̃∗(� ×�): ω(a, b) ∈ N for all a, b ∈ � .
1+ 1{a = b}
We denote by �n := Pn(�)× P̃n(� ×�) and � := P(�)× P̃∗(� ×�). With
∣
(n) { }
P(ρ ,ω )(ηn) := P MG = η
∣
n H(MG) = (ρn,ωn) ,n n
{( ) }
P(n)(ρn,ωn) := P L
1
G ,L
2
G = (ρn,ωn)
the joint distribution of L1G ,L2G and MG is the mixture of 
(n)
P(ρ ,ω ) with P
(n)(ρn,ωn) n n
defined as
n (n)dP̃ (ρn,ωn, ηn) := dP (η ) dP
(n)
(ρ ,ω ) n (ρn,ωn). (12)n n
Biggins (2004, Theorem 5(b)) gives criteria for the validity of large deviation principles 
for the mixtures and for the goodness of the rate function if individual large deviation 
principles are known. The following three lemmas ensure validity of these conditions.
We recall from Lemma 6 that the family of measures (Pn: n ∈ N) is exponentially tight 
on 
Lemma 6 (Doku-Amponsah and Mörters 2010) The family of measures (P̃n: n ∈ N) is 
exponentially tight on �× P(� × N).
Define the function
J̃ : �× P(� × N) → [0,∞], J̃ ((η1,ω), η) = J̃(η1,ω)(η),
where
{
( )
(ω,η)
H η �Qpoi if (ω, η) is consistent and η1 = ωJ̃(η1,ω)(η) =
2 (13)
∞ otherwise.
Lemma 7 (Doku-Amponsah and Mörters 2010) J̃  is lower semi-continuous.
By Biggins (2004,  Theorem  5(b)) the two previous lemmas and the large deviation 
principles we have established Theorem 2.2 and Doku-Amponsah (2015, Theorem 2.1) 
ensure that under (P̃n) the random variables (ρn,ωn, ηn) satisfy a large deviation princi-
ple on P(�)× P̃∗(� ×�)× P(� × N) with good rate function
{ ( )
1 (ω,η)
H(η1 � ν)+ H2(ω ��)+H η �Q , if (ω, η) is consistent and η1 = ω2,
Ĵ (η 2 poi1,ω, η) =
∞, otherwise.
By projection onto the last two components we obtain the large deviation prin-
ciple as stated in Theorem  1 from the contraction principle, see e.g. Dembo et  al. 
(2005, Theorem 4.2.1).
Doku‑Amponsah  SpringerPlus ( 2016) 5:1140 Page 14 of 16
Proof of Corollaries 2, 3, and 5
We derive the theorems from Theorem  1 by applying the contraction principle, see 
e.g.  Dembo and Zeitouni (1998,  Theorem  4.2.1). In fact Theorem  1 and the contrac-
tion principle imply a large deviation principle for D. It just remains to simplify the rate 
functions.
Proof of Theorem 2
Note that, in the case of an uncoloured RGG graphs, the function C degenerates to a 
constant  c, L2G = |E|/n ∈ [0,∞) and MG = D ∈ P(N ∪ {0}). Theorem  1 and the con-
traction principle imply a large deviation principle for D with good rate function
2(δ) = inf{J (x, δ): x ≥ 0}
{ }
1 1 1 1
= inf H(δ � qx)+ x log x − x log�(d)c + �(d)c − x: �δ� ≤ x ,
2 2 2 2
which is to be understood as infinity if 〈d〉 is infinite. We denote by x (δ) the expression 
inside the infimum. For any ε > 0, we have
�δ�+ε �δ�
2 (δ)− 2 (δ)
( )
ε �δ� − ε �δ� ε �δ� ε �δ� − ε −ε ε �δ�
= + log + log ≥ + + log > 0,
2 2 �δ� + ε 2 �(d)c 2 2 �δ� 2 �(d)c
so that the minimum is attained at x = �(d)�δ�.
Proof of Corollary 3
Corollary 3 follows from Theorem 2 and the contraction principle applied to the con-
tinuous linear map G: P(N ∪ {0}) → [0, 1] defined by G(δ) = δ(0). Thus, Theorem  2 
implies the large deviation principle for G(D) = W  with the good rate function 
ξ2(y) = inf{2(δ): δ(0) = y, �δ� < ∞}. We recall the definition of x2 and observe that 
ξ2(y) can be expressed as
{ }
∞
1 b2 ∑ δ(k)
ξ2(y) = inf inf c + y log y+ + δ(k) log − b(1− y) .
b≥0 d∈P(N∪{0}) 2 2�(d)c qb(k)
δ(0)=y,�(d)c�δ�=b2 k=1
Now, using Jensen’s inequality, we have that
∞
∑ δ(k) (1− y)
δ(k) log ≥ (1− y) log ,
q k −b (14)b( ) (1− e )
k=1
with equality if (1−y)δ(k) = −b qb(k), for all k ∈ N. Therefore, we have the inequality(1−e )
inf{2(δ): δ(0)
{ }
1 b2 (1− y)
= y, �δ� < ∞} ≥ inf c + y log y+ + (1− y) log − b(1− y): b ≥ 0 .
2 2�(d)c (1− e−b)
Let y ∈ [0, 1]. Then, the equation a(1− e−a) = �(d)c(1− y) has a unique 
positive solution. Elementary calculus shows that the global minimum of 
Doku‑Amponsah S pringerPlus  (2016) 5:1140 Page 15 of 16
2 (1−y)
b �→ 12�(d)c + y log y+
b
2 + (1− y) log −b − b(1− y) on (0,∞) is attained at �(d)c (1−e )
the value b = a, where a is the positive solution of our equation. We obtain the form of ξ 
in Corollary 3 by observing that
( )
a(y)2 + (�(d)c)2 − 2�(d)ca(y) 1− y �(d)cy( ) 1 ( )2
= 2− y + a(y)−�(d)c(1− y) .
2�(d)c 2 2�(d)c
Proof of Corollary 5
We define the continuous linear map W : P(�)× P̃ (�2∗ ) → [0,∞) by W (η1,ω) = 12�ω�, 
and infer from Theorem 4 and the contraction principle that W (L1G ,L2G) = |E|/n satis-
fies a large deviation principle in [0,∞) with the good rate function
{ }
ζ(y) = inf I(η1,ω): W (η1,ω) = y .
To obtain the form of the rate in the corollary, the infimum is reformulated as uncon-
strained optimization problem (by normalising ω)
{ }
�(d)
inf H(η1 � ν)+ yH(ω ��(d)Cη1 ⊗ η1)+ y log 2y+ �Cω ⊗ ω� − y . (15)
ω∈P∗(�2) 2
η1∈P(�)
By Jensen’s inequality H(ω ��(d)Cη1 ⊗ η1) ≥ − log ��(d)Cη1 ⊗ η1�, with equality if 
= Cηω 1⊗η1�C ⊗ � , and hence, by symmetry of C we haveη1 η1
{ }
�(d)
min H(η1 � ν)+ yH(ω ��(d)Cη1 ⊗ η1)+ y log 2y+ �Cη1 ⊗ η1� − y
ω∈P (�2∗ ) 2
�(d)
= H(η1 � ν)− y log ��(d)Cη1 ⊗ η1� + y log 2y+ �Cη1 ⊗ η1� − y.
2
The form given in Corollary 5 follows by defining
1 ∑
y = �(d) Cd(a, b)η1(a)η1(b).
2
a,b∈�
Conclusion
In this work, we have proved joint large deviation principle for the empirical pair meas-
ure and empirical locality measure of the near intermediate CGRG models. From this 
result we have obtained asymptotic results about useful graph quantities such as num-
ber of edges per vertex, the degree distribution and the proportion of isolated vertices 
for the near intermediate CGRG models. The rate functions of all these large deviation 
principles compared very well with the rate functions of the results for coloured random 
graph models by Doku-Amponsah and Mörters (2010), with some extra terms account-
ing for the geometric effect in the CGRG models. An important future research direc-
tion is to formulate and prove an Asymptotic Equipartition Property for Networked 
Data Structures Modelled as the CGRG, and then a possible Coding or Approximate 
Pattern Matching Algorithms for such Networks. One could also investigate the Statisti-
cal Mechanics on the CGRG.
Doku‑Amponsah  SpringerPlus  (2016)5 :1140 Page 16 of 16
Acknowledgements
This extension has been mentioned in the author’s PhD Thesis at University of Bath.
Competing interests
The author declares that he has no competing interests.
Received: 16 January 2016   Accepted: 29 June 2016
References
Biggins JD (2004) Large deviations for mixtures. Electron Commun Probab 9:60–71
Bordenave C, Caputo P (2015) Large deviations of empirical neighbor‑hood distribution in sparse random graphs. Probab 
Theory Relat Fields 163(1–2):149–222
Biggins JD, Penman DB (2009) Large deviations in randomly coloured random graphs. Electron Comm Probab 
14:290–301
Cannings C, Penman DB (2003) Handbook of statistics 21. Stochastic processes: modelling and simulation. In: Shanbhag 
DN, Rao CR (eds) Models of random graphs and their applications. Elsevier, Amsterdam, pp 51–91
Dembo A, Zeitouni O (1998) Large deviations techniques and applications. Springer, New York
Dembo A, Mörters P, Sheffield S (2005) Large deviations of Markov chains indexed by random trees. Ann Inst Henri 
Poincaré Probab Stat 41:971–996
Doku‑Amponsah K (2006) Large deviations and basic information theory for hierarchical and networked data structures. 
PhD thesis, Bath
Doku‑Amponsah K (2014) Exponential approximation, method of types for empirical neighbourhood measures of 
random graphs by random allocation. Int J Stat Probab 3(2):110–120
Doku‑Amponsah K (2015) Large deviation result for the empirical locality measure of typed random geometric graphs. 
Int J Stat Prob 4(1):94–99
Doku‑Amponsah K (2016) Large deviation, Basic information theory for wireless sensor networks. Presented at College of 
Basic and Applied Sciences, Science Development Platform 2016, University of Ghana. arxiv:1512.08050
Doku‑Amponsah K, Mörters P (2010) Large deviation principles for empirical measures of coloured random graphs. Ann 
Appl Probab 20(6):1989–2021. doi:10.1214/09‑AAP47
Mukherjee S (2014) Large deviation for the empirical degree distribution of an Erdos–Renyi graph. arxiv:1310.4160
O’Connell N (1998) Some large deviation results for sparse random graphs. Probab Theory Relat Fields 110:277–285
Penrose MD (2003) Random geometric graphs. Oxford University Press, Oxford
View publication stats