almost exact recovery is impossible when the signal to noise ratio is less than the threshold

Created	Last Edited	Edits
2025-08-17	2025-08-17	3

Data

subject:: Data Science Methods for Large Scale Graphs
parent:: Graph Signals and Graph Signal Processing
theme:: math notes

Theorem

Suppose we have a stochastic block model problem where the adjacency matrix $A \sim P, P = Y B Y^{T}$ , and $B = {\begin{cases} p if c_{1} = c_{2} \\ q otherwise \end{cases}$ .

If the signal to noise ratio $S N R \frac{< 1}{n}$ , then almost exact recovery is impossible.

Even with infinite time and resources, there is no algorithm that can recover the true communities with just the probabilities of adjacency. This gives us the information theoretic threshold

Proof

see proofs in Massoulié (2014) and Mossel (2014). Aside: interesting to see since there are different proof methods from different domains. See also Abbe (survey papers).

Example: Sparse Graphs

$p = \frac{a}{n}, q = \frac{b}{n}$ and $\frac{| E |}{n^{2}} \to 0$ as $n \to \infty$ (the average degree vanishes).
Then $$SNR = \frac{\left( \frac{a}{n} - \frac{b}{n} \right)^2}{2\frac{(a+b)}{n}} = \frac{1}{2n} \frac{(a-b)^2}{(a+b)}$$
ie, in sparse graphs, it is not difficult to identify the information theoretic threshhold (we can easily calculate the signal to noise ratio)

Mentions

File
information theoretic threshold
2025-02-10 graphs lecture 6
2025-02-12 graphs lecture 7