GNNs inherit stability from their layers

[[concept]]

Theorem

Let $Φ (S, h)$ be an $L$ -layer GNN. Let $\tilde{S}$ be a graph perturbation modulo permutations.

(1) if $\tilde{S} = S + ϵ S$ and all filters are integral Lipschitz, then

| | Φ (S, h) - Φ (\tilde{S}, h) | |_{p} \leq L C ϵ + O (ϵ^{2})

(stable to dilation/scaling)

(2) If $P^{T} \tilde{S} P = S \tilde{E}$ and all $h$ are lipschitz, then

| | Φ (S, h) - Φ (\tilde{S}, h) | |_{p} \leq L C (1 + δ \sqrt{n}) ϵ + O (ϵ^{2})

(stable to additive perturbations)

(3) If $P^{T} \tilde{S} P = S + \tilde{E} S + S \tilde{E}$ and all $h$ are integral Lipschitz, then

| | Φ (S, h) - Φ (\tilde{S}, h) | |_{p} \leq 2 L C (1 + δ \sqrt{n}) ϵ + O (ϵ^{2})

(stable to relative perturbations)

And GNNs perform better than their constituent filters.

Proof

We begin with some non-restrictive additional assumptions:

$| | x_{ℓ} | | \leq 1 \forall ℓ$ - normalized input at all layers (easy to achieve with non-amplifying $h$ , ie $| | H | | = 1$ )
$σ$ activation function/nonlinearity is normalized Lipschitz, ie has a Lipschitz constant of 1.

Let $| | \tilde{E} | | = ϵ$ for any of the three perturbation types. Let filters $h$ be stable to $\tilde{E}$
with

| | H (\tilde{S}) = H (S) | |_{p} \leq c_{h} ϵ

For each layer $1 \leq ℓ \leq L$ , we have $ℓ$ is a graph perceptron with filter $H_{ℓ}$ . Then, note that

\begin{aligned} | | {\tilde{x}}_{ℓ} - x_{ℓ} | | & = | | σ (H_{ℓ} (\tilde{S}) {\tilde{x}}_{ℓ - 1}) - σ (H_{ℓ} (S) x_{ℓ - 1}) | | \\ (since σ = 1) & \leq | | H_{ℓ} (\tilde{S}) {\tilde{x}}_{ℓ - 1} - H_{ℓ} (\tilde{S}) x_{ℓ - 1} | | \\ = | | H_{ℓ} (\tilde{S}) {\tilde{x}}_{ℓ - 1} - H_{ℓ} (\tilde{S}) x_{ℓ - 1} + H_{ℓ} (\tilde{S}) x_{ℓ - 1} - H_{ℓ} (S) x_{ℓ - 1} | | \\ = | | H_{ℓ} (\tilde{S}) [{\tilde{x}}_{ℓ - 1} - x_{ℓ - 1}] + [H_{ℓ (\tilde{S})} - H_{ℓ} (S)] x_{ℓ - 1} | | \\ (by △ ineq.) & \leq {| | H_{ℓ} (\tilde{S}) | |}^{1} \cdot | | {\tilde{x}}_{ℓ - 1} - x_{ℓ - 1} | | + {| | x_{ℓ - 1} | |}^{\leq 1} \cdot {| | H_{ℓ} (S) - H_{ℓ} (\tilde{S}) | |}^{\leq c_{h} ϵ} \\ (*) & \leq | | {\tilde{x}}_{ℓ - 1} - x_{ℓ - 1} | | + c_{h} ϵ \end{aligned}

We can apply the same reasoning to get a similar expression for $| | {\tilde{x}}_{ℓ - 1} - x_{ℓ - 1} | |, | | {\tilde{x}}_{ℓ - 2} - x_{ℓ - 2} | |, \dots$ etc for the final expression with $L C$

Mentions

File
2025-03-10 graphs lecture 13
2025-03-24 graphs lecture 14