2025-04-07 lecture 18

[[lecture-data]]
Summary

Last Time

3. Graphon Signal Processing

Recall that

  1. graph convolutions converge in the spectral domain
  2. given (Gn,Xn)(W,X) where (W,X) is bandlimited, GFT of x^n converges to WFT X^ (bandlimited convergent graph signals converge in the fourier domain)

For bandlimited graphon signals, we should be able to show convergence in the node domain!

Convergence of bandlimited signals in the node domain

Theorem

Let (Gn,Xn)(W,X) with cbandlimited (W,X). Let H(Gn)=k=0K1hk(Ann)k be a sequence of graph convolution that share coefficients with graphon convolution TH=k=0K1hkTW(k). Then

(Gn,Yn)(W,Y)

Where Yn and Y are the graph and graphon convolution outputs respectively.

Recall that our definition of a convergent sequence of graph signals compares the induced graphon signal and the limiting graphon signal to see if they converse in an L2 way.

theorem basically says

Proof

Y^nY follows X^nX^ conbined with H^(Gn)T^H(W). Since X is bandlimited, so is Y because both graph convolutions and graphon convolutions act pointwise in the spectral domain. Therefore, the iGFT converges to the iWFT.

See graph convolutions of bandlimited signals converge to the graphon convolution

Our first result for convergence for graphon signals is restrictive though, since we require that {as||the signal is bandlimited||restriction} and {ha||graphons have infinite-dimensional spectra||why restrictive}.

Can we get a better result? Can we generalize to arbitrary convergent graph signals?

set-up for solution

To show convergence of filters for general signals (not only bandlimited), we instead restrict {as||the filters||what restrict} to {ha||be Lipschitz||restriction}.

Recall the definition for lipschitz graph filters.

lipschitz filter

Let H be a convolutional graph filter and h(λ) its spectral representation. H is a lipschitz filter if h(λ) is Lipschitz on some interval TR. ie, that there exists some cR such that
|h(λ)h(λ)|c|λλ|λ,λT
This uses the idea that once the filter is fixed, we can think of the representation as sampling along a "spectrum polynomial". Typically, T=[λmin,λmax]

We define our analogue for graphon filters in the same way:

Lipschitz graphon filters

Let TH be a graphon filter. TH is a Lipschitz graphon filter if it has Lipschitz spectral response. ie, it

|h(λ1)h(λ2)|L|λ1λ2|λ1,λ2[1,1]

This upper bounds the magnitude of the first derivative on the interval [1,1]

Note

In this statement, our h are arbitrary analytic functions. However, in practice, these will be polynomials

  • On the real line, polynomials are not Lipschitz. However, in a bounded interval, we can always find a Lipschitz constant.

We consider general analytic filters h(λ) with Lipschitz constant L (both for generality and to avoid dependence on the polynomial degree k which gives us better/tighter bounds)

see Lipschitz graphon filter

Improved/more general version

Aside: this is a LONG proof (longest in the class probably). The theorem is basically the same as the one above, but without the bandlimited assumption.

Theorem

Let (Gn,Xn)(W,X).

Let H(Gn)=k=0K1hk(Ann)k be a sequence of graph convolution that share coefficients with graphon convolution TH=k=0K1hkTW(k) where h(λ) is a Lipschitz graphon filter. Then

(Gn,Yn)(W,Y)

Where Yn and Y are the graph and graphon convolution outputs respectively.

Proof

Note

We want to show the outputs on the graph convolutions converge in the "induced graphon signal" sense to the outputs of the graphon convolution

WLOG, we assume that |h(λ)|1λ[1,1]. We have

Y=jZ{0}h(λj)X^jφj and Yn=jZ{0}h(λj(n))[X^n]jφj(n)
note on notation

λj,Xj,φj are the eigenvalues, graphon signal, and eigenfunctions of the graphon respectively. λj(n),[X^n]j,φj(n) are the eigenvalues, graph signals, and eigenvectors of the graphs in the sequence of convergent graphs and convergent graph signals.

Define an partitioning index set C such that

{jC={i s.t. |λi|c}jC={i s.t. |λi|<c} where c=1|h0|L(2||X||ϵ1+1)

Where we fix some ϵ>0, h0=h(0), and L is the Lipschitz constant. We can choose ϵ such that c=kk[1,1]. Further, note that this creates a set of bandlimited components (C) and non-bandlimited components (Cc) of the signal.

Then, by the triangle inequality, we have

||YYn||=||jZ{0}h(λj)X^jφjjZ{0}h(λj(n))[X^n]jφj(n)||||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||(1)+||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||(2)

And we can bound (1) and (2) individually.

It is very easy to bound (1), since this is equivalent to filter applications to a bandlimited X with bandwith c. Thus, (1) vanishes as a direct applications of the previous result (bandlimited convergent graph signals converge in the fourier domain) to this expression.

For (2), we note that

h0Lch(λj)h0+LcjC

Then we can write

(2)=||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||(h0+Lc)||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||h0||jCX^jφj[X^n]jφj(n)||(3)+Lc||jCX^jφj||()Lc||X||+Lc||jC[X^n]jφj(n)||(4)

We can see () follows since jCX^jφj is a portion of graphon signal X=jZ{0}X^jφj. And so, we can upper bound the norm of the portion of the signal by the norm of the entire signal.

Now, we need to bound (3) and (4).

Note that we can write

()jC[X^n]jφj(n)=XnjC[X^n]jφj(n)

We can do this because the φ form an orthonormal basis for both the graph and the graphon (see the appropriate spectral theorem).

For (3), by writing both the induced graphon signal and the limiting graphon signal in terms of the identity (), we can get

||jCX^jφj[X^n]jφj(n)||=||(XjCX^jφj)(XnjC[X^n]jφj(n))||||XXn||XnX+||jCX^jφj[X^n]jφj(n)||converges (bandlimited signal)<ϵ

Where ϵ is the same one we fixed above. Since both RHS terms converge (the first is from our initial conditions, the second one is again a direct application of bandlimited convergence), we are done for (3).

Finally, for (4),

||jC[X^n]jφj(n)||=||XnjC[X^n]jφj(n)+XX||||XnX||XnX+||jC[X^n]jφj(n)X^jφj||converges (bandlimited)+||jCX^jφj||||X||<ϵ+||X||

Thus we get that

(2)=||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||Lc||X||+h0ϵ+Lc(ϵ+||X||)<ϵ~

for all n>Nϵ

see lipschitz graph convolutions of graph signals converge to lipschitz graphon filters

Summary

For node-domain convergence, we first saw that graph convolutions of bandlimited signals converge to the graphon convolution. But since graphons have infinite-dimensional spectra, this can be quite strict and therefore not super useful. Instead, we replaced the bandlimited assumption with a Lipschitz continuity requirement.

It was difficult to show convergence for the GFT for spectral components associated with eigenvalues close to 0 (see conjecture 1 from Lecture 17). This is why we showed instead that bandlimited convergent graph signals converge in the fourier domain.

Here, the same thing happens because graphon shift operator eigenvalues accumulate at 0. The Lipschitz continuity addresses this by ensuring all spectral components near 0 are amplified in an increasingly "similar" way. We can see this by looking at how we defined c in the proof:

c=1|h0|L(2||X||ϵ1+1)

convergence-discriminability tradeoff

In HW3:

Transferability:
If we want to make an error of at most ϵ ...

Transferability of Graph Convolutions and GNNs

The asymptotic convergence of the lipschitz graph filters reveals an important tradeoff between convergence/transferability and discriminability. We can compare this to the stability-discriminability tradeoff for Lipschitz filters.

c-band cardinality

c-band cardinality of a graphon W are the number of graphon eigenvalues with magnitude greater than c

nc(W)=|{λi:|λi|>c}|

Note that this quantity is finite since the graphon shift operator eigenvalues accumulate only at 0.

see c band cardinality

c-eigenvalue margin

The c-eigenvalue margin of graphons W and W is the minimum difference between and eigenvalue of W outside of the c band and an eigenvalue of W:

δc(W,W)=minij{|λi(TW)λj(TW)|:|λi(TW)|c}

see c eigenvalue margin

Non-asymptotic convergence of graph convolutions

Theorem

Let (Gn,Xn)(W,X) and let graph and graphon convolutions be such that

H(Gn)=k=0hk(Ann)k and TH=k=0hkTWk

Further, assume

  1. ||X||1 (WLOG)
  2. |h(λ)|<1 and h(λ) is LLipschitz in [1,c][c,1] and Lipschitz in (c,c)

Then we have

||YnY||(L+πncδc)||TWTWn||+(Lc+2)||XXn||+2c
Note

note that this bound depends on

We already know that these two things converge (we've already discussed the (slow) convergence rate for Wn in induced graph sequence converges if and only if the induced graphon sequence converges). We can bring the rates for both quantities to O(1n) by introducing more restrictive requirements.

Here, the convergence-discriminability tradeoff is explicit. We can see that larger L and smaller c (more discriminative filters) lead to a higher error bound.

The last term 2c does not vanish. We can think of this as the "approximation error" that comes from the fact that the eigenvalues of the graphs converge non-uniformly (recall this conjecture).

In the finite sample regime, unless =0, there is always leftover "nontransferable energy" corresponding to the spectral components with |λi|<c which do not converge

see convergence bound for graph convolutions

Review

#flashcards/math/dsg

Even though graph convolutions converge in the spectral domain, this is not enough for our purposes. Why is that?
-?-
graph convolutions and graphon convolutions operate in the node domain (not the spectral domain)

What does the lipschitz condition achieve in lipschitz graph convolutions of graph signals converge to lipschitz graphon filters?
-?-
it ensures all spectral components near 0 are amplified in an increasingly "similar" way.

Summary

For node-domain convergence, we first saw that {ass||graph convolutions of bandlimited signals converge to the graphon convolution}. But since graphons have {has||infinite-dimensional spectra}, this can be quite strict and therefore not super useful. Instead, we replaced the {ass||bandlimited assumption} with {hha||a Lipschitz continuity requirement}.

Created 2025-04-09 Last Modified 2025-05-30