2025-04-07 lecture 18

[[lecture-data]]
Summary

Last Time

3. [[Graphon Signal Processing]]

Recall that

  1. graph convolutions converge in the spectral domain
  2. given (Gn,Xn)(W,X) where (W,X) is bandlimited, GFT of x^n converges to WFT X^ ([[bandlimited convergent graph signals converge in the fourier domain]])

For bandlimited graphon signals, we should be able to show convergence in the node domain!

Convergence of bandlimited signals in the node domain

Theorem

Let (Gn,Xn)(W,X) with cbandlimited (W,X). Let H(Gn)=k=0K1hk(Ann)k be a sequence of [[graph convolution]] that share coefficients with [[graphon convolution]] TH=k=0K1hkTW(k). Then

(Gn,Yn)(W,Y)

Where Yn and Y are the graph and [[graphon convolution]] outputs respectively.

Recall that our definition of a [[convergent sequence of graph signals]] compares the [[induced graphon signal]] and the limiting [[graphon signal]] to see if they converse in an L2 way.

theorem basically says

Proof

Y^nY follows X^nX^ conbined with H^(Gn)T^H(W). Since X is bandlimited, so is Y because both graph convolutions and iGFT converges to the iWFT.

See [[graph convolutions of bandlimited signals converge to the graphon convolution]]

Our first result for convergence for graphon signals is restrictive though, since we require that {as||the signal is bandlimited||restriction} and {ha||graphons have infinite-dimensional spectra||why restrictive}.

Can we get a better result? Can we generalize to arbitrary convergent graph signals?

set-up for solution

To show convergence of filters for general signals (not only bandlimited), we instead restrict {as||the filters||what restrict} to {ha||be Lipschitz||restriction}.

Recall the definition for lipschitz graph filters.

lipschitz filter

Let H be a convolutional graph filter and h(λ) its spectral representation. H is a lipschitz filter if h(λ) is Lipschitz on some interval TR. ie, that there exists some cR such that
|h(λ)h(λ)|c|λλ|λ,λT
This uses the idea that once the filter is fixed, we can think of the representation as sampling along a "spectrum polynomial". Typically, T=[λmin,λmax]

We define our analogue for graphon filters in the same way:

Lipschitz graphon filters

Let TH be a graphon filter. TH is a Lipschitz graphon filter if it has Lipschitz spectral response. ie, it

|h(λ1)h(λ2)|L|λ1λ2|λ1,λ2[1,1]

This upper bounds the magnitude of the first derivative on the interval [1,1]

Note

In this statement, our h are arbitrary analytic functions. However, in practice, these will be polynomials

We consider general analytic filters h(λ) with Lipschitz constant L (both for generality and to avoid dependence on the polynomial degree k which gives us better/tighter bounds)

see [[Lipschitz graphon filter]]

Improved/more general version

Aside: this is a LONG proof (longest in the class probably). The theorem is basically the same as the one above, but without the bandlimited assumption.

Theorem

Let (Gn,Xn)(W,X).

Let H(Gn)=k=0K1hk(Ann)k be a sequence of [[graph convolution]] that share coefficients with [[graphon convolution]] TH=k=0K1hkTW(k) where h(λ) is a [[Lipschitz graphon filter]]. Then

(Gn,Yn)(W,Y)

Where Yn and Y are the graph and [[graphon convolution]] outputs respectively.

Proof

Note

We want to show the outputs on the graph convolutions converge in the "[[induced graphon signal]]" sense to the outputs of the [[graphon convolution]]

  • ie, want to show ||YnY||20 where Yn is the [[induced graphon signal]] for Yn.

WLOG, we assume that |h(λ)|1λ[1,1]. We have

Y=jZ{0}h(λj)X^jφj and Yn=jZ{0}h(λj(n))[X^n]jφj(n)
note on notation

λj,Xj,φj are the eigenvalues, eigenfunctions of the eigenvectors of the graphs in the sequence of convergent graphs and convergent graph signals.

Define an partitioning index set C such that

{jC={i s.t. |λi|c}jC={i s.t. |λi|<c} where c=1|h0|L(2||X||ϵ1+1)

Where we fix some ϵ>0, h0=h(0), and L is the Lipschitz constant. We can choose ϵ such that c=kk[1,1]. Further, note that this creates a set of bandlimited components (C) and non-bandlimited components (Cc) of the signal.

Then, by the [[triangle inequality]], we have

||YYn||=||jZ{0}h(λj)X^jφjjZ{0}h(λj(n))[X^n]jφj(n)||||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||(1)+||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||(2)

And we can bound (1) and (2) individually.

It is very easy to bound (1), since this is equivalent to filter applications to a bandlimited X with bandwith c. Thus, (1) vanishes as a direct applications of the previous result ([[bandlimited convergent graph signals converge in the fourier domain]]) to this expression.

For (2), we note that

h0Lch(λj)h0+LcjC

Then we can write

(2)=||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||(h0+Lc)||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||h0||jCX^jφj[X^n]jφj(n)||(3)+Lc||jCX^jφj||()Lc||X||+Lc||jC[X^n]jφj(n)||(4)

We can see () follows since jCX^jφj is a portion of graphon signal X=jZ{0}X^jφj. And so, we can upper bound the norm of the portion of the signal by the norm of the entire signal.

Now, we need to bound (3) and (4).

Note that we can write

()jC[X^n]jφj(n)=XnjC[X^n]jφj(n)

We can do this because the φ form an orthonormal spectral theorem).

For (3), by writing both the [[induced graphon signal]] and the limiting [[graphon signal]] in terms of the identity (), we can get

||jCX^jφj[X^n]jφj(n)||=||(XjCX^jφj)(XnjC[X^n]jφj(n))||||XXn||XnX+||jCX^jφj[X^n]jφj(n)||converges (bandlimited signal)<ϵ

Where ϵ is the same one we fixed above. Since both RHS terms converge (the first is from our initial conditions, the second one is again a direct application of bandlimited convergence), we are done for (3).

Finally, for (4),

||jC[X^n]jφj(n)||=||XnjC[X^n]jφj(n)+XX||||XnX||XnX+||jC[X^n]jφj(n)X^jφj||converges (bandlimited)+||jCX^jφj||||X||<ϵ+||X||

Thus we get that

(2)=||jCh(λjX^jφj)jCh(λj(n))[X^n]jφj(n)||Lc||X||+h0ϵ+Lc(ϵ+||X||)<ϵ~

for all n>Nϵ

see [[lipschitz graph convolutions of graph signals converge to lipschitz graphon filters]]

Summary

For node-domain convergence, we first saw that graphons have infinite-dimensional spectra, this can be quite strict and therefore not super useful. Instead, we replaced the bandlimited assumption with a Lipschitz continuity requirement.

It was difficult to show convergence for the GFT for spectral components associated with eigenvalues close to 0 (see conjecture 1 from Lecture 17). This is why we showed instead that [[bandlimited convergent graph signals converge in the fourier domain]].

Here, the same thing happens because accumulate at 0. The Lipschitz continuity addresses this by ensuring all spectral components near 0 are amplified in an increasingly "similar" way. We can see this by looking at how we defined c in the proof:

c=1|h0|L(2||X||ϵ1+1)

convergence-discriminability tradeoff

In HW3:

Transferability:
If we want to make an error of at most ϵ ...

Transferability of Graph Convolutions and GNNs

The asymptotic convergence of the lipschitz graph filters reveals an important tradeoff between convergence/transferability and discriminability. We can compare this to the [[stability-discriminability tradeoff for Lipschitz filters]].

c-band cardinality

c-band cardinality of a graphon eigenvalues with magnitude greater than c

nc(W)=|{λi:|λi|>c}|

Note that this quantity is finite since the [[graphon shift operator eigenvalues]] accumulate only at 0.

see [[c band cardinality]]

c-eigenvalue margin

The c-eigenvalue margin of graphons W and W is the minimum difference between and [[eigenvalue]] of W outside of the c band and an eigenvalue of W:

δc(W,W)=minij{|λi(TW)λj(TW)|:|λi(TW)|c}

see [[c eigenvalue margin]]

Non-asymptotic convergence of graph convolutions

Theorem

Let (Gn,Xn)(W,X) and let graph and graphon convolutions be such that

H(Gn)=k=0hk(Ann)k and TH=k=0hkTWk

Further, assume

  1. ||X||1 (WLOG)
  2. |h(λ)|<1 and h(λ) is LLipschitz in [1,c][c,1] and Lipschitz in (c,c)

Then we have

||YnY||(L+πncδc)||TWTWn||+(Lc+2)||XXn||+2c
Note

note that this bound depends on

We already know that these two things converge (we've already discussed the (slow) convergence rate for Wn in induced [[graph sequence converges if and only if the induced graphon sequence converges]]). We can bring the rates for both quantities to O(1n) by introducing more restrictive requirements.

Here, the convergence-discriminability tradeoff is explicit. We can see that larger L and smaller c (more discriminative filters) lead to a higher error bound.

The last term 2c does not vanish. We can think of this as the "approximation error" that comes from the fact that the eigenvalues of the graphs converge non-uniformly (recall this conjecture).

In the finite sample regime, unless =0, there is always leftover "nontransferable energy" corresponding to the spectral components with |λi|<c which do not converge

see [[convergence bound for graph convolutions]]

Review

#flashcards/math/dsg

Even though graph convolutions converge in the spectral domain, this is not enough for our purposes. Why is that?
-?-
graph convolutions and graphon convolutions operate in the node domain (not the spectral domain)

What does the lipschitz condition achieve in [[lipschitz graph convolutions of graph signals converge to lipschitz graphon filters]]?
-?-
it ensures all spectral components near 0 are amplified in an increasingly "similar" way.

[!summary]
For node-domain convergence, we first saw that {ass||graphons have {has||infinite-dimensional spectra}, this can be quite strict and therefore not super useful. Instead, we replaced the {ass||bandlimited assumption} with {hha||a Lipschitz continuity requirement}.
const { dateTime } = await cJS()

return function View() {
	const file = dc.useCurrentFile();
	return <p class="dv-modified">Created {dateTime.getCreated(file)}     ֍     Last Modified {dateTime.getLastMod(file)}</p>
}