given where is bandlimited, GFT of converges to WFT ([[bandlimited convergent graph signals converge in the fourier domain]])
For bandlimited graphon signals, we should be able to show convergence in the node domain!
Convergence of bandlimited signals in the node domain
Theorem
Let with bandlimited. Let be a sequence of [[graph convolution]] that share coefficients with [[graphon convolution]] . Then
Where and are the graph and [[graphon convolution]] outputs respectively.
Recall that our definition of a [[convergent sequence of graph signals]] compares the [[induced graphon signal]] and the limiting [[graphon signal]] to see if they converse in an way.
theorem basically says
if we have a sequence that converges in this way and we have these graphon and graph convolutions, the outputs of those convolutions also converge in
See [[graph convolutions of bandlimited signals converge to the graphon convolution]]
Our first result for convergence for graphon signals is restrictive though, since we require that {as||the signal is bandlimited||restriction} and {ha||graphons have infinite-dimensional spectra||why restrictive}.
Can we get a better result? Can we generalize to arbitrary convergent graph signals?
set-up for solution
To show convergence of filters for general signals (not only bandlimited), we instead restrict {as||the filters||what restrict} to {ha||be Lipschitz||restriction}.
This upper bounds the magnitude of the first derivative on the interval
Note
In this statement, our are arbitrary analytic functions. However, in practice, these will be polynomials
On the real line, polynomials are not Lipschitz. However, in a bounded interval, we can always find a Lipschitz constant.
We consider general analytic filters with Lipschitz constant (both for generality and to avoid dependence on the polynomial degree which gives us better/tighter bounds)
see [[Lipschitz graphon filter]]
Improved/more general version
Aside: this is a LONG proof (longest in the class probably). The theorem is basically the same as the one above, but without the bandlimited assumption.
Theorem
Let .
Let be a sequence of [[graph convolution]] that share coefficients with [[graphon convolution]] where is a [[Lipschitz graphon filter]]. Then
Where and are the graph and [[graphon convolution]] outputs respectively.
Proof
Note
We want to show the outputs on the graph convolutions converge in the "[[induced graphon signal]]" sense to the outputs of the [[graphon convolution]]
ie, want to show where is the [[induced graphon signal]] for .
Where we fix some , , and is the Lipschitz constant. We can choose such that . Further, note that this creates a set of bandlimited components () and non-bandlimited components () of the signal.
Then, by the [[triangle inequality]], we have
And we can bound and individually.
It is very easy to bound , since this is equivalent to filter applications to a bandlimited with bandwith . Thus, vanishes as a direct applications of the previous result ([[bandlimited convergent graph signals converge in the fourier domain]]) to this expression.
For , we note that
Illustration
We don't care about the spectral intervals and since they are covered by . Since is -lipschitz, the maximum value the function can change between and is .
Then we can write
We can see follows since is a portion of graphon signal . And so, we can upper bound the norm of the portion of the signal by the norm of the entire signal.
For , by writing both the [[induced graphon signal]] and the limiting [[graphon signal]] in terms of the identity , we can get
Where is the same one we fixed above. Since both RHS terms converge (the first is from our initial conditions, the second one is again a direct application of bandlimited convergence), we are done for .
Finally, for ,
Thus we get that
for all
see [[lipschitz graph convolutions of graph signals converge to lipschitz graphon filters]]
Summary
For node-domain convergence, we first saw that graphons have infinite-dimensional spectra, this can be quite strict and therefore not super useful. Instead, we replaced the bandlimited assumption with a Lipschitz continuity requirement.
It was difficult to show convergence for the GFT for spectral components associated with eigenvalues close to (see conjecture 1 from Lecture 17). This is why we showed instead that [[bandlimited convergent graph signals converge in the fourier domain]].
Here, the same thing happens because accumulate at 0. The Lipschitz continuity addresses this by ensuring all spectral components near 0 are amplified in an increasingly "similar" way. We can see this by looking at how we defined in the proof:
If we fix , in order to have , we need progressively smaller Lipschitz constant . ie, we need flatter and flatter functions
If we want to get smaller (ie, we want the region where the spectral components cannot be discriminated to get smaller), we need a larger .
convergence-discriminability tradeoff
In HW3:
Train a GNN on a subsample of the graph and as we increase the size, the convergence gets better and better
explanation: convolution converges asymptotically
Transferability:
If we want to make an error of at most ...
how do we pick the size of the subgraph to train on?
Can we see a relationship between the size of the graph and the Lipschitz constant ?
Transferability of Graph Convolutions and GNNs
The asymptotic convergence of the lipschitz graph filters reveals an important tradeoff between convergence/transferability and discriminability. We can compare this to the [[stability-discriminability tradeoff for Lipschitz filters]].
the norm difference between the [[graphon]] and the [[induced graphon]]
the the norm difference between the [[graphon signal]] and the [[induced graphon signal]]
We already know that these two things converge (we've already discussed the (slow) convergence rate for in induced [[graph sequence converges if and only if the induced graphon sequence converges]]). We can bring the rates for both quantities to by introducing more restrictive requirements.
Convergence with appropriate node labelling means approximation improves with as expected
The bound grows with
if we want better discriminability, then we need a lower . A lower means
a higher [[c band cardinality]]
a lower [[c eigenvalue margin]]
a higher , and thus a worse convergence bound
ie, the bound is large when the filter is most discriminative
Here, the convergence-discriminability tradeoff is explicit. We can see that larger and smaller (more discriminative filters) lead to a higher error bound.
The last term does not vanish. We can think of this as the "approximation error" that comes from the fact that the eigenvalues of the graphs converge non-uniformly (recall this conjecture).
In the finite sample regime, unless , there is always leftover "nontransferable energy" corresponding to the spectral components with which do not converge
What does the lipschitz condition achieve in [[lipschitz graph convolutions of graph signals converge to lipschitz graphon filters]]?
-?-
it ensures all spectral components near 0 are amplified in an increasingly "similar" way.
[!summary]
For node-domain convergence, we first saw that {ass||graphons have {has||infinite-dimensional spectra}, this can be quite strict and therefore not super useful. Instead, we replaced the {ass||bandlimited assumption} with {hha||a Lipschitz continuity requirement}.