Data
2025-01-27
See Lecture 2.pdf
Today
Housekeeping
Scribing sign-up sheet, choose whichever lecture you want by Wednesday
A few lectures cancelled
Last time
- graphs
- graph signals/data with graphs
- network diffusion process/ the way we can process data
- general definition of graph diffusion with graph shift operators
the most common operators were the adjacency matrix and the graph laplacian
Today
- interpretation of the graph laplacian
- total variation energy of graph signals
- graph frequencies and oscillation modes
- the graph fourier transform
- graph convolution (maybe)
1. Graph Signals and Graph Signal Processing
1.1 Interpretation of the (left)/directed graph laplacian
We begin with an example
Example
Consider a 5-cycle digraph with some signal . We can think of this as a periodic signal sampled at 5 discrete time intervals.
- For an -cycle digraph, we can think of the graph as a periodic signal sampled at discrete time intervals
Recall the definition of the adjacency matrix:
Adjacency Matrix
the adjacency matrix for the 5-cycle digraph is The (left) laplacian is
w & 0 & 0 & 0 & -w \ -w & w & 0 & 0 & 0 \ 0 & -w & w & 0 & 0 \ 0 & 0 & -w & w & 0 \ 0 & 0 & 0 & -w & w \end{bmatrix}$$
Let . Then is sampled with period . Let . Then
\frac{1}{\Delta t}(x_{i}-x_{i-1}), ;;;z \leq i \leq 5 \ \frac{1}{\Delta t}(x_{1}-x_{5}),;;;;;;i = 5 \end{cases}$$
remind us of?
What does this definition of
derivative of a function (!!!)
This looks like the
Interpretation of the (left) Graph Laplacian
The graph laplacian generalizes differentiation to arbitrary graphs.
1.2 Interpretation for the normalized symmetric laplacian
Similar to the graph laplacian, we can build an interpretation for the normalized symmetric laplacian.
Symmetric Laplacian
The symmetric adjacency matrix is created by converting a directed graph into an undirected graph. The symmetric laplacian is calculated as the laplacian using the symmetrized adjacency matrix
(see symmetric laplacian)
We will see that the (normalized) symmetric laplacian generalizes the idea of total variation energy for graph signals.
Total Variation Energy
The total variation energy of discrete-time signals is defined as
This is a good proxy for the signal frequency (and in fact can be used to estimate it)
The example on the left with longer period has lower TV. The example on the right with shorter period has higher
High TV → high frequency Low TV → low frequency
see also total variation distance
(see total variation energy)
We first start with our interpretation of signals on digraphs as discrete time periodic data.
Claim
The total variation energy of graph signal is the (euclidean) inner product with laplacian kernel. ie. Let be an -cycle digraph with (ie unweighted graph) and let be a graph signal. Let be the symmetric laplacian of . Then
Proof
x^TLx &= x^T(Lx) \\ &= \langle x, Lx \rangle \end{aligned}$$ For a di-graph on $n$ nodes, this becomes $$(x_{1}-x_{n})^2+(x_{2}-x_{1})^2+(x_{3}-x_{2})^2+\dots+(x_{n}-x_{n-1})^2 = \sum_{i}(x_{i} - x_{i-1})^2 = TV(x)$$ >[!example] Example (using our 5-cycle digraph) >![[Attachments/2025-01-27_graph2.jpg]]
Interpretation of the Laplacian Inner Product
generalizes the notion of total variation energy (and thus of signal frequency) to arbitrary graphs.
General method for getting our interpretations:
- Recognizing that can represent time signals as cycle graphs →
- Dy using the definitions of notions of differentiation and TVE/inner product, we can recover the classical definitions in this setting
- We then carry these interpretations to arbitrary graphs.
Question
What if the signal frequency is higher than the number of nodes graph?
Answer
This is addressed in signal processing by the nyquist-shannon theorem. The gist is that we need the sampling period to be small enough to meaningfully capture the signal. And, for periodic signals, we can calculate this explicitly.
1.3 Building to the graph fourier transform 😀
Recall for (now arbitrary) signal on graph we have . Suppose
Question
What are the lowest and highest values can take?
be the eigenvalues of , ordered by increasing value.
Let
- Note that and is positive semidefinite and this is the minimum value for
- The highest value can take is
Show the symmetrized
Important
The symmetric laplacian/TVE gives us a range of values for the frequency of the signal of the graph.
- The eigenvalues are the graph’s canonical frequencies
- the eigenvectors are the oscillation modes of those frequencies.
Tip
From now on, we will assume that the laplacian is the symmetrized version.
Example
Now, let’s consider the 4 digraph. Then
-1 & 2 & -1 & 0 \\ 0 & -1 & 2 & -1 \\ -1 & 0 & -1 & 2 \end{bmatrix}$$ ![[Attachments/2025-01-27_graph3.jpg]]
Since is symmetric, it is always diagonalizable (see why). As we saw above, and its corresponding eigenvector is . Thus, the eigenvector will be some multiple of
For the example above,
--- start-multi-column
number of columns: 2
Shadow: off
Border: on
Column Spacing: 0We can visualize these vectors as (discrete) time signals, where each of the nodes of the graphs correspond to the sampled locations.
What do we notice?
- As the eigenvalue increases, the frequency of the signal increases
- has frequency
--- end-column ---

--- end-multi-column
An arbitrary graph is an algebraic extension of a cyclic graph. We can think of a general graph as a discrete sampling of some underlying data manifold.
Notes
Since and is orthogonal, its columns/the eigenvectors (or oscillation modes) form a basis for
- ie, we can represent any graph signal in this basis
More generally, this is true not just for the laplacian (which is symmetric and thus always diagonalizable), but for any diagonalizable graph shift operator .
Example
- adjacencies of undirected graph
- adjacencies of directed graphs, random walk matrix adjacencies and general graph laplacians with distinct eigenvalues
(recall that a matrix is diagonalizable if it has all distinct eigenvalues)
Graph Fourier Transform
Given a diagonalizable graph shift operator and a signal , the projection of onto is called the graph fourier transform of . ie, and (where is the th column of )
(see graph fourier transform)
Example
Back to our 5 cycle example. Let’s define a diagonalizable operator as follows:
- The eigenvalues are the complex exponentials
- The eigenvectors are given by the fourier basis, ie is given by
The GFT here is then And for general cyclical graphs on nodes: And this is precisely the definition of the discrete Fourier transform(!!) ie, we can recover the classical definition for the (discrete) Fourier transform using our definition for the graph version.
Main Takeaway
The graph fourier transform generalizes the Fourier Transform in euclidian space into graph signal space.
Notes
In general, the interpretation of the adjacency eigenvalues as graph frequencies is not as clean as it is for the laplacian.
- The eigenvectors of and do not generally match, and there is no alternative definition in terms of .
In practice, it is generally true that low-magnitude adjacency eigenvalues correspond to high graph frequencies and vice versa.
When considering the normalized adjacency matrix and the normalized laplacian, it is easy to see that the eigenvectors are the same.
- But in this case, the adjacency eigenvalues can NOT be interpreted as graph frequencies.
Consider a 5-cycle digraph with some signal
The example on the left with longer period has lower TV. The example on the right with shorter period has higher