[DSP] W02 - Vector Spaces | Computer Science

discrete signal vectors
- vector space framework
vector spaces
signal spaces
- completeness
- common signal spaces
bases
subspaces
references

discrete signal vectors

a generic discrete signal:
- $x [n] = \dots, 1.23, - 0.73, 0.89, 0.17, - 1.15, - 0.26, \dots$
- set of ordered number sequence
four classes of signals
- finite length
- infinite length
- periodic
- finite support
digital signal processing:
- signal analysis
- signal synthesis
number sets:
- $N$ : natural numbers $[1, \infty)$
  - whole numbers $[0, \infty)$
- $Z$ : integers
- $Q$ : rational numbers
  - inclusive of recurring mantissa
- $P$ : irrational numbers
  - non-repeating and non-recurring mantissa
  - $π$ value, $\sqrt{2}$
- $R$ : real numbers (everything on the number line)
  - includes rational and irrational numbers
- $C$ : complex numbers
  - includes real and imaginary numbers

number-sets

fig: number sets

vector space framework:

justification for dsp application:
- common framework for various signal classes
  - inclusive of continuous-time signals
- easy explanation of Fourier Transform
- easy explanation of sampling and interpolation
- useful in approximation and compression
- fundamental to communication system design
paradigms for vector space application to discrete signals
- object oriented programming
  - the instantiated object can have unique property values
  - but for a given object class, the properties and methods are the same
- lego
  - various unit blocks, same and different
  - units assembled in different ways to build different complex structures
  - similarly, a complex signal is broken down into a combination of basis vectors for analysis
key takeaways
- vector spaces are general objects
- vector spaces are defined by their properties
- once signal properties satisfy vector space conditions
  - vector space tools can be applied to signals

vector spaces

some vector spaces
- $R^{2}$ : 2D space
- $R^{3}$ : 3D space
- $R^{N}$ : N real numbers
- $C^{N}$ : N complex numbers
- $ℓ_{2} (Z)$ :
  - square-summable infinite sequences
- $L_{2} ([a, b])$ :
  - square-integrable functions over interval $[a, b]$
vector spaces can be diverse
some vector spaces can be represented graphically
- helps visualize the signal for analysis insights
$R^{2}$ : $x = [x_{0}, x_{1}]^{T}$
$R^{3}$ : $x = [x_{0}, x_{1}, x_{2}]^{T}$
- both can be visualized in a cartesian system fig: vector in 2D space
$L_{2} ([a, b])$ : $x = x (t), t \in [- 1, 1]$
- function vector space
- can be represented as sine wave along time fig: L2 function vector
others cannot be represented graphically:
- $R^{N}$ for $N > 3$
- $C^{N}$ for $N > 1$

vector space axioms

informally, a vector space:
- has vectors in it
- has scalars in it, like $C$
- scalers must be able to scale vectors
- vector summation must work
formally, $\forall x,y,z \in V, a n d α, β \in C$ ( $V$ : vector space)
- $x + y = y + x$
  - commutative addition
- $(x + y) + z = x + (y + z)$
  - distributive addition
- $α (x + y) = α y + α x$
  - distributive scalar multiplication over vectors
- $(α + β) x = α x + β x$
  - distributive vector multiplication over scalers
- $α (β x) = α (β x)$
  - associative scalar multiplication
- $\exists 0 \in V | x + 0 = 0 + x = x$
  - null vector, $0$ , exists
  - addition of $0$ and another vector $x$ returns $x$
  - summation with null vector is commutative
- $\forall x \in V \exists (- x) | x + (- x) = 0$
  - an inverse vector exists in vector space such that adding the vector with it’s inverse yields the null vector
examples:
- $R^{N}$ : vector of N real numbers
  - two vectors from this space look like:
    - $x = [x_{0}, x_{1}, x_{2}, \dots x_{N}]^{T}$
    - $y = [x_{0}, x_{1}, x_{2}, \dots x_{N}]^{T}$
  - the above mentioned rules apply to these vectors and can be verified
- $L_{2} [- 1, 1]$

inner product

aka dot product: measure of similarity of two vectors
- $⟨ \cdot, \cdot ⟩ : V \times V \to C$
takes two vectors and outputs a scaler which indicates how similar the two vectors are
inner product axioms
- $⟨ x + y, z ⟩ = ⟨ x, z ⟩ + ⟨ y, z ⟩$
  - distributive over vector addition
- $⟨ x, y ⟩ = ⟨ y, x ⟩^{*}$
  - commutative with conjugation
- $⟨ x, α y ⟩ = α ⟨ x, y ⟩$
  - distributive with scalar multiplication
  - when scalar scales the second operand
- $⟨ α x, y ⟩ = α^{*} ⟨ x, y ⟩$
  - distributive with scalar multiplication
  - conjugate scalar if it scaling the first operand
- $⟨ x, x ⟩ \geq 0$
  - self inner product ( \in \mathbb{R})
- $⟨ x, x ⟩ = 0 \Leftrightarrow x = 0$
  - if self inner product is 0, then the vector is the null vector
- if $⟨ x, y ⟩ = 0 a n d x, y \neq 0$ ,
  - then $x$ and $y$ are orthogonal
inner product is computed differently for different vector spaces
in $R^{2}$ vector space:
- $⟨ x, y ⟩ = x_{0} y_{0} + x_{1} y_{1} = ∥ x ∥ ∥ y ∥ cos α$
  - where $α$ : angle between $x$ and $y$
- when two vectors are orthogonal to each other
  - $α = 90^{\circ}$ , so $cos 90^{\circ} = 0$ , so $⟨ x, y ⟩ = 0$
in $L_{2} [- 1, 1]$ vector space:
- $⟨ x, y ⟩ = \int_{- 1}^{1} x (t) y (t) d t$
  - norm: $⟨ x, x ⟩ = ∥ x ∥^{2} = \int_{- 1}^{1} {sin}^{2} (π t) d t$
- the inner product of a symmetric and an anti-symmetric function is 0
  - i.e. they are orthogonal to each other and cannot be expressed as a factor of the other in any way
  - example 1:
    - $x = sin (π t)$ - anti-symmetric
    - $y = 1 - | t |$ - symmetric
    - $⟨ x, y ⟩ = \int_{- 1}^{1} (sin (π t)) (1 - | t |) d t = 0$
    fig: inner product of a symmetric and an anti-symmetric function
  - example 2:
    - $x = sin (4 π t)$
    - $y = sin (5 π t)$
    - $⟨ x, y ⟩ = \int_{- 1}^{1} (sin (4 π t)) (sin (5 π t)) d t = 0$

norm and distance

norm of a vector: - inner product of a vector with itself - square of the norm (length) of a vector - $⟨ x, x ⟩ = x_{0}^{2} + x_{1}^{2} = ∥ x ∥^{2}$
distance between two vectors:
- the norm of the difference of the two vectors
the distance between orthogonal vectors is not zero
in $R^{2}$ , norm is the distance between the vector end points
- $∥ x - y ∥$ is the difference vector
- $∥ x - y ∥ = \sqrt{(x_{0} - y_{0})^{2} + (x_{1} - y_{1})^{2}}$
  - connects the end points of the vectors $x$ and $y$
- see triangle rule of vector addition, and pythagorean theorem
in $L_{2} [- 1, 1]$ , the norm is the mean-squared error:
- $\int_{- 1}^{1} | x (t) - y (t) |^{2} d t$

signal spaces

completeness

consider an infinite sequence of vectors in a vector space
if it converges to a limit within the vector space
- then said vector space is “complete”
- also called Hilbert Space
limiting operation is ambiguous, definition varies from one space to the other
so some limiting operation may fail and point outside the vector space
- such vector spaces are not said to be complete

common signal spaces

while vectors spaces can be applied to signal processing
- not all vector spaces can be used for all signals
different signal classes are managed in different spaces
$C^{N}$ : vector space of N complex tuples
- valid signal space for finite length signals
  - vector notation: $x = [x_{0}, x_{1}, \dots x_{N}]^{T}$
  - where $x_{0}, x_{1} \dots x_{N}$ are complex tuples
- also valid for periodic signals
  - vector notation: $~ x$
- all operations are well defined and intuitive
- inner product: $⟨ x,y ⟩ = \sum_{n = 0}^{N - 1} x^{*} [n] y [n]$
  - well defined for all finite-length vectors in ( \mathbb{C}^N)
the inner product for infinite length signals explode in $C^{N}$ - inappropriate for infinite length signal analysis
$ℓ_{2} (Z)$ : vector space of square-summable sequences - requirement for sequences to be square-summable: - $\sum | x [n] |^{2} < \infty$ - sum of squares of elements of the sequence is less than infinity - all sequences that live in this space must have finite energy - “well-behaved” infinite-length signals live in $ℓ_{2} (Z)$ - vector notation: $x = [\dots, x_{- 2}, x_{- 1}, x_{0}, x_{1}, \dots]^{T}$
lot of other interesting infinite length signals do not live in $ℓ_{2}$
- examples:
  - $x [n] = 1$
  - $x [n] = cos (ω n)$
- these have to be dealt with case-by-case

basis

a basis is a building block of a vector space
- a vector space usually has a few basis vectors called bases
- like the lego unit blocks
any element in a vector space can be
- built with a combination of these bases
- decomposed into a linear combination of these bases
the basis of a space is a family of vectors which are least like each other
- but they all belong to the same space
- as a linear combination, the basis vectors capture all the information within their vector space
fourier transform is simply a change of basis

vector families

$w^{(k)}$ : family of vectors - $k$ : index of the basis in the family
canonical $R^{2}$ basis: $e^{k}$
- $e^{(0)} = [\begin{matrix} 10 \end{matrix}]; e^{(1)} = [\begin{matrix} 01 \end{matrix}]$
- this family of basis vectors is denoted by $e^{k}$
any vector can be expressed as a linear combination of ( \textbf{e}^k) in ( \mathbb{R}^2 )
- $[\begin{matrix} x_{0} x_{1} \end{matrix}] = x_{0} [\begin{matrix} 10 \end{matrix}] + x_{1} [\begin{matrix} 01 \end{matrix}]$
- $x = x_{0} e^{(0)} + x_{1} e^{(1)}$
graphical example:
- $[\begin{matrix} 21 \end{matrix}] = 2 [\begin{matrix} 10 \end{matrix}] + 1 [\begin{matrix} 01 \end{matrix}]$
- $x = 2 e^{(0)} + 1 e^{(1)}$

R2-basis

fig: linear combination of canonical ( \textbf{e}^k) in (\mathbb{R}^2)

non-canonical $R^{2}$ basis example: $v^{k}$
- $v^{(0)} = [\begin{matrix} 10 \end{matrix}]; v^{(1)} = [\begin{matrix} 11 \end{matrix}]$
any vector can be expressed as a linear combination of these vectors in $R^{2}$
- the coefficients of the bases will be different compared to the canonical bases
graphical example:
- $[\begin{matrix} 21 \end{matrix}] = α v^{(0)} + β v^{(1)}$
- $[\begin{matrix} 21 \end{matrix}] = α [\begin{matrix} 10 \end{matrix}] + β [\begin{matrix} 11 \end{matrix}]$
  - by rule of parallelogram vector addition
- $α = 1; β = 1$

R2-basis

fig: linear combination of non-canonical $v^{k}$ in $R^{2}$

only vectors which are linearly independent can be the basis vectors of a space
infinite dimensional spaces bases:
- some limitations have to be applied to obtain basis vectors of infinite dimension
- $x = \sum_{k = 0}^{\infty} α_{k} w^{(k)}$
a canonical basis of $ℓ_{2} (Z)$ - $e^{k} = ⎡ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣ \begin{matrix} . . . 001000 . . . \end{matrix} ⎤ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦$ , $k$ -th position, $k \in Z$
function vector spaces:
- basis vector for functions: $f (t) = \sum_{k} α_{k} h^{(k)} (t)$
the fourier basis for functions over an interval $[- 1, 1]$ : - $\frac{1}{\sqrt{2}}, cos π t, sin π t, cos 2 π t, sin 2 π t, cos 3 π t, sin 3 π t, \dots$ - any square-integrable function in $[- 1, 1]$ can be represented as a linear combination of fourier bases - a square wave can be expressed as a sum of sines
formally, in a vector space $H$ ,
a set of $K$ vectors from $H$ , $W = {w^{(k)}}_{k = 0, 1, \dots, K - 1}$ is a basis for $H$ if: `
1. $\forall \in H$ : $x = \sum_{k = 0}^{K - 1} α_{k} w^{(k)}$ , $α_{k} \in C$
2. the coefficients $α_{k}$ are unique
  - this implies linear independence in the vector basis
  - $\sum_{k = 0}^{K - 1} α_{k} w^{(k)} = 0 \Leftrightarrow α_{k} = 0, k = 0, 1, \dots, K - 1$ `

orthonormal basis

the orthogonal bases are the most important
- of all possible bases for a vector space
orthogonal basis: $⟨ w^{(k)}, w^{(n)} ⟩ = 0$ for $k \neq n$
- vectors of an orthogonal basis are mutually orthogonal
- their inner product with each other is zero
in some spaces, the orthogonal bases are also orthonormal
- i.e. they are unit norm
- their length $∥ w^{(k)} ∥ = 1$
the inner product of any two vectors in the orthonormal bases is the difference between their indices
- $⟨ w^{(k)}, w^{(n)} ⟩ = δ [n - k]$
gran-schmidt algorithm can be used to orthonormalize any orthogonal bases
obtaining the bases coefficients $α_{k}$ for bases can be involved and challenging
- $x = \sum_{k = 0}^{K - 1} α_{k} w^{(k)}$
  - $x$ : a vector as the linear combination of $K$ basis vectors $w^{(k)}$ ,
  - with corresponding coefficients $α_{k}$
- however, they are easy to obtain with an orthonormal basis
  - $α_{k} = ⟨ w^{(k)}, x ⟩$

change of basis

$x = \sum_{k = 0}^{K - 1} α_{k} w^{(k)} = \sum_{k = 0}^{K - 1} β_{k} v^{(k)}$
- $v^{(k)}$ is the target basis, $w^{(k)}$ is the original basis
if $v^{(k)}$ is orthonormal:
- $β_{h} = ⟨ v^{(h)}, x ⟩$
- $= ⟨ v^{(h)}, \sum_{k = 0}^{K - 1} α_{k} w^{(k)} ⟩$
- $= \sum_{k = 0}^{K - 1} α_{k} ⟨ v^{(h)}, w^{(k)} ⟩$
- $= \sum_{k = 0}^{K - 1} α_{k} c_{h k}$
- $= ⎡ ⎢ ⎢ ⎢ ⎣ \begin{matrix} c_{00} & c_{01} & \dots & c_{0 (K - 1)} ⋮ c_{(K - 1) 0} & c_{(K - 1) 1} & \dots & c_{(K - 1) (K - 1)} \end{matrix} ⎤ ⎥ ⎥ ⎥ ⎦ [\begin{matrix} α_{0} ⋮ α_{K - 1} \end{matrix}]$
this forms the core of the discrete fourier transform algorithm for finite length signals
can be applied to elementary rotations of basis vectors in the euclidean plane
- the same vector has different coefficients in the original and the rotates bases
- the rotation matrix is obtained by the matrix multiplication of the original and the target bases
- the rotation matrix applied to a vector in the original bases yields the coefficients of the same vector in the rotated bases
- the matrix multiplication of the rotation matrix with its inverse yields the identity matrix

subspaces

subspaces can be applied to signal approximation and compression
with vector $x \in V$ and subspace $S \subseteq V$
- approximate $x$ with $^x \in S$ by
- take projection of the vector $x$ in $V$ on $S$
due to the adaptation of vector space paradigm for signal processing
- this geometric intuition for approximation can be extended to arbitrarily complex vector spaces

vector subspace

a subspace is a subset of vectors of a vector space closed under addition and scalar multiplication
classic example:
- $R^{2} \subset R^{3}$
- in-plane vector addition and scalar multiplication operations do not result in vectors outside the plane
- $R^{2}$ uses only 2 of the 3 orthonormal basis of $R^{3}$
the subspace concept can be extended to other vector spaces
- $L_{2} [- 1, 1]$ : function vector space
  - subspace: set of symmetric functions in $L_{2} [- 1, 1]$
  - when two symmetric functions are added, they yield symmetric functions
subspaces have their own bases
- a subset of their parent space’s bases

least square approximations

${s^{(k)}}_{k = 0, 1, \dots, K - 1}$ orthonormal basis for $S$
orthogonal projection:
- $^x = \sum_{k = 0}^{K - 1} ⟨ s^{(k)}, x ⟩ s^{(k)}$
the orthogonal projection: the “best” approximation of $x$ over $S$ - because of two of its properties - it has minimum-norm error: - $a r g m i n_{y \in S} ∥ x - y ∥ =^x$ - orthogonal projection minimizes the error between the original vector and the approximated vector - this error is orthogonal to the approximation: - $⟨ x -^x,^x ⟩ = 0$ - the error and the basis vectors of the subspace are maximally different - they are uncorrelated - the basis vectors cannot capture any more information in the error
example: polynomial approximation
- approximating from vector space $L_{2} [- 1, 1]$ to $P_{N} [- 1, 1]$
- i.e. vector space of square-integrable functions to a subspace of polynomials of degree $N - 1$
- generic element of subspace $P_{N} [- 1, 1]$ has form
  - $p = a_{0} + a_{1} t + \dots + a_{N - 1} t^{N - 1}$
- a naive, self-evident basis for this subspace:
  - $s^{(k)} = t^{k}, k = 0, 1, \dots, N - 1$
  - not orthonormal, however

approximation with Legendre polynomials

example goal:
- approximate $x = sin t \in L_{2} [- 1, 1]$ to $P_{3} [- 1, 1]$
  - $P_{3} [- 1, 1]$ : polynomials of the degree 2
build orthonormal basis from naive basis
- use Gram-Schmidt orthonormalization procedure for naive bases:
  - $s^{(k)} \to u^{(k)}$
  - $s^{(k)}$ : original naive bases
  - $u^{(k)}$ : orthonormalized naive bases
- this algorithm takes one vector at a time from the original step and incrementally produces an orthonormal set
  1. $p^{(k)} = s^{(k)} - \sum_{n = 0}^{k - 1} ⟨ u^{(n)}, s^{(n)} ⟩ u^{(n)}$
    - for the first naive basis vector, normalize it with 1
    - project the second naive basis vector on to the normalized first basis
    - then subtract this projection from the second basis vector to get the second normalized basis
    - this removes the the first normalized basis’s component from the second naive basis
  2. $u^{(k)} = \frac{p^{(k)}}{∥ p ∥^{(k)}}$
    - normalize the extracted vector
- this process yields:
  - $u^{(1)} = \sqrt{\frac{1}{2}}$
  - $u^{(2)} = \sqrt{\frac{3}{2}} t$
  - $u^{(3)} = \sqrt{\frac{5}{8}} (3 t^{2} - 1)$
  - and so on
- these are known as Legendre polynomials
- they can be computed to the arbitrary degree,
  - for this example, up to degree 2
project $x$ over the orthonormal basis
- simply dot product the original vector $x$ over all the legendre polynomials i.e. the orthogonal basis of the $P_{3} [- 1, 1]$ subspace
- $α_{k} = ⟨ u^{(k)}, x ⟩ = \int_{- 1}^{1} u_{k} (t) sin t d t$
  - $α_{0} = ⟨ \sqrt{\frac{1}{2}}, sin t ⟩ = 0$
  - $α_{1} = ⟨ \sqrt{\frac{3}{2}} t, sin t ⟩ \approx 0.7377$
  - $α_{2} = ⟨ \sqrt{\frac{5}{8}} (3 t^{2} - 1), sin t ⟩ = 0$
compute approximation error
- so using the orthogonal projection
  - $sin t \to α_{1} u^{(1)} \approx 0.9035 t$
  - this subspace has only one non-zero basis:
    - $\sqrt{\frac{3}{2}} t$
compare error to taylor’s expansion approximation
- well known expansion, easy to compute but not optimal over interval
- taylor’s approximation: $sin t \approx t$
- in both cases, the approximation is a straight line, but the slopes are slightly different ( $\approx$ 10% off)
  - the taylor’s expansion is a local approximation around 0,
  - the legendre polynomials method minimizes the global mean-squared-error between the approximation and the original vector
  - the error of the legendre method has a higher error around 0
  - however, the energy of the error compared to the error of the taylor’s expansion is lower in the interval
- error norm:
  - legendre polynomial based approximation:
    - $∥ sin t - α_{1} u^{(1)} ∥ \approx 0.0337$
  - taylor series based approximation:
    - $∥ sin t - t ∥ \approx 0.0857$

haar spaces

haar spaces are matrix spaces
- note: matrices can be reshaped for vector operations
encodes matrix information in a hierarchical way
- finds application in image compression and transmission
it has two kinds of basis matrices
- the first one encodes the broad information
- the rest encode the details, which get finer by the basis index
each basis matrix has positive and negative values in some symmetric pattern
the basis matrix will implicitly compute the difference between image areas
- low-index basis matrices take differences between large areas
- high-index matrices take differences in smaller, localized areas
this is a more robust way of encoding images for transmission methods prone to losses on the way
if images are transmitted as simple matrices, they are prone to being chopped is loss in communication occurs during transmission

haar encoding transmits coefficients not pixel by pixel but hierarchically in the level of detail
- so if communication loss occurs, the broad idea of the image is still conveyed
- while continued transmission will push up the detail level
approximation of matrices to harr space is an example of progressive encoding

contents

discrete signal vectors

vector space framework:

vector spaces

vector space axioms

inner product

norm and distance

signal spaces

completeness

common signal spaces

basis

vector families

orthonormal basis

change of basis

subspaces

vector subspace

least square approximations

approximation with Legendre polynomials

haar spaces

references