Linear transformations and matrices

[[Reading Status Button]]

Linear Transformation:

Transformation - Function.
Linear Transformation - takes a vector and transforms(moves) that vector into another vector
Linear Transformation - 2 properites:
- All lines remain lines, without getting curved
- The origin is fixed
- The resultant grid lines are parallel and evenly spaced
If you apply a linear transformation on a set of vectors, if you know where the vectors $\hat{i}$ and $\hat{j}$ are, you can find any other vector. It will be the same linear combination $\hat{i}$ and $\hat{j}$ , that was before the transformation

$v = a \hat{i} + b \hat{j}$ will always be the same.

Matrix mulitplication is just a transformation

[x y] \to x \times \hat{i} + y \times \hat{j}

This will always be true in a linear transformation. So when you apply linear transformation, just with the co-ordinates of $\hat{i}$ and $\hat{j}$ you can get any other vector, if you know the original co-ordinates of the vector before the transformation.

Example:

you have a vector $[- 1 2]$ , which is -1 $- 1 \times \hat{i} + 2 \times \hat{j}$ and, suppose after some transformation the $\hat{i}$ lands at $[1 - 2]$ and $\hat{j}$ lands at $[30]$ then the $v = - 1 \times \hat{i} + 2 \times \hat{j}$

So the original $v$ has moved from co-ordinates (-1,2) to (5,-2) after the transformation.

v = - 1 \times [1 - 2] + 2 \times [30] = [- 1 \times 1 - 1 \times - 2] + [2 \times 3 2 \times 0] = [- 1 2] + [60] = [- 1 + 6 2 + 0] = [52]

A 2 dimensional linear transformation always relies on just 4 numbers, the the 2 co-ordinates of $\hat{i}$ and $\hat{j}$ .

These 4 numbers are packed into a 2 x 2 matrix.

The columns denote where the vectors $\hat{i}$ and $\hat{j}$ land.

If the lines where $\hat{i}$ and $\hat{j}$ are linearly dependent, then entire 2D space into a single line, where the 2 vectors sit.

matrices is way to describe transformation, and matrix multiplication is the way to find out what the transformation does to a given vector.

How does matrix multiplication relate to Neural Networks:

so when performing matrix multiplication on the token’s embeddings, we ensure that the reflected context is transformed onto the other tokens.
And when a 2 linearly dependent vectors are multiplied, the resulting vector has only one dimension.
say for example take 2 tokens, first name, and second name, those 2 are nouns, so will occupy the same dimension. and might have linearly dependent vectors. So when you multiply them the result will a noun.

Prabanjan's Cosmos

Explorer

Linear transformations and matrices

Linear Transformation:

Example:

How does matrix multiplication relate to Neural Networks:

Graph View

Backlinks