Part IV: Advanced Topics | Chapter 1

Tensors & Multilinear Algebra

Extending linear algebra beyond matrices to higher-order data structures

Historical Context

Tensor calculus was developed by Gregorio Ricci-Curbastro and Tullio Levi-Civita in the 1890s for differential geometry. Einstein adopted their notation for general relativity (1915), establishing the Einstein summation convention. The modern algebraic treatment via tensor products of vector spaces emerged from the work of Whitney, Bourbaki, and others in the mid-20th century.

Today, tensors are central to physics (stress, inertia, electromagnetic field), computer science (deep learning weight tensors), data science (multiway data analysis), and quantum information (entanglement). Tensor decompositions (CP, Tucker) generalize matrix factorization to higher-order data structures.

4.1 Multilinear Maps and Tensor Products

Definition: Tensor Product

The tensor product $V \otimes W$ of vector spaces $V$ and $W$ is a vector space with the universal property: every bilinear map $\phi: V \times W \to Z$factors uniquely as $\phi = \tilde{\phi} \circ \otimes$ where $\otimes: V \times W \to V \otimes W$is bilinear. If $\dim V = m, \dim W = n$, then $\dim(V \otimes W) = mn$.

A tensor of type $(r, s)$ on a vector space $V$ is an element of$V^{\otimes r} \otimes (V^*)^{\otimes s}$—a multilinear map that takes $r$ covectors and $s$ vectors to produce a scalar. Matrices are $(1,1)$-tensors; the metric tensor in Riemannian geometry is a $(0,2)$-tensor.

4.2 Einstein Summation Convention

In the Einstein convention, repeated indices imply summation: $A^i{}_j B^j{}_k$ means$\sum_j A^i{}_j B^j{}_k$. This compact notation handles operations like:

Contraction: $T^i{}_{ij} = \sum_j T^i{}_{ij}$ (trace-like operation)
Raising/lowering indices: $v^i = g^{ij}v_j$ using the metric tensor
Covariant derivative: $\nabla_i T^j{}_k = \partial_i T^j{}_k + \Gamma^j{}_{il}T^l{}_k - \Gamma^l{}_{ik}T^j{}_l$

NumPy's $\texttt{einsum}$ function directly implements Einstein summation, enabling efficient computation of arbitrary tensor contractions.

4.3 Symmetric and Antisymmetric Tensors

Any tensor can be decomposed into symmetric and antisymmetric parts. The wedge product $\alpha \wedge \beta = \alpha \otimes \beta - \beta \otimes \alpha$ generates theexterior algebra $\Lambda(V)$, which governs differential forms, determinants, and cross products.

In $\mathbb{R}^3$, the wedge product of two vectors is equivalent to the cross product. In general, $\Lambda^k(V)$ has dimension $\binom{n}{k}$, and the top exterior power $\Lambda^n(V)$ is 1-dimensional, explaining why the determinant is unique up to scale.

4.4 Tensor Decompositions

The CP decomposition (CANDECOMP/PARAFAC) expresses a tensor as a sum of rank-1 tensors:$\mathcal{T} = \sum_{r=1}^R a_r \otimes b_r \otimes c_r$. The minimal $R$ is the tensor rank—unlike matrix rank, tensor rank is NP-hard to compute in general.

The Tucker decomposition $\mathcal{T} = \mathcal{G} \times_1 A \times_2 B \times_3 C$generalizes PCA to higher order, with a core tensor $\mathcal{G}$ and factor matrices. Higher-order SVD (HOSVD) provides a truncated Tucker decomposition analogous to truncated SVD.

4.5 Applications

Continuum mechanics: The stress tensor $\sigma_{ij}$ and strain tensor $\epsilon_{ij}$ are symmetric $(0,2)$-tensors
General relativity: The Riemann curvature tensor $R^i{}_{jkl}$ encodes spacetime geometry
Deep learning: Weight tensors in convolutional networks, attention mechanisms
Quantum information: Entangled states are tensors that cannot be factored
Chemometrics: Fluorescence excitation-emission matrices, multiway factor analysis

Computational Laboratory

This simulation demonstrates tensor products, Einstein summation via np.einsum, symmetric/antisymmetric decomposition, CP tensor decomposition, and stress tensor analysis.

Tensors & Multilinear Algebra

Python

tensors.py162 lines

import numpy as np
import matplotlib
matplotlib.use("Agg")
import matplotlib.pyplot as plt

# ============================================================
# Tensors and Multilinear Algebra
# ============================================================

# --- 1. Tensor products ---
print("=" * 60)
print("TENSOR PRODUCTS")
print("=" * 60)

u = np.array([1, 2, 3])
v = np.array([4, 5])
outer = np.outer(u, v)
print(f"u = {u}, v = {v}")
print(f"u ⊗ v (outer product):\n{outer}")
print(f"Shape: {outer.shape} (rank-1 tensor)")

# Higher-order tensor
w = np.array([1, -1])
tensor_3 = np.einsum('i,j,k->ijk', u, v, w)
print(f"\nu ⊗ v ⊗ w shape: {tensor_3.shape}")
print(f"Entry [1,0,1] = u[1]*v[0]*w[1] = {u[1]}*{v[0]}*{w[1]} = {tensor_3[1,0,1]}")

# --- 2. Einstein summation ---
print("\n" + "=" * 60)
print("EINSTEIN SUMMATION CONVENTION")
print("=" * 60)

A = np.array([[1, 2], [3, 4]])
B = np.array([[5, 6], [7, 8]])

# Matrix multiplication: C_ij = A_ik B_kj
C = np.einsum('ik,kj->ij', A, B)
print(f"A @ B (einsum 'ik,kj->ij'):\n{C}")
print(f"Match np.dot? {np.allclose(C, A @ B)}")

# Trace: A_ii
trace = np.einsum('ii->', A)
print(f"\ntr(A) (einsum 'ii->'): {trace}")

# Frobenius inner product: A_ij B_ij
frob = np.einsum('ij,ij->', A, B)
print(f"<A, B>_F (einsum 'ij,ij->'): {frob}")

# Tensor contraction
T = np.random.randn(3, 4, 5)
contracted = np.einsum('ijk,jl->ilk', T, np.random.randn(4, 2))
print(f"\nTensor T: {T.shape}")
print(f"Contracted T_ijk M_jl -> R_ilk: {contracted.shape}")

# --- 3. Symmetric and antisymmetric tensors ---
print("\n" + "=" * 60)
print("SYMMETRIC AND ANTISYMMETRIC TENSORS")
print("=" * 60)

M = np.random.randn(3, 3)
M_sym = (M + M.T) / 2
M_asym = (M - M.T) / 2

print(f"M_sym (symmetric part):\n{np.round(M_sym, 4)}")
print(f"M_asym (antisymmetric part):\n{np.round(M_asym, 4)}")
print(f"M = M_sym + M_asym? {np.allclose(M, M_sym + M_asym)}")
print(f"M_sym = M_sym^T? {np.allclose(M_sym, M_sym.T)}")
print(f"M_asym = -M_asym^T? {np.allclose(M_asym, -M_asym.T)}")

# Wedge product in R^3 (cross product)
a = np.array([1, 0, 0])
b = np.array([0, 1, 0])
cross = np.cross(a, b)
print(f"\na ∧ b (cross product): {cross}")

# --- 4. CP decomposition (simple example) ---
print("\n" + "=" * 60)
print("CP TENSOR DECOMPOSITION")
print("=" * 60)

# Create rank-2 tensor
R = 2
factors_true = [np.random.randn(4, R), np.random.randn(5, R), np.random.randn(3, R)]
T_cp = np.zeros((4, 5, 3))
for r in range(R):
    T_cp += np.einsum('i,j,k->ijk', factors_true[0][:, r], factors_true[1][:, r], factors_true[2][:, r])

print(f"Created rank-{R} tensor of shape {T_cp.shape}")
print(f"Frobenius norm: {np.linalg.norm(T_cp):.4f}")

# Verify via unfolding
unfolding_0 = T_cp.reshape(4, -1)
rank = np.linalg.matrix_rank(unfolding_0)
print(f"Mode-0 unfolding rank: {rank} (should be {R})")

# --- 5. Stress tensor example ---
print("\n" + "=" * 60)
print("STRESS TENSOR (PHYSICS APPLICATION)")
print("=" * 60)

sigma = np.array([[100, 30, 0],
                   [30, 50, 0],
                   [0, 0, -20]], dtype=float)
print(f"Stress tensor σ (MPa):\n{sigma}")
print(f"Symmetric? {np.allclose(sigma, sigma.T)}")

eigenvalues, eigenvectors = np.linalg.eigh(sigma)
print(f"Principal stresses: {np.round(eigenvalues, 2)} MPa")
print(f"Principal directions:\n{np.round(eigenvectors, 4)}")

# Von Mises stress
s = sigma - np.trace(sigma)/3 * np.eye(3)
von_mises = np.sqrt(3/2 * np.sum(s**2))
print(f"Von Mises stress: {von_mises:.2f} MPa")

# --- 6. Visualization ---
fig, axes = plt.subplots(1, 3, figsize=(16, 5))

# Plot 1: Outer product
ax = axes[0]
im = ax.imshow(outer, cmap="Blues", aspect="auto")
ax.set_title("Outer Product u ⊗ v", fontsize=12, fontweight="bold")
ax.set_xlabel("v components", color="gray")
ax.set_ylabel("u components", color="gray")
for i in range(outer.shape[0]):
    for j in range(outer.shape[1]):
        ax.text(j, i, f"{outer[i,j]}", ha="center", va="center", fontsize=12, color="white")
ax.set_facecolor("#0f172a")

# Plot 2: Tensor contraction network
ax = axes[1]
ax.set_xlim(0, 10)
ax.set_ylim(0, 8)
shapes = {"T": (3, 5, "3×4×5"), "M": (7, 5, "4×2"), "R": (5, 2, "3×2×5")}
colors_d = {"T": "#3b82f6", "M": "#0ea5e9", "R": "#06b6d4"}
for name, (x, y, shape) in shapes.items():
    circle = plt.Circle((x, y), 0.8, color=colors_d[name], alpha=0.7)
    ax.add_patch(circle)
    ax.text(x, y, f"{name}\n{shape}", ha="center", va="center", fontsize=9, color="white", fontweight="bold")
ax.plot([3.8, 6.2], [5, 5], color="#38bdf8", linewidth=2)
ax.annotate("contract j", xy=(5, 5.3), fontsize=9, color="#38bdf8", ha="center")
ax.arrow(5, 3.5, 0, -0.8, head_width=0.2, fc="#38bdf8", ec="#38bdf8")
ax.set_aspect("equal")
ax.set_title("Tensor Network", fontsize=12, fontweight="bold")
ax.axis("off")
ax.set_facecolor("#0f172a")

# Plot 3: Principal stresses
ax = axes[2]
ax.bar(range(3), eigenvalues, color=["#3b82f6", "#0ea5e9", "#06b6d4"])
ax.axhline(y=0, color="white", linewidth=0.5, alpha=0.3)
ax.set_xlabel("Principal direction", color="gray")
ax.set_ylabel("Stress (MPa)", color="gray")
ax.set_title("Principal Stresses", fontsize=12, fontweight="bold")
ax.grid(True, alpha=0.3)
ax.set_facecolor("#0f172a")

plt.tight_layout()
plt.savefig("output.png", dpi=150, bbox_inches="tight", facecolor="#0f172a")
plt.show()
print("\nVisualization saved.")

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Share:X Reddit LinkedIn

← Least Squares Intro to Functional Analysis →