Module 8: Land Use & Land Cover Classification

Mapping land use and land cover (LULC) from satellite imagery is one of the most impactful applications of Earth observation. This module covers supervised classification with Random Forest, accuracy assessment using confusion matrices and the Kappa statistic, and change detection for monitoring urban expansion, deforestation, and agricultural conversion.

1. Supervised Classification & Accuracy Metrics

Supervised classification assigns each pixel to a predefined land cover class based on its spectral, textural, and contextual features. The classifier learns the mapping from labeled training samples. The confusion matrix (error matrix) is the standard tool for accuracy assessment, from which we derive:

$$OA = \frac{\sum_i N_{ii}}{N_{total}}, \quad \kappa = \frac{OA - P_e}{1 - P_e}$$

where $N_{ii}$ are the diagonal elements of the confusion matrix (correctly classified pixels), $N_{total}$ is the total number of validation pixels, and $P_e$ is the expected agreement by chance:

$$P_e = \frac{1}{N_{total}^2} \sum_i N_{i+} \cdot N_{+i}$$

where $N_{i+}$ and $N_{+i}$ are the row and column marginal totals, respectively. Kappa values above 0.80 indicate strong agreement, 0.60–0.80 substantial agreement, and below 0.40 poor agreement. Per-class metrics include:

Producer's Accuracy (Recall)

The probability that a reference pixel of class $i$ is correctly classified: $PA_i = N_{ii} / N_{+i}$. Measures omission error ($1 - PA_i$).

User's Accuracy (Precision)

The probability that a pixel classified as class $i$ actually belongs to that class: $UA_i = N_{ii} / N_{i+}$. Measures commission error ($1 - UA_i$).

Sample Size Requirements

A common rule of thumb is 50–100 validation samples per class for reliable accuracy assessment. The samples should be independent of the training data (spatially separated) and collected using a probability-based design (stratified random sampling is recommended by Olofsson et al., 2014) to enable unbiased area estimation and confidence intervals.

2. Feature Stack for Random Forest Classification

A rich feature stack improves classification accuracy by providing the classifier with diverse spectral, index, and textural information. For Sentinel-2 based LULC classification, a typical 14-feature stack includes:

14-Feature Stack

Spectral Bands (10)

B2 (Blue, 490 nm) — 10 m
B3 (Green, 560 nm) — 10 m
B4 (Red, 665 nm) — 10 m
B5 (Red Edge 1, 705 nm) — 20 m
B6 (Red Edge 2, 740 nm) — 20 m
B7 (Red Edge 3, 783 nm) — 20 m
B8 (NIR, 842 nm) — 10 m
B8A (NIR narrow, 865 nm) — 20 m
B11 (SWIR 1, 1610 nm) — 20 m
B12 (SWIR 2, 2190 nm) — 20 m

Spectral Indices (4)

NDVI = (B8 − B4) / (B8 + B4) — vegetation vigor
NDWI = (B3 − B8) / (B3 + B8) — water bodies
NDBI = (B11 − B8) / (B11 + B8) — built-up areas
NBR = (B8A − B12) / (B8A + B12) — burn scars

Optional Extras

GLCM texture (contrast, homogeneity)
DEM-derived slope and aspect
SAR backscatter (VV, VH)
Multi-temporal composites

Random Forest Advantages for LULC

●Handles high-dimensional feature spaces without feature selection (ensemble of decorrelated trees).
●Provides built-in feature importance rankings via mean decrease in impurity (Gini importance) or permutation importance.
●Robust to noisy features and outliers; does not require feature normalization.
●Out-of-bag (OOB) error provides an unbiased estimate of generalization accuracy without a separate validation set.

Hyperparameter Tuning

Key parameters: n_estimators (100–500 trees typically sufficient), max_depth (None for full trees or limit to prevent overfitting), min_samples_leaf (5–10 for pixel-based classification), and max_features($\sqrt{n_{features}}$ is the default for classification).

3. Random Forest Classification Pipeline

This pipeline generates a synthetic multi-class dataset simulating a Sentinel-2 feature stack, trains a Random Forest classifier, computes the full confusion matrix with OA and Kappa, and visualizes feature importances and the classified map.

Random Forest LULC Classification with Accuracy Assessment

Python

script.py193 lines

import numpy as np
import matplotlib
matplotlib.use('Agg')
import matplotlib.pyplot as plt
from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import train_test_split
from sklearn.metrics import confusion_matrix, classification_report

np.random.seed(42)

# ─── Define land cover classes ───
classes = {
    0: ('Water', '#3b82f6'),
    1: ('Forest', '#166534'),
    2: ('Cropland', '#a3e635'),
    3: ('Urban', '#94a3b8'),
    4: ('Bare Soil', '#d97706'),
    5: ('Wetland', '#06b6d4'),
}
n_classes = len(classes)
class_names = [classes[i][0] for i in range(n_classes)]
class_colors = [classes[i][1] for i in range(n_classes)]

# ─── Feature names (14-feature stack) ───
feature_names = ['B2', 'B3', 'B4', 'B5', 'B6', 'B7', 'B8', 'B8A',
                 'B11', 'B12', 'NDVI', 'NDWI', 'NDBI', 'NBR']
n_features = len(feature_names)

# ─── Generate synthetic training data ───
samples_per_class = 500
n_total = samples_per_class * n_classes

# Spectral profiles per class (mean reflectance for each of 10 S-2 bands)
profiles = {
    0: [0.06, 0.04, 0.03, 0.02, 0.02, 0.01, 0.01, 0.01, 0.00, 0.00],  # Water
    1: [0.02, 0.05, 0.03, 0.07, 0.22, 0.38, 0.45, 0.44, 0.16, 0.07],  # Forest
    2: [0.05, 0.08, 0.06, 0.10, 0.18, 0.28, 0.35, 0.34, 0.20, 0.12],  # Cropland
    3: [0.09, 0.11, 0.13, 0.14, 0.15, 0.16, 0.17, 0.17, 0.21, 0.23],  # Urban
    4: [0.10, 0.15, 0.20, 0.22, 0.24, 0.25, 0.27, 0.27, 0.30, 0.28],  # Bare Soil
    5: [0.04, 0.05, 0.04, 0.06, 0.15, 0.25, 0.30, 0.29, 0.12, 0.06],  # Wetland
}

X_all = []
y_all = []
for cls in range(n_classes):
    prof = np.array(profiles[cls])
    noise_scale = 0.015 + 0.005 * cls  # vary noise by class
    bands = np.tile(prof, (samples_per_class, 1)) + np.random.normal(0, noise_scale, (samples_per_class, 10))
    bands = np.clip(bands, 0, 1)
    # Compute indices
    ndvi = (bands[:, 6] - bands[:, 2]) / (bands[:, 6] + bands[:, 2] + 1e-10)
    ndwi = (bands[:, 1] - bands[:, 6]) / (bands[:, 1] + bands[:, 6] + 1e-10)
    ndbi = (bands[:, 8] - bands[:, 6]) / (bands[:, 8] + bands[:, 6] + 1e-10)
    nbr = (bands[:, 7] - bands[:, 9]) / (bands[:, 7] + bands[:, 9] + 1e-10)
    features = np.column_stack([bands, ndvi, ndwi, ndbi, nbr])
    X_all.append(features)
    y_all.extend([cls] * samples_per_class)

X = np.vstack(X_all)
y = np.array(y_all)

# ─── Train/test split ───
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42, stratify=y)

# ─── Train Random Forest ───
rf = RandomForestClassifier(n_estimators=200, max_depth=15, min_samples_leaf=5,
                            max_features='sqrt', random_state=42, n_jobs=-1)
rf.fit(X_train, y_train)

# ─── Predictions and accuracy ───
y_pred = rf.predict(X_test)
cm = confusion_matrix(y_test, y_pred)

# Overall Accuracy
OA = np.trace(cm) / cm.sum()

# Kappa
N = cm.sum()
row_sums = cm.sum(axis=1)
col_sums = cm.sum(axis=0)
Pe = np.sum(row_sums * col_sums) / (N ** 2)
kappa = (OA - Pe) / (1 - Pe)

print(f"Overall Accuracy: {OA:.4f} ({OA*100:.1f}%)")
print(f"Kappa Coefficient: {kappa:.4f}")
print(f"OOB Score: {rf.oob_score:.4f}" if hasattr(rf, 'oob_score_') else "")
print(f"\nConfusion Matrix:")
print(cm)
print(f"\nPer-class metrics:")
report = classification_report(y_test, y_pred, target_names=class_names)
print(report)

# ─── Generate synthetic classified map ───
map_size = 150
classified_map = np.zeros((map_size, map_size), dtype=int)
# Create spatial patterns
for i in range(map_size):
    for j in range(map_size):
        x, y_coord = i / map_size, j / map_size
        if (x - 0.3)**2 + (y_coord - 0.7)**2 < 0.02:
            classified_map[i, j] = 0  # lake
        elif x < 0.35 and y_coord < 0.4:
            classified_map[i, j] = 1  # forest
        elif x > 0.6 and y_coord > 0.55:
            classified_map[i, j] = 3  # urban
        elif 0.3 < x < 0.7 and 0.2 < y_coord < 0.6:
            classified_map[i, j] = 2  # cropland
        elif x > 0.8:
            classified_map[i, j] = 4  # bare soil
        elif y_coord > 0.85:
            classified_map[i, j] = 5  # wetland
        else:
            classified_map[i, j] = np.random.choice([1, 2])

# Add some noise to make it realistic
noise_mask = np.random.random((map_size, map_size)) < 0.05
classified_map[noise_mask] = np.random.randint(0, n_classes, np.sum(noise_mask))

# ─── 4-Panel Visualization ───
fig, axes = plt.subplots(2, 2, figsize=(14, 11))

# Panel 1: Confusion Matrix
ax = axes[0, 0]
im = ax.imshow(cm, interpolation='nearest', cmap='Blues')
ax.set_title(f'Confusion Matrix (OA={OA:.1%}, K={kappa:.3f})', fontsize=12, fontweight='bold', color='white')
ax.set_xticks(range(n_classes))
ax.set_yticks(range(n_classes))
ax.set_xticklabels(class_names, rotation=45, ha='right', fontsize=9)
ax.set_yticklabels(class_names, fontsize=9)
ax.set_xlabel('Predicted', fontsize=11)
ax.set_ylabel('Reference', fontsize=11)
for i in range(n_classes):
    for j in range(n_classes):
        color = 'white' if cm[i, j] > cm.max() / 2 else 'black'
        ax.text(j, i, str(cm[i, j]), ha='center', va='center', color=color, fontsize=10)
plt.colorbar(im, ax=ax, shrink=0.8)

# Panel 2: Feature Importances
ax2 = axes[0, 1]
importances = rf.feature_importances_
indices = np.argsort(importances)[::-1]
colors_imp = ['#38bdf8' if i < 10 else '#f59e0b' for i in indices]
ax2.barh(range(n_features), importances[indices], color=colors_imp, alpha=0.85)
ax2.set_yticks(range(n_features))
ax2.set_yticklabels([feature_names[i] for i in indices], fontsize=9)
ax2.set_xlabel('Importance (Gini)', fontsize=11)
ax2.set_title('Feature Importances', fontsize=12, fontweight='bold', color='white')
ax2.grid(True, alpha=0.3, axis='x')
ax2.invert_yaxis()
ax2.set_facecolor('#0a0a0a')

# Panel 3: Classified Map
ax3 = axes[1, 0]
from matplotlib.colors import ListedColormap
cmap_lulc = ListedColormap(class_colors)
im3 = ax3.imshow(classified_map, cmap=cmap_lulc, vmin=-0.5, vmax=n_classes-0.5)
ax3.set_title('Classified Land Cover Map', fontsize=12, fontweight='bold', color='white')
ax3.set_xlabel('Column (pixels)', fontsize=11)
ax3.set_ylabel('Row (pixels)', fontsize=11)
# Legend
from matplotlib.patches import Patch
legend_elements = [Patch(facecolor=classes[i][1], label=classes[i][0]) for i in range(n_classes)]
ax3.legend(handles=legend_elements, loc='lower right', fontsize=8, framealpha=0.8)

# Panel 4: Per-class accuracy
ax4 = axes[1, 1]
producers_acc = np.diag(cm) / cm.sum(axis=0)
users_acc = np.diag(cm) / cm.sum(axis=1)
x_pos = np.arange(n_classes)
width = 0.35
bars1 = ax4.bar(x_pos - width/2, producers_acc * 100, width, label="Producer's (Recall)",
               color='#38bdf8', alpha=0.85)
bars2 = ax4.bar(x_pos + width/2, users_acc * 100, width, label="User's (Precision)",
               color='#f59e0b', alpha=0.85)
ax4.set_xticks(x_pos)
ax4.set_xticklabels(class_names, rotation=45, ha='right', fontsize=9)
ax4.set_ylabel('Accuracy (%)', fontsize=11)
ax4.set_title('Per-Class Accuracy', fontsize=12, fontweight='bold', color='white')
ax4.legend(fontsize=9)
ax4.grid(True, alpha=0.3, axis='y')
ax4.set_ylim(80, 102)
ax4.set_facecolor('#0a0a0a')

for ax in axes.flat:
    ax.tick_params(colors='white')
    for spine in ax.spines.values():
        spine.set_color('#334155')

plt.tight_layout()
plt.savefig('output.png', dpi=130, bbox_inches='tight', facecolor='#0a0a0a')
plt.close()
print("\n4-panel classification results saved.")

Click Run to execute the Python code

Code will be executed with Python 3 on the server

4. Land Cover Change Detection

Change detection identifies pixels that transition between land cover classes over time. The two main approaches are:

Post-Classification Comparison

Classify two dates independently, then compute the transition matrix. Simple but errors in either classification propagate. The change matrix entry $T_{ij}$gives the area that transitioned from class $i$ to class $j$. Total change for class $i$: $\Delta A_i = \sum_j T_{ij} - \sum_j T_{ji}$.

Spectral Change Detection

Compute pixel-wise differences in spectral values or indices between dates. Change Vector Analysis (CVA) uses the magnitude and direction of the spectral change vector: $|\Delta\vec{\rho}| = \sqrt{\sum_b (\rho_{b,t2} - \rho_{b,t1})^2}$. Pixels exceeding a magnitude threshold are flagged as changed.

Area-Adjusted Accuracy (Olofsson et al., 2014)

Standard accuracy metrics assume equal sampling probability across classes. When class proportions are unequal (e.g., 5% urban, 60% forest), area-adjusted estimators correct for sampling bias:

$$\hat{A}_i = A_{total} \sum_j W_j \frac{n_{ji}}{n_{j\cdot}}$$

where $W_j$ is the area proportion of mapped class $j$,$n_{ji}$ is the number of validation samples mapped as class $j$but reference class $i$, and $n_{j\cdot}$ is the total validation samples in mapped class $j$. This provides unbiased area estimates with confidence intervals.

BFAST & Continuous Change Detection

For dense time series (e.g., Sentinel-2 every 5 days), algorithms like BFAST (Breaks For Additive Seasonal and Trend) and CCDC (Continuous Change Detection and Classification) fit harmonic models to the time series and detect breakpoints where the observed values deviate significantly from the model. This enables near-real-time deforestation alerts and gradual change detection.

5. GEE Dynamic World: Urban Expansion (Illustrative)

Google's Dynamic World is a near-real-time LULC product at 10 m resolution, generated for every Sentinel-2 scene using a deep learning model. The following GEE script demonstrates how to detect urban expansion between two periods using Dynamic World classifications.

// ── GEE Dynamic World Urban Expansion Detection ──
// Illustrative code for Google Earth Engine Code Editor

// 1. Define region of interest
var city = ee.Geometry.Point([77.2, 28.6]); // Delhi
var region = city.buffer(30000); // 30km radius

// 2. Load Dynamic World for two periods
var dw_early = ee.ImageCollection('GOOGLE/DYNAMICWORLD/V1')
  .filterBounds(region)
  .filter(ee.Filter.date('2018-01-01', '2018-12-31'))
  .select('label');

var dw_late = ee.ImageCollection('GOOGLE/DYNAMICWORLD/V1')
  .filterBounds(region)
  .filter(ee.Filter.date('2023-01-01', '2023-12-31'))
  .select('label');

// 3. Compute mode (most frequent class) for each period
var mode_early = dw_early.reduce(ee.Reducer.mode()).rename('label');
var mode_late = dw_late.reduce(ee.Reducer.mode()).rename('label');

// Dynamic World classes: 0=water, 1=trees, 2=grass, 3=flooded_veg,
// 4=crops, 5=shrub, 6=built, 7=bare, 8=snow_ice

// 4. Detect urban expansion (non-urban -> urban)
var built_class = 6;
var was_not_built = mode_early.neq(built_class);
var is_built = mode_late.eq(built_class);
var urban_expansion = was_not_built.and(is_built);

// 5. Compute areas by previous land cover
var previous_lulc = mode_early.updateMask(urban_expansion);
var area_by_class = ee.Image.pixelArea()
  .addBands(previous_lulc)
  .reduceRegion({
    reducer: ee.Reducer.sum().group({
      groupField: 1, groupName: 'class'
    }),
    geometry: region,
    scale: 10,
    maxPixels: 1e10
  });
print('Urban expansion by previous class:', area_by_class);

// 6. Compute total urban expansion area
var expansion_area = urban_expansion.selfMask()
  .multiply(ee.Image.pixelArea()).divide(1e6)
  .reduceRegion({
    reducer: ee.Reducer.sum(),
    geometry: region,
    scale: 10,
    maxPixels: 1e10
  });
print('Total urban expansion (km2):', expansion_area);

// 7. Visualization
var dwVis = {
  min: 0, max: 8,
  palette: ['419bdf','397d49','88b053','7a87c6','e49635',
            'dfc35a','c4281b','a59b8f','b39fe1']
};

Map.centerObject(region, 11);
Map.addLayer(mode_early.clip(region), dwVis, 'LULC 2018');
Map.addLayer(mode_late.clip(region), dwVis, 'LULC 2023');
Map.addLayer(urban_expansion.selfMask().clip(region),
  {palette: ['ff0000']}, 'Urban Expansion 2018-2023');

// 8. Transition matrix (sampled)
var transition = mode_early.addBands(mode_late)
  .reduceRegion({
    reducer: ee.Reducer.frequencyHistogram(),
    geometry: region,
    scale: 30,
    maxPixels: 1e9
  });
print('Transition frequencies:', transition);

Pipeline Explanation

●Steps 1–2: Load Dynamic World classifications for 2018 and 2023 over the Delhi metropolitan area.
●Step 3: Compute the mode (most frequent class) per pixel across all scenes in each year, reducing temporal noise.
●Step 4: Boolean change detection: pixels that were not built-up in 2018 but are built-up in 2023.
●Steps 5–6: Compute expansion area in km², broken down by what land cover was replaced (cropland, trees, etc.).

6. Global LULC Products

Several global land cover products are available, each with different characteristics:

ESA WorldCover (10 m)

Based on Sentinel-1 and Sentinel-2 data. 11 classes. Global coverage for 2020 and 2021. OA ~75%. The highest resolution global product available.

Google Dynamic World (10 m)

Near-real-time, per-Sentinel-2-scene classification. 9 classes with probability maps. Deep learning model. Available from 2015 to present. Enables temporal analysis.

Copernicus Global Land Cover (100 m)

Discrete and fractional cover maps. 23 classes. Annual products from 2015. Provides fractional cover of trees, shrubs, herbaceous, and bare soil per pixel.

MODIS MCD12Q1 (500 m)

Annual land cover from 2001 to present. Multiple classification schemes (IGBP, UMD, LAI/fPAR). The longest consistent moderate-resolution record available.

7. Key Takeaways

✓Random Forest classification with a 14-feature Sentinel-2 stack (10 bands + NDVI + NDWI + NDBI + NBR) routinely achieves OA > 85% for 6–10 class LULC maps.

✓The Kappa coefficient accounts for chance agreement and provides a more conservative accuracy measure than OA alone.

✓Feature importance from Random Forest reveals which bands and indices contribute most to class separability, guiding feature engineering.

✓Change detection via post-classification comparison or spectral change vectors enables monitoring of urbanization, deforestation, and agricultural expansion.

✓Google Dynamic World provides near-real-time 10 m LULC classification for every Sentinel-2 scene, enabling planetary-scale monitoring.

Share:X Reddit LinkedIn