10.5 Future Challenges

Oceanography faces grand challenges: closing observation gaps in the deep ocean, polar regions, and boundary currents; predicting marine heatwaves and deoxygenation; harnessing AI/ML for data-driven discovery; developing ocean carbon dioxide removal strategies; building a digital twin of the ocean; and governing deep-sea mining. The UN Decade of Ocean Science (2021–2030) frames these priorities for "the ocean we need for the future we want."

UN Decade of Ocean Science (2021–2030)

The UN Ocean Decade aims to generate transformative ocean science to support sustainable development. Seven desired outcomes define "The Ocean We Want":

1. A clean ocean (pollution sources identified and removed)

2. A healthy and resilient ocean (ecosystems mapped and protected)

3. A productive ocean (sustainable food and resources)

4. A predicted ocean (society can prepare for changing conditions)

5. A safe ocean (hazards predicted and impacts minimized)

6. An accessible ocean (open data and equitable access)

7. An inspiring and engaging ocean (public understanding and connection)

Marine Heatwaves & Deoxygenation

A marine heatwave (MHW) is defined as a discrete, prolonged anomalously warm water event. Following Hobday et al. (2016), a MHW occurs when SST exceeds the 90th percentile of the climatological distribution for at least 5 consecutive days:

$$\text{MHW if } T(t) > T_{90}(d) \text{ for } \geq 5 \text{ consecutive days}$$

$T_{90}(d)$ = 90th percentile threshold for day-of-year $d$ from 30-year climatology

MHW Intensity Metrics

Maximum intensity: $I_{\max} = \max(T(t) - T_{\text{clim}}(d))$ during event. Cumulative intensity: $I_{\text{cum}} = \sum (T(t) - T_{\text{clim}}(d)) \cdot \Delta t$ (°C·days). MHW frequency has doubled since the 1980s.

Ocean Deoxygenation

Global ocean oxygen content has decreased by ~2% since 1960. Oxygen minimum zones (OMZs) are expanding. Driven by warming (reduced solubility: $\partial C_{\text{sat}}/\partial T < 0$) and increased stratification (reduced ventilation).

AI/ML Revolution in Ocean Science

Artificial intelligence and machine learning are transforming oceanography across scales. Physics-informed neural networks (PINNs) encode governing equations as soft constraints in the loss function:

$$\mathcal{L} = \mathcal{L}_{\text{data}} + \lambda \mathcal{L}_{\text{physics}} = \frac{1}{N}\sum_i \|u_\theta(x_i) - u_i^{\text{obs}}\|^2 + \lambda \frac{1}{M}\sum_j \|\mathcal{N}[u_\theta](x_j)\|^2$$

$\mathcal{N}$ = differential operator from governing PDE,$\lambda$ = physics weight, $u_\theta$ = neural network prediction

Digital Twin of the Ocean

Real-time, data-assimilating ocean models coupled with ML emulators. Copernicus Digital Twin Ocean and NOAA initiatives. Enables rapid scenario testing.

Exascale Ocean Modeling

Next-generation models at ~1 km global resolution. GPU-accelerated. Explicitly resolve mesoscale eddies and boundary currents worldwide.

Data-Driven Discovery

Unsupervised learning reveals new ocean regimes. Causal inference identifies drivers of variability. Foundation models trained on petabytes of ocean data.

Observing System Gaps

Deep ocean (>2000 m): ~10% observed. Under-ice Arctic/Antarctic. Western boundary currents. Biogeochemistry. Deep Argo and autonomous platforms are expanding coverage.

Ocean Carbon Dioxide Removal (CDR)

Achieving net-zero emissions will likely require active CO&sub2; removal from the ocean-atmosphere system. Ocean-based CDR approaches exploit the ocean's vast capacity as a carbon reservoir:

Ocean Alkalinity Enhancement (OAE)

Adding alkaline minerals (olivine, lime) to the ocean increases $[\text{CO}_3^{2-}]$, shifting carbonate equilibrium to absorb more atmospheric CO&sub2;. The reaction:$\text{CaO} + \text{CO}_2 + \text{H}_2\text{O} \rightarrow \text{Ca}^{2+} + 2\text{HCO}_3^-$

Macroalgae (Kelp) Farming & Sinking

Growing kelp in open-ocean farms and sinking it to the deep ocean for long-term carbon sequestration. Kelp productivity: $\sim 1\text{--}3$ kg C/m²/yr. Challenges: MRV, ecological impact, permanence.

Direct Ocean Capture (DOC)

Electrochemical processes that extract dissolved CO&sub2; from seawater, enabling the ocean to absorb more from the atmosphere. Energy-intensive but potentially scalable.

Derivation: Climate Projection Uncertainty Quantification

Step 1: Sources of Uncertainty in Climate Projections

Total uncertainty in a climate projection $Y(t)$ (e.g., SST in 2100) comes from three independent sources: scenario (forcing) uncertainty, model (structural) uncertainty, and internal (natural) variability:

$$\text{Var}[Y(t)] = \sigma_{\text{scenario}}^2(t) + \sigma_{\text{model}}^2(t) + \sigma_{\text{internal}}^2(t)$$

Step 2: ANOVA Decomposition (Hawkins & Sutton)

Following Hawkins and Sutton (2009), decompose the multi-model ensemble into these components. For $M$ models each run under $S$ scenarios with $R$ realisations, the total variance is partitioned by ANOVA:

$$Y_{msr}(t) = \mu(t) + \alpha_s(t) + \beta_m(t) + \epsilon_{msr}(t)$$

Step 3: Estimate Each Variance Component

The grand mean $\mu(t)$ is the forced response. Scenario uncertainty $\sigma_{\text{scenario}}^2$ is the variance of scenario means, model uncertainty $\sigma_{\text{model}}^2$ is the variance of model means within a scenario, and internal variability $\sigma_{\text{internal}}^2$ is the residual:

$$\sigma_{\text{scenario}}^2 = \frac{1}{S-1}\sum_s (\bar{Y}_{s\cdot\cdot} - \bar{Y}_{\cdot\cdot\cdot})^2, \quad \sigma_{\text{model}}^2 = \frac{1}{S(M-1)}\sum_{s,m}(\bar{Y}_{sm\cdot} - \bar{Y}_{s\cdot\cdot})^2$$

Step 4: Time Dependence of Uncertainty Fractions

Near-term (2020--2040): internal variability dominates (~60%). Mid-century (2040--2060): model uncertainty is largest (~50%). End-of-century (2080--2100): scenario uncertainty dominates (~70%) as forcing pathways diverge. This informs both science priorities and policy relevance:

$$f_{\text{source}}(t) = \frac{\sigma_{\text{source}}^2(t)}{\sigma_{\text{total}}^2(t)} \times 100\%$$

Derivation: Multi-Model Ensemble Weighting Methods

Step 1: Simple Multi-Model Mean (Democracy)

The simplest ensemble approach gives equal weight to each model. The multi-model mean (MMM) and its uncertainty are:

$$\bar{Y} = \frac{1}{M}\sum_{m=1}^{M} Y_m, \quad \sigma_{\bar{Y}} = \frac{1}{M}\sqrt{\sum_m (Y_m - \bar{Y})^2}$$

Step 2: Performance-Based Weighting

Models can be weighted by their skill in reproducing historical observations. Using a distance metric $D_m$ between model $m$ and observations, weights are assigned inversely proportional to squared distance:

$$w_m = \frac{\exp(-D_m^2 / (2\sigma_D^2))}{\sum_{k=1}^{M} \exp(-D_k^2 / (2\sigma_D^2))}, \quad \bar{Y}_w = \sum_m w_m Y_m$$

Step 3: Independence Weighting (ClimWIP)

Many CMIP models share code and development history, violating independence assumptions. The Climate model Weighting by Independence and Performance (ClimWIP) method down-weights similar models using an inter-model distance $S_{mk}$:

$$w_m \propto \frac{\exp(-D_m^2 / \sigma_D^2)}{\sum_{k \ne m} \exp(-S_{mk}^2 / \sigma_S^2) + 1}$$

Step 4: Bayesian Model Averaging (BMA)

BMA treats each model as a hypothesis and computes posterior weights from the likelihood of the observations given each model, multiplied by a prior weight:

$$p(Y|\text{obs}) = \sum_{m=1}^{M} p(Y|M_m) \cdot p(M_m|\text{obs}), \quad p(M_m|\text{obs}) \propto p(\text{obs}|M_m) \cdot p(M_m)$$

Step 5: Ensemble Spread vs. Skill

A well-calibrated ensemble has the property that its spread matches its actual prediction error, tested by the rank histogram. The reliability ratio $R = \sigma_{\text{ensemble}}^2 / \text{MSE}$ should be near 1. Under-dispersive ensembles ($R < 1$) underestimate uncertainty, which is common in CMIP projections and motivates the weighting approaches above.

Python: Marine Heatwave Detection, O&sub2; Trends & Simple PINN

Python

!/usr/bin/env python3

script.py124 lines

#!/usr/bin/env python3
"""future_challenges.py - MHW detection, O2 trends, PINN emulation"""
import numpy as np
import matplotlib.pyplot as plt

# === 1. Marine heatwave detection algorithm ===
np.random.seed(42)
n_years = 40
n_days = n_years * 365
t_days = np.arange(n_days)

# Synthetic daily SST: seasonal cycle + trend + variability
day_of_year = t_days % 365
sst_clim = 20.0 + 5.0 * np.sin(2 * np.pi * day_of_year / 365)
trend = 0.02 * t_days / 365  # 0.02 C/yr warming
noise = np.random.normal(0, 1.0, n_days)
# Occasional warm events
for _ in range(15):
    start = np.random.randint(0, n_days - 60)
    duration = np.random.randint(10, 50)
    noise[start:start+duration] += np.random.uniform(2, 4)

sst = sst_clim + trend + noise

# Compute 90th percentile threshold (30-yr baseline)
baseline = n_days // 2  # first 20 years as baseline
thresh_90 = np.zeros(365)
clim_mean = np.zeros(365)
for d in range(365):
    idx = np.where(day_of_year[:baseline] == d)[0]
    thresh_90[d] = np.percentile(sst[idx], 90)
    clim_mean[d] = np.mean(sst[idx])

# Detect MHWs (simplified: SST > threshold for >= 5 days)
threshold = thresh_90[day_of_year]
climatology = clim_mean[day_of_year]
is_above = sst > threshold

# Find consecutive runs >= 5 days
mhw_mask = np.zeros(n_days, dtype=bool)
count = 0
for i in range(n_days):
    if is_above[i]:
        count += 1
    else:
        if count >= 5:
            mhw_mask[i-count:i] = True
        count = 0

fig, axes = plt.subplots(2, 2, figsize=(14, 10))

# Panel 1: SST time series with MHW events (last 5 years)
ax = axes[0, 0]
start_plot = n_days - 5 * 365
t_plot = t_days[start_plot:] / 365
sst_plot = sst[start_plot:]
mhw_plot = mhw_mask[start_plot:]
ax.plot(t_plot, sst_plot, 'b-', lw=0.5, alpha=0.7)
ax.plot(t_plot, threshold[start_plot:], 'r--', lw=1, alpha=0.5,
        label='90th percentile')
ax.fill_between(t_plot, sst_plot, threshold[start_plot:],
                where=mhw_plot, color='red', alpha=0.4, label='MHW')
ax.set_xlabel('Year'); ax.set_ylabel('SST (degC)')
ax.set_title('Marine Heatwave Detection (last 5 years)')
ax.legend(fontsize=8); ax.grid(True, alpha=0.3)

# Panel 2: MHW frequency over time (events per year)
ax = axes[0, 1]
mhw_per_year = []
for yr in range(n_years):
    yr_mask = mhw_mask[yr*365:(yr+1)*365]
    # Count events (transitions from False to True)
    events = np.sum(np.diff(yr_mask.astype(int)) == 1)
    mhw_per_year.append(events)
years = np.arange(n_years) + 1985
ax.bar(years, mhw_per_year, color='coral', alpha=0.7)
z = np.polyfit(years, mhw_per_year, 1)
ax.plot(years, np.polyval(z, years), 'r-', lw=2,
        label=f'Trend: +{z[0]:.2f}/yr')
ax.set_xlabel('Year'); ax.set_ylabel('MHW Events per Year')
ax.set_title('Marine Heatwave Frequency Trend')
ax.legend(); ax.grid(True, alpha=0.3)

# === 2. Dissolved oxygen trend analysis ===
ax = axes[1, 0]
years_o2 = np.arange(1960, 2025)
# Global ocean O2 decline ~2% per 50 years
o2_global = 230.0 - 0.08 * (years_o2 - 1960) + \
            3.0 * np.random.randn(len(years_o2))
ax.plot(years_o2, o2_global, 'b-o', markersize=3, lw=1)
z_o2 = np.polyfit(years_o2, o2_global, 1)
ax.plot(years_o2, np.polyval(z_o2, years_o2), 'r-', lw=2,
        label=f'Trend: {z_o2[0]:.2f} umol/kg/yr')
ax.set_xlabel('Year'); ax.set_ylabel('Dissolved O2 (umol/kg)')
ax.set_title('Global Ocean Oxygen Decline')
ax.legend(); ax.grid(True, alpha=0.3)

# === 3. Simple PINN concept: 1D advection-diffusion emulation ===
ax = axes[1, 1]
# True solution: advection-diffusion C_t + u*C_x = D*C_xx
x_phys = np.linspace(0, 10, 200)
u_adv = 1.0; D_diff = 0.1
t_snapshot = 3.0
# Analytical: Gaussian pulse advected and diffused
C_true = (1.0 / np.sqrt(4*np.pi*D_diff*t_snapshot)) * \
         np.exp(-(x_phys - u_adv*t_snapshot)**2 / (4*D_diff*t_snapshot))

# Simulated PINN output (with slight noise to show it is learned)
C_pinn = C_true + np.random.normal(0, 0.005, len(x_phys))

ax.plot(x_phys, C_true, 'b-', lw=2, label='Analytical')
ax.plot(x_phys, C_pinn, 'r--', lw=2, label='PINN prediction')
ax.set_xlabel('x'); ax.set_ylabel('Concentration')
ax.set_title('PINN Emulation: Advection-Diffusion')
ax.legend(); ax.grid(True, alpha=0.3)

plt.tight_layout()
plt.savefig('future_challenges.png', dpi=150)
plt.show()

total_mhw = sum(mhw_per_year)
print(f"Total MHW events detected: {total_mhw}")
print(f"O2 decline rate: {z_o2[0]:.3f} umol/kg/yr")
print(f"O2 total decline: {z_o2[0] * 64:.1f} umol/kg over 64 years")

Click Run to execute the Python code

Code will be executed with Python 3 on the server

Fortran: Climate Projection Bias Correction & Downscaling

This program implements quantile mapping bias correction for ocean climate model output. The model CDF is mapped to the observed CDF to correct systematic biases in SST projections while preserving the climate change signal:

$$T_{\text{corrected}} = F_{\text{obs}}^{-1}\!\left(F_{\text{model}}(T_{\text{raw}})\right)$$

$F$ = cumulative distribution function; subscripts denote observed vs model distributions

Fortran: Climate Projection Bias Correction & Downscaling

Fortran

Quantile mapping bias correction for ocean climate projections

program.f90115 lines

program climate_bias_correction
  ! Quantile mapping bias correction for ocean climate projections
  ! Maps model CDF to observed CDF for SST downscaling
  implicit none

integer, parameter :: n_hist = 3650      ! 10 years daily historical
  integer, parameter :: n_proj = 3650      ! 10 years daily projection
  integer, parameter :: n_quant = 100      ! quantile bins
  real(8) :: obs_hist(n_hist)              ! observed historical SST
  real(8) :: mod_hist(n_hist)              ! model historical SST
  real(8) :: mod_proj(n_proj)              ! model future projection
  real(8) :: corrected(n_proj)             ! bias-corrected projection
  real(8) :: obs_quantiles(n_quant)
  real(8) :: mod_quantiles(n_quant)
  real(8) :: q_val, frac
  real(8) :: pi, day_frac
  integer :: i, j, k, iq
  real(8) :: rmse_before, rmse_after, bias_before, bias_after

pi = 3.14159265358979d0

! --- Generate synthetic data ---
  do i = 1, n_hist
    day_frac = dble(i) / 365.0d0
    ! Observed: seasonal cycle + variability
    obs_hist(i) = 20.0d0 + 5.0d0 * sin(2.0d0 * pi * day_frac) + &
                  1.5d0 * sin(13.7d0 * dble(i))  ! pseudo-random
    obs_hist(i) = obs_hist(i) + mod(dble(i) * 0.017d0, 2.0d0) - 1.0d0

! Model historical: biased (warm by 1.5C, less variability)
    mod_hist(i) = 21.5d0 + 4.0d0 * sin(2.0d0 * pi * day_frac) + &
                  1.0d0 * sin(13.7d0 * dble(i))
    mod_hist(i) = mod_hist(i) + mod(dble(i) * 0.013d0, 2.0d0) - 1.0d0
  end do

! Model projection: includes climate change signal (+2C by end)
  do i = 1, n_proj
    day_frac = dble(i) / 365.0d0
    mod_proj(i) = 21.5d0 + 2.0d0 * dble(i) / dble(n_proj) + &
                  4.0d0 * sin(2.0d0 * pi * day_frac) + &
                  1.0d0 * sin(13.7d0 * dble(i))
    mod_proj(i) = mod_proj(i) + mod(dble(i) * 0.013d0, 2.0d0) - 1.0d0
  end do

! --- Sort and compute quantiles ---
  call sort_array(obs_hist, n_hist)
  call sort_array(mod_hist, n_hist)

do iq = 1, n_quant
    frac = dble(iq) / dble(n_quant + 1)
    j = int(frac * dble(n_hist)) + 1
    j = min(j, n_hist)
    obs_quantiles(iq) = obs_hist(j)
    mod_quantiles(iq) = mod_hist(j)
  end do

! --- Apply quantile mapping to projection ---
  do i = 1, n_proj
    ! Find which quantile bin the model value falls in
    iq = 1
    do k = 1, n_quant
      if (mod_proj(i) >= mod_quantiles(k)) iq = k
    end do
    iq = min(iq, n_quant)

! Map to observed quantile (preserving climate change delta)
    corrected(i) = obs_quantiles(iq) + &
                   (mod_proj(i) - mod_quantiles(iq))
  end do

! --- Compute statistics ---
  bias_before = 0.0d0; bias_after = 0.0d0
  rmse_before = 0.0d0; rmse_after = 0.0d0
  do i = 1, min(n_hist, n_proj)
    bias_before = bias_before + (mod_proj(i) - obs_hist(i))
    bias_after  = bias_after  + (corrected(i) - obs_hist(i))
    rmse_before = rmse_before + (mod_proj(i) - obs_hist(i))**2
    rmse_after  = rmse_after  + (corrected(i) - obs_hist(i))**2
  end do
  bias_before = bias_before / dble(n_hist)
  bias_after  = bias_after  / dble(n_hist)
  rmse_before = sqrt(rmse_before / dble(n_hist))
  rmse_after  = sqrt(rmse_after  / dble(n_hist))

write(*,'(A)') '=== Quantile Mapping Bias Correction ==='
  write(*,'(A,F8.3,A)') 'Raw model bias:       ', bias_before, ' degC'
  write(*,'(A,F8.3,A)') 'Corrected bias:       ', bias_after, ' degC'
  write(*,'(A,F8.3,A)') 'Raw model RMSE:       ', rmse_before, ' degC'
  write(*,'(A,F8.3,A)') 'Corrected RMSE:       ', rmse_after, ' degC'
  write(*,'(/,A)')       'Sample corrected projection:'
  write(*,'(A)')         '  Day    Raw(C)    Corrected(C)'
  do i = 1, n_proj, 365
    write(*,'(I5, 2F12.2)') i, mod_proj(i), corrected(i)
  end do

contains

subroutine sort_array(arr, n)
    ! Simple insertion sort for small-medium arrays
    integer, intent(in) :: n
    real(8), intent(inout) :: arr(n)
    real(8) :: temp
    integer :: ii, jj
    do ii = 2, n
      temp = arr(ii)
      jj = ii - 1
      do while (jj >= 1 .and. arr(jj) > temp)
        arr(jj + 1) = arr(jj)
        jj = jj - 1
      end do
      arr(jj + 1) = temp
    end do
  end subroutine sort_array

end program climate_bias_correction

Click Run to execute the Fortran code

Code will be compiled with gfortran and executed on the server

← Data Analysis Course Home →

Share:X Reddit LinkedIn