logit link functionlog link functionSpatial Dependence:
Observations collected at nearby locations are often more similar than those farther apart.
Ignoring Spatial Structure:
Applications:
Goal:
Account for spatial correlation to improve prediction and inference.


Cressie’s book classifies spatial statistics according to data format:
Gelfand’s book classifies spatial statistics according to spatial variation:
Geostatistical data: River blindness in Cameroon
Lattice data: COPD emergency admission
Point pattern: Primary biliary cirrhosis data
\[ Y_i = \beta_0 + \underbrace{\beta_1 d_1(s_i) + \beta_2 d_2(s_i)}_{\text{explained}} + \underbrace{S(s_i) + Z_j}_{\text{unexplained}}, \]
where
\[ r(u; \rho, \nu) = \frac{1}{2^{\nu-1}\Gamma(\nu)} \left(\frac{u}{\rho}\right)^\nu K_\nu\!\left(\frac{u}{\rho}\right), \]
\(K_\nu(\cdot)\): modified Bessel function of order \(\nu\)
Interpretation
Special cases
Often sufficient to choose \(\nu \in \{0.5, 1.5, 2.5\}\)
\[
r(u; \rho, \nu)
= \frac{1}{2^{\nu-1}\Gamma(\nu)}
\left(\frac{u}{\rho}\right)^\nu
K_\nu\!\left(\frac{u}{\rho}\right),
\] 
Exponential correlation function: \(r(u; \rho, 0.5) = \exp\{-u/\rho\}\)
Regression residuals
\[
\hat{r}_i = Y_i - \hat{Y}_i
\] from a GLM
Each \(\hat{r}_i\) estimates
\[
S(s_i) + Z_i
\]
Empirical variogram \[ \hat{V}(u) = \frac{1}{2|N(u)|}\sum_{(i,j)\in N(u)}(\hat{r}_i-\hat{r}_j)^2 \]
where
\[
N(u)=\{(i,j): \lVert s_i-s_j\rVert=u\}
\]


Multivariate Gaussian distribution
\[
Y \sim \text{MVN}(D\beta, \sigma^2 R + \tau^2 I)
\]
Fitting process
So far, we assumed: \[ Y \sim \text{MVN}(D\beta, \sigma^2 R + \tau^2 I) \]
However, many geostatistical datasets are:
A Gaussian likelihood is no longer appropriate
Model-based geostatistics distinguishes between:
Observation model (likelihood): \[ Y_i \mid \eta_i \sim \pi(y_i \mid \eta_i) \]
Latent process model: \[ \eta_i = X_i \beta + S(s_i) + Z_i \]
Key idea:
Gaussian \[ Y_i \mid \eta_i \sim N(\eta_i, \sigma^2) \] Continuous measurements (e.g. pollution levels)
Binomial \[ Y_i \mid \eta_i \sim \text{Binomial}(n_i, p_i), \qquad \text{logit}(p_i) = \eta_i \] Prevalence survey data
Poisson \[ Y_i \mid \eta_i \sim \text{Poisson}(\lambda_i), \qquad \log(\lambda_i) = \eta_i \] Disease counts, event data
The linear predictor enters the likelihood via a link:
\[ g(\mu_i) = \eta_i = X_i\beta + S(s_i) + Z_i \]
Examples:
This gives a generalized linear mixed model (GLMM) structure
For non-Gaussian likelihoods:
The marginal likelihood requires: \[ L(\theta) = \int \pi(y \mid S, \theta)\pi(S \mid \theta)\,dS \]
High-dimensional integral over latent field \(x\)
No closed-form solution
Consequences:
Limitations:
We want:
This motivates Integrated Nested Laplace Approximation (INLA)
Hierarchical models setup:
This class of models is known as Latent Gaussian Models
INLA is designed specifically for this class
INLA is designed for latent Gaussian models (LGMs):
\[ y_i \mid \eta_i, \theta \sim \pi(y_i \mid \eta_i, \theta), \qquad \eta = A S \]
where
Examples of \(S\):
Why this matters:
A Gaussian Markov Random Field (GMRF) is a Gaussian random vector with conditional independence structure.
Let
\[
\mathbf{S} = (S_1,\dots,S_n)^\top \sim \mathcal{N}(0, Q^{-1})
\]
where:
For a GMRF:
\[ Q_{ij} = 0 \quad \Longleftrightarrow \quad S_i \;\perp\!\!\!\perp\; S_j \mid \mathbf{S}_{-ij} \]
Interpretation:
Goal: \[ \pi(\theta \mid y) \propto \frac{\pi(y \mid S, \theta)\pi(S \mid \theta)\pi(\theta)} {\pi(S \mid y, \theta)} \]
Key point:
Conditional on \(\theta\), INLA approximates: \[ \pi(S \mid \theta, y) \]
Approximation:
This step exploits:
Final goal: \[ \pi(S_i \mid y), \qquad \pi(\theta_j \mid y) \]
INLA computes: \[ \pi(S_i \mid y) = \int \pi(S_i \mid \theta, y)\,\pi(\theta \mid y)\,d\theta \]
Integration performed numerically
Output:
Lindgren, Rue, and Lindström (2011) provides the link between the two.
Key point:
In INLA, a GMRF is used as a computational approximation to a continuous GRF
Key ideas:
Benefits:
In geostatistics, we often model spatial dependence using Gaussian random fields (GRFs) with Matérn covariance
For observations at locations \(s_1, \dots, s_n\):
This becomes infeasible for large spatial datasets
A spatial Gaussian random field is defined as
\[ S(s) \sim \text{GRF}(0, C(\cdot)) \]
with covariance function: \[ \text{Cov}\{S(s), S(s')\} = C(\|s - s'\|) \]
A key result (Whittle, 1954):
A Matérn GRF is the solution to the SPDE: \[ (\kappa^2 - \Delta)^{\alpha/2} S(s) = \mathcal{W}(s), \]
where:
We often describe a Matérn GRF using:
INLA/SPDE uses a different (equivalent) parameterisation:
Key relationships (common INLA convention):
Interpretation:
In INLA output you will often see hyperparameters related to:
Even if the underlying SPDE uses \((\kappa,\tau)\), INLA/inlabru often provides summaries in the more interpretable scale:
In inla.spde2.pcmatern() you specify:
The spatial domain is discretised using a triangular mesh
The continuous field is approximated as: \[ S(s) \approx \sum_{k=1}^K w_k \psi_k(s) \] where:
\(\psi_k(s)\) are basis functions
\(w_k\) are random weights
Local support of basis functions ⇒ sparsity
The SPDE discretisation yields \[ \mathbf{w} \sim \mathcal{N}(0, Q^{-1}) \]
Precision matrix \(Q\) is sparse
This defines a Gaussian Markov Random Field (GMRF)
Key consequence:
INLA is well suited for:
INLA is less suitable for:
INLA and inlabru R packagesWhat is inlabru?
An R package built on top of INLA
Designed for spatial and spatio-temporal modelling using
Integrated Nested Laplace Approximation (INLA)
Provides an intuitive interface for:
Install INLA: https://www.r-inla.org/download-install
Examples and documentation: https://inlabru-org.github.io/inlabru/
Key features
sf, sp, and raster