Interactive Gaussian Process Visualizer

Kernel Matrix Formation

Physical Representation in Data Space

The kernel matrix (K) represents the covariance structure between all pairs of data points. Each element K[i][j] shows how much point i influences point j based on their distance and the kernel parameters. This creates a smoothness constraint on possible functions.

Posterior Mean Calculation

Step 1: Kernel Matrix (K)

The kernel matrix defines the covariance structure between all training points. It encodes how much each point influences others based on their distance.

Step 2: Inverse of K (K⁻¹)

The inverse matrix (K⁻¹) represents the precision matrix. It determines how much each observation should contribute to the final prediction, accounting for correlations between points.

Step 3: Observations (y)

The observation vector contains the actual function values at the training points. These are the values we want to interpolate between.

Step 4: Multiply K⁻¹ and y

K⁻¹y represents the weights for each observation. It's a projection of the data onto the space defined by the kernel, determining how much each point contributes to predictions.

Step 5: Final Posterior Mean

Posterior Mean = K*ᵀ · (K⁻¹ · y)
Where K* is the covariance between test points and training points

The posterior mean is a weighted combination of kernel functions centered at each data point. The weights (K⁻¹y) determine how much each point contributes to the prediction at any location.

Data Points

Posterior Mean

Confidence Band

Kernel Correlations

How Gaussian Processes Work

A Gaussian Process is a powerful non-parametric method that defines a distribution over functions. It's completely specified by its mean function and covariance (kernel) function. As you add data points, the GP updates its posterior distribution to reflect both the observed data and its uncertainty in regions without data.

Kernel Parameters

Length Scale (σₗ)

0.5

Controls how far correlations extend Signal Variance (σₛ²)

1.0

Controls the amplitude of functions Noise Variance (σₙ²)

0.05

Controls observation noise

Data Controls

Animation Settings

Animation Speed

1.0x

Kernel Type

Understanding Gaussian Processes

Step 1: Input Points - These are your observed data that the GP will learn from. Each (x,y) pair tells the GP "the function must pass near this point." The more data points you add, the more constrained the possible functions become.

Step 2: Kernel Matrix - The kernel defines how points relate to each other. Points close in input space (small x-distance) are highly correlated (bright cells), while distant points are less related. The kernel encodes our assumptions about the function's smoothness.

Step 3: Posterior Mean - The GP's best estimate of the underlying function, smoothly interpolating between data points. The mean is a weighted average of possible functions, with weights determined by the kernel. It passes exactly through points when there's no noise (σₙ²=0).

Step 4: Uncertainty - The confidence bands show where the GP is uncertain (wide bands) and where it's confident (narrow bands). Uncertainty grows as we move away from observed data points. The width depends on the kernel parameters and data density.

Tip: Adjust the kernel parameters to see how they affect the predictions. The length scale controls how quickly correlations decay, signal variance affects function amplitude, and noise variance accounts for observation noise.

Gaussian Process Visual Explorer