← home ← writings

The Medium Computes

On reservoir computing and the intelligence of physical systems

What if computation isn’t something we impose on matter, but something matter already does?

A rock falling computes its own trajectory. The differential equation is F = ma, and the rock solves it in real time, with perfect precision, at no energy cost beyond what gravity supplies. Water ripples solve wave equations. A soap film minimizes surface area—an optimization problem that would take your laptop seconds to approximate. The rock doesn’t know Newton. The soap film has never heard of Lagrange multipliers. The question isn’t whether physical systems compute. They obviously do. The question is whether we can read the answer.

· · ·

In the early 2000s, Herbert Jaeger and Wolfgang Maass independently arrived at the same insight: you don’t need to train the whole network. Feed a signal into a dynamical system—a reservoir of interconnected nodes. Each node responds differently: different coupling strengths, different timescales, different nonlinearities. The input signal gets scattered into a high-dimensional space, like white light hitting a prism and fanning into a spectrum. Then attach a simple linear readout—a weighted sum—to combine these scattered responses into whatever answer you need.

The reservoir never learns. Its weights are random, fixed at initialization, never updated. Only the readout learns. And the readout is trivially simple: a linear combination, solvable in one step by least squares. No backpropagation. No gradient descent. No vanishing gradients. The complex temporal processing happens in the reservoir’s dynamics. The learning happens in the readout’s arithmetic.

This is the echo state network. And the visual makes the logic clear:

Echo State Network

Same input, many different temporal transformations. Each node responds with its own timescale and nonlinearity—the reservoir scatters the input into a high-dimensional echo space. Colors on the network correspond to the traces below.

Watch the nodes respond. Some track the input closely. Others lag behind, smoothing over fast transients. A few invert. Several ring at their own frequencies, triggered by the input but singing their own song. The reservoir is not reproducing the input. It is decomposing it—spreading one signal across many temporal bases, each capturing a different feature of the input’s history.

· · ·

Softness is memory. An octopus arm bending stores information about past forces in its current shape. The viscoelastic compliance of muscle tissue means the arm’s configuration at any moment encodes a weighted integral of every force that has acted on it—recent forces weighed heavily, older forces fading exponentially. This is exactly what a reservoir does. The arm is the reservoir.

A dead trout swims. Place a trout carcass in a vortex street—a Kármán wake behind a cylinder—and it undulates upstream. No neural control. No muscle activation. The body’s passive viscoelasticity transforms the periodic water flow into locomotion. The fish’s body is a reservoir that computes the appropriate swimming gait from the incoming flow, without any brain to read the output.

Viscosity is the echo state property. It determines how long the medium remembers. Too little damping and the reservoir is chaotic—small differences in input produce wildly different responses, and the system forgets nothing, which means it distinguishes everything, which means it generalizes from nothing. Too much damping and the reservoir is dead—every input produces the same flat response, the system remembers nothing. The sweet spot—the edge of chaos—is where the spectral radius of the weight matrix approaches one. Where echoes persist long enough to be useful but fade fast enough to stay separable.

And here is the key insight: the reservoir already contains the answer. You just need to find the right linear combination.

Readout Training

target:

MSE: 0.000

Try adjusting the weights manually, then press “solve” to see optimal least-squares weights snap into place. The reservoir already contains the answer—finding it is just linear algebra. White line: target. Orange line: weighted readout.

Try it yourself. Drag the weight sliders, watch the output lurch toward the target, overshoot, oscillate. You are doing by hand what gradient descent does iteratively: searching the weight space for the combination that minimizes error. Now press solve. The least-squares solution finds the optimum in one step—not iteratively, not approximately, but exactly. The normal equation: w = (X^TX)⁻¹X^Ty. One matrix inversion. Done.

The error drops. The output snaps onto the target. The reservoir was already doing the hard work—the nonlinear temporal expansion. The readout just had to find the projection.

· · ·

This is why active media matter. Equilibrium systems can’t compute well. Detailed balance—the requirement that every microscopic process runs equally in both directions—prevents directed information flow. In equilibrium, correlations decay monotonically. There are no oscillations, no sustained echoes, no temporal memory beyond exponential decay. A system at thermal equilibrium is the ultimate overdamped reservoir: it forgets everything instantly.

Energy input changes everything. A driven system can amplify nonlinearly, sustain oscillations, maintain multiple timescales, self-organize into functional architectures. A chemical reaction network with a million molecular species solved XOR—not because anyone designed it to, but because the energy-driven nonequilibrium dynamics naturally generated the necessary nonlinear separability. The reaction computed the answer. Nobody told it how.

This is the deeper lesson. We spend enormous effort engineering computation: designing logic gates, optimizing neural architectures, tuning hyperparameters. But the universe has been computing since long before we arrived to watch. Every nonlinear dynamical system is a reservoir. Every dissipative structure is a readout waiting for someone to set the weights. The medium was never just carrying the message. It was never just shaping the message. It was computing the message—performing transformations that no linear system could replicate, generating outputs that no passive medium could produce.

The question was never whether matter can think. The question is whether we can learn to listen.