Diagnosing the Wait: A Bayesian breakdown of NHS cancer wait times.
News outlets constantly report on missed targets, but rarely explain what is driving them. That is why I engineered a Bayesian Hierarchical model using four years of NHS open data to isolate the exact drivers of these delays—quantifying precisely where resources will make the biggest impact.
Why a Bayesian model — and not a black box
The 62-day standard is simple: following an urgent referral, treatment should begin within 62 days. When it doesn’t, that’s a breach—and behind each one is a real person waiting longer than they should for an answer.
The published figures tell you a breach happened. They don’t tell you why, or where to act. I could have thrown the data at a black-box predictor, but prediction isn’t the job here—understanding is. A Bayesian model gives the answer and tells you how much to trust it, which in healthcare matters as much as the answer itself.
What the structure reveals
For anyone deciding where to put scarce time and money, the model points to three concrete places to look — and one more that proves it can be trusted.
Which cancer types are structurally deteriorating, and where are we gaining ground?
Some cancer types are simply more difficult to treat inside the 62-day standard than others, and a single national average buries that detail.
This model gives every cancer type its own honest baseline, separating real structural drift from normal monthly noise.
It then translates those trends into hard projections. The result is a projected range for how many patients will wait too long.
157 trusts. Which are genuine outliers — and which are turning it around?
Judging every hospital against a flat national target hides their actual operational reality.
Instead, this model plots all 157 trusts on two axes — their true baseline and their structural trajectory — separating genuine systemic outliers from trusts simply having a run of bad luck.
Filter by cancer type, and it exposes exactly where ostensibly “safe” hospitals are quietly failing a specific one.
“What’s the risk for my cancer type, at my hospital — and how many patients is that?”
This view translates a complex statistical landscape into an on-demand operational answer.
It makes the model’s full posterior queryable, instantly returning a hospital’s true structural breach probability with all seasonal noise and data anomalies stripped out.
By projecting these rates into hard patient volumes and 95% credible intervals, it hands budget holders the full best-case and worst-case range they need to allocate resources — without relying on false-precision historical averages.
What this model delivers
Diagnosis, not just prediction
Not “what will happen” but “why it happens.” Each cancer’s baseline risk, each trust’s deviation from the average, and each trend — separated, quantified, and ranked. The starting point for any sensible plan.
Translated for decision-makers
Statistical coefficients become breach probabilities; trends become projected patient numbers; rankings filter by region and cancer type. The output speaks the language of the people who hold the budget — not the people who write the code.
Honest about what it doesn’t know
Every estimate carries its uncertainty. Thin-evidence trusts are flagged, not ranked on noise, and a calibration check proves the risk scores can be trusted. In healthcare, false confidence is the dangerous kind.
All of the above only counts if the model holds up. Here’s the proof it does.
Have a data challenge that requires this level of clarity?
I build quantitative systems for organisations that act on their data — explained clearly enough for the boardroom and rigorously enough for the analysts.
Get in touch