A Bayesian network is the right structure for causal reasoning under uncertainty. It is not the right structure for everything.

The case for Bayesian networks is strong precisely because the conditions under which they are not appropriate are well-defined — and honest about scope.

Dialog: We tried Bayesian networks a few years ago. It didn't work. — answered: tell us what failed. Most past efforts that didn't work weren't doing what current SCMs do — correlational networks dressed up as causal ones, or learning structure from data alone, or answering Rung 1 questions and calling them Rung 3.

The strength of a Bayesian network is that it makes the causal structure explicit and auditable. The corresponding weakness: if the structure is wrong, the model is precisely wrong. A BN with an incorrect graph makes specific, confident predictions in the wrong direction — which may be more dangerous than a vaguer model that simply admits uncertainty.

There are domains where the causal structure is genuinely contested or unknowable from available evidence. Macroeconomic systems with extensive feedback loops and non-stationarity are a hard case. Social systems where behavior adapts to the model itself — Lucas critique dynamics — are another. In these cases, a BN imposes a causal architecture that may be no more defensible than a competing architecture, and the appearance of rigor can be misleading.

The minimum bar

A BN is appropriate when subject-matter experts can construct a causal graph that they will defend under challenge — not because it is certainly correct, but because it is more plausible than the alternatives. If no such graph can be constructed, a BN is premature. Start with the causal question, not the modeling tool.

Bayesian networks are particularly valuable when data is scarce, when expert knowledge is rich, and when the goal is causal inference — estimating what would happen under an intervention, not just predicting what will happen next given the current regime. For purely predictive tasks in stable, data-rich environments, gradient-boosted trees or deep learning models will often achieve better predictive accuracy with less modeling effort.

The decision criterion: is the model expected to answer “what will happen if we do X?” or only “what is the probability of Y given what we currently observe?” If the latter, and if the distribution is expected to remain stable, the causal overhead of a BN may not be justified.

The caveat: predictive goals drift into causal goals faster than practitioners expect. A model built to predict default probability is quickly repurposed to ask which interventions would reduce default rates. At that point, the predictive model fails and the causal model is required. Building for causality from the start is often the cheaper option in the long run.

Expert-encoded Bayesian networks require a workshop to construct, validation against data or domain expertise, and ongoing governance to update when the structure changes. For systems where the causal structure is relatively stable — operational risk, insurance underwriting, supply chain dependencies — this governance cost is justified by the model’s longevity.

For systems in rapid flux — a new product line in its first year, a market disruption unfolding in real time, an organization undergoing structural change — the causal graph may be obsolete before it has been fully specified. In these cases, a more flexible tool that explicitly represents model uncertainty — or a simpler model that is updated frequently — may be more appropriate.

Dynamic Bayesian Networks partially address this by modeling temporal evolution, but they require the transition structure to be specified, which is itself a modeling commitment. The deeper point: BNs are well-suited to stable causal structures with well-understood mechanisms. They are less well-suited to domains where the mechanism itself is under construction.

Exact inference — computing the posterior probability of every node given evidence — uses the junction tree algorithm, which has complexity exponential in the treewidth of the graph. For typical enterprise risk models with moderate numbers of nodes and sparse connections, this is fast enough. For large, densely connected networks, it becomes a constraint.

Approximate inference algorithms — loopy belief propagation, variational methods, Monte Carlo sampling — scale better but sacrifice exactness. For high-stakes risk quantification, where the precision of the inference matters, approximate methods require careful validation.

The practical implication: for high-frequency scoring applications (fraud detection at transaction speed, credit scoring for real-time lending decisions), the inference cost of a BN may be prohibitive relative to a simpler lookup table or a pre-compiled scoring model. BNs are better suited to models that run on a schedule — daily, weekly, quarterly — than to models that must respond in milliseconds to live data streams.

Use a BN when: the goal involves intervention reasoning or diagnosis; expert causal knowledge is available and defensible; data is sparse relative to the complexity of the problem; and the causal structure is stable enough to survive the governance cycle.

Use something else when: the goal is purely predictive and the distribution is stable; the causal structure is genuinely unknown; the system is changing faster than the model can be updated; or real-time inference at scale is required.

The deeper point is that these are not competing ideologies. A BN is the right structure for a specific class of problems. Recommending it for all problems would be advocacy, not expertise. The value of knowing when not to use a tool is that it makes the recommendation to use it more credible when the conditions are right.

Next Step

Knowing when not to use a tool is what makes the recommendation to use it credible. If your problem fits the four conditions — let’s confirm that together.

info@rung3.ai

Condition Use a BN Use instead
Goal is intervention reasoning or counterfactual
Goal is purely predictive, stable distribution, abundant data Gradient boosting, neural nets, logistic regression
Causal structure genuinely unknown and unknowable Randomised experiment; structure learning with expert review
System evolves faster than model can be updated Dynamic BN with fast update cycle; online learning
Real-time inference at scale required (<10ms latency) Approximate inference (loopy BP, variational); compiled decision trees

The deeper point is that these are not competing ideologies. A BN is the right structure for a specific class of problems. These conditions are not failures of the approach — they are the boundary conditions that define where the approach belongs, and where to use something else.