Causal Representation Learning from Unknown Interventions
Wendong et al., 2024

The standard assumption in causal discovery is that we know which variables were intervened on during data collection. This paper relaxes that assumption, developing identification results for causal structure even when the intervention targets are unknown — which is the realistic condition in most observational enterprise data, where the “interventions” are business decisions made without experimental control.

Why it matters: Most enterprise datasets are generated by a mix of deliberate actions and natural variation whose causal origins are ambiguous. This work extends the conditions under which structure learning produces reliable graphs — relevant to the Structure Learning question of when algorithmic discovery complements expert encoding.

NeurIPS 2024  ·  arXiv 2306.02380
Estimating Causal Effects Identifiable from a Combination of Observations and Experiments
Tikka et al., 2023

Develops algorithms for combining observational and experimental data sources when each source alone is insufficient to identify the causal effect of interest. Directly addresses the most common practical situation: an organization has historical observational data and limited experimental evidence from pilots or A/B tests, and wants to use both.

Why it matters: The identification assumptions on this site’s case studies assume either purely observational data or a clean causal graph. This work provides the formal basis for the mixed case — which is the realistic one when a risk program has both historical claims data and a handful of controlled trials.

JMLR 24(1)  ·  arXiv 2011.06327
Large Language Models as Causal Reasoners
Kıcıman et al., 2023

A systematic evaluation of LLM performance on causal reasoning tasks — pairwise causal discovery, counterfactual reasoning, and intervention prediction — across benchmarks. LLMs perform well on pairwise direction tasks (where they can draw on world knowledge) and poorly on counterfactual and interventional queries (where computation is required). The gap between the two widens as queries move up Pearl’s Ladder.

Why it matters: Provides empirical grounding for the architectural claim on this site: LLMs are useful for Rung 1 pattern recognition and graph structure suggestion; they cannot substitute for an Structural Causal Model (SCM) on Rung 2 and Rung 3 queries. The failure mode is not random — it is systematic and predictable from the Ladder.

Microsoft Research, 2023  ·  arXiv 2305.00050
Causal Discovery with Score Matching on Additive Models with Arbitrary Noise
Montagna et al., 2023

Extends score-based structure learning to additive noise models without assuming a specific noise distribution. In practice, enterprise risk data rarely follows Gaussian noise assumptions — loss distributions are heavy-tailed, claim counts are over-dispersed, credit defaults cluster. This work removes a significant restriction on when algorithmic structure learning produces valid graphs.

Why it matters: The gap between structure-learning theory and practice has long included the noise distribution assumption. This narrows it — relevant to any domain where the data-generating process is non-Gaussian, which is most risk domains.

AISTATS 2023  ·  arXiv 2304.03265
Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes
Triantafyllou et al., 2023

Formalizes actual causality — the “but for” question in legal and regulatory contexts — within multi-agent settings where decisions are made in parallel with partial information. Extends Halpern-Pearl actual causality to the case where responsibility is distributed across agents who cannot observe each other’s actions.

Why it matters: Directly relevant to the Insurance Attribution and Criminal Causation cases, where the legal question is exactly this: given that multiple parties acted, which ones bear responsibility for the outcome, and by how much?

AAMAS 2023  ·  arXiv 2209.08420

The papers that established the formal framework this site is built on. Not new — but the right starting point for a practitioner who wants to read the primary sources rather than summaries.

This work sits inside the broader neurosymbolic AI research tradition — the project of fusing neural and symbolic systems. The references below are the formal foundations of the symbolic side, with the causal-inference subset given primary attention.

Causality: Models, Reasoning, and Inference (2nd ed.)
Pearl, 2009

The mathematical foundation. Chapters 1–3 (the do-calculus and identification theory) and Chapter 7 (counterfactuals) are the core. The rest is important but can be read in any order once those are clear.

Causal Inference in Statistics: A Primer
Pearl, Glymour & Jewell, 2016

The accessible introduction for statisticians. Shorter and more direct than Causality. The right starting point for a quantitative practitioner who wants the formal argument without the full technical apparatus.

Bayesian Networks and Decision Graphs (2nd ed.)
Jensen & Nielsen, 2007

The applied reference for Bayesian networks as decision tools. Covers belief propagation, CPT specification, influence diagrams, and LIMIDS. The book closest to what a practitioner building the models on this site actually needs.

Elements of Causal Inference
Peters, Janzing & Schölkopf, 2017

The ML-oriented treatment. Covers structural causal models, identifiability, and structure learning from a machine learning perspective. Freely available online. Particularly useful for practitioners coming from an ML background who want to understand where the causal framework sits relative to what they already know.

Modeling Creative Abduction Bayesian Style
Feldbacher-Escamilla & Gebharter, 2018

Formalizes creative abduction — the inference that posits a new unobserved cause to explain correlated dispositions — as inference over a Bayesian network with a latent common-cause node. Provides three necessary probabilistic conditions and grounds the move in Reichenbach’s Common Cause Principle. The formal bridge between what senior experts do when they recognize a hidden driver and what a Bayesian network with a latent variable represents.

Quantitative Risk Management: Concepts, Techniques and Tools
McNeil, Frey & Embrechts, 2005

The standard graduate reference for observational-statistical QRM — copulas, extreme value theory, coherent risk measures, multivariate GARCH. The most complete account of how sophisticated the classical apparatus becomes before it meets the rung-3 wall. Read it to understand exactly what statistical machinery a modern risk function already has, and what questions that machinery still cannot answer.

How to Measure Anything: Finding the Value of Intangibles in Business
Hubbard, 2014 (3rd ed.)

The practitioner companion to the academic elicitation literature. Hubbard’s lasting contribution is operational: he turned the calibration-training finding from Tversky & Kahneman, Spetzler, and O’Hagan into a workshop module any risk function can run. Read it for the discipline that makes elicited probabilities defensible — and for the case that anything an organization cares about can be measured well enough to support a decision.

Suggestions

If you have read something that belongs here — a paper that changes how causal models should be built or queried in applied settings — send it.

info@rung3.ai