16. FCI β Fast Causal Inferenceο
Constraint-based β’ Latent-capable β’ Output: PAG
FCI is the canonical constraint-based algorithm for learning causal structure when hidden confounders and selection bias may be present. It generalizes PC from DAGs/CPDAGs to MAGs/PAGs, using conditional independence tests plus a rich set of orientation rules.
16.1. Key ideaο
Use conditional independences to recover a Partial Ancestral Graph (PAG) that represents all causal models (MAGs) compatible with the data, even when some variables are unmeasured or selection-biased.
FCI proceeds in three broad stages:
Adjacency search
Start from a complete undirected graph; remove edges when variables are independent given some conditioning set.Extra-edge removal using possible-d-separation sets
Go beyond PC by testing separations along paths that may involve latent variables, using βpossible-d-sepβ sets to identify additional edges that must be removed.Orientation rules (R0βR10)
Apply a collection of sound orientation rules (including collider orientation, discriminating paths, and visible/invisible-edge rules) to obtain a PAG with tails, arrowheads, and circles encoding ancestral constraints.
The output PAG compactly encodes all causal structures consistent with the CI relations under the assumptions of Causation, Prediction, and Search (CPS).
16.2. When to use FCIο
Use FCI when:
You cannot assume causal sufficiency (latent confounders of measured variables are plausible).
Selection bias may be present.
You want a PAG as the target representation.
You prefer a pure CI-based method with well-understood theory.
Typical scenarios:
Observational studies in epidemiology, social science, psychology.
Combining variables from different sources where unmeasured drivers are likely.
Exploratory causal discovery where βno latentsβ is clearly unrealistic.
If you are confident there are no hidden confounders, algorithms like PC, FGES, or BOSS (with DAG/CPDAG output) are usually more appropriate.
16.3. Assumptionsο
FCI relies on the standard assumptions from Causation, Prediction, and Search:
The data come from a causal DAG over measured plus latent plus selection variables.
The distribution is Markov and faithful to the corresponding MAG over the measured variables.
Errors are independent.
CI tests are sufficiently reliable in the sample limit (independence oracle in theory).
The full FCI rule set (R0βR10), as completed by Zhang (2008), is sound, arrowhead-complete, and tail-complete for PAGs. This means FCI orients every arrowhead and tail that is invariant across all MAGs in the Markov equivalence class, and orients none that are not so invariant. The resulting graph is the maximally informative PAG.
16.4. How it works (at a glance)ο
16.4.1. 1. Initial adjacency search (PC-style)ο
Start with a complete undirected graph over the measured variables.
For each pair of variables X, Y, search over conditioning sets S (starting from size 0 and increasing) such that:
If X is independent of Y given S (written: X β Y | S), remove the edge X - Y and record S as the separating set Sepset(X, Y).
This stage is essentially the Fast Adjacency Search (FAS) from PC.
Result: a sparse skeleton plus separating sets, but still possibly containing edges that can only be explained by latents or selection.
16.4.2. 2. Extra-edge removal using possible-d-sepο
PCβs skeleton is not sufficient in the presence of latents. FCI refines it via possible-d-separation:
For each remaining adjacency X - Y, compute a Possible-D-SEP set, which contains nodes that could appear on a d-separating path between X and Y in the presence of latent variables.
Test additional independences of the form X β Y | S where S is a subset of Possible-D-SEP(X, Y).
If such an S is found, remove the edge and update the separating set.
This step finds edges that PC cannot remove because the conditioning sets required to reveal independence are nonlocal when latent variables exist.
16.4.3. 3. Orientation rules (R0βR10)ο
With the refined skeleton and separating sets, FCI orients edges using a sequence of rules (R0βR10). Highlights:
Unshielded colliders (R0)
For triples X - Z - Y where X and Y are nonadjacent:
If Z is not in Sepset(X, Y), orient as:
X -> Z <- Y
Propagation rules (R1βR4)
Generalizations of Meek-style rules but adapted for PAGs:
Prevent creation of new unshielded colliders.
Avoid directed cycles.
Manage circle endpoints appropriately.
Latent/selection-specific rules (R5βR10)
These operate on discriminating paths, visible versus invisible edges, and ensure the resulting orientation is valid for some MAG compatible with the detected CI pattern.
Rules are applied iteratively until no further orientations apply.
16.4.4. 4. PAG outputο
A PAG uses three endpoint marks:
Tail (βββ) on X β? Y: X is an ancestor of Y in all MAGs in the equivalence class.
Arrowhead (β->β) on X ?-> Y: Y is not an ancestor of X in any compatible MAG.
Circle (βoβ): Orientation is not determined (both possibilities occur across compatible MAGs).
The PAG captures all invariant ancestral relations implied by the observed CIs.
16.5. How it relates to other Tetrad algorithmsο
PC: special case assuming no latent confounders or selection bias; outputs a CPDAG.
RFCI: faster but sometimes more conservative.
GFCI, BOSS-FCI, GRaSP-FCI: hybrids that use score-based initialization followed by FCI-style refinement.
FCIT: retains FCIβs PAG semantics but uses targeted, score-guided test scheduling.
16.6. Strengthsο
Handles hidden confounders and selection bias explicitly.
Produces a PAG that encodes all invariant ancestral relations.
Provably sound and arrow/tail complete with an independence oracle.
Foundational algorithm in causal discovery with latent variables.
16.7. Limitationsο
CI-test intensive; expensive for large graphs or high maximum conditioning-set sizes.
Finite-sample CI errors can propagate.
PAGs are more complex to interpret than CPDAGs.
Assumes acyclicity and faithfulness.
RFCI, GFCI, or FCIT may be preferred for larger problems.
16.8. Prior knowledgeο
FCI in Tetrad typically supports:
Required and forbidden edges.
Tiers (variables in earlier tiers cannot have later ones as ancestors).
Required edges are usually applied early so that subsequent rules propagate consequences.
16.9. Key parameters in Tetradο
Parameter (camelCase) |
Description |
|---|---|
|
Maximum conditioning-set size for adjacency search and possible-d-sep refinement. Use |
|
If |
|
Determines how colliders are oriented: |
|
Limits the length of discriminating paths considered in orientation rules. Use |
|
If |
|
If |
|
False discovery rate threshold |
|
Time-series lag parameter. If positive, applies the time-lag transform to produce lagged copies of each variable. |
|
If |
|
If |
|
If |
Note that for the collider orientation style, a choice of CONSERVATIVE yields an algorithm that has been called CFCI (Conservative FCI), whereas a choice of MAX-P yields an algorithm that has been called FCI-Max. The reference for FCI-Max is given below (Raghu et al., 2018).
16.10. Referencesο
Spirtes, Glymour, and Scheines.
Causation, Prediction, and Search. MIT Press.
Zhang, J. (2008). βOn the completeness of orientation rules for causal discovery in the presence of latent confounders and selection bias.β Artificial Intelligence, 172(16β17), 1873β1896.
Raghu, V. K., Ramsey, J. D., Morris, A., Manatakis, D. V., Sprites, P., Chrysanthis, P. K., β¦ & Benos, P. V. (2018). Comparison of strategies for scalable causal discovery of latent variable models from mixed data. International journal of data science and analytics, 6(1), 33-45.