46. SP — Sparsest Permutation
Type: Score-based CPDAG search (exact/ideal, order-based)
Output: CPDAG
Category: 📏 Score-based
SP (Sparsest Permutation) is a conceptually simple but combinatorial algorithm: it evaluates all permutations of the variables and returns the DAG with the sparsest structure consistent with the data. It is only feasible for very small graphs but serves as an important theoretical and practical reference. oai_citation:6‡arXiv
46.1. Key idea
The SP principle:
For each permutation (ordering) of the variables:
Construct the best DAG consistent with that ordering, typically by allowing each variable to take parents only from variables earlier in the order.
Use a sparsity-aware score (e.g., penalized likelihood, sparsity in the Cholesky factor).
Among all permutations, select the DAG that:
Minimizes the number of edges (or a sparsity-penalized score).
In the Gaussian case, this corresponds to the sparsest Cholesky factorization of the precision matrix.
Under appropriate conditions, SP is statistically consistent under assumptions strictly weaker than full faithfulness. oai_citation:7‡arXiv
46.2. When to use
In Tetrad, SP is primarily a small-scale, high-accuracy algorithm:
Suitable for:
Not suitable for:
Medium or large graphs (permutation space grows as
p!).Routine large-scale structure learning.
For anything beyond very small p, consider GRaSP or BOSS, both of which are SP-inspired but scalable.
46.3. How it works (at a glance)
Enumerate permutations
Loop over all permutations of the variable set.For each permutation
Construct a DAG by allowing each node to take parents among earlier variables in the order.
Select the parent set that optimizes a local score (subject to sparsity), e.g., penalized likelihood.
Measure sparsity
Count number of edges, or use an equivalent sparsity-penalized score.
In the Gaussian case, this is closely tied to the sparsity of the Cholesky factor of the inverse covariance. oai_citation:8‡arXiv
Select the winner
Choose the permutation/DAG with the sparsest structure (or best global score).
Convert the resulting DAG to its CPDAG equivalence class for reporting.
The algorithm is simple but factorial in p, which is why it is mostly used for small problems and theory.
46.4. Strengths
Conceptually clean and principled
Directly encodes the idea that the true DAG should be sparser than alternatives consistent with the data.Strong theoretical guarantees
SP is consistent under conditions weaker than classical faithfulness (e.g., sparsest Markov representation assumptions). oai_citation:9‡arXivUseful as a gold standard
For small graphs, SP can serve as a “ground truth” structure-learning method to benchmark other algorithms.
46.5. Limitations
Factorial complexity in
p
Evaluating all permutations makes SP quickly infeasible beyond roughly ~10 variables.No explicit latent-variable modeling
SP is defined for DAGs over observed variables; latent confounding must be addressed via other methods or hybrids (e.g., SP-FCI in Tetrad).Score and model assumptions
As with other score-based methods, performance depends on score choice (Gaussian vs discrete, etc.) and model adequacy.
46.6. How it relates to other Tetrad algorithms
Parent of GRaSP
GRaSP can be viewed as a greedy relaxation of SP: it explores a neighborhood of permutations rather than the full factorial space. oai_citation:10‡arXiv
Motivational ancestor for BOSS
BOSS adopts a permutation-based viewpoint (like SP) but uses Grow–Shrink Trees and heuristic pruning to scale to much larger graphs.
Hybrid latent-variable extensions
46.7. Prior knowledge support
Within Tetrad, SP is knowledge-aware:
Tier / ordering constraints restrict the set of permutations to those consistent with user-specified partial orders.
Required / forbidden edges constrain admissible parent sets under each permutation.
These constraints significantly reduce the effective search space in small problems.
46.7.1. Parameters
Parameter (camelCase) |
Description |
|---|---|
|
Integer time-lag parameter used when SP is applied to time-series data. A value of 0 (default) treats the data as i.i.d.; positive values indicate the maximum lag to consider for temporal parent–child relationships. |
46.8. Reference
Raskutti, G., & Uhler, C. (2018).
Learning directed acyclic graph models based on sparsest permutations.
Stat, 7(1), e183. oai_citation:11‡arXiv
46.9. Summary
SP is the ideal sparsest-permutation search: simple, consistent, and conceptually appealing, but factorial in dimension. In Tetrad it is mainly a small-graph, high-accuracy benchmark—a theoretical reference point that motivates scalable relaxations such as GRaSP and BOSS, and their latent-variable hybrids.