IEEE Access (Jan 2020)

Distinguish Markov Equivalence Classes from Large-Scale Linear Non-Gaussian Data

  • Guizhen Mai,
  • Yinghan Hong,
  • Pinghua Chen,
  • Kexi Chen,
  • Han Huang,
  • Gengzhong Zheng

DOI
https://doi.org/10.1109/ACCESS.2020.2965093
Journal volume & issue
Vol. 8
pp. 10924 – 10932

Abstract

Read online

In the problem of causal discovery, conditional independence (CI) tests are generally used to detect the causal relationships among observed data. Due to the curse of dimensionality and the limitation of causal direction learning based on V-structure learning, it is difficult for constraint-based methods to distinguish the actual graph from a set of Markov equivalence classes. To alleviate this problem, in this work, a novel regression-based method to test CIs over linear Non-Gaussian data is proposed. The main purpose of this proposal is to relax the CI test of x⊥y|Z to two unconditional independence tests x - f (Z) ⊥y - g (Z) + ΣH (Z) and x - f (Z) + ΣH (Z) ⊥y - g (Z), wheref and g can be estimated by linear regression independently. In addition, we further show that x -f (Z) ⊥y-g (Z)+ΣH (Z) ( or x -f (Z)+ΣH (Z) ⊥y- g (Z) ) can lead to x ← Z ( or y ← Z ). According to this regression-based method, we design a causal structure learning algorithm to learn the actual graph instead of a set of Markov equivalence classes over the observed data. Experiments indicate that our method can detect much more causal relationships than other existing methods, especially in large-scale cases.

Keywords