Distinguish Markov Equivalence Classes from Large-Scale Linear Non-Gaussian Data

Guizhen Mai; Yinghan Hong; Pinghua Chen; Kexi Chen; Han Huang; Gengzhong Zheng

doi:10.1109/ACCESS.2020.2965093

IEEE Access (Jan 2020)

Distinguish Markov Equivalence Classes from Large-Scale Linear Non-Gaussian Data

Guizhen Mai,
Yinghan Hong,
Pinghua Chen,
Kexi Chen,
Han Huang,
Gengzhong Zheng

Affiliations

Guizhen Mai: ORCiD; School of Computer Science and Technology, Guangdong University of Technology, Guangzhou, China
Yinghan Hong: ORCiD; School of Physics and Electronic Engineering, Hanshan Normal University, Chaozhou, China
Pinghua Chen: ORCiD; School of Computer Science and Technology, Guangdong University of Technology, Guangzhou, China
Kexi Chen: ORCiD; School of Automation, Guangdong University of Technology, Guangzhou, China
Han Huang: ORCiD; School of Software Engineering, South China University of Technology, Guangzhou, China
Gengzhong Zheng: ORCiD; School of Computer and Information Engineering, Hanshan Normal University, Chaozhou, China

DOI: https://doi.org/10.1109/ACCESS.2020.2965093
Journal volume & issue: Vol. 8
pp. 10924 – 10932

Abstract

Read online

In the problem of causal discovery, conditional independence (CI) tests are generally used to detect the causal relationships among observed data. Due to the curse of dimensionality and the limitation of causal direction learning based on V-structure learning, it is difficult for constraint-based methods to distinguish the actual graph from a set of Markov equivalence classes. To alleviate this problem, in this work, a novel regression-based method to test CIs over linear Non-Gaussian data is proposed. The main purpose of this proposal is to relax the CI test of x⊥y|Z to two unconditional independence tests x - f (Z) ⊥y - g (Z) + ΣH (Z) and x - f (Z) + ΣH (Z) ⊥y - g (Z), wheref and g can be estimated by linear regression independently. In addition, we further show that x -f (Z) ⊥y-g (Z)+ΣH (Z) ( or x -f (Z)+ΣH (Z) ⊥y- g (Z) ) can lead to x ← Z ( or y ← Z ). According to this regression-based method, we design a causal structure learning algorithm to learn the actual graph instead of a set of Markov equivalence classes over the observed data. Experiments indicate that our method can detect much more causal relationships than other existing methods, especially in large-scale cases.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords