Prevention and Control of Pathogens Based on Big-Data Mining and Visualization Analysis

Cui‐Xia Chen; Cui‐Xia Chen; Li‐Na Sun; Xue‐Xin Hou; Peng‐Cheng Du; Xiao‐Long Wang; Xiao‐Chen Du; Yu‐Fei Yu; Yu‐Fei Yu; Rui‐Kun Cai; Rui‐Kun Cai; Lei Yu; Lei Yu; Tian‐Jun Li; Tian‐Jun Li; Min‐Na Luo; Min‐Na Luo; Yue Shen; Yue Shen; Chao Lu; Chao Lu; Qian Li; Qian Li; Chuan Zhang; Chuan Zhang; Hua‐Fang Gao; Hua‐Fang Gao; Xu Ma; Xu Ma; Hao Lin; Zong‐Fu Cao; Zong‐Fu Cao

doi:10.3389/fmolb.2020.626595

Frontiers in Molecular Biosciences (Feb 2021)

Prevention and Control of Pathogens Based on Big-Data Mining and Visualization Analysis

Cui‐Xia Chen,
Cui‐Xia Chen,
Li‐Na Sun,
Xue‐Xin Hou,
Peng‐Cheng Du,
Xiao‐Long Wang,
Xiao‐Chen Du,
Yu‐Fei Yu,
Yu‐Fei Yu,
Rui‐Kun Cai,
Rui‐Kun Cai,
Lei Yu,
Lei Yu,
Tian‐Jun Li,
Tian‐Jun Li,
Min‐Na Luo,
Min‐Na Luo,
Yue Shen,
Yue Shen,
Chao Lu,
Chao Lu,
Qian Li,
Qian Li,
Chuan Zhang,
Chuan Zhang,
Hua‐Fang Gao,
Hua‐Fang Gao,
Xu Ma,
Xu Ma,
Hao Lin,
Zong‐Fu Cao,
Zong‐Fu Cao

Affiliations

Cui‐Xia Chen: National Research Institute for Family Planning, Beijing, China
Cui‐Xia Chen: National Center of Human Genetic Resources, Beijing, China
Li‐Na Sun: National Institute for Communicable Disease Control and Prevention, Beijing, China
Xue‐Xin Hou: National Institute for Communicable Disease Control and Prevention, Beijing, China
Peng‐Cheng Du: Bejing Ditan Hospital, Beijing, China
Xiao‐Long Wang: Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing, China
Xiao‐Chen Du: Shanghai Jiaotong University School of Medicine, Shanghai, China
Yu‐Fei Yu: National Research Institute for Family Planning, Beijing, China
Yu‐Fei Yu: National Center of Human Genetic Resources, Beijing, China
Rui‐Kun Cai: National Research Institute for Family Planning, Beijing, China
Rui‐Kun Cai: National Center of Human Genetic Resources, Beijing, China
Lei Yu: National Research Institute for Family Planning, Beijing, China
Lei Yu: National Center of Human Genetic Resources, Beijing, China
Tian‐Jun Li: National Research Institute for Family Planning, Beijing, China
Tian‐Jun Li: National Center of Human Genetic Resources, Beijing, China
Min‐Na Luo: National Research Institute for Family Planning, Beijing, China
Min‐Na Luo: National Center of Human Genetic Resources, Beijing, China
Yue Shen: National Research Institute for Family Planning, Beijing, China
Yue Shen: National Center of Human Genetic Resources, Beijing, China
Chao Lu: National Research Institute for Family Planning, Beijing, China
Chao Lu: National Center of Human Genetic Resources, Beijing, China
Qian Li: National Research Institute for Family Planning, Beijing, China
Qian Li: National Center of Human Genetic Resources, Beijing, China
Chuan Zhang: National Research Institute for Family Planning, Beijing, China
Chuan Zhang: National Center of Human Genetic Resources, Beijing, China
Hua‐Fang Gao: National Research Institute for Family Planning, Beijing, China
Hua‐Fang Gao: National Center of Human Genetic Resources, Beijing, China
Xu Ma: National Research Institute for Family Planning, Beijing, China
Xu Ma: National Center of Human Genetic Resources, Beijing, China
Hao Lin: Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
Zong‐Fu Cao: National Research Institute for Family Planning, Beijing, China
Zong‐Fu Cao: National Center of Human Genetic Resources, Beijing, China

DOI: https://doi.org/10.3389/fmolb.2020.626595
Journal volume & issue: Vol. 7

Abstract

Read online

Morbidity and mortality caused by infectious diseases rank first among all human illnesses. Many pathogenic mechanisms remain unclear, while misuse of antibiotics has led to the emergence of drug-resistant strains. Infectious diseases spread rapidly and pathogens mutate quickly, posing new threats to human health. However, with the increasing use of high-throughput screening of pathogen genomes, research based on big data mining and visualization analysis has gradually become a hot topic for studies of infectious disease prevention and control. In this paper, the framework was performed on four infectious pathogens (Fusobacterium, Streptococcus, Neisseria, and Streptococcus salivarius) through five functions: 1) genome annotation, 2) phylogeny analysis based on core genome, 3) analysis of structure differences between genomes, 4) prediction of virulence genes/factors with their pathogenic mechanisms, and 5) prediction of resistance genes/factors with their signaling pathways. The experiments were carried out from three angles: phylogeny (macro perspective), structure differences of genomes (micro perspective), and virulence and drug-resistance characteristics (prediction perspective). Therefore, the framework can not only provide evidence to support the rapid identification of new or unknown pathogens and thus plays a role in the prevention and control of infectious diseases, but also help to recommend the most appropriate strains for clinical and scientific research. This paper presented a new genome information visualization analysis process framework based on big data mining technology with the accommodation of the depth and breadth of pathogens in molecular level research.

Published in Frontiers in Molecular Biosciences

ISSN: 2296-889X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Biology (General)
Website: https://www.frontiersin.org/journals/molecular-biosciences

About the journal

Abstract

Keywords