Systematic clustering algorithm for chromatin accessibility data and its application to hematopoietic cells.

Azusa Tanaka; Yasuhiro Ishitsuka; Hiroki Ohta; Akihiro Fujimoto; Jun-Ichirou Yasunaga; Masao Matsuoka

doi:10.1371/journal.pcbi.1008422

PLoS Computational Biology (Nov 2020)

Systematic clustering algorithm for chromatin accessibility data and its application to hematopoietic cells.

Azusa Tanaka,
Yasuhiro Ishitsuka,
Hiroki Ohta,
Akihiro Fujimoto,
Jun-Ichirou Yasunaga,
Masao Matsuoka

Affiliations

Azusa Tanaka
Yasuhiro Ishitsuka
Hiroki Ohta
Akihiro Fujimoto
Jun-Ichirou Yasunaga
Masao Matsuoka

DOI: https://doi.org/10.1371/journal.pcbi.1008422
Journal volume & issue: Vol. 16, no. 11
p. e1008422

Abstract

Read online

The huge amount of data acquired by high-throughput sequencing requires data reduction for effective analysis. Here we give a clustering algorithm for genome-wide open chromatin data using a new data reduction method. This method regards the genome as a string of 1s and 0s based on a set of peaks and calculates the Hamming distances between the strings. This algorithm with the systematically optimized set of peaks enables us to quantitatively evaluate differences between samples of hematopoietic cells and classify cell types, potentially leading to a better understanding of leukemia pathogenesis.

Published in PLoS Computational Biology

ISSN: 1553-734X (Print); 1553-7358 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Science: Biology (General)
Website: https://journals.plos.org/ploscompbiol/

About the journal