SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing data
Imad Abugessaisa,
Akira Hasegawa,
Shuhei Noguchi,
Melissa Cardon,
Kazuhide Watanabe,
Masataka Takahashi,
Harukazu Suzuki,
Shintaro Katayama,
Juha Kere,
Takeya Kasukawa
Affiliations
Imad Abugessaisa
Laboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
Akira Hasegawa
Laboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
Shuhei Noguchi
Laboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
Melissa Cardon
Laboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
Kazuhide Watanabe
Laboratory for Cellular Function Conversion Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
Masataka Takahashi
Laboratory for Cellular Function Conversion Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
Harukazu Suzuki
Laboratory for Cellular Function Conversion Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan
Shintaro Katayama
Folkhälsan Research Center, Topeliuksenkatu 20, 00250 Helsinki, Finland; Department of Biosciences and Nutrition, Karolinska Institutet, 141 83 Huddinge, Sweden; Stem Cells and Metabolism Research Program, University of Helsinki, P.O. Box 4 (Yliopistonkatu 3), Helsinki, Finland
Juha Kere
Folkhälsan Research Center, Topeliuksenkatu 20, 00250 Helsinki, Finland; Department of Biosciences and Nutrition, Karolinska Institutet, 141 83 Huddinge, Sweden; Stem Cells and Metabolism Research Program, University of Helsinki, P.O. Box 4 (Yliopistonkatu 3), Helsinki, Finland; Corresponding author
Takeya Kasukawa
Laboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan; Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan; Corresponding author
Summary: The analysis and interpretation of single-cell RNA sequencing (scRNA-seq) experiments are compromised by the presence of poor-quality cells. For meaningful analyses, such poor-quality cells should be excluded as they introduce noise in the data. We introduce SkewC, a quality-assessment tool, to identify skewed cells in scRNA-seq experiments. The tool’s methodology is based on the assessment of gene coverage for each cell, and its skewness as a quality measure; the gene body coverage is a unique characteristic for each protocol, and different protocols yield highly different coverage profiles. This tool is designed to avoid misclustering or false clusters by identifying, isolating, and removing cells with skewed gene body coverage profiles. SkewC is capable of processing any type of scRNA-seq dataset, regardless of the protocol. We envision SkewC as a distinctive QC method to be incorporated into scRNA-seq QC processing to preclude the possibility of scRNA-seq data misinterpretation.