Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism Network

Bharuno Mahesworo; Arif Budiarto; Alam Ahmad Hidayat; Bens Pardamean

doi:10.4258/hir.2022.28.3.247

Healthcare Informatics Research (Jul 2022)

Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism Network

Bharuno Mahesworo,
Arif Budiarto,
Alam Ahmad Hidayat,
Bens Pardamean

Affiliations

Bharuno Mahesworo: Department of Statistics, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia
Arif Budiarto: Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
Alam Ahmad Hidayat: Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
Bens Pardamean: Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia

DOI: https://doi.org/10.4258/hir.2022.28.3.247
Journal volume & issue: Vol. 28, no. 3
pp. 247 – 255

Abstract

Read online

Objectives Genome-wide association studies (GWAS) are performed to study the associations between genetic variants with respect to certain phenotypic traits such as cancer. However, the method that is commonly used in GWAS assumes that certain traits are solely affected by a single mutation. We propose a network analysis method, in which we generate association networks of single-nucleotide polymorphisms (SNPs) that can differentiate case and control groups. We hypothesize that certain phenotypic traits are attributable to mutations in groups of associated SNPs. Methods We propose a method based on a network analysis framework to study SNP-SNP interactions related to cancer incidence. We employed logistic regression to measure the significance of all SNP pairs from GWAS for the incidence of colorectal cancer and computed a cancer risk score based on the generated SNP networks. Results We demonstrated our method in a dataset from a case-control study of colorectal cancer in the South Sulawesi population. From the GWAS results, 20,094 pairs of 200 SNPs were created. We obtained one cluster containing four pairs of five SNPs that passed the filtering threshold based on their p-values. A locus on chromosome 12 (12:54410007) was found to be strongly connected to the four variants on chromosome 1. A polygenic risk score was computed from the five SNPs, and a significant difference in colorectal cancer risk was obtained between the case and control groups. Conclusions Our results demonstrate the applicability of our method to understand SNP-SNP interactions and compute risk scores for various types of cancer.

Published in Healthcare Informatics Research

ISSN: 2093-3681 (Print); 2093-369X (Online)
Publisher: The Korean Society of Medical Informatics
Country of publisher: Korea, Republic of
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: http://www.e-hir.org

About the journal

Abstract

Keywords