An analytical study on the identification of N-linked glycosylation sites using machine learning model

Muhammad Aizaz Akmal; Muhammad Awais Hassan; Shoaib Muhammad; Khaldoon S. Khurshid; Abdullah Mohamed

doi:10.7717/peerj-cs.1069

PeerJ Computer Science (Sep 2022)

An analytical study on the identification of N-linked glycosylation sites using machine learning model

Muhammad Aizaz Akmal,
Muhammad Awais Hassan,
Shoaib Muhammad,
Khaldoon S. Khurshid,
Abdullah Mohamed

Affiliations

Muhammad Aizaz Akmal: Department of Computer Science, University of Engineering and Technology, KSK, Lahore, Punjab, Pakistan
Muhammad Awais Hassan: Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan
Shoaib Muhammad: Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan
Khaldoon S. Khurshid: Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan
Abdullah Mohamed: Research Centre, Future University in Egypt, New Cairo, Egypt

DOI: https://doi.org/10.7717/peerj-cs.1069
Journal volume & issue: Vol. 8
p. e1069

Abstract

Read online Read online

N-linked is the most common type of glycosylation which plays a significant role in identifying various diseases such as type I diabetes and cancer and helps in drug development. Most of the proteins cannot perform their biological and psychological functionalities without undergoing such modification. Therefore, it is essential to identify such sites by computational techniques because of experimental limitations. This study aims to analyze and synthesize the progress to discover N-linked places using machine learning methods. It also explores the performance of currently available tools to predict such sites. Almost seventy research articles published in recognized journals of the N-linked glycosylation field have shortlisted after the rigorous filtering process. The findings of the studies have been reported based on multiple aspects: publication channel, feature set construction method, training algorithm, and performance evaluation. Moreover, a literature survey has developed a taxonomy of N-linked sequence identification. Our study focuses on the performance evaluation criteria, and the importance of N-linked glycosylation motivates us to discover resources that use computational methods instead of the experimental method due to its limitations.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords