Prediction of ubiquitination sites by using the composition of k-spaced amino acid pairs.

Zhen Chen; Yong-Zi Chen; Xiao-Feng Wang; Chuan Wang; Ren-Xiang Yan; Ziding Zhang

doi:10.1371/journal.pone.0022930

PLoS ONE (Jan 2011)

Prediction of ubiquitination sites by using the composition of k-spaced amino acid pairs.

Zhen Chen,
Yong-Zi Chen,
Xiao-Feng Wang,
Chuan Wang,
Ren-Xiang Yan,
Ziding Zhang

Affiliations

Zhen Chen
Yong-Zi Chen
Xiao-Feng Wang
Chuan Wang
Ren-Xiang Yan
Ziding Zhang

DOI: https://doi.org/10.1371/journal.pone.0022930
Journal volume & issue: Vol. 6, no. 7
p. e22930

Abstract

Read online

As one of the most important reversible protein post-translation modifications, ubiquitination has been reported to be involved in lots of biological processes and closely implicated with various diseases. To fully decipher the molecular mechanisms of ubiquitination-related biological processes, an initial but crucial step is the recognition of ubiquitylated substrates and the corresponding ubiquitination sites. Here, a new bioinformatics tool named CKSAAP_UbSite was developed to predict ubiquitination sites from protein sequences. With the assistance of Support Vector Machine (SVM), the highlight of CKSAAP_UbSite is to employ the composition of k-spaced amino acid pairs surrounding a query site (i.e. any lysine in a query sequence) as input. When trained and tested in the dataset of yeast ubiquitination sites (Radivojac et al, Proteins, 2010, 78: 365-380), a 100-fold cross-validation on a 1∶1 ratio of positive and negative samples revealed that the accuracy and MCC of CKSAAP_UbSite reached 73.40% and 0.4694, respectively. The proposed CKSAAP_UbSite has also been intensively benchmarked to exhibit better performance than some existing predictors, suggesting that it can be served as a useful tool to the community. Currently, CKSAAP_UbSite is freely accessible at http://protein.cau.edu.cn/cksaap_ubsite/. Moreover, we also found that the sequence patterns around ubiquitination sites are not conserved across different species. To ensure a reasonable prediction performance, the application of the current CKSAAP_UbSite should be limited to the proteome of yeast.

Published in PLoS ONE

ISSN: 1932-6203 (Online)
Publisher: Public Library of Science (PLoS)
Country of publisher: United States
LCC subjects: Medicine; Science
Website: https://journals.plos.org/plosone/

About the journal