Screening drug-target interactions with positive-unlabeled learning

Lihong Peng; Wen Zhu; Bo Liao; Yu Duan; Min Chen; Yi Chen; Jialiang Yang

doi:10.1038/s41598-017-08079-7

Scientific Reports (Aug 2017)

Screening drug-target interactions with positive-unlabeled learning

Lihong Peng,
Wen Zhu,
Bo Liao,
Yu Duan,
Min Chen,
Yi Chen,
Jialiang Yang

Affiliations

Lihong Peng: Key Laboratory for Embedded and Network Computing of Hunan Province, College of Information Science and Engineering, Hunan University
Wen Zhu: Key Laboratory for Embedded and Network Computing of Hunan Province, College of Information Science and Engineering, Hunan University
Bo Liao: Key Laboratory for Embedded and Network Computing of Hunan Province, College of Information Science and Engineering, Hunan University
Yu Duan: Hunan University of Commerce
Min Chen: Key Laboratory for Embedded and Network Computing of Hunan Province, College of Information Science and Engineering, Hunan University
Yi Chen: College of Drug, Changsha Medical University
Jialiang Yang: Department of Genetics and Genomic Sciences, Icahn School of Medicine

DOI: https://doi.org/10.1038/s41598-017-08079-7
Journal volume & issue: Vol. 7, no. 1
pp. 1 – 17

Abstract

Read online

Abstract Identifying drug-target interaction (DTI) candidates is crucial for drug repositioning. However, usually only positive DTIs are deposited in known databases, which challenges computational methods to predict novel DTIs due to the lack of negative samples. To overcome this dilemma, researchers usually randomly select negative samples from unlabeled drug-target pairs, which introduces a lot of false-positives. In this study, a negative sample extraction method named NDTISE is first developed to screen strong negative DTI examples based on positive-unlabeled learning. A novel DTI screening framework, PUDTI, is then designed to infer new drug repositioning candidates by integrating NDTISE, probabilities that remaining ambiguous samples belong to the positive and negative classes, and an SVM-based optimization model. We investigated the effectiveness of NDTISE on a DTI data provided by NCPIS. NDTISE is much better than random selection and slightly outperforms NCPIS. We then compared PUDTI with 6 state-of-the-art methods on 4 classes of DTI datasets from human enzymes, ion channels, GPCRs and nuclear receptors. PUDTI achieved the highest AUC among the 7 methods on all 4 datasets. Finally, we validated a few top predicted DTIs through mining independent drug databases and literatures. In conclusion, PUDTI provides an effective pre-filtering method for new drug design.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal