Asia-Pacific Journal of Information Technology and Multimedia (Dec 2014)

FINGERPRINT-PERFORMANCE 2D IN CHEMISTRY PUBCHEM DATABASE

  • Shereena M. Arif,
  • Fatimah Zawani Abdullah,
  • Nurul Hashimah Ahamed Hassain Malim

DOI
https://doi.org/10.17576/apjitm-2014-0302-05
Journal volume & issue
Vol. 3, no. (2)
pp. 63 – 70

Abstract

Read online

The study of molecular similarity search in chemical database is increasingly widespread, especially in the area of drug discovery. Similarity search is an application in the field of Chemoinformatics to measure the similarity between the molecular structure which is known as the query and the structure of chemical compounds in the database. Similarity search is also one of the approaches in virtual screening which involves computational techniques and scoring the probabilities of activity. The main objective of this work is to determine the best fingerprint among six fingerprints selected in this study using PubChem chemicaldataset. This paper will discuss the similarity searching process conducted using 6 types of descriptors, which are ECFP4, ECFC4, FCFP4, FCFC4, SRECFC4 and SRFCFC4 on 15 activity classes of PubChem dataset using Tanimoto coefficient to calculate the similarity between the query structures and each of the database structure. The results suggest that ECFP4 performs the best to be used withTanimoto coefficient in the PubChem dataset

Keywords