Molecules (Dec 2004)

Atom, Atom-Type, and Total Linear Indices of the “Molecular Pseudograph’s Atom Adjacency Matrix”: Application to QSPR/QSAR Studies of Organic Compounds

  • Eduardo A. Castro,
  • Vicente Romero Zaldivar,
  • Francisco Torrens,
  • Juan Alberto Castillo Garit,
  • Yovani Marrero Ponce

DOI
https://doi.org/10.3390/91201100
Journal volume & issue
Vol. 9, no. 12
pp. 1100 – 1123

Abstract

Read online

In this paper we describe the application in QSPR/QSAR studies of a newgroup of molecular descriptors: atom, atom-type and total linear indices of the molecularpseudograph’s atom adjacency matrix. These novel molecular descriptors were used forthe prediction of boiling point and partition coefficient (log P), specific rate constant (logk), and antibacterial activity of 28 alkyl-alcohols and 34 derivatives of 2-furylethylenes,respectively. For this purpose two quantitative models were obtained to describe thealkyl-alcohols’ boiling points. The first one includes only two total linear indices andshowed a good behavior from a statistical point of view (R2 = 0.984, s = 3.78, F = 748.57,q2 = 0.981, and scv = 3.91). The second one includes four variables [3 global and 1 local(heteroatom) linear indices] and it showed an improvement in the description of physicalproperty (R2 = 0.9934, s = 2.48, F = 871.96, q2 = 0.990, and scv = 2.79). Later, linearmultiple regression analysis was also used to describe log P and log k of the 2-furyl-ethylenes derivatives. These models were statistically significant [(R2 = 0.984, s = 0.143, and F = 113.38) and (R2 = 0.973, s = 0.26 and F = 161.22), respectively] and showed very good stability to data variation in leave-one-out (LOO) cross-validation experiment [(q2 = 0.93.8 and scv = 0.178) and (q2 = 0.948 and scv = 0.33), respectively]. Finally, a linear discriminant model for classifying antibacterial activity of these compounds was also achieved with the use of the atom and atom-type linear indices. The global percent of good classification in training and external test set obtained was of 94.12% and 100.0%, respectively. The comparison with other approaches (connectivity indices, total and local spectral moments, quantum chemical descriptors, topographic indices and E- state/biomolecular encounter parameters) reveals a good behavior of our method. The approach described in this paper appears to be a very promising structural invariant, useful for QSPR/QSAR studies and computer-aided “rational” drug design.

Keywords