Molecules (Apr 2009)

On Two Novel Parameters for Validation of Predictive QSAR Models

  • Partha Pratim Roy,
  • Somnath Paul,
  • Indrani Mitra,
  • Kunal Roy

DOI
https://doi.org/10.3390/molecules14051660
Journal volume & issue
Vol. 14, no. 5
pp. 1660 – 1701

Abstract

Read online

Validation is a crucial aspect of quantitative structure–activity relationship (QSAR) modeling. The present paper shows that traditionally used validation parameters (leave-one-out Q2 for internal validation and predictive R2 for external validation) may be supplemented with two novel parameters rm2 and Rp2 for a stricter test of validation. The parameter rm2(overall) penalizes a model for large differences between observed and predicted values of the compounds of the whole set (considering both training and test sets) while the parameter Rp2 penalizes model R2 for large differences between determination coefficient of nonrandom model and square of mean correlation coefficient of random models in case of a randomization test. Two other variants of rm2 parameter, rm2(LOO) and rm2(test), penalize a model more strictly than Q2 and R2pred respectively. Three different data sets of moderate to large size have been used to develop multiple models in order to indicate the suitability of the novel parameters in QSAR studies. The results show that in many cases the developed models could satisfy the requirements of conventional parameters (Q2 and R2pred) but fail to achieve the required values for the novel parameters rm2 and Rp2. Moreover, these parameters also help in identifying the best models from among a set of comparable models. Thus, a test for these two parameters is suggested to be a more stringent requirement than the traditional validation parameters to decide acceptability of a predictive QSAR model, especially when a regulatory decision is involved.

Keywords