F1000Research (Feb 2014)

Matched molecular pair-based data sets for computer-aided medicinal chemistry [v2; ref status: indexed, http://f1000r.es/309]

  • Ye Hu,
  • Antonio de la Vega de León,
  • Bijun Zhang,
  • Jürgen Bajorath

DOI
https://doi.org/10.12688/f1000research.3-36.v2
Journal volume & issue
Vol. 3

Abstract

Read online

Matched molecular pairs (MMPs) are widely used in medicinal chemistry to study changes in compound properties including biological activity, which are associated with well-defined structural modifications. Herein we describe up-to-date versions of three MMP-based data sets that have originated from in-house research projects. These data sets include activity cliffs, structure-activity relationship (SAR) transfer series, and second generation MMPs based upon retrosynthetic rules. The data sets have in common that they have been derived from compounds included in the ChEMBL database (release 17) for which high-confidence activity data are available. Thus, the activity data associated with MMP-based activity cliffs, SAR transfer series, and retrosynthetic MMPs cover the entire spectrum of current pharmaceutical targets. Our data sets are made freely available to the scientific community.

Keywords