Scientific Data (Jan 2021)

A database framework for rapid screening of structure-function relationships in PFAS chemistry

  • An Su,
  • Krishna Rajan

DOI
https://doi.org/10.1038/s41597-021-00798-x
Journal volume & issue
Vol. 8, no. 1
pp. 1 – 10

Abstract

Read online

Abstract This paper describes a database framework that enables one to rapidly explore systematics in structure-function relationships associated with new and emerging PFAS chemistries. The data framework maps high dimensional information associated with the SMILES approach of encoding molecular structure with functionality data including bioactivity and physicochemical property. This ‘PFAS-Map’ is a 3-dimensional unsupervised visualization tool that can automatically classify new PFAS chemistries based on current PFAS classification criteria. We provide examples on how the PFAS-Map can be utilized, including the prediction and estimation of yet unmeasured fundamental physical properties of PFAS chemistries, uncovering hierarchical characteristics in existing classification schemes, and the fusion of data from diverse sources.