Data in Brief (Apr 2020)

Data set of intrinsically disordered proteins analysed at a local protein conformation level

  • Akhila Melarkode Vattekatte,
  • Tarun Jairaj Narwani,
  • Aline Floch,
  • Mirjana Maljković,
  • Soubika Bisoo,
  • Nicolas K. Shinada,
  • Agata Kranjc,
  • Jean-Christophe Gelly,
  • Narayanaswamy Srinivasan,
  • Nenad Mitić,
  • Alexandre G. de Brevern

Journal volume & issue
Vol. 29

Abstract

Read online

Intrinsic Disorder Proteins (IDPs) have become a hot topic since their characterisation in the 90s. The data presented in this article are related to our research entitled “A structural entropy index to analyse local conformations in Intrinsically Disordered Proteins” published in Journal of Structural Biology [1]. In this study, we quantified, for the first time, continuum from rigidity to flexibility and finally disorder. Non-disordered regions were also highlighted in the ensemble of disordered proteins. This work was done using the Protein Ensemble Database (PED), which is a useful database collecting series of protein structures considered as IDPs. The data set consists of a collection of cleaned protein files in classical pdb format that can be readily used as an input with most automatic analysis software. The accompanying data include the coding of all structural information in terms of a structural alphabet, namely Protein Blocks (PBs). An entropy index derived from PBs that allows apprehending the continuum between protein rigidity to flexibility to disorder is included, with information from secondary structure assignment, protein accessibility and prediction of disorder from the sequences. The data may be used for further structural bioinformatics studies of IDPs. It can also be used as a benchmark for evaluating disorder prediction methods. Keywords: Protein disorder, PDB, Ensembles, Entropy, Local protein conformation, Structural alphabet