SoftwareX (Jan 2016)

pyPcazip: A PCA-based toolkit for compression and analysis of molecular simulation data

  • Ardita Shkurti,
  • Ramon Goni,
  • Pau Andrio,
  • Elena Breitmoser,
  • Iain Bethune,
  • Modesto Orozco,
  • Charles A. Laughton

Journal volume & issue
Vol. 5
pp. 44 – 50

Abstract

Read online

The biomolecular simulation community is currently in need of novel and optimised software tools that can analyse and process, in reasonable timescales, the large generated amounts of molecular simulation data. In light of this, we have developed and present here pyPcazip: a suite of software tools for compression and analysis of molecular dynamics (MD) simulation data. The software is compatible with trajectory file formats generated by most contemporary MD engines such as AMBER, CHARMM, GROMACS and NAMD, and is MPI parallelised to permit the efficient processing of very large datasets. pyPcazip is a Unix based open-source software (BSD licenced) written in Python. Keywords: Data analysis, Principal component analysis, Molecular dynamics, Molecular simulation