MEMOPS: Data modelling and automatic code generation

Fogh Rasmus H.; Boucher Wayne; Ionides John M.C.; Vranken Wim F.; Stevens Tim J.; Laue Ernest D.

doi:10.1515/jib-2010-123

Journal of Integrative Bioinformatics (Dec 2010)

MEMOPS: Data modelling and automatic code generation

Fogh Rasmus H.,
Boucher Wayne,
Ionides John M.C.,
Vranken Wim F.,
Stevens Tim J.,
Laue Ernest D.

Affiliations

Fogh Rasmus H.: Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, United Kingdom of Great Britain and Northern Ireland
Boucher Wayne: Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, United Kingdom of Great Britain and Northern Ireland
Ionides John M.C.: Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, United Kingdom of Great Britain and Northern Ireland
Vranken Wim F.: PDBe group, EMBL-EBI, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom of Great Britain and Northern Ireland
Stevens Tim J.: Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, United Kingdom of Great Britain and Northern Ireland
Laue Ernest D.: Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, United Kingdom of Great Britain and Northern Ireland

DOI: https://doi.org/10.1515/jib-2010-123
Journal volume & issue: Vol. 7, no. 3
pp. 112 – 134

Abstract

Read online

In recent years the amount of biological data has exploded to the point where much useful information can only be extracted by complex computational analyses. Such analyses are greatly facilitated by metadata standards, both in terms of the ability to compare data originating from different sources, and in terms of exchanging data in standard forms, e.g. when running processes on a distributed computing infrastructure. However, standards thrive on stability whereas science tends to constantly move, with new methods being developed and old ones modified. Therefore maintaining both metadata standards, and all the code that is required to make them useful, is a non-trivial problem. Memops is a framework that uses an abstract definition of the metadata (described in UML) to generate internal data structures and subroutine libraries for data access (application programming interfaces - APIs - currently in Python, C and Java) and data storage (in XML files or databases). For the individual project these libraries obviate the need for writing code for input parsing, validity checking or output. Memops also ensures that the code is always internally consistent, massively reducing the need for code reorganisation. Across a scientific domain a Memops-supported data model makes it easier to support complex standards that can capture all the data produced in a scientific area, share them among all programs in a complex software pipeline, and carry them forward to deposition in an archive. The principles behind the Memops generation code will be presented, along with example applications in Nuclear Magnetic Resonance (NMR) spectroscopy and structural biology.

Published in Journal of Integrative Bioinformatics

ISSN: 1613-4516 (Online)
Publisher: De Gruyter
Country of publisher: Germany
LCC subjects: Technology: Chemical technology: Biotechnology
Website: https://www.degruyter.com/view/j/jib

About the journal