An informatic pipeline for the data capture and submission of quantitative proteomic data using iTRAQ<smcaps><sup>TM</sup></smcaps>

Hart Sarah R; Jones Andrew R; Swainston Neil; Siepen Jennifer A; Hermjakob Henning; Jones Philip; Hubbard Simon J

doi:10.1186/1477-5956-5-4

Proteome Science (Feb 2007)

An informatic pipeline for the data capture and submission of quantitative proteomic data using iTRAQ<smcaps><sup>TM</sup></smcaps>

Hart Sarah R,
Jones Andrew R,
Swainston Neil,
Siepen Jennifer A,
Hermjakob Henning,
Jones Philip,
Hubbard Simon J

Affiliations

Hart Sarah R
Jones Andrew R
Swainston Neil
Siepen Jennifer A
Hermjakob Henning
Jones Philip
Hubbard Simon J

DOI: https://doi.org/10.1186/1477-5956-5-4
Journal volume & issue: Vol. 5, no. 1
p. 4

Abstract

Read online

Abstract Background Proteomics continues to play a critical role in post-genomic science as continued advances in mass spectrometry and analytical chemistry support the separation and identification of increasing numbers of peptides and proteins from their characteristic mass spectra. In order to facilitate the sharing of this data, various standard formats have been, and continue to be, developed. Still not fully mature however, these are not yet able to cope with the increasing number of quantitative proteomic technologies that are being developed. Results We propose an extension to the PRIDE and mzData XML schema to accommodate the concept of multiple samples per experiment, and in addition, capture the intensities of the iTRAQTM reporter ions in the entry. A simple Java-client has been developed to capture and convert the raw data from common spectral file formats, which also uses a third-party open source tool for the generation of iTRAQTM reported intensities from Mascot output, into a valid PRIDE XML entry. Conclusion We describe an extension to the PRIDE and mzData schemas to enable the capture of quantitative data. Currently this is limited to iTRAQTM data but is readily extensible for other quantitative proteomic technologies. Furthermore, a software tool has been developed which enables conversion from various mass spectrum file formats and corresponding Mascot peptide identifications to PRIDE formatted XML. The tool represents a simple approach to preparing quantitative and qualitative data for submission to repositories such as PRIDE, which is necessary to facilitate data deposition and sharing in public domain database. The software is freely available from http://www.mcisb.org/software/PrideWizard.

Published in Proteome Science

ISSN: 1477-5956 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Science: Biology (General): Cytology
Website: https://proteomesci.biomedcentral.com/

About the journal