F1000Research (Mar 2016)

A curated transcriptome dataset collection to investigate the immunobiology of HIV infection [version 1; referees: 2 approved]

  • Jana Blazkova,
  • Sabri Boughorbel,
  • Scott Presnell,
  • Charlie Quinn,
  • Damien Chaussabel

DOI
https://doi.org/10.12688/f1000research.8204.1
Journal volume & issue
Vol. 5

Abstract

Read online

Compendia of large-scale datasets available in public repositories provide an opportunity to identify and fill current gaps in biomedical knowledge. But first, these data need to be readily accessible to research investigators for interpretation. Here, we make available a collection of transcriptome datasets relevant to HIV infection. A total of 2717 unique transcriptional profiles distributed among 34 datasets were identified, retrieved from the NCBI Gene Expression Omnibus (GEO), and loaded in a custom web application, the Gene Expression Browser (GXB), designed for interactive query and visualization of integrated large-scale data. Multiple sample groupings and rank lists were created to facilitate dataset query and interpretation via this interface. Web links to customized graphical views can be generated by users and subsequently inserted in manuscripts reporting novel findings, such as discovery notes. The tool also enables browsing of a single gene across projects, which can provide new perspectives on the role of a given molecule across biological systems. This curated dataset collection is available at: http://hiv.gxbsidra.org/dm3/geneBrowser/list.

Keywords