BMC Bioinformatics (Nov 2007)

Novel definition files for human GeneChips based on GeneAnnot

  • Ferrari Sergio,
  • Shmoish Michael,
  • Safran Marilyn,
  • Sirota Alexandra,
  • Coppe Alessandro,
  • Bortoluzzi Stefania,
  • Ferrari Francesco,
  • Lancet Doron,
  • Danieli Gian,
  • Bicciato Silvio

DOI
https://doi.org/10.1186/1471-2105-8-446
Journal volume & issue
Vol. 8, no. 1
p. 446

Abstract

Read online

Abstract Background Improvements in genome sequence annotation revealed discrepancies in the original probeset/gene assignment in Affymetrix microarray and the existence of differences between annotations and effective alignments of probes and transcription products. In the current generation of Affymetrix human GeneChips, most probesets include probes matching transcripts from more than one gene and probes which do not match any transcribed sequence. Results We developed a novel set of custom Chip Definition Files (CDF) and the corresponding Bioconductor libraries for Affymetrix human GeneChips, based on the information contained in the GeneAnnot database. GeneAnnot-based CDFs are composed of unique custom-probesets, including only probes matching a single gene. Conclusion GeneAnnot-based custom CDFs solve the problem of a reliable reconstruction of expression levels and eliminate the existence of more than one probeset per gene, which often leads to discordant expression signals for the same transcript when gene differential expression is the focus of the analysis. GeneAnnot CDFs are freely distributed and fully compliant with Affymetrix standards and all available software for gene expression analysis. The CDF libraries are available from http://www.xlab.unimo.it/GA_CDF, along with supplementary information (CDF libraries, installation guidelines and R code, CDF statistics, and analysis results).