BMC Genomics (Jul 2007)

Use of tiling array data and RNA secondary structure predictions to identify noncoding RNA genes

  • Vinther Jeppe,
  • Hedegaard Mads M,
  • Gardner Paul P,
  • Weile Christian

DOI
https://doi.org/10.1186/1471-2164-8-244
Journal volume & issue
Vol. 8, no. 1
p. 244

Abstract

Read online

Abstract Background Within the last decade a large number of noncoding RNA genes have been identified, but this may only be the tip of the iceberg. Using comparative genomics a large number of sequences that have signals concordant with conserved RNA secondary structures have been discovered in the human genome. Moreover, genome wide transcription profiling with tiling arrays indicate that the majority of the genome is transcribed. Results We have combined tiling array data with genome wide structural RNA predictions to search for novel noncoding and structural RNA genes that are expressed in the human neuroblastoma cell line SK-N-AS. Using this strategy, we identify thousands of human candidate RNA genes. To further verify the expression of these genes, we focused on candidate genes that had a stable hairpin structures or a high level of covariance. Using northern blotting, we verify the expression of 2 out of 3 of the hairpin structures and 3 out of 9 high covariance structures in SK-N-AS cells. Conclusion Our results demonstrate that many human noncoding, structured and conserved RNA genes remain to be discovered and that tissue specific tiling array data can be used in combination with computational predictions of sequences encoding structural RNAs to improve the search for such genes.