PLoS ONE (Jan 2015)

Using RNA-Seq Data to Evaluate Reference Genes Suitable for Gene Expression Studies in Soybean.

  • Aldrin Kay-Yuen Yim,
  • Johanna Wing-Hang Wong,
  • Yee-Shan Ku,
  • Hao Qin,
  • Ting-Fung Chan,
  • Hon-Ming Lam

DOI
https://doi.org/10.1371/journal.pone.0136343
Journal volume & issue
Vol. 10, no. 9
p. e0136343

Abstract

Read online

Differential gene expression profiles often provide important clues for gene functions. While reverse transcription quantitative real-time polymerase chain reaction (RT-qPCR) is an important tool, the validity of the results depends heavily on the choice of proper reference genes. In this study, we employed new and published RNA-sequencing (RNA-Seq) datasets (26 sequencing libraries in total) to evaluate reference genes reported in previous soybean studies. In silico PCR showed that 13 out of 37 previously reported primer sets have multiple targets, and 4 of them have amplicons with different sizes. Using a probabilistic approach, we identified new and improved candidate reference genes. We further performed 2 validation tests (with 26 RNA samples) on 8 commonly used reference genes and 7 newly identified candidates, using RT-qPCR. In general, the new candidate reference genes exhibited more stable expression levels under the tested experimental conditions. The three newly identified candidate reference genes Bic-C2, F-box protein2, and VPS-like gave the best overall performance, together with the commonly used ELF1b. It is expected that the proposed probabilistic model could serve as an important tool to identify stable reference genes when more soybean RNA-Seq data from different growth stages and treatments are used.