BMC Genomics (Aug 2012)

Development of a porcine (Sus scofa) embryo-specific microarray: array annotation and validation

  • Tsoi Stephen,
  • Zhou Chi,
  • Grant Jason R,
  • Pasternak J,
  • Dobrinsky John,
  • Rigault Philippe,
  • Nieminen Julie,
  • Sirard Marc-André,
  • Robert Claude,
  • Foxcroft George R,
  • Dyck Michael K

DOI
https://doi.org/10.1186/1471-2164-13-370
Journal volume & issue
Vol. 13, no. 1
p. 370

Abstract

Read online

Abstract Background The domestic pig is an important livestock species and there is strong interest in the factors that affect the development of viable embryos and offspring in this species. A limited understanding of the molecular mechanisms involved in early embryonic development has inhibited our ability to fully elucidate these factors. Next generation deep sequencing and microarray technologies are powerful tools for delineation of molecular pathways involved in the developing embryo. Results Here we present the development of a porcine-embryo-specific microarray platform created from a large expressed sequence tag (EST) analysis generated by Roche/454 next-generation sequencing of cDNAs constructed from critical stages of in vivo or in vitro porcine preimplantation embryos. Two cDNA libraries constructed from in vitro and in vivo produced preimplantation porcine embryos were normalized and sequenced using 454 Titanium pyrosequencing technology. Over one million high-quality EST sequences were obtained and used to develop the EMbryogene Porcine Version 1 (EMPV1) microarray composed of 43,795 probes. Based on an initial probe sequence annotation, the EMPV1 features 17,409 protein-coding, 473 pseudogenes, 46 retrotransposed, 2,359 non-coding RNA, 4,121 splice variants in 2,862 genes and a total of 12,324 Novel Transcript Regions (NTR). After re-annotation, the total unique genes increased from 11,961 to 16,281 and 1.9% of them belonged to a large olfactory receptor (OR) gene family. Quality control on the EMPV1 was performed and revealed an even distribution of ten clusters of spiked-in control spots and array to array (dye-swap) correlation was 0.97. Conclusions Using next-generation deep sequencing we have produced a large EST dataset to allow for the selection of probe sequences for the development of the EMPV1 microarray platform. The quality of this embryo-specific array was confirmed with a high-level of reproducibility using current Agilent microarray technology. With more than an estimated 20,000 unique genes represented on the EMPV1, this platform will provide the foundation for future research into the in vivo and in vitro factors that affect the viability of porcine embryos, as well as the effects of these factors on the live offspring that result from these embryos.