BMC Biotechnology (Aug 2017)

A bioinformatics approach for identifying transgene insertion sites using whole genome sequencing data

  • Doori Park,
  • Su-Hyun Park,
  • Yong Wook Ban,
  • Youn Shic Kim,
  • Kyoung-Cheul Park,
  • Nam-Soo Kim,
  • Ju-Kon Kim,
  • Ik-Young Choi

DOI
https://doi.org/10.1186/s12896-017-0386-x
Journal volume & issue
Vol. 17, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Background Genetically modified crops (GM crops) have been developed to improve the agricultural traits of modern crop cultivars. Safety assessments of GM crops are of paramount importance in research at developmental stages and before releasing transgenic plants into the marketplace. Sequencing technology is developing rapidly, with higher output and labor efficiencies, and will eventually replace existing methods for the molecular characterization of genetically modified organisms. Methods To detect the transgenic insertion locations in the three GM rice gnomes, Illumina sequencing reads are mapped and classified to the rice genome and plasmid sequence. The both mapped reads are classified to characterize the junction site between plant and transgene sequence by sequence alignment. Results Herein, we present a next generation sequencing (NGS)-based molecular characterization method, using transgenic rice plants SNU-Bt9–5, SNU-Bt9–30, and SNU-Bt9–109. Specifically, using bioinformatics tools, we detected the precise insertion locations and copy numbers of transfer DNA, genetic rearrangements, and the absence of backbone sequences, which were equivalent to results obtained from Southern blot analyses. Conclusion NGS methods have been suggested as an effective means of characterizing and detecting transgenic insertion locations in genomes. Our results demonstrate the use of a combination of NGS technology and bioinformatics approaches that offers cost- and time-effective methods for assessing the safety of transgenic plants.

Keywords