A Transposon Story: From TE Content to TE Dynamic Invasion of <i>Drosophila</i> Genomes Using the Single-Molecule Sequencing Technology from Oxford Nanopore
Mourdas Mohamed,
Nguyet Thi-Minh Dang,
Yuki Ogyama,
Nelly Burlet,
Bruno Mugat,
Matthieu Boulesteix,
Vincent Mérel,
Philippe Veber,
Judit Salces-Ortiz,
Dany Severac,
Alain Pélisson,
Cristina Vieira,
François Sabot,
Marie Fablet,
Séverine Chambeyron
Affiliations
Mourdas Mohamed
Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France
Nguyet Thi-Minh Dang
IRD/UM UMR DIADE, 911 avenue Agropolis BP64501, 34394 Montpellier, France
Yuki Ogyama
Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France
Nelly Burlet
Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France
Bruno Mugat
Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France
Matthieu Boulesteix
Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France
Vincent Mérel
Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France
Philippe Veber
Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France
Judit Salces-Ortiz
Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France
Dany Severac
MGX-Montpellier GenomiX, c/o Institut de Génomique Fonctionnelle, CNRS, INSERM, Université de Montpellier, 34094 Montpellier, France
Alain Pélisson
Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France
Cristina Vieira
Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France
François Sabot
IRD/UM UMR DIADE, 911 avenue Agropolis BP64501, 34394 Montpellier, France
Marie Fablet
Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France
Séverine Chambeyron
Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France
Transposable elements (TEs) are the main components of genomes. However, due to their repetitive nature, they are very difficult to study using data obtained with short-read sequencing technologies. Here, we describe an efficient pipeline to accurately recover TE insertion (TEI) sites and sequences from long reads obtained by Oxford Nanopore Technology (ONT) sequencing. With this pipeline, we could precisely describe the landscapes of the most recent TEIs in wild-type strains of Drosophila melanogaster and Drosophila simulans. Their comparison suggests that this subset of TE sequences is more similar than previously thought in these two species. The chromosome assemblies obtained using this pipeline also allowed recovering piRNA cluster sequences, which was impossible using short-read sequencing. Finally, we used our pipeline to analyze ONT sequencing data from a D. melanogaster unstable line in which LTR transposition was derepressed for 73 successive generations. We could rely on single reads to identify new insertions with intact target site duplications. Moreover, the detailed analysis of TEIs in the wild-type strains and the unstable line did not support the trap model claiming that piRNA clusters are hotspots of TE insertions.