PeerJ (Sep 2024)
Identification of non-model mammal species using the MinION DNA sequencer from Oxford Nanopore
Abstract
Background The Neotropics harbors the largest species richness of the planet; however, even in well-studied groups, there are potentially hundreds of species that lack a formal description, and likewise, many already described taxa are difficult to identify using morphology. Specifically in small mammals, complex morphological diagnoses have been facilitated by the use of molecular data, particularly from mitochondrial sequences, to obtain accurate species identifications. Obtaining mitochondrial markers implies the use of PCR and specific primers, which are largely absent for non-model organisms. Oxford Nanopore Technologies (ONT) is a new alternative for sequencing the entire mitochondrial genome without the need for specific primers. Only a limited number of studies have employed exclusively ONT long-reads to assemble mitochondrial genomes, and few studies have yet evaluated the usefulness of such reads in multiple non-model organisms. Methods We implemented fieldwork to collect small mammals, including rodents, bats, and marsupials, in five localities in the northern extreme of the Cordillera Central of Colombia. DNA samples were sequenced using the MinION device and Flongle flow cells. Shotgun-sequenced data was used to reconstruct the mitochondrial genome of all the samples. In parallel, using a customized computational pipeline, species-level identifications were obtained based on sequencing raw reads (Whole Genome Sequencing). ONT-based identifications were corroborated using traditional morphological characters and phylogenetic analyses. Results A total of 24 individuals from 18 species were collected, morphologically identified, and deposited in the biological collection of Universidad EAFIT. Our different computational pipelines were able to reconstruct mitochondrial genomes from exclusively ONT reads. We obtained three new mitochondrial genomes and eight new molecular mitochondrial sequences for six species. Our species identification pipeline was able to obtain accurate species identifications for up to 75% of the individuals in as little as 5 s. Finally, our phylogenetic analyses corroborated the identifications from our automated species identification pipeline and revealed important contributions to the knowledge of the diversity of Neotropical small mammals. Discussion This study was able to evaluate different pipelines to reconstruct mitochondrial genomes from non-model organisms, using exclusively ONT reads, benchmarking these protocols on a multi-species dataset. The proposed methodology can be applied by non-expert taxonomists and has the potential to be implemented in real-time, without the need to euthanize the organisms and under field conditions. Therefore, it stands as a relevant tool to help increase the available data for non-model organisms, and the rate at which researchers can characterize life specially in highly biodiverse places as the Neotropics.
Keywords