Genome Biology (Feb 2022)

Genomic variations and epigenomic landscape of the Medaka Inbred Kiyosu-Karlsruhe (MIKK) panel

  • Adrien Leger,
  • Ian Brettell,
  • Jack Monahan,
  • Carl Barton,
  • Nadeshda Wolf,
  • Natalja Kusminski,
  • Cathrin Herder,
  • Narendar Aadepu,
  • Clara Becker,
  • Jakob Gierten,
  • Omar T. Hammouda,
  • Eva Hasel,
  • Colin Lischik,
  • Katharina Lust,
  • Natalia Sokolova,
  • Risa Suzuki,
  • Tinatini Tavhelidse,
  • Thomas Thumberger,
  • Erika Tsingos,
  • Philip Watson,
  • Bettina Welz,
  • Kiyoshi Naruse,
  • Felix Loosli,
  • Joachim Wittbrodt,
  • Ewan Birney,
  • Tomas Fitzgerald

DOI
https://doi.org/10.1186/s13059-022-02602-4
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 24

Abstract

Read online

Abstract Background The teleost medaka (Oryzias latipes) is a well-established vertebrate model system, with a long history of genetic research, and multiple high-quality reference genomes available for several inbred strains. Medaka has a high tolerance to inbreeding from the wild, thus allowing one to establish inbred lines from wild founder individuals. Results We exploit this feature to create an inbred panel resource: the Medaka Inbred Kiyosu-Karlsruhe (MIKK) panel. This panel of 80 near-isogenic inbred lines contains a large amount of genetic variation inherited from the original wild population. We use Oxford Nanopore Technologies (ONT) long read data to further investigate the genomic and epigenomic landscapes of a subset of the MIKK panel. Nanopore sequencing allows us to identify a large variety of high-quality structural variants, and we present results and methods using a pan-genome graph representation of 12 individual medaka lines. This graph-based reference MIKK panel genome reveals novel differences between the MIKK panel lines and standard linear reference genomes. We find additional MIKK panel-specific genomic content that would be missing from linear reference alignment approaches. We are also able to identify and quantify the presence of repeat elements in each of the lines. Finally, we investigate line-specific CpG methylation and performed differential DNA methylation analysis across these 12 lines. Conclusions We present a detailed analysis of the MIKK panel genomes using long and short read sequence technologies, creating a MIKK panel-specific pan genome reference dataset allowing for investigation of novel variation types that would be elusive using standard approaches.

Keywords