BMC Bioinformatics (May 2018)

Computing the family-free DCJ similarity

  • Diego P. Rubert,
  • Edna A. Hoshino,
  • Marília D. V. Braga,
  • Jens Stoye,
  • Fábio V. Martinez

DOI
https://doi.org/10.1186/s12859-018-2130-5
Journal volume & issue
Vol. 19, no. S6
pp. 31 – 42

Abstract

Read online

Abstract Background The genomic similarity is a large-scale measure for comparing two given genomes. In this work we study the (NP-hard) problem of computing the genomic similarity under the DCJ model in a setting that does not assume that the genes of the compared genomes are grouped into gene families. This problem is called family-free DCJ similarity. Results We propose an exact ILP algorithm to solve the family-free DCJ similarity problem, then we show its APX-hardness and present four combinatorial heuristics with computational experiments comparing their results to the ILP. Conclusions We show that the family-free DCJ similarity can be computed in reasonable time, although for larger genomes it is necessary to resort to heuristics. This provides a basis for further studies on the applicability and model refinement of family-free whole genome similarity measures.

Keywords