BMC Bioinformatics (Nov 2017)

GFF3sort: a novel tool to sort GFF3 files for tabix indexing

  • Tao Zhu,
  • Chengzhen Liang,
  • Zhigang Meng,
  • Sandui Guo,
  • Rui Zhang

DOI
https://doi.org/10.1186/s12859-017-1930-3
Journal volume & issue
Vol. 18, no. 1
pp. 1 – 4

Abstract

Read online

Abstract Background The traditional method of visualizing gene annotation data in JBrowse is converting GFF3 files to JSON format, which is time-consuming. The latest version of JBrowse supports rendering sorted GFF3 files indexed by tabix, a novel strategy that is more convenient than the original conversion process. However, current tools available for GFF3 file sorting have some limitations and their sorting results would lead to erroneous rendering in JBrowse. Results We developed GFF3sort, a script to sort GFF3 files for tabix indexing. Specifically designed for JBrowse rendering, GFF3sort can properly deal with the order of features that have the same chromosome and start position, either by remembering their original orders or by conducting parent-child topology sorting. Based on our test datasets from seven species, GFF3sort produced accurate sorting results with acceptable efficiency compared with currently available tools. Conclusions GFF3sort is a novel tool to sort GFF3 files for tabix indexing. We anticipate that GFF3sort will be useful to help with genome annotation data processing and visualization.

Keywords