PLoS ONE (Jan 2013)

CoCiter: an efficient tool to infer gene function by assessing the significance of literature co-citation.

  • Nan Qiao,
  • Yi Huang,
  • Hammad Naveed,
  • Christopher D Green,
  • Jing-Dong J Han

DOI
https://doi.org/10.1371/journal.pone.0074074
Journal volume & issue
Vol. 8, no. 9
p. e74074

Abstract

Read online

A routine approach to inferring functions for a gene set is by using function enrichment analysis based on GO, KEGG or other curated terms and pathways. However, such analysis requires the existence of overlapping genes between the query gene set and those annotated by GO/KEGG. Furthermore, GO/KEGG databases only maintain a very restricted vocabulary. Here, we have developed a tool called "CoCiter" based on literature co-citations to address the limitations in conventional function enrichment analysis. Co-citation analysis is widely used in ranking articles and predicting protein-protein interactions (PPIs). Our algorithm can further assess the co-citation significance of a gene set with any other user-defined gene sets, or with free terms. We show that compared with the traditional approaches, CoCiter is a more accurate and flexible function enrichment analysis method. CoCiter is freely available at www.picb.ac.cn/hanlab/cociter/.