BMC Bioinformatics (Apr 2008)

GraphFind: enhancing graph searching by low support data mining techniques

  • Ferro Alfredo,
  • Giugno Rosalba,
  • Mongiovì Misael,
  • Pulvirenti Alfredo,
  • Skripin Dmitry,
  • Shasha Dennis

DOI
https://doi.org/10.1186/1471-2105-9-S4-S10
Journal volume & issue
Vol. 9, no. Suppl 4
p. S10

Abstract

Read online

Abstract Background Biomedical and chemical databases are large and rapidly growing in size. Graphs naturally model such kinds of data. To fully exploit the wealth of information in these graph databases, a key role is played by systems that search for all exact or approximate occurrences of a query graph. To deal efficiently with graph searching, advanced methods for indexing, representation and matching of graphs have been proposed. Results This paper presents GraphFind. The system implements efficient graph searching algorithms together with advanced filtering techniques that allow approximate search. It allows users to select candidate subgraphs rather than entire graphs. It implements an effective data storage based also on low-support data mining. Conclusions GraphFind is compared with Frowns, GraphGrep and gIndex. Experiments show that GraphFind outperforms the compared systems on a very large collection of small graphs. The proposed low-support mining technique which applies to any searching system also allows a significant index space reduction.