BioData Mining (Aug 2018)

TRIQ: a new method to evaluate triclusters

  • David Gutiérrez-Avilés,
  • Raúl Giráldez,
  • Francisco Javier Gil-Cumbreras,
  • Cristina Rubio-Escudero

DOI
https://doi.org/10.1186/s13040-018-0177-5
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 29

Abstract

Read online

Abstract Background Triclustering has shown to be a valuable tool for the analysis of microarray data since its appearance as an improvement of classical clustering and biclustering techniques. The standard for validation of triclustering is based on three different measures: correlation, graphic similarity of the patterns and functional annotations for the genes extracted from the Gene Ontology project (GO). Results We propose TRIQ, a single evaluation measure that combines the three measures previously described: correlation, graphic validation and functional annotation, providing a single value as result of the validation of a tricluster solution and therefore simplifying the steps inherent to research of comparison and selection of solutions. TRIQ has been applied to three datasets already studied and evaluated with single measures based on correlation, graphic similarity and GO terms. Triclusters have been extracted from this three datasets using two different algorithms: TriGen and OPTricluster. Conclusions TRIQ has successfully provided the same results as a the three single evaluation measures. Furthermore, we have applied TRIQ to results from another algorithm, OPTRicluster, and we have shown how TRIQ has been a valid tool to compare results from different algorithms in a quantitative straightforward manner. Therefore, it appears as a valid measure to represent and summarize the quality of tricluster solutions. It is also feasible for evaluation of non biological triclusters, due to the parametrization of each component of TRIQ.

Keywords