Functional Enrichment Analysis of Regulatory Elements
Adrian Garcia-Moreno,
Raul López-Domínguez,
Juan Antonio Villatoro-García,
Alberto Ramirez-Mena,
Ernesto Aparicio-Puerta,
Michael Hackenberg,
Alberto Pascual-Montano,
Pedro Carmona-Saez
Affiliations
Adrian Garcia-Moreno
Bioinformatics Unit, Centre for Genomics and Oncological Research, GENYO, Pfizer/University of Granada/Andalusian Regional Government, PTS, 18016 Granada, Spain
Raul López-Domínguez
Bioinformatics Unit, Centre for Genomics and Oncological Research, GENYO, Pfizer/University of Granada/Andalusian Regional Government, PTS, 18016 Granada, Spain
Juan Antonio Villatoro-García
Bioinformatics Unit, Centre for Genomics and Oncological Research, GENYO, Pfizer/University of Granada/Andalusian Regional Government, PTS, 18016 Granada, Spain
Alberto Ramirez-Mena
Bioinformatics Unit, Centre for Genomics and Oncological Research, GENYO, Pfizer/University of Granada/Andalusian Regional Government, PTS, 18016 Granada, Spain
Ernesto Aparicio-Puerta
Department of Genetics, University of Granada, 18071 Granada, Spain
Michael Hackenberg
Department of Genetics, University of Granada, 18071 Granada, Spain
Alberto Pascual-Montano
Data Science & Analytics at IDBS (Danaher Group), 68 Chertsey Road, Woking GU21 5BJ, UK
Pedro Carmona-Saez
Bioinformatics Unit, Centre for Genomics and Oncological Research, GENYO, Pfizer/University of Granada/Andalusian Regional Government, PTS, 18016 Granada, Spain
Statistical methods for enrichment analysis are important tools to extract biological information from omics experiments. Although these methods have been widely used for the analysis of gene and protein lists, the development of high-throughput technologies for regulatory elements demands dedicated statistical and bioinformatics tools. Here, we present a set of enrichment analysis methods for regulatory elements, including CpG sites, miRNAs, and transcription factors. Statistical significance is determined via a power weighting function for target genes and tested by the Wallenius noncentral hypergeometric distribution model to avoid selection bias. These new methodologies have been applied to the analysis of a set of miRNAs associated with arrhythmia, showing the potential of this tool to extract biological information from a list of regulatory elements. These new methods are available in GeneCodis 4, a web tool able to perform singular and modular enrichment analysis that allows the integration of heterogeneous information.