BMC Bioinformatics (Aug 2017)

Cancerouspdomains: comprehensive analysis of cancer type-specific recurrent somatic mutations in proteins and domains

  • Seirana Hashemi,
  • Abbas Nowzari Dalini,
  • Adrin Jalali,
  • Ali Mohammad Banaei-Moghaddam,
  • Zahra Razaghi-Moghadam

DOI
https://doi.org/10.1186/s12859-017-1779-5
Journal volume & issue
Vol. 18, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Background Discriminating driver mutations from the ones that play no role in cancer is a severe bottleneck in elucidating molecular mechanisms underlying cancer development. Since protein domains are representatives of functional regions within proteins, mutations on them may disturb the protein functionality. Therefore, studying mutations at domain level may point researchers to more accurate assessment of the functional impact of the mutations. Results This article presents a comprehensive study to map mutations from 29 cancer types to both sequence- and structure-based domains. Statistical analysis was performed to identify candidate domains in which mutations occur with high statistical significance. For each cancer type, the corresponding type-specific domains were distinguished among all candidate domains. Subsequently, cancer type-specific domains facilitated the identification of specific proteins for each cancer type. Besides, performing interactome analysis on specific proteins of each cancer type showed high levels of interconnectivity among them, which implies their functional relationship. To evaluate the role of mitochondrial genes, stem cell-specific genes and DNA repair genes in cancer development, their mutation frequency was determined via further analysis. Conclusions This study has provided researchers with a publicly available data repository for studying both CATH and Pfam domain regions on protein-coding genes. Moreover, the associations between different groups of genes/domains and various cancer types have been clarified. The work is available at http://www.cancerouspdomains.ir .

Keywords