Inteligencia Artificial (Mar 2022)
A Gene Expression Clustering Method to Extraction of Cell-to-Cell Biological Communication
Abstract
Graph-based clustering identification is a practical method to detect the communication between nodes in complex networks that has obtained considerable comments. Since identifying different communities in large-scale data is a challenging task, by understanding the communication between the behaviors of the elements in a community (a cluster), the general characteristics of clusters can be predicted. Graph-based clustering methods have played an important role in clustering gene expression data because of their ability to show the relations between the data. In order to be able to identify genes that lead to the development of diseases, the communication between the cells must be established. The communication between different cells can be indicated by the expression of different genes within them. In this study, the problem of cell-to-cell communication is expressed as a graph and the communication are extracted by recognizing the communities. The FANTOM5 dataset is used to simulate and calculate the similarity between cells. After preprocessing and normalizing the data, to convert this data into graphs, the expression of genes in different cells was examined and by considering a threshold and Wilcoxon test, the communication between them were identified through using clustering.
Keywords