IEEE Access (Jan 2018)
Correlation Analysis for Exploring Multivariate Data Sets
Abstract
Correlation analysis is of great significance for exploring the multivariate data sets as it helps researchers toward an in-depth understanding of the complex interactions and relationships among variables. In this paper, we propose a correlation analysis method that identifies salient scalars for multivariate data exploration. We exploit specific mutual information metric to measure the information overlap and analyze the relationships between one scalar and other variables. Moreover, we define the information flow and introduce another metric, influence to quantify the associations among scalars of different variables. Furthermore, we integrate these two information metrics and construct a surprise-influence map for users' interaction to identify the salient scalars. By investigating the relationships among these salient scalars, we analyze the correlations among variables. We demonstrate the applicability and effectiveness of our proposed method by applying it to different data sets.
Keywords