Analysis of a Similarity Measure for Non-Overlapped Data

Sanghyuk Lee; Jaehoon Cha; Nipon Theera-Umpon; Kyeong Soo Kim

doi:10.3390/sym9050068

Symmetry (May 2017)

Analysis of a Similarity Measure for Non-Overlapped Data

Sanghyuk Lee,
Jaehoon Cha,
Nipon Theera-Umpon,
Kyeong Soo Kim

Affiliations

Sanghyuk Lee: Department of Electrical and Electronic Engineering, Xi’an Jiaotong-Liverpool University, Xi’an 215123, China
Jaehoon Cha: Department of Electrical and Electronic Engineering, Xi’an Jiaotong-Liverpool University, Xi’an 215123, China
Nipon Theera-Umpon: Biomedical Engineering Centre, Chiang Mai University, Chiang Mai 50200, Thailand
Kyeong Soo Kim: Department of Electrical and Electronic Engineering, Xi’an Jiaotong-Liverpool University, Xi’an 215123, China

DOI: https://doi.org/10.3390/sym9050068
Journal volume & issue: Vol. 9, no. 5
p. 68

Abstract

Read online

A similarity measure is a measure evaluating the degree of similarity between two fuzzy data sets and has become an essential tool in many applications including data mining, pattern recognition, and clustering. In this paper, we propose a similarity measure capable of handling non-overlapped data as well as overlapped data and analyze its characteristics on data distributions. We first design the similarity measure based on a distance measure and apply it to overlapped data distributions. From the calculations for example data distributions, we find that, though the similarity calculation is effective, the designed similarity measure cannot distinguish two non-overlapped data distributions, thus resulting in the same value for both data sets. To obtain discriminative similarity values for non-overlapped data, we consider two approaches. The first one is to use a conventional similarity measure after preprocessing non-overlapped data. The second one is to take into account neighbor data information in designing the similarity measure, where we consider the relation to specific data and residual data information. Two artificial patterns of non-overlapped data are analyzed in an illustrative example. The calculation results demonstrate that the proposed similarity measures can discriminate non-overlapped data.

Published in Symmetry

ISSN: 2073-8994 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/symmetry/

About the journal

Abstract

Keywords