Zhejiang dianli (Jan 2024)
Homologous matching of recording channels in intelligent substations based on regular expression and Jaccard similarity coefficient
Abstract
In addressing the challenge of homologous matching for dual sets of recording channels in intelligent substations of 220 kV and above, this paper presents a novel method employing regular expression and Jaccard index. To overcome the issue of irregular naming of recording channels, regular expressions to preprocess name texts of the channels are employed to ensure a standardized expression format. Furthermore, through Jieba word segmentation algorithm and stopword removal potential redundant information within the name texts of the channels. Subsequently, the Jaccard similarity coefficient matching algorithm is employed to calculate the similarity between recording channel names, screening out homologous channels based on their similarity degrees. To validate the proposed method, simulations are conducted using actual recording file data from the power grid. The results affirm the effectiveness of the proposed method in achieving homologous matching of recording channels in intelligent substations.
Keywords