Electronic Research Archive (Sep 2023)

Word-level dual channel with multi-head semantic attention interaction for community question answering

  • Jinmeng Wu,
  • HanYu Hong,
  • YaoZong Zhang,
  • YanBin Hao,
  • Lei Ma ,
  • Lei Wang

DOI
https://doi.org/10.3934/era.2023306
Journal volume & issue
Vol. 31, no. 10
pp. 6012 – 6026

Abstract

Read online

The semantic matching problem detects whether the candidate text is related to a specific input text. Basic text matching adopts the method of statistical vocabulary information without considering semantic relevance. Methods based on Convolutional neural networks (CNN) and Recurrent networks (RNN) provide a more optimized structure that can merge the information in the entire sentence into a single sentence-level representation. However, these representations are often not suitable for sentence interactive learning. We design a multi-dimensional semantic interactive learning model based on the mechanism of multiple written heads in the transformer architecture, which not only considers the correlation and position information between different word levels but also further maps the representation of the sentence to the interactive three-dimensional space, so as to solve the problem and the answer can select the best word-level matching pair, respectively. Experimentally, the algorithm in this paper was tested on Yahoo! and StackEx open-domain datasets. The results show that the performance of the method proposed in this paper is superior to the previous CNN/RNN and BERT-based methods.

Keywords