Complexity (Jan 2021)

A Discourse Coherence Analysis Method Combining Sentence Embedding and Dimension Grid

  • Lanlan Jiang,
  • Shengjun Yuan,
  • Jun Li

DOI
https://doi.org/10.1155/2021/6654925
Journal volume & issue
Vol. 2021

Abstract

Read online

Discourse coherence is strongly associated with text quality, making it important to natural language generation and understanding. However, existing coherence models focus on measuring individual aspects of coherence, such as lexical overlap, entity centralization, rhetorical structure, etc., lacking measurement of the semantics of text. In this paper, we propose a discourse coherence analysis method combining sentence embedding and the dimension grid, we obtain sentence-level vector representation by deep learning, and we introduce a coherence model that captures the fine-grained semantic transitions in text. Our work is based on the hypothesis that each dimension in the embedding vector is exactly assigned a stated certainty and specific semantic. We take every dimension as an equal grid and compute its transition probabilities. The document feature vector is also enriched to model the coherence. Finally, the experimental results demonstrate that our method achieves excellent performance on two coherence-related tasks.