Mathematics (Oct 2021)

Definition Extraction from Generic and Mathematical Domains with Deep Ensemble Learning

  • Natalia Vanetik,
  • Marina Litvak

DOI
https://doi.org/10.3390/math9192502
Journal volume & issue
Vol. 9, no. 19
p. 2502

Abstract

Read online

Definitions are extremely important for efficient learning of new materials. In particular, mathematical definitions are necessary for understanding mathematics-related areas. Automated extraction of definitions could be very useful for automated indexing educational materials, building taxonomies of relevant concepts, and more. For definitions that are contained within a single sentence, this problem can be viewed as a binary classification of sentences into definitions and non-definitions. In this paper, we focus on automatic detection of one-sentence definitions in mathematical and general texts. We experiment with different classification models arranged in an ensemble and applied to a sentence representation containing syntactic and semantic information, to classify sentences. Our ensemble model is applied to the data adjusted with oversampling. Our experiments demonstrate the superiority of our approach over state-of-the-art methods in both general and mathematical domains.

Keywords