EAI Endorsed Transactions on Scalable Information Systems (Jun 2019)
An Experimental Study with Tensor Flow for Characteristic mining of Mathematical Formulae from a Document
Abstract
Through this article a deep learning technique is proposed for the extraction and classification of mathematical keywords from textual documents. Extraction of math keywords from textual data is predominant problem as textual documents contain a culmination of mathematical symbols and literals from natural language such as alphabets and words. Separation of these textual words embedded in the mathematical formulae is a complex task. Our proposed technique solves this critical problem of extracting mathematical keywords from textual documents using techniques such as stemming,tokenization and clustering mathematical keywords based on a training set of mathematical keyword and formulae pairs. The performance of the proposed technique is measured using the metrics such as retrieval time, Sensitivity, Accuracy, FPR, FNR, and FDR are used for appraisal of the proposed technique.