Proceedings of the XXth Conference of Open Innovations Association FRUCT (Apr 2019)
Analysis of Natural Language Sentences by Methods of the Theory of Graphs and the Theory of Sets
Abstract
Natural language sentence can be represented by means of graphs, where words, groups of pixels or variants of decisions are used as vertices, and as edges is the relationship between words in a sentence, elements of images or decisions. In most sentences, the relations of subordination and linear order are related. The representation of the syntactic structure of the sentence in the form of a subordination tree is used in generative grammars of the language and in the algorithms of syntactic analysis. The tree is built, starting from the distribution of lexical units of the sentence by parts of speech, and then there is a transfer from the subordination tree to the tree of the components. A binary search tree is a kind of data structure that corresponds to the representation of the sentence in the form of a tree of components. When a tree graph is built it's possible to proceed to the analysis of a lexical expression by comparing the intersection of sets representing it with the dictionary expressions in order to reveal the maximum coincidence between them.