Multi‐modal knowledge graph inference via media convergence and logic rule

Feng Lin; Dongmei Li; Wenbin Zhang; Dongsheng Shi; Yuanzhou Jiao; Qianzhong Chen; Yiying Lin; Wentao Zhu

doi:10.1049/cit2.12217

CAAI Transactions on Intelligence Technology (Feb 2024)

Multi‐modal knowledge graph inference via media convergence and logic rule

Feng Lin,
Dongmei Li,
Wenbin Zhang,
Dongsheng Shi,
Yuanzhou Jiao,
Qianzhong Chen,
Yiying Lin,
Wentao Zhu

Affiliations

Feng Lin: School of Information Science and Technology Beijing Forestry University Beijing China
Dongmei Li: School of Information Science and Technology Beijing Forestry University Beijing China
Wenbin Zhang: Michigan Technological University Houghton Michigan USA
Dongsheng Shi: School of Information Science and Technology Beijing Forestry University Beijing China
Yuanzhou Jiao: School of Information Science and Technology Beijing Forestry University Beijing China
Qianzhong Chen: School of Information Science and Technology Beijing Forestry University Beijing China
Yiying Lin: School of Information Science and Technology Beijing Forestry University Beijing China
Wentao Zhu: School of Information Science and Technology Beijing Forestry University Beijing China

DOI: https://doi.org/10.1049/cit2.12217
Journal volume & issue: Vol. 9, no. 1
pp. 211 – 221

Abstract

Read online

Abstract Media convergence works by processing information from different modalities and applying them to different domains. It is difficult for the conventional knowledge graph to utilise multi‐media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective. To address the issue, an inference method based on Media Convergence and Rule‐guided Joint Inference model (MCRJI) has been proposed. The authors not only converge multi‐media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction. First, a multi‐headed self‐attention approach is used to obtain the attention of different media features of entities during semantic synthesis. Second, logic rules of different lengths are mined from knowledge graph to learn new entity representations. Finally, knowledge graph inference is performed based on representing entities that converge multi‐media features. Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi‐media features and knowledge graph inference, demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi‐media features.

Published in CAAI Transactions on Intelligence Technology

ISSN: 2468-2322 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/24682322

About the journal

Abstract

Keywords