Digital Human Intelligent Interaction System Based on Multimodal Pre-training Mode

Xuliang Yang; Yong Fang; Lili Wang; Rodolfo C. Raga

doi:10.1080/08839514.2024.2405953

Applied Artificial Intelligence (Dec 2024)

Digital Human Intelligent Interaction System Based on Multimodal Pre-training Mode

Xuliang Yang,
Yong Fang,
Lili Wang,
Rodolfo C. Raga

Affiliations

Xuliang Yang: Dongguan City University, University and Urban Integration Development Research Center, Dongguan, China
Yong Fang: Dongguan City University, University and Urban Integration Development Research Center, Dongguan, China
Lili Wang: Dongguan City University, University and Urban Integration Development Research Center, Dongguan, China
Rodolfo C. Raga: College of Computing & Information Technologies, National University, Manila, Philippines

DOI: https://doi.org/10.1080/08839514.2024.2405953
Journal volume & issue: Vol. 38, no. 1

Abstract

Read online

As the forefront of human–computer intelligent interaction, digital humans have increasingly diverse application scenarios, but also face many challenges. In order to enhance the intelligence level and interaction capability of digital human, the study first optimizes the traditional UniLM model, then introduces the mechanism of multi-head attention, and mitigates the exposure bias by using adversarial training and random replacement of decoder to design an improved UniLM model, and finally applies the improved model to the digital human management system. The results show that the precision rate of the improved UniLM model is improved by 7.68%, 6.4%, and 4.96%, the recall rate is improved by 11.94%, 9.69%, and 8.83%, and the F1-score is improved by 8.34%, 6.41%, and 7.68% compared with the other three models, which proves that it has a better precision rate, robustness, and generalization ability. The perplexity of the improved UniLM model is 135, 95, 76, 71, and 55 under five text lengths, which is significantly lower than the other models, proving that its text generation ability is better. The above results demonstrate the performance of the research-designed digital human system based on the Improved UniLM model, which provides a direction for the further development of digital human technology.

Published in Applied Artificial Intelligence

ISSN: 0883-9514 (Print); 1087-6545 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science; Science: Science (General): Cybernetics
Website: https://www.tandfonline.com/journals/uaai

About the journal