Applied Sciences (Sep 2023)

Improved Lightweight Multi-Target Recognition Model for Live Streaming Scenes

  • Zongwei Li,
  • Kai Qiao,
  • Jianing Chen,
  • Zhenyu Li,
  • Yanhui Zhang

DOI
https://doi.org/10.3390/app131810170
Journal volume & issue
Vol. 13, no. 18
p. 10170

Abstract

Read online

Nowadays, the commercial potential of live e-commerce is being continuously explored, and machine vision algorithms are gradually attracting the attention of marketers and researchers. During live streaming, the visuals can be effectively captured by algorithms, thereby providing additional data support. This paper aims to consider the diversity of live streaming devices and proposes an extremely lightweight and high-precision model to meet different requirements in live streaming scenarios. Building upon yolov5s, we incorporate the MobileNetV3 module and the CA attention mechanism to optimize the model. Furthermore, we construct a multi-object dataset specific to live streaming scenarios, including anchor facial expressions and commodities. A series of experiments have demonstrated that our model realized a 0.4% improvement in accuracy compared to the original model, while reducing its weight to 10.52%.

Keywords