Lightweight Model for Fish Recognition Based on YOLOV5-MobilenetV3 and Sonar Images

Yizhi LUO; Huazhong LU; Xingxing ZHOU; Yu YUAN; Haijun QI; Bin LI; Zhichang LIU

doi:10.16768/j.issn.1004-874X.2023.07.004

Guangdong nongye kexue (Jul 2023)

Lightweight Model for Fish Recognition Based on YOLOV5-MobilenetV3 and Sonar Images

Yizhi LUO,
Huazhong LU,
Xingxing ZHOU,
Yu YUAN,
Haijun QI,
Bin LI,
Zhichang LIU

Affiliations

Yizhi LUO: Institute of Facility Agriculture, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
Huazhong LU: Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
Xingxing ZHOU: Institute of Facility Agriculture, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
Yu YUAN: Institute of Facility Agriculture, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
Haijun QI: Institute of Facility Agriculture, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
Bin LI: Institute of Facility Agriculture, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
Zhichang LIU: Institute of Animal Science (Fisheries Research Institute), Guangdong Academy of Agricultural Sciences, Guangzhou 510645, China

DOI: https://doi.org/10.16768/j.issn.1004-874X.2023.07.004
Journal volume & issue: Vol. 50, no. 7
pp. 37 – 46

Abstract

Read online

【Objective】Cage biometrics and statistics are one of the key reference factors for marine pasture farming management. Aiming at the interference of reverberation noise and complex background, this paper constructs fish detection data sets under different lighting conditions, and uses forward-looking sonar imaging technology to propose a fish recognition lightweight model based on YOLOV5-MobilenetV3 and sonar images (LAPR-Net) to realize fish recognition in water cages in turbid or dark scenes.【Method】Taking tilapia as the research object, based on the frame structure of the YOLOV5 model, the backbone network module ado pts the lightweight Mob ileNetV3 bneck block, using the linear bottleneck inverse residual structure and depth separable convolution extract the features of fish in sonar images, applying the attention mechanism SE-Net to obtain multi-scale semantic features of sonar images and enhance the correlation between features; the neck network adopts the path aggregation network structure to perform multi-scale fusion of target features, to enhance the feature fusion ability; the prediction part adopts the maximum local search based on the non-maximum suppression method, removes the redundant detection frame, screens the detection frame with the highest confidence, and finally outputs and displays the detection result of the fish, including the position, category and detection probability of detecting an object.【Result】Four other mainstream detection models were selected for comparative experiments, including YOLOV3-ting (Darknet53), YOLOV5 (CSPdarknet53), YOLOV5 (Repvgg), and YOLOV5s (Transformer). It proposes the model parameter quantityof 3 545 453, FLOPs of 6.3 G, and the mAP of 0.957, and the average inference speed of each picture of the model is 0.08868 s. Compared with the YOLOV5 model, the mAP of the improved model has increased by 9.7%.【Conclusion】The proposed network improves the speed of training and recognition, reduces the requirements for hardware equipment, and provides a reference for the detection model of cage cultured fish in marine pastures.

Published in Guangdong nongye kexue

ISSN: 1004-874X (Print)
Publisher: Guangdong Academy of Agricultural Sciences
Country of publisher: China
LCC subjects: Agriculture
Website: http://gdnykx.cnjournals.org/gdnykx/ch/index.aspx

About the journal

Abstract

Keywords