Quantifying the Effects of Ground Truth Annotation Quality on Object Detection and Instance Segmentation Performance

Cathaoir Agnew; Ciaran Eising; Patrick Denny; Anthony Scanlan; Pepijn Van De Ven; Eoin M. Grua

doi:10.1109/ACCESS.2023.3256723

IEEE Access (Jan 2023)

Quantifying the Effects of Ground Truth Annotation Quality on Object Detection and Instance Segmentation Performance

Cathaoir Agnew,
Ciaran Eising,
Patrick Denny,
Anthony Scanlan,
Pepijn Van De Ven,
Eoin M. Grua

Affiliations

Cathaoir Agnew: ORCiD; Department of Electronic and Computer Engineering, Data-Driven Computer Engineering (D2iCE) Group, University of Limerick, Limerick, Ireland
Ciaran Eising: ORCiD; CONFIRM Centre for Smart Manufacturing, University of Limerick, Limerick, Ireland
Patrick Denny: Department of Electronic and Computer Engineering, Data-Driven Computer Engineering (D2iCE) Group, University of Limerick, Limerick, Ireland
Anthony Scanlan: ORCiD; Department of Electronic and Computer Engineering, Data-Driven Computer Engineering (D2iCE) Group, University of Limerick, Limerick, Ireland
Pepijn Van De Ven: Department of Electronic and Computer Engineering, Data-Driven Computer Engineering (D2iCE) Group, University of Limerick, Limerick, Ireland
Eoin M. Grua: ORCiD; Department of Electronic and Computer Engineering, Data-Driven Computer Engineering (D2iCE) Group, University of Limerick, Limerick, Ireland

DOI: https://doi.org/10.1109/ACCESS.2023.3256723
Journal volume & issue: Vol. 11
pp. 25174 – 25188

Abstract

Read online

Fully-supervised object detection and instance segmentation models have accomplished notable results on large-scale computer vision benchmark datasets. However, fully-supervised machine learning algorithms’ performances are immensely dependent on the quality of the training data. Preparing computer vision datasets for object detection and instance segmentation is a labor-intensive task requiring each instance in an image to be annotated. In practice, this often results in the quality of bounding box and polygon mask annotations being suboptimal. This paper quantifies empirically the ground truth annotation quality and COCO’s mean average precision (mAP) performance by introducing two separate noise measures, uniform and radial, into the ground truth bounding box and polygon mask annotations for the COCO and Cityscapes datasets. Mask-RCNN models are trained on various levels of noise measures to investigate the performance of each level of noise. The results showed degradation of mAP as the level of both noise measures increased. For object detection and instance segmentation respectively, using the highest level of noise measure resulted in a mAP degradation of 0.185 & 0.208 for uniform noise with reductions of 0.118 & 0.064 for radial noise on the COCO dataset. As for the Cityscapes datasets, reductions of mAP performance of 0.147 & 0.142 for uniform noise and 0.101 & 0.033 for radial noise were recorded. Furthermore, a decrease in average precision is seen across all classes, with the exception of the class motorcycle. The reductions between classes vary, indicating the effects of annotation uncertainty are class-dependent.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords