Boosting AI applications: Labeling format for complex datasets

Marcos Nieto; Orti Senderos; Oihana Otaegui

SoftwareX (Jan 2021)

Boosting AI applications: Labeling format for complex datasets

Marcos Nieto,
Orti Senderos,
Oihana Otaegui

Affiliations

Marcos Nieto: Corresponding author.; Vicomtech Foundation, Basque Research and Technology Alliance (BRTA), Mikeletegi 57, 20009 San Sebastian, Spain
Orti Senderos: Vicomtech Foundation, Basque Research and Technology Alliance (BRTA), Mikeletegi 57, 20009 San Sebastian, Spain
Oihana Otaegui: Vicomtech Foundation, Basque Research and Technology Alliance (BRTA), Mikeletegi 57, 20009 San Sebastian, Spain

Journal volume & issue: Vol. 13
p. 100653

Abstract

Read online

Data labeling has become a major problem in industries aiming to create and use ground truth labels from massive multi-sensor archives to feed into Artificial Intelligence (AI) applications. Annotation of multi-sensor set-ups with multiple cameras and LIDAR is now particularly relevant for the automotive industry aiming to build Autonomous Driving (AD) functions. In this paper, we present the Video Content Description (VCD), as the first open source metadata structure and set of tools, able to structure annotations for such complex scenes, including unprecedented flexibility to label 2D and 3D objects, pixel-wise labels, actions, events, contexts, semantic relations, odometry, and calibration. Several example cases are reported to demonstrate the flexibility of the VCD.

Published in SoftwareX

ISSN: 2352-7110 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: http://www.journals.elsevier.com/softwarex/

About the journal

Abstract

Keywords