From a Visual Scene to a Virtual Representation: A Cross-Domain Review

Americo Pereira; Pedro Carvalho; Nuno Pereira; Paula Viana; Luis Corte-Real

doi:10.1109/ACCESS.2023.3283495

IEEE Access (Jan 2023)

From a Visual Scene to a Virtual Representation: A Cross-Domain Review

Americo Pereira,
Pedro Carvalho,
Nuno Pereira,
Paula Viana,
Luis Corte-Real

Affiliations

Americo Pereira: ORCiD; Centre for Telecommunications and Multimedia, Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), Porto, Portugal
Pedro Carvalho: ORCiD; Centre for Telecommunications and Multimedia, Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), Porto, Portugal
Nuno Pereira: Centre for Telecommunications and Multimedia, Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), Porto, Portugal
Paula Viana: ORCiD; Centre for Telecommunications and Multimedia, Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), Porto, Portugal
Luis Corte-Real: ORCiD; Centre for Telecommunications and Multimedia, Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), Porto, Portugal

DOI: https://doi.org/10.1109/ACCESS.2023.3283495
Journal volume & issue: Vol. 11
pp. 57916 – 57933

Abstract

Read online

The widespread use of smartphones and other low-cost equipment as recording devices, the massive growth in bandwidth, and the ever-growing demand for new applications with enhanced capabilities, made visual data a must in several scenarios, including surveillance, sports, retail, entertainment, and intelligent vehicles. Despite significant advances in analyzing and extracting data from images and video, there is a lack of solutions able to analyze and semantically describe the information in the visual scene so that it can be efficiently used and repurposed. Scientific contributions have focused on individual aspects or addressing specific problems and application areas, and no cross-domain solution is available to implement a complete system that enables information passing between cross-cutting algorithms. This paper analyses the problem from an end-to-end perspective, i.e., from the visual scene analysis to the representation of information in a virtual environment, including how the extracted data can be described and stored. A simple processing pipeline is introduced to set up a structure for discussing challenges and opportunities in different steps of the entire process, allowing to identify current gaps in the literature. The work reviews various technologies specifically from the perspective of their applicability to an end-to-end pipeline for scene analysis and synthesis, along with an extensive analysis of datasets for relevant tasks.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords