Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey

Hareesh Mandalapu; Aravinda Reddy P N; Raghavendra Ramachandra; Krothapalli Sreenivasa Rao; Pabitra Mitra; S. R. Mahadeva Prasanna; Christoph Busch

doi:10.1109/ACCESS.2021.3063031

IEEE Access (Jan 2021)

Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey

Hareesh Mandalapu,
Aravinda Reddy P N,
Raghavendra Ramachandra,
Krothapalli Sreenivasa Rao,
Pabitra Mitra,
S. R. Mahadeva Prasanna,
Christoph Busch

Affiliations

Hareesh Mandalapu: ORCiD; Department of Information Security and Communication Technology, Norwegian University of Science and Technology (NTNU), Gjøvik, Norway
Aravinda Reddy P N: Advanced Technology Development Centre, Indian Institute of Technology Kharagpur, Kharagpur, India
Raghavendra Ramachandra: ORCiD; Department of Information Security and Communication Technology, Norwegian University of Science and Technology (NTNU), Gjøvik, Norway
Krothapalli Sreenivasa Rao: Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, India
Pabitra Mitra: Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, India
S. R. Mahadeva Prasanna: Department of Electrical Engineering, Indian Institute of Technology Dharwad, Dharwad, India
Christoph Busch: ORCiD; Department of Information Security and Communication Technology, Norwegian University of Science and Technology (NTNU), Gjøvik, Norway

DOI: https://doi.org/10.1109/ACCESS.2021.3063031
Journal volume & issue: Vol. 9
pp. 37431 – 37455

Abstract

Read online

Biometric recognition is a trending technology that uses unique characteristics data to identify or verify/authenticate security applications. Amidst the classically used biometrics, voice and face attributes are the most propitious for prevalent applications in day-to-day life because they are easy to obtain through restrained and user-friendly procedures. The pervasiveness of low-cost audio and face capture sensors in smartphones, laptops, and tablets has made the advantage of voice and face biometrics more exceptional when compared to other biometrics. For many years, acoustic information alone has been a great success in automatic speaker verification applications. Meantime, the last decade or two has also witnessed a remarkable ascent in face recognition technologies. Nonetheless, in adverse unconstrained environments, neither of these techniques achieves optimal performance. Since audio-visual information carries correlated and complementary information, integrating them into one recognition system can increase the system's performance. The vulnerability of biometrics towards presentation attacks and audio-visual data usage for the detection of such attacks is also a hot topic of research. This paper made a comprehensive survey on existing state-of-the-art audio-visual recognition techniques, publicly available databases for benchmarking, and Presentation Attack Detection (PAD) algorithms. Further, a detailed discussion on challenges and open problems is presented in this field of biometrics.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords