HybridMatch: Semi-Supervised Facial Landmark Detection via Hybrid Heatmap Representations

Seoungyoon Kang; Minhyun Lee; Minjae Kim; Hyunjung Shim

doi:10.1109/ACCESS.2023.3257180

IEEE Access (Jan 2023)

HybridMatch: Semi-Supervised Facial Landmark Detection via Hybrid Heatmap Representations

Seoungyoon Kang,
Minhyun Lee,
Minjae Kim,
Hyunjung Shim

Affiliations

Seoungyoon Kang: ORCiD; School of Integrated Technology, Yonsei University, Yeonsu-gu, Incheon, Republic of Korea
Minhyun Lee: School of Integrated Technology, Yonsei University, Yeonsu-gu, Incheon, Republic of Korea
Minjae Kim: ORCiD; Vision AI Laboratory, AI Center, NCSOFT, Bundang-gu, Seongnam-si, Gyeonggi-do, Republic of Korea
Hyunjung Shim: ORCiD; Kim Jaechul Graduate School of Artificial Intelligence, KAIST, Dongdaemun-gu, Seoul, Republic of Korea

DOI: https://doi.org/10.1109/ACCESS.2023.3257180
Journal volume & issue: Vol. 11
pp. 26125 – 26135

Abstract

Read online

Facial landmark detection is an essential task in face-processing techniques. Traditional methods, however, require expensive pixel-level labels. Semi-supervised facial landmark detection has been explored as an alternative, but previous approaches only focus on training-oriented issues (e.g., noisy pseudo-labels in semi-supervised learning), neglecting task-oriented issues (i.e., the quantization error in landmark detection). We argue that semi-supervised landmark detectors should resolve the two technical issues simultaneously. Through a simple experiment, we found that task- and training-oriented solutions may negatively influence each other, thus eliminating their negative interactions is important. To this end, we devise a new heatmap regression framework via hybrid representation, namely HybridMatch. We utilize both 1-D and 2-D heatmap representations. Here, the 1-D and 2-D heatmaps help alleviate the task-oriented and training-oriented issues, respectively. To exploit the advantages of our hybrid representation, we introduce curriculum learning; relying more on the 2-D heatmap at the early training stage and gradually increasing the effects of the 1-D heatmap. By resolving the two issues simultaneously, we can capture more precise landmark points than existing methods with only a few annotated data. Extensive experiments show that HybridMatch achieves state-of-the-art performance on three benchmark datasets, especially showing 26.3% NME improvement over the existing method in the 300-W full set at 5% data ratio. Surprisingly, our method records a comparable performance, 5.04 (challenging set in the 300-W) to the fully-supervised facial landmark detector 5.03. The remarkable performance of HybridMatch shows its potential as a practical alternative to the fully-supervised model.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords