Twin-Net Descriptor: Twin Negative Mining With Quad Loss for Patch-Based Matching

Aman Irshad; Rehan Hafiz; Mohsen Ali; Muhammad Faisal; Yongju Cho; Jeongil Seo

doi:10.1109/ACCESS.2019.2940737

IEEE Access (Jan 2019)

Twin-Net Descriptor: Twin Negative Mining With Quad Loss for Patch-Based Matching

Aman Irshad,
Rehan Hafiz,
Mohsen Ali,
Muhammad Faisal,
Yongju Cho,
Jeongil Seo

Affiliations

Aman Irshad: Information Technology University, Lahore, Pakistan
Rehan Hafiz: Information Technology University, Lahore, Pakistan
Mohsen Ali: Information Technology University, Lahore, Pakistan
Muhammad Faisal: Information Technology University, Lahore, Pakistan
Yongju Cho: ORCiD; Tera Media Research Group, Electronics and Telecommunications Research Institute, Daejeon, South Korea
Jeongil Seo: Tera Media Research Group, Electronics and Telecommunications Research Institute, Daejeon, South Korea

DOI: https://doi.org/10.1109/ACCESS.2019.2940737
Journal volume & issue: Vol. 7
pp. 136062 – 136072

Abstract

Read online

Local keypoint matching is an important step for computer vision based tasks. In recent years, Deep Convolutional Neural Network (CNN) based strategies have been employed to learn descriptor generation to enhance keypoint matching accuracy. Recent state-of-art works in this direction primarily rely upon a triplet based loss function (and its variations) utilizing three samples: an anchor, a positive and a negative. In this work we propose a novel “Twin Negative Mining” based sampling strategy coupled with a Quad loss function to train a deep neural network based pipeline (Twin-Net) for generating a robust descriptor that provides an increased discriminatory power to differentiate between patches that do not correspond to each other. Our sampling strategy and choice of loss function is aimed at placing an upper bound that descriptors of two patches representing same location could be at worst no more dissimilar than the descriptors of two similar looking patches that do-not belong to same 3D location. This results in an increase in the generalization capability of the network and outperforms its existing counterparts when trained over the same datasets. Twin-Net outputs a 128-dimensional descriptor and uses L2 Distance as the similarity metric, and hence conforms to the classical descriptor matching pipelines such as that of SIFT. Our results on Brown and HPatches datasets demonstrate Twin-Net's consistently better performance as well as better discriminatory and generalization capability as compared to the state-of-art.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords