ETRI Knowledge Sharing Platform : Twin-Net Descriptor: Twin Negative Mining With Quad Loss for Patch-Based Matching

BROWSE

Titles

논문 검색
Type		SCI
Year	~	Keyword

Detail

List

Journal Article Twin-Net Descriptor: Twin Negative Mining With Quad Loss for Patch-Based Matching

Cited 5 time in scopus

Download 138 time Share share

Authors: Aman Irshad, Rehan Hafiz, Mohsen Ali, Muhammad Faisal, Yong Ju Cho, Jeongil Seo

Issue Date: 2019-10

Citation: IEEE Access, v.7, pp.136062-136072

ISSN: 2169-3536

Publisher: IEEE

Language: English

Type: Journal Article

DOI: https://dx.doi.org/10.1109/ACCESS.2019.2940737

Abstract: Local keypoint matching is an important step for computer vision based tasks. In recent years, Deep Convolutional Neural Network (CNN) based strategies have been employed to learn descriptor generation to enhance keypoint matching accuracy. Recent state-of-art works in this direction primarily rely upon a triplet based loss function (and its variations) utilizing three samples: An anchor, a positive and a negative. In this work we propose a novel 'Twin Negative Mining' based sampling strategy coupled with a Quad loss function to train a deep neural network based pipeline (Twin-Net) for generating a robust descriptor that provides an increased discriminatory power to differentiate between patches that do not correspond to each other. Our sampling strategy and choice of loss function is aimed at placing an upper bound that descriptors of two patches representing same location could be at worst no more dissimilar than the descriptors of two similar looking patches that do-not belong to same 3D location. This results in an increase in the generalization capability of the network and outperforms its existing counterparts when trained over the same datasets. Twin-Net outputs a 128-dimensional descriptor and uses $L{2}$ Distance as the similarity metric, and hence conforms to the classical descriptor matching pipelines such as that of SIFT. Our results on Brown and HPatches datasets demonstrate Twin-Net's consistently better performance as well as better discriminatory and generalization capability as compared to the state-of-art.

KSP Keywords: 3D location, Belong to, Computer Vision(CV), Convolution neural network(CNN), Coupled with, Deep convolutional neural networks, Deep neural network(DNN), Descriptor matching, Generalization capability, Local keypoint, Matching accuracy

This work is distributed under the term of Creative Commons License (CCL)
(CC BY)

ETRI-Knowledge Sharing Plaform

BROWSE

Titles

Detail

ETRI