SuperCATs: Cost Aggregation with Transformers for Sparse Correspondence
ICCE-Asia 2022

  • Seungjun Lee
  • Seungjun An
  • Sunghwan Hong
  • Seokju Cho
  • Jisu Nam
  • Susung Hong
  • Seungryong Kim
Korea University

Comparison between SuperCATs (left) and SuperGlue (right).

Abstract

In this work, we introduce a novel network, namely SuperCATs, which aims to find a correspondence field between visually similar images. SuperCATs stands on the shoulder of the recently proposed matching networks, SuperGlue and CATs, taking the merits of both for constructing an integrative framework. Specifically, given keypoints and corresponding descriptors, we first apply attentional aggregation consisting of self- and cross- graph neural network to obtain feature descriptors. Subsequently, we construct a cost volume using the descriptors, which then undergoes a tranformer aggregator for cost aggregation. With this approach, we manage to replace the handcrafted module based on solving an optimal transport problem initially included in SuperGlue with a transformer well known for its global receptive fields, making our approach more robust to severe deformations. We conduct experiments to demonstrate the effectiveness of the proposed method, and show that the proposed model is on par with SuperGlue for both indoor and outdoor scenes.

Architecture

Overall network architecture of SuperCATs.

Structure of Transformer Aggregator.

Citation

Acknowledgements

Thanks to our family Podo (cat), Aru (dog) and Dubu (dog) for their support. We love you.
The website template was borrowed from Michaël Gharbi.