TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
About this item
Full title
Author / Creator
Liao, Yun , Di, Yide , Zhou, Hao , Zhu, Kaijun , Lu, Mingyu , Zhang, Yijia , Duan, Qing and Liu, Junhui
Publisher
Ithaca: Cornell University Library, arXiv.org
Journal title
Language
English
Formats
Publication information
Publisher
Ithaca: Cornell University Library, arXiv.org
Subjects
More information
Scope and Contents
Contents
Local feature matching remains a challenging task, primarily due to difficulties in matching sparse keypoints and low-texture regions. The key to solving this problem lies in effectively and accurately integrating global and local information. To achieve this goal, we introduce an innovative local feature matching method called TKwinFormer. Our approach employs a multi-stage matching strategy to optimize the efficiency of information interaction. Furthermore, we propose a novel attention mechanism called Top K Window Attention, which facilitates global information interaction through window tokens prior to patch-level matching, resulting in improved matching accuracy. Additionally, we design an attention block to enhance attention between channels. Experimental results demonstrate that TKwinFormer outperforms state-of-the-art methods on various benchmarks. Code is available at: https://github.com/LiaoYun0x0/TKwinFormer....
Alternative Titles
Full title
TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Authors, Artists and Contributors
Author / Creator
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_proquest_journals_2858809827
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2858809827
Other Identifiers
E-ISSN
2331-8422