AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation

| Ask | Become a Library member

AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3030959776

AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation

About this item

Full title

AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation

Author / Creator

Qiu, Peijie , Yang, Jin , Kumar, Sayantan , Ghosh, Soumyendu Sekhar and Sotiras, Aristeidis

Publisher

Ithaca: Cornell University Library, arXiv.org

Journal title

arXiv.org, 2024-09

Language

English

Formats

Articles

Publication information

Publisher

Ithaca: Cornell University Library, arXiv.org

Subjects

Subjects and topics

More information

Scope and Contents

Contents

In the past decades, deep neural networks, particularly convolutional neural networks, have achieved state-of-the-art performance in a variety of medical image segmentation tasks. Recently, the introduction of the vision transformer (ViT) has significantly altered the landscape of deep segmentation models. There has been a growing focus on ViTs, driven by their excellent performance and scalability. However, we argue that the current design of the vision transformer-based UNet (ViT-UNet) segmentation models may not effectively handle the heterogeneous appearance (e.g., varying shapes and sizes) of objects of interest in medical image segmentation tasks. To tackle this challenge, we present a structured approach to introduce spatially dynamic components to the ViT-UNet. This adaptation enables the model to effectively capture features of target objects with diverse appearances. This is achieved by three main components: \textbf{(i)} deformable patch embedding; \textbf{(ii)} spatially dynamic multi-head attention; \textbf{(iii)} deformable positional encoding. These components were integrated into a novel architecture, termed AgileFormer. AgileFormer is a spatially agile ViT-UNet designed for medical image segmentation. Experiments in three segmentation tasks using publicly available datasets demonstrated the effectiveness of the proposed method. The code is available at \href{https://github.com/sotiraslab/AgileFormer}{https://github.com/sotiraslab/AgileFormer}....

Alternative Titles

Full title

AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation

Authors, Artists and Contributors

Author / Creator

Qiu, Peijie
Yang, Jin
Kumar, Sayantan
Ghosh, Soumyendu Sekhar
Sotiras, Aristeidis

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_proquest_journals_3030959776

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3030959776

Other Identifiers

E-ISSN

2331-8422

How to access this item

Full text available

View in old catalogue

AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation

AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation

AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation

About this item

Publication information

Subjects

More information

Scope and Contents

Alternative Titles

Authors, Artists and Contributors

Identifiers

Primary Identifiers

Other Identifiers

How to access this item

Connecting people and collections

Indigenous engagement

Learning

Stories