Leveraging two-dimensional pre-trained vision transformers for three-dimensional model generation vi...
Leveraging two-dimensional pre-trained vision transformers for three-dimensional model generation via masked autoencoders
About this item
Full title
Author / Creator
Publisher
London: Nature Publishing Group UK
Journal title
Language
English
Formats
Publication information
Publisher
London: Nature Publishing Group UK
Subjects
More information
Scope and Contents
Contents
Although the Transformer architecture has established itself as the industry standard for jobs involving natural language processing, it still has few uses in computer vision. In vision, attention is used in conjunction with convolutional networks or to replace individual convolutional network elements while preserving the overall network design. D...
Alternative Titles
Full title
Leveraging two-dimensional pre-trained vision transformers for three-dimensional model generation via masked autoencoders
Authors, Artists and Contributors
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_doaj_primary_oai_doaj_org_article_3143c9ff3a0846e5bbefb25e2bfc10b7
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_3143c9ff3a0846e5bbefb25e2bfc10b7
Other Identifiers
ISSN
2045-2322
E-ISSN
2045-2322
DOI
10.1038/s41598-025-87376-y