A novel iteration scheme with conjugate gradient for faster pruning on transformer models
A novel iteration scheme with conjugate gradient for faster pruning on transformer models
About this item
Full title
Author / Creator
Li, Jun , Zhu, Yuchen and Sun, Kexue
Publisher
Cham: Springer International Publishing
Journal title
Language
English
Formats
Publication information
Publisher
Cham: Springer International Publishing
Subjects
More information
Scope and Contents
Contents
Pre-trained models based on the Transformer architecture have significantly advanced research within the domain of Natural Language Processing (NLP) due to their superior performance and extensive applicability across multiple technological sectors. Despite these advantages, there is a significant challenge in optimizing these models for more effic...
Alternative Titles
Full title
A novel iteration scheme with conjugate gradient for faster pruning on transformer models
Authors, Artists and Contributors
Author / Creator
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_doaj_primary_oai_doaj_org_article_c357960ba562483a9b42cd02800ef899
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_c357960ba562483a9b42cd02800ef899
Other Identifiers
ISSN
2199-4536
E-ISSN
2198-6053
DOI
10.1007/s40747-024-01595-w