Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
About this item
Full title
Author / Creator
Publisher
Ithaca: Cornell University Library, arXiv.org
Journal title
Language
English
Formats
Publication information
Publisher
Ithaca: Cornell University Library, arXiv.org
Subjects
More information
Scope and Contents
Contents
Advancements in Large Language Models (LLMs) have significantly enhanced instruction-following capabilities. However, most Instruction Fine-Tuning (IFT) datasets are predominantly in English, limiting model performance in other languages. Traditional methods for creating multilingual IFT datasets such as translating existing English IFT datasets or...
Alternative Titles
Full title
Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Authors, Artists and Contributors
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_proquest_journals_3075443614
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3075443614
Other Identifiers
E-ISSN
2331-8422