El Departamento de Nosotros: How Machine Translated Corpora Affects Language Models in MRC Tasks
El Departamento de Nosotros: How Machine Translated Corpora Affects Language Models in MRC Tasks
About this item
Full title
Author / Creator
Publisher
Ithaca: Cornell University Library, arXiv.org
Journal title
Language
English
Formats
Publication information
Publisher
Ithaca: Cornell University Library, arXiv.org
Subjects
More information
Scope and Contents
Contents
Pre-training large-scale language models (LMs) requires huge amounts of text corpora. LMs for English enjoy ever growing corpora of diverse language resources. However, less resourced languages and their mono- and multilingual LMs often struggle to obtain bigger datasets. A typical approach in this case implies using machine translation of English...
Alternative Titles
Full title
El Departamento de Nosotros: How Machine Translated Corpora Affects Language Models in MRC Tasks
Authors, Artists and Contributors
Author / Creator
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_proquest_journals_2420906813
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2420906813
Other Identifiers
E-ISSN
2331-8422