A General Language Assistant as a Laboratory for Alignment
A General Language Assistant as a Laboratory for Alignment
About this item
Full title
Author / Creator
Askell, Amanda , Bai, Yuntao , Chen, Anna , Drain, Dawn , Ganguli, Deep , Henighan, Tom , Jones, Andy , Nicholas, Joseph , Mann, Ben , DasSarma, Nova , Nelson Elhage , Hatfield-Dodds, Zac , Hernandez, Danny , Jackson Kernion , Ndousse, Kamal , Olsson, Catherine , Amodei, Dario , Brown, Tom , Clark, Jack , McCandlish, Sam , Olah, Chris and Kaplan, Jared
Publisher
Ithaca: Cornell University Library, arXiv.org
Journal title
Language
English
Formats
Publication information
Publisher
Ithaca: Cornell University Library, arXiv.org
Subjects
More information
Scope and Contents
Contents
Given the broad capabilities of large language models, it should be possible to work towards a general-purpose, text-based assistant that is aligned with human values, meaning that it is helpful, honest, and harmless. As an initial foray in this direction we study simple baseline techniques and evaluations, such as prompting. We find that the benef...
Alternative Titles
Full title
A General Language Assistant as a Laboratory for Alignment
Authors, Artists and Contributors
Author / Creator
Bai, Yuntao
Chen, Anna
Drain, Dawn
Ganguli, Deep
Henighan, Tom
Jones, Andy
Nicholas, Joseph
Mann, Ben
DasSarma, Nova
Nelson Elhage
Hatfield-Dodds, Zac
Hernandez, Danny
Jackson Kernion
Ndousse, Kamal
Olsson, Catherine
Amodei, Dario
Brown, Tom
Clark, Jack
McCandlish, Sam
Olah, Chris
Kaplan, Jared
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_proquest_journals_2605773955
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2605773955
Other Identifiers
E-ISSN
2331-8422