Measuring Progress on Scalable Oversight for Large Language Models
Measuring Progress on Scalable Oversight for Large Language Models
About this item
Full title
Author / Creator
Bowman, Samuel R , Jeeyoon Hyun , Perez, Ethan , Chen, Edwin , Pettit, Craig , Scott, Heiner , Lukošiūtė, Kamilė , Askell, Amanda , Jones, Andy , Chen, Anna , Goldie, Anna , Mirhoseini, Azalia , McKinnon, Cameron , Olah, Christopher , Amodei, Daniela , Amodei, Dario , Drain, Dawn , Li, Dustin , Tran-Johnson, Eli , Jackson Kernion , Kerr, Jamie , Mueller, Jared , Ladish, Jeffrey , Landau, Joshua , Ndousse, Kamal , Lovitt, Liane , Nelson Elhage , Schiefer, Nicholas , Nicholas, Joseph , Mercado, Noemí , DasSarma, Nova , Larson, Robin , McCandlish, Sam , Kundu, Sandipan , Johnston, Scott , Kravec, Shauna , Sheer El Showk , t, Stanislav , Telleen-Lawton, Timothy , Brown, Tom , Henighan, Tom , Hume, Tristan , Bai, Yuntao , Hatfield-Dodds, Zac , Mann, Ben and Kaplan, Jared
Publisher
Ithaca: Cornell University Library, arXiv.org
Journal title
Language
English
Formats
Publication information
Publisher
Ithaca: Cornell University Library, arXiv.org
Subjects
More information
Scope and Contents
Contents
Developing safe and useful general-purpose AI systems will require us to make progress on scalable oversight: the problem of supervising systems that potentially outperform us on most skills relevant to the task at hand. Empirical work on this problem is not straightforward, since we do not yet have systems that broadly exceed our abilities. This p...
Alternative Titles
Full title
Measuring Progress on Scalable Oversight for Large Language Models
Authors, Artists and Contributors
Author / Creator
Jeeyoon Hyun
Perez, Ethan
Chen, Edwin
Pettit, Craig
Scott, Heiner
Lukošiūtė, Kamilė
Askell, Amanda
Jones, Andy
Chen, Anna
Goldie, Anna
Mirhoseini, Azalia
McKinnon, Cameron
Olah, Christopher
Amodei, Daniela
Amodei, Dario
Drain, Dawn
Li, Dustin
Tran-Johnson, Eli
Jackson Kernion
Kerr, Jamie
Mueller, Jared
Ladish, Jeffrey
Landau, Joshua
Ndousse, Kamal
Lovitt, Liane
Nelson Elhage
Schiefer, Nicholas
Nicholas, Joseph
Mercado, Noemí
DasSarma, Nova
Larson, Robin
McCandlish, Sam
Kundu, Sandipan
Johnston, Scott
Kravec, Shauna
Sheer El Showk
t, Stanislav
Telleen-Lawton, Timothy
Brown, Tom
Henighan, Tom
Hume, Tristan
Bai, Yuntao
Hatfield-Dodds, Zac
Mann, Ben
Kaplan, Jared
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_proquest_journals_2733847578
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2733847578
Other Identifiers
E-ISSN
2331-8422