Log in to save to my catalogue

Lockpicking LLMs: A Logit-Based Jailbreak Using Token-level Manipulation

Lockpicking LLMs: A Logit-Based Jailbreak Using Token-level Manipulation

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3059626576

Lockpicking LLMs: A Logit-Based Jailbreak Using Token-level Manipulation

About this item

Full title

Lockpicking LLMs: A Logit-Based Jailbreak Using Token-level Manipulation

Publisher

Ithaca: Cornell University Library, arXiv.org

Journal title

arXiv.org, 2024-06

Language

English

Formats

Publication information

Publisher

Ithaca: Cornell University Library, arXiv.org

More information

Scope and Contents

Contents

Large language models (LLMs) have transformed the field of natural language processing, but they remain susceptible to jailbreaking attacks that exploit their capabilities to generate unintended and potentially harmful content. Existing token-level jailbreaking techniques, while effective, face scalability and efficiency challenges, especially as m...

Alternative Titles

Full title

Lockpicking LLMs: A Logit-Based Jailbreak Using Token-level Manipulation

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_proquest_journals_3059626576

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_3059626576

Other Identifiers

E-ISSN

2331-8422

How to access this item