New AI Jailbreak Method 'Bad Likert Judge' Boosts Attack Success Rates by Over 60%

Written By Auto Tech Friday, 3 January 2025 Add Comment

January 03, 2025 at 04:44PM

Cybersecurity researchers have shed light on a new jailbreak technique that could be used to get past a large language model's (LLM) safety guardrails and produce potentially harmful or malicious responses. The multi-turn (aka many-shot) attack strategy has been codenamed Bad Likert Judge by Palo Alto Networks Unit 42 researchers Yongzhe Huang, Yang Ji, Wenjun Hu, Jay Chen, Akshata Rao, and

from The Hacker News https://ift.tt/fmAEuHs

{ads}

New AI Jailbreak Method 'Bad Likert Judge' Boosts Attack Success Rates by Over 60%

0 Response to "New AI Jailbreak Method 'Bad Likert Judge' Boosts Attack Success Rates by Over 60%"

Post a Comment

Article Top Ads

Central Ads Article 1

Middle Ads Article 2

Article Bottom Ads