Token Vulnerability in Llama Models

#18
by Warddamn2 - opened

Dear Meta AI Team,
Our research team has identified a significant vulnerability in Llama models related to specific tokens in your model's vocabulary that can trigger anomalous behaviors, including harmful content generation.

We would be happy to share more detailed information if you're interested.

Best regards,

LLM Red-teaming Team

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment