this post was submitted on 28 Jul 2023
25 points (96.3% liked)

LocalLLaMA

2296 readers
2 users here now

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 5 points 1 year ago (1 children)

I skimmed through the llama 2 research paper, there were some sections about them working to prevent users from circumventing the language model's programming. IIRC one of the examples of model hijacking was to disguise the request as a creative/fictional prompt. perhaps it's some part of that training gone wrong.

[โ€“] [email protected] 4 points 1 year ago

Just goes to show the importance of being able to produce uncensored models.