this post was submitted on 27 Jul 2023
7 points (81.8% liked)
LocalLLaMA
2293 readers
2 users here now
Community to discuss about LLaMA, the large language model created by Meta AI.
This is intended to be a replacement for r/LocalLLaMA on Reddit.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I'm sorry for repeating myself. But didn't Meta just stop disclosing the exact training dataset? Presumably because they're using copyrighted data from the internet? Isn't that hypocritical? IMHO we need laws and/or companies need to stop disregarding copyright when training their own models and then claiming copyright once other people start doing the same thing.
Personally I don't think copyright holders really have a leg to stand on as far as that goes. Simply having and using a copyrighted work isn't a violation, and the work that is produced in the form of a trained neural network is the very definition of transformative. I also think Meta would have the same issue with trying to use a copyright claim for someone using their llama output to improve other non-llama models. That's why they had to slip it into a terms of service.
I guess what you might see going forward is every book that's published comes with a user agreement you agree to by opening the book... But that doesn't sound practical in any sense.