this post was submitted on 26 Jul 2023
859 points (96.5% liked)
Technology
59689 readers
4100 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Nobody has provided any evidence that this is the case. Until this is proven it should not be assumed. Bandwagoning (and repeating this over and over again without any evidence or proof) against the ML people without evidence is not fair. The whole point of the Justice system is innocent until proven guilty.
Derivative works are 100% protected under copyright law. https://www.legalzoom.com/articles/what-are-derivative-works-under-copyright-law
This is the same premise that allows "fair use" that we all got up and arms about on youtube. Claiming that this doesn't exist now in this case means that all that stuff we fought for on Youtube needs to be rolled back.
Why not? Why can't someone grab a book, scan it... chuck it into an OCR and get the same content? There are plenty of ways that snippets of raw content could make it into these repositories WITHOUT asserting legal problems.
No... You could have for all intents and purposes have recorded all your songs from the radio onto a cassette... That would be 100% legal for personal consumption... which would be what the ML authors are doing. ChatGPT and others could have sources information from published sources that are completely legit. No "Author" has provided any evidence otherwise yet to believe that ChatGPT and others have actually broken a law yet. For all we know the authors of these tools have library cards, and fed in screenshots of the digital scans of the book or hand scanned the book. Or didn't even use the book at all and contextually grabbed a bunch of content from the internet at large.
Since the ML bots are all making derivative works, rather than spitting out original content... they'd be covered by copyright as a derivative work.
This only becomes an actual problem if you can prove that these tools have done BOTH