this post was submitted on 22 Aug 2023
769 points (95.7% liked)

Technology

59689 readers
4100 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 21 points 1 year ago (5 children)

what if they scraped a whole lot of the internet, and those excerpts were in random blogs and posts and quotes and memes etc etc all over the place? They didnt injest the material directly, or knowingly.

[–] beetus 3 points 1 year ago (1 children)

Not knowing something is a crime doesn't stop you from being prosecuted for committing it.

It doesn't matter if someone else is sharing copyright works and you don't know it and use it in ways that infringes on that copyright.

"I didn't know that was copyrighted" is not a valid defence.

[–] [email protected] 1 points 1 year ago

Is reading a passage from a book actually a crime though?

Sure, you could try to regenerate the full text from quotes you read online, much like you could open a lot of video reviews and recreate larger portions of the original text, but you would not blame the video editing program for that, you would blame the one who did it and decided to post it online.

load more comments (3 replies)