ChatGPT

9144 readers

11 users here now

Unofficial ChatGPT community to discuss anything ChatGPT

founded 2 years ago

MODERATORS

2 authors say OpenAI 'ingested' their books to train ChatGPT. Now they're suing, and a 'wave' of similar court cases may follow. (www.businessinsider.com)

submitted 2 years ago by [email protected] to c/[email protected]

24 comments fedilink hide all child comments

cross-posted from: https://lemmy.world/post/1246165

Two authors sued OpenAI, accusing the company of violating copyright law. They say OpenAI used their work to train ChatGPT without their consent.

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 4 points 2 years ago* (last edited 2 years ago) (1 children)

Soo, if I read a book without asking the author first, he can sue me for reading the book?

[–] [email protected] 2 points 2 years ago (2 children)

Yes, apparently we do. It's like there's a correct way of reading a book, and if you read that book to improve your English you are doing it wrong

This is going to be interesting. We'll end up having to sign an EULA before reading soon...

[–] [email protected] 1 points 2 years ago

To be fair, GPT is not a person. It's like a fuzzy database with lossy-compression. If they over-trained GPT on specific books, it could cite the books verbatim, which would then violate copyright and IP laws. (Not that I'm a fan of IP laws).

[–] [email protected] 1 points 2 years ago* (last edited 2 years ago)

While I appreciate thinking of this in absurdity, you're being disingenuous here. It's like reading a book for a person with eidetic memory then asking for "writing in the style of so and so." And so you use exactly the sentence structure, the verbiage and even the paragraph style. When inspected, you perfectly reproduced the writing style, but effectively only changed a couple words to match the request.

You reproduced 95% of an essay, and 5% of it is yours. You didn't improve on the work, you simply changed the least amount of it you could to suit your purpose.

The way these systems retain the relative symbols is irrelevant if the structure and form of the original is what gives it it's value. The parameters are simply those things that are elements of someone elses copyrighted material. The lawsuit alleges that the books were used, well it's not too hard to get GPT to spit out gutenberg books, or to lie to it and get it to think other books it knows are now public domain and have it do the same. Paragraph and page you can get it to barf them back out verbatim.