this post was submitted on 24 Jul 2024
22 points (95.8% liked)

Hacker News

2171 readers
1 users here now

A mirror of Hacker News' best submissions.

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 4 points 3 months ago (1 children)

We find that preservation of the original data allows for better model fine-tuning and leads to only minor degradation of performance

That means, as long as generated content isn't like 90% of the Internet, they'll be fine. Even then, you can find relatively easy ways to sift data for generated content. Doesn't even have to be perfect.

What really bothers me here is that we might create a world, where the typical AI style of writing takes over the world, because the AI learns on itself, and the companies simply don't care about it. That's not really a collapse as such, but a narrowing.

[–] [email protected] 3 points 3 months ago

That means, as long as generated content isn't like 90% of the Internet, they'll be fine

this sounds so much like the 2° Celsius target for climate change