Technology

59038 readers

3047 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

[email protected]

249

AI models collapse when trained on recursively generated data (www.nature.com)

submitted 3 months ago by [email protected] to c/[email protected]

34 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 14 points 3 months ago (2 children)

Eventually an AI will be developed that can learn with much less data. In the end we don't need to read the entire internet to get through our education. But, that's not going to be LLM. No matter how much you tweak LLM models, it won't get there. It's like trying to tune a coal fired steam powered car until you can compete in a formula 1 race.

[–] conciselyverbose 17 points 3 months ago (1 children)

Yeah, it's entirely plausible that LLMs are a small part of the answer as basically the language center of the brain, but the brain is a hell of a lot more complex than that. The language center isn't your whole brain, and is only loosely connected to actual decision making. It confabulates a lot.

[–] [email protected] 19 points 3 months ago

OpenAI stumbled on something that worked and ran with it, and people started proclaiming it to be the answer to everything. The same happened with Deep Learning and every AI invention so far. It's all just another stepping stone on the way.

[–] [email protected] 15 points 3 months ago

It's already happening. A quote from Andrej Karpathy :

Turns out that LLMs learn a lot better and faster from educational content as well. This is partly because the average Common Crawl article (internet pages) is not of very high value and distracts the training, packing in too much irrelevant information. The average webpage on the internet is so random and terrible it's not even clear how prior LLMs learn anything at all.