this post was submitted on 23 May 2024

938 points (100.0% liked)

TechTakes

1848 readers

88 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago

MODERATORS

[email protected]

938

The Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith. (lemmy.dbzer0.com)

submitted 11 months ago by [email protected] to c/[email protected]

254 comments fedilink hide all child comments

I see Google's deal with Reddit is going just great...

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 74 points 11 months ago (15 children)

This is why actual AI researchers are so concerned about data quality.

Modern AIs need a ton of data and it needs to be good data. That really shouldn't surprise anyone.

What would your expectations be of a human who had been educated exclusively by internet?

[–] [email protected] 49 points 11 months ago (2 children)

Even with good data, it doesn't really work. Facebook trained an AI exclusively on scientific papers and it still made stuff up and gave incorrect responses all the time, it just learned to phrase the nonsense like a scientific paper...

[–] [email protected] 45 points 11 months ago (1 children)

To date, the largest working nuclear reactor constructed entirely of cheese is the 160 MWe Unit 1 reactor of the French nuclear plant École nationale de technologie supérieure (ENTS).

"That's it! Gromit, we'll make the reactor out of cheese!"

[–] Socsa 10 points 11 months ago (1 children)

Of course it would be French

[–] [email protected] 3 points 11 months ago

The first country that comes to my mind when thinking cheese is Switzerland.

[–] [email protected] -1 points 11 months ago (1 children)

A bunch of scientific papers are probably better data than a bunch of Reddit posts and it's still not good enough.

Consider the task we're asking the AI to do. If you want a human to be able to correctly answer questions across a wide array of scientific fields you can't just hand them all the science papers and expect them to be able to understand it. Even if we restrict it to a single narrow field of research we expect that person to have a insane levels of education. We're talking 12 years of primary education, 4 years as an undergraduate and 4 more years doing their PhD, and that's at the low end. During all that time the human is constantly ingesting data through their senses and they're getting constant training in the form of feedback.

All the scientific papers in the world don't even come close to an education like that, when it comes to data quality.

[–] [email protected] 6 points 11 months ago

this appears to be a long-winded route to the nonsense claim that LLMs could be better and/or sentient if only we could give them robot bodies and raise them like people, and judging by your post history long-winded debate bullshit is nothing new for you, so I’m gonna spare us any more of your shit

[–] [email protected] 21 points 11 months ago (1 children)

I'd expect them to put 1/8 cup of glue in their pizza

[–] [email protected] 10 points 11 months ago

That's my point. Some of them wouldn't even go through the trouble of making sure that it's non-toxic glue.

There are humans out there who ate laundry pods because the internet told them to.

[–] [email protected] 11 points 11 months ago (1 children)

We are experiencing a watered down version of Microsoft's Tay

[–] [email protected] 2 points 11 months ago

Oh boy, that was hilarious!

[–] [email protected] 2 points 11 months ago* (last edited 11 months ago) (2 children)

Is this a dig at gen alpha/z?

[–] [email protected] 2 points 11 months ago (2 children)

I guess it would have to be be default, since only older millennials and up can remember a time before internet.

[–] [email protected] 8 points 11 months ago* (last edited 11 months ago) (1 children)

not everyone is a westerner you know

my village didn't get any kind of internet, even dialup until like 2009, i remember pre-internet and i still don't have mortgage

e: now that i'm thinking ADSL was a thing for maybe a year or two, but it was expensive and never really caught on. the first real internet experience™ was delivered by a sketchy point to point radiolink that dropped every time it rained. much later it was all replaced by FTTH paid for by EU money

[–] [email protected] 4 points 11 months ago* (last edited 11 months ago)

heh yeah

I had a pretty weird arc. I got to experience internet really early (‘93~94), and it took until ‘99+ for me to have my first “regular” access (was 56k on airtime-equiv landline). it took until ‘06 before I finally had a reliable recurrent connection

I remember seeing mentions (and downloads for) eggdrops years before I had any idea of what they were for/could do

(and here I am building ISPs and shit….)

[–] [email protected] 2 points 11 months ago* (last edited 11 months ago) (2 children)

Lies. Internet at first was just some mystical place accessed by expensive service. So even if it already existed it wasn’t full of twitter fake news etc as we know it. At most you had a peer to peer chat service and some weird class forum made by that one class nerd up until like 2006

[–] [email protected] 10 points 11 months ago (1 children)

never been to the usenet, i see.

[–] [email protected] 2 points 11 months ago* (last edited 11 months ago) (1 children)

I wasn’t a nerd back then frankly. I mean it wasn’t good look for surviving the school. The only one was bullied like fuck

[–] [email protected] 6 points 11 months ago (1 children)

ah. well, my commiserations, the us seems to thrive on pitting people against each other.

anyways, my point is that usenet had every type of crank you can see these days on twitter. this is not new.

[–] [email protected] -1 points 11 months ago* (last edited 11 months ago) (1 children)

Well probably but what’s the point if some extremely small minority used it?

The point with iPad kids is that it is so common. The kids played outside and stuff well into 2000s.

Still I guess iPads are better than dxm tabs but as the old wisdom says: why not both?

[–] [email protected] 1 points 11 months ago (1 children)

@Emmie “it wasn’t full of twitter fake news etc as we know it”

Maybe you should say what your point is.

[–] [email protected] -1 points 11 months ago (2 children)

I thought I did, the fuck ya want from me

[–] [email protected] 2 points 11 months ago

"this only matters now because I only became aware of it today" is not, y'know, a very compelling argument

for, well, anything

[–] [email protected] 1 points 11 months ago (1 children)

@Emmie You thought wrong

[–] [email protected] -1 points 11 months ago* (last edited 11 months ago)

I don’t even remember what it was about now tbh, must not been super important

[–] [email protected] 4 points 11 months ago

reading your post gave me multiple kinds of whiplash

are you, like, aware of the fact that there can be different ways experiences? for other people? that didn’t match whatever you went through?

[–] [email protected] -3 points 11 months ago

Haha. Not specifically.

It's more a comment on how hard it is to separate truth from fiction. Adding glue to pizza is obviously dumb to any normal human. Sometimes the obviously dumb answer is actually the correct one though. Semmelweis's contemporaries lambasted him for his stupid and obviously nonsensical claims about doctors contaminating pregnant women with "cadaveric particles" after performing autopsies.

Those were experts in the field and they were unable to guess the correctness of the claim. Why would we expect normal people or AIs to do better?

There may be a time when we can reasonably have such an expectation. I don't think it will happen before we can give AIs training that's as good as, or better, than what we give the most educated humans. Reading all of Reddit, doesn't even come close to that.

load more comments (10 replies)