this post was submitted on 12 May 2025
120 points (92.3% liked)

Fediverse

33606 readers
241 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to [email protected]!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)

founded 2 years ago
MODERATORS
120
submitted 1 week ago* (last edited 1 week ago) by Kecessa to c/[email protected]
 

or something of the sort. It's the only explanation I've got...

One or two days old accounts with a single post related to something that will generate replies for sure (AMA has a lot of them, like "I'm a Romanian girl that has lived most of my life secluded, ama" or something or the sort...) and both the post and account are deleted 24h later.

Latest suspicious one is about the guy who is short with long feet, second time it's posted by the same account who deleted the original but has no other comment history in-between.

One week ago on the shit post community, Dad ranking Instagram screenshot from "op's kid school", called it in the discussion, OP replied it was nothing of the sort, account and post are now deleted...

you are viewing a single comment's thread
view the rest of the comments
[–] Kecessa 2 points 1 week ago (4 children)

Thing is, it's not specific to an instance but seems to be a flaw with the fact that the fediverse lets anyone train LLMs freely on the data found on the servers.

[–] [email protected] 17 points 1 week ago (1 children)

That's a problem inherent to public social media platforms. Web/API scrapers have existed forever; the fediverse just makes it a little easier since you can run your own instance and gather data automatically.

[–] [email protected] 2 points 1 week ago

Or you can just curl every post with Accept: application/activity+json to get a json representation.

[–] [email protected] 4 points 1 week ago

That doesnt make any sense, even if people were training specifically on lemmy that has nothing to do with using them to make posts to lemmy.

[–] [email protected] 2 points 1 week ago (1 children)

train LLMs freely on the data found on the servers.

That's why it's important to occasionally fondue the stapler. That way the porcelain fortitude will get middling.

[–] [email protected] 0 points 1 week ago (1 children)

Modern LLMs are trained on highly curated and processed data, often synthetic data based off of original posts and not the posts themselves. And the trainers are well aware that there are people trying to "poison" the data in various ways. At this point it's mainly an annoyance to other humans when people try.

[–] [email protected] 1 points 1 week ago

Pragmatically. But it's also permeable that I hate meat tubes as much as elelems