this post was submitted on 19 Feb 2024
1170 points (94.1% liked)

Lemmy Shitpost

26948 readers
3112 users here now

Welcome to Lemmy Shitpost. Here you can shitpost to your hearts content.

Anything and everything goes. Memes, Jokes, Vents and Banter. Though we still have to comply with lemmy.world instance rules. So behave!


Rules:

1. Be Respectful


Refrain from using harmful language pertaining to a protected characteristic: e.g. race, gender, sexuality, disability or religion.

Refrain from being argumentative when responding or commenting to posts/replies. Personal attacks are not welcome here.

...


2. No Illegal Content


Content that violates the law. Any post/comment found to be in breach of common law will be removed and given to the authorities if required.

That means:

-No promoting violence/threats against any individuals

-No CSA content or Revenge Porn

-No sharing private/personal information (Doxxing)

...


3. No Spam


Posting the same post, no matter the intent is against the rules.

-If you have posted content, please refrain from re-posting said content within this community.

-Do not spam posts with intent to harass, annoy, bully, advertise, scam or harm this community.

-No posting Scams/Advertisements/Phishing Links/IP Grabbers

-No Bots, Bots will be banned from the community.

...


4. No Porn/ExplicitContent


-Do not post explicit content. Lemmy.World is not the instance for NSFW content.

-Do not post Gore or Shock Content.

...


5. No Enciting Harassment,Brigading, Doxxing or Witch Hunts


-Do not Brigade other Communities

-No calls to action against other communities/users within Lemmy or outside of Lemmy.

-No Witch Hunts against users/communities.

-No content that harasses members within or outside of the community.

...


6. NSFW should be behind NSFW tags.


-Content that is NSFW should be behind NSFW tags.

-Content that might be distressing should be kept behind NSFW tags.

...

If you see content that is a breach of the rules, please flag and report the comment and a moderator will take action where they can.


Also check out:

Partnered Communities:

1.Memes

2.Lemmy Review

3.Mildly Infuriating

4.Lemmy Be Wholesome

5.No Stupid Questions

6.You Should Know

7.Comedy Heaven

8.Credible Defense

9.Ten Forward

10.LinuxMemes (Linux themed memes)


Reach out to

All communities included on the sidebar are to be made in compliance with the instance rules. Striker

founded 1 year ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 176 points 9 months ago* (last edited 9 months ago) (3 children)

That was cringe but I think a better reason NOT to return to reddit is the fact that they just sold out their users to an AI company that hasn't even been named.

[–] [email protected] 56 points 9 months ago (3 children)

Could you imagine this is what we are training AI with !

[–] [email protected] 23 points 9 months ago

I can. Remember Tay?

[–] [email protected] 12 points 9 months ago

Lol yeah, other bot made data

[–] [email protected] 11 points 9 months ago (1 children)

Yeah, all these bots replies is copied from other comment, and there's shit tons of r/confidentlyincorrect comment that is outright factually wrong, which then get regurgitated by other user and copied by bots, so good luck to the AI company filtering those.

[–] [email protected] 2 points 9 months ago

r/confidentlyincorrect comment that is outright factually wrong

Sounds like it would fit right in with other AI models

[–] CodeInvasion 41 points 9 months ago (1 children)

AFAIK, there’s nothing stopping any company from scraping Lemmy either. The whole point pf reddit limiting API usage was so they could make money like this.

Outside of morals, there is nothing to stop anybody from training on data from Lemmy just like there’s nothing stopping me from using Wikipedia. Most conferences nowadays require a paragraph on ethics in the submission, but I and many of my colleagues would have no qualms saying we scraped our data from open source internet forums and blogs.

[–] [email protected] 21 points 9 months ago

You're right, anyone can scrape Lemmy. But that's not the issue (to me anyway) - Reddit have sold user data - user generated content. None of what they're profiting from was generated or created by them. Are Reddit users who did generate all this content getting a slice of the profits?

When I post on here I know it's all open for anyone to access but that's true of any non walled garden space. I've accepted the fact that it's going to get fed into the hungry maw of some AI behemoth or two.

What Reddit have done is make money for doing absolutely nothing based on content others have created like some sort of technological tapeworm feeding second hand. And along the way they killed off a lot of tools that users loved, moderators found made their jobs easier and people with a visual disability found vital. And all this so u/spez can live out his mini-Musk fantasies.

[–] [email protected] 2 points 9 months ago* (last edited 9 months ago)

Fuck Reddit, but why does this matter? Them selling internal analytics and profile information isn't going to be nearly as valuable as post/comment history which has already been public and scraped continuously since the site's foundings. Practically every LLM is already has already scraped the entire site! Whatever company is buying their info is probably the only ones doing it legitimately. You can also assume Lemmy is no different, it's all public and scrapable for LLMs to freely feast on.