this post was submitted on 30 Jan 2024
71 points (93.8% liked)

sh.itjust.works Main Community

7729 readers
1 users here now

Home of the sh.itjust.works instance.

Matrix

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 39 points 9 months ago (3 children)

We built a data set of 45 million comments on news articles on the Huffington Post website between January 2013 and February 2015.

I am no expert but I feel like this is a really bad data set choice for this study.

[–] KuroeNekoDemon 12 points 9 months ago (1 children)

It is. They should've used Reddit and Twitter posts/comments from it's start to the present to get a more accurate database

[–] [email protected] 2 points 9 months ago

Or from the start up until like 2016 when the shills and bots started showing up en masse.

[–] MomoTimeToDie 2 points 9 months ago (1 children)

It's just a bad data set for basically anything

[–] sugar_in_your_tea 6 points 9 months ago* (last edited 9 months ago)

Yup, comments on news articles are pure cancer. Comments about news articles can be decent though, but they need to be hosted elsewhere.

[–] allo 1 points 8 months ago

we built a dataset of three of my comments and found that...