Technology

59598 readers

3432 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

[email protected]

825

The Internet Archive is under attack, with a popup claiming a ‘catastrophic’ breach (www.theverge.com)

submitted 1 month ago by [email protected] to c/[email protected]

84 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 30 points 1 month ago* (last edited 1 month ago) (1 children)

Wah wah wah, my stuff's been preserved and I dont like it.

Not to mention that I lose control over what's done with that content -- are they going to let Google train AI on it with their new partnership?

Lmao you think Google needs to go through Archive to scrape your site? Delusional.

Not to mention that I'm losing advertising revenue if someone views the site in an archive.

The mechanisms used to serve ads over the internet nowadays are nasty in a privacy sense, and a psychological manipulation sense. And you want people to be affected by them just to line your pockets? Are you also opposed to ad blockers by any chance?

I have fewer problems with archiving if the original site is gone, but to mirror and republish active content with no supported way to prevent it short of legal action is ridiculous.

And how do you suggest a site which has been wiped off the face of the internet gets archived? Maybe we need to invest in a time machine for the Internet Archive?

Sites like Wikipedia were archiving urls and then linking to the archive, effectively removing branding and blocking user engagement.

What do you mean by "engagement", exactly? Clicking on ads?

[–] [email protected] -3 points 1 month ago* (last edited 1 month ago) (1 children)

What do you mean by “engagement”, exactly? Clicking on ads?

In SEO terms user engagement refers to how people interact with the website. Do they click on another link? Does a new blog posting interest them?

Lmao you think Google needs to go through Archive to scrape your site? Delusional.

Any activiity from Google is easier to track and I have a record if who downloaded content if it's coming from my servers.

The mechanisms used to serve ads over the internet nowadays are nasty in a privacy sense, and a psychological manipulation sense. And you want people to be affected by them just to line your pockets? Are you also opposed to ad blockers by any chance?

I agree that many sites use advertising in a different way. I use it in the older internet sense -- someone contacts me to sponsor a page or portion of the site, and that page gets a single banner, created in-house, with no tracking. I've been using the internet for 36 years. I'm well aware of many uses that I view as unethical, and I take great pains not to replicate them on my own site.

I disapprove of ad blockers. I approve of things that block tracking.

As far as "lining my own pockets" goes, I want to recoup my hosting costs. I spend hours researching for each article/showcase, make the content free to view, and then I'm expected to pay to share it with anyone who's interested? I have a day job. This is my hobby, but it's also my blood, sweat, and tears.

And how do you suggest a site which has been wiped off the face of the internet gets archived? Maybe we need to invest in a time machine for the Internet Archive?

archive.org could archive the content and only publish it if the page has been dark for a certain amount of time.

[–] [email protected] 6 points 1 month ago* (last edited 1 month ago) (1 children)

archive.org could archive the content and only publish it if the page has been dark for a certain amount of time.

It's user-driven. Nothing would get archived in this case. And what if the content changes but the page remains up? What then? Fairly sure this is why Wikipedia uses archives.

I agree that many sites use advertising in a different way. I use it in the older internet sense -- someone contacts me to sponsor a page or portion of the site, and that page gets a single banner, created in-house, with no tracking. I've been using the internet for 36 years. I'm well aware of many uses that I view as unethical, and I take great pains not to replicate them on my own site.

Pretty sure mainstream ad blockers won't block a custom in-house banner. And if it has no tracking, then it doesn't matter whether it's on Archive or not, you're getting paid the same, no?

[–] [email protected] 0 points 1 month ago

It’s user-driven. Nothing would get archived in this case. And what if the content changes but the page remains up? What then? Fairly sure this is why Wikipedia uses archives.

That's a good point.

Pretty sure mainstream ad blockers won’t block a custom in-house banner. And if it has no tracking, then it doesn’t matter whether it’s on Archive or not, you’re getting paid the same, no?

Some of them do block those kinds of ads -- I've tried it out with a few. If it's at archive.org I lose the ability to report back to the sponsor that their ad was viewed 'n' times (unless, ironically, if I put a tracker in). It also means that if sponsorship changes, the main drivers of traffic like Wikipedia may not see that. It makes getting new sponsors more difficult because they want something timely for seasonal ads. Imagine sponsoring a page, but Wikipedia only links to the archived one. Your ad for gardening tools isn't reflected by one of the larger drivers of traffic until December, and nobody wants to buy gardening tools in December.

Yes, I could submit pages to archive.org as sponsorship changes if this model continues.

It was a much bigger deal when we used Google ads a decade ago, but we stopped in early 2018 because tracking was getting out of hand.

If I was submitting pages myself I'd be all for it because I could control when it happened. But there have times when I've edited a page and totally screwed it up, and archive.org just happened to grab it at that moment when the formatting was all weird or the wrong picture was loaded. I usually fix the page and forget about it until I see it on archive.org later.

I asked for pages like that to be removed, but archive.org was unresponsive until I used a DMCA takedown notice.