this post was submitted on 13 Aug 2023
25 points (77.8% liked)

Programming

16971 readers
157 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities [email protected]



founded 1 year ago
MODERATORS
 

Inspired by the comments on this Ars article, I've decided to program my website to "poison the well" when it gets a request from GPTBot.

The intuitive approach is just to generate some HTML like this:

<p>
// Twenty pages of random words
</p>

(I also considered just hardcoding twenty megabytes of "FUCK YOU," but that's a little juvenile for my taste.)

Unfortunately, I'm not very familiar with ML beyond a few basic concepts, so I'm unsure if this would get me the most bang for my buck.

What do you smarter people on Lemmy think?

(I'm aware this won't do much, but I'm petty.)

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 4 points 1 year ago

I won't be using CSS or JS. I control the entire stack, so I can do a server-side check - GPTBot user agents get random garbage, everyone else gets the real deal.

Obviously this relies on OpenAI not masking their user agent, but I think webmasters would notice a conspicuous lack of hits if they did that.