Gmail data isn't used for ai training.
You attempting to fuck with ai by yourself isn't going to do anything.
"We did it, Patrick! We made a technological breakthrough!"
A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.
Gmail data isn't used for ai training.
You attempting to fuck with ai by yourself isn't going to do anything.
Google scans your gmail, you can believe they don't use that for training if you want.
I'm not attempting to single-handedly stop AI training on people's data which is impossible. What I want to do is make MY data useless, or preferably even harmful, to it by putting in a bunch of "bad" training data in with it.
Your reply did not address my question, only criticized me for asking it. I'm not here for an argument.
That presumes Google doesn't have a filtering algorithm that will catch your inserted garbage and ignore it.
I've heard markov chains can be a good way to poison LLMs because it trains them to ignore words beyond the most recent
I did find some stuff on that, but I'm not technical enough to know what to do with it. For example, there's this:
https://algorithmic-sabotage.github.io/asrg/posts/sabot-in-the-age-of-ai/
You could scrape your spam then randomly sort the data.
Nice idea, but Google is good at recognizing spam so I doubt they would use it as training data, and that would most likely result in my emails being categorized as spam so the person I'm writing to wouldn't receive them.
Not if they are emailing you first.
there are lorem ipsum generators, but if you want real words, i would suggest using spell checker dictionaries filtered by words longer than 2 or 3 characters.