Technology

62936 readers

3391 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

[email protected]

686

OpenAI confirms that AI writing detectors don’t work (arstechnica.com)

submitted 1 year ago by [email protected] to c/[email protected]

111 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 27 points 1 year ago (2 children)

OpenAI discontinued its AI Classifier, which was an experimental tool designed to detect AI-written text. It had an abysmal 26 percent accuracy rate.

If you ask this thing whether or not some given text is AI generated, and it is only right 26% of the time, then I can think of a real quick way to make it 74% accurate.

[–] [email protected] 14 points 1 year ago (2 children)

I feel like this must stem from a misunderstanding of what 26% accuracy means, but for the life of me, I can't figure out what it would be.

[–] [email protected] 10 points 1 year ago* (last edited 1 year ago)

Looks like they got that number from this quote from another arstechnica article ”…OpenAI admitted that its AI Classifier was not "fully reliable," correctly identifying only 26 percent of AI-written text as "likely AI-written" and incorrectly labeling human-written works 9 percent of the time”

Seems like it mostly wasn’t confident enough to make a judgement, but 26% it correctly detected ai text and 9% incorrectly identified human text as ai text. It doesn’t tell us how often it labeled AI text as human text or how often it was just unsure.

EDIT: this article https://arstechnica.com/information-technology/2023/07/openai-discontinues-its-ai-writing-detector-due-to-low-rate-of-accuracy/

[–] [email protected] 4 points 1 year ago (1 children)

it seemed like a really weird decision for OpenAI to have an AI classifier in the first place. their whole business is to generate output that's good enough that it can't be distinguished from what a human might produce, and then they went and made a tool to try and point out where they failed.

[–] [email protected] 2 points 1 year ago

That may have been the goal. Look how good our AI is, even we can't tell if its output is human generated or not.