Futurology

1886 readers

54 users here now

founded 2 years ago

MODERATORS

A small study found ChatGPT outdid human physicians when assessing medical case histories, even when those doctors were using a chatbot. (www.nytimes.com)

submitted 1 month ago by [email protected] to c/[email protected]

7 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 3 points 1 month ago (1 children)

They studied 52 doctors responses to standardized (read publicly available online) cases written in front of them.
...
Then they ran 3 trials solely with LLM and find that these were significantly better.

How do they know that the answers to the "standardized" publically available case studies were not in the training data of the LLM? Isn't it extremely likely that they were?

[–] [email protected] 3 points 1 month ago

It's very likely that they were in the training data. I forgot to include that as a point. Unfortunately, though, that's a very difficult variable to control in the LLM research.