Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
LLMs exceed physicians on complex text-based differential diagnosis (arxiv.org)
3 points by rippeltippel 11 days ago | hide | past | favorite | 2 comments
 help



This was using o3. GPT 5.2/5.3 should be much improved.

Just like software engineering, it may be best to leave it up to the AI to do the work but let a human guide it and check it.


I wonder if we’ll have to develop strategies for battling confirmation bias. Human review only works if the review is independent.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: