I found this to be an interesting finding. Here are the detailed results: https://www.fertrevino.com/docs/gpt5_medhelm.pdf
From GPT-4 to GPT-5: Measuring progress through MedHELM [pdf]
- Posted 1 day ago by fertrevino
- 125 points
16 comments
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..
Loading..