Bluffbench is near saturation: LLMs can interpret counterintuitive plots
Posted 5 hours ago by
ionychal
2
points
https://opensource.posit.co/blog/2026-06-19_ai-newsletter/
0
comments