A curated, non-BS library of the best resources for evaluating agents
Posted 2 hours ago by
xdotli
1
points
https://github.com/benchflow-ai/awesome-evals
0
comments