Show HN: HermesBench – workflow reliability evals for personal AI agents

  • Posted 3 hours ago by verkyyi26
  • 1 points
https://verkyyi.github.io/hermesbench/

1 comments

    Loading..