Position: Coding Benchmarks Are Misaligned with Agentic Software Engineering

  • Posted 3 hours ago by wek
  • 2 points
https://arxiv.org/abs/2606.17799

0 comments