Position: Coding Benchmarks Are Misaligned with Agentic Software Engineering
Posted 3 hours ago by
wek
2
points
https://arxiv.org/abs/2606.17799
0
comments