Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks

  • Posted 8 hours ago by doppp
  • 1 points
https://arxiv.org/abs/2512.03262

0 comments