Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks
Posted 2 months ago by
doppp
1
points
https://arxiv.org/abs/2512.03262
0
comments