Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks
Posted 8 hours ago by
doppp
1
points
https://arxiv.org/abs/2512.03262
0
comments