Go back

Benchmarking Vulnerability of Agent-Generated Code in Real-World Tasks

Posted 2 months ago by doppp
1 points

https://arxiv.org/abs/2512.03262

0 comments