Counting as a minimal probe of language model reliability
Posted 4 hours ago by
nateb2022
4
points
https://arxiv.org/abs/2605.02028
0
comments