A Long-Tail Professional Forum-Based Benchmark for LLM Evaluation
Posted 7 hours ago by
wslh
1
points
https://arxiv.org/abs/2511.06346
0
comments