Tide: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference
Posted 4 hours ago by
OsamaJaber
3
points
https://arxiv.org/abs/2603.21365
0
comments