Tide: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference

  • Posted 4 hours ago by OsamaJaber
  • 3 points
https://arxiv.org/abs/2603.21365

0 comments