FASTEST LLM decode engine on Apple Silicon. 658 tok/s on M4-Max,beats MLX by 19%

  • Posted 4 hours ago by sanchitmonga
  • 3 points
https://www.runanywhere.ai/blog/metalrt-fastest-llm-decode-engine-apple-silicon

1 comments

    Loading..