FASTEST LLM decode engine on Apple Silicon. 658 tok/s on M4-Max,beats MLX by 19%
Posted 4 hours ago by
sanchitmonga
3
points
https://www.runanywhere.ai/blog/metalrt-fastest-llm-decode-engine-apple-silicon
1
comments
Loading..