Qwen3.6-35B-A3B speculative decoding is net-negative on RTX 3090
Posted 3 hours ago by
thc1006
5
points
https://github.com/thc1006/qwen3.6-speculative-decoding-rtx3090
1
comments
Loading..