Qwen3.6-35B-A3B speculative decoding is net-negative on RTX 3090

  • Posted 3 hours ago by thc1006
  • 5 points
https://github.com/thc1006/qwen3.6-speculative-decoding-rtx3090

1 comments

    Loading..