We managed to find a mixed precision configuration such that it maintains virtually the same accuracy as the unquantized model while being able to run really efficiently on the Nano Super and other edge devices :)
Show HN: Cosmos-Reason2-2B on Nano Super
- Posted 8 hours ago by vottivott
- 1 points