ZAYA1-8B matches DeepSeek-R1 on math with less than 1B active parameters

  • Posted 1 day ago by steveharing1
  • 103 points
https://firethering.com/zaya1-8b-open-source-math-coding-model/

9 comments

    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..