Hypura – A storage-tier-aware LLM inference scheduler for Apple Silicon

  • Posted 5 hours ago by tatef
  • 166 points
https://github.com/t8/hypura

20 comments

    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..
    Loading..