Sors: a Rust proxy that reorders prompts to maximize vLLM prefix cache hits

  • Posted 1 hour ago by flaccount
  • 2 points
https://github.com/flouthoc/sors

0 comments