Steering interpretable language models with concept algebra

  • Posted 23 hours ago by luulinh90s
  • 35 points
https://www.guidelabs.ai/post/steerling-steering-8b/

1 comments

    Loading..