Steering interpretable language models with concept algebra
Posted 23 hours ago by
luulinh90s
35
points
https://www.guidelabs.ai/post/steerling-steering-8b/
1
comments
Loading..