-
tinkering with LLMs: steering Gemma 3 with SAEs
A 2am side quest — using sparse autoencoders to bend a language model's emotional weather. Four steering experiments, qualitative only.
A 2am side quest — using sparse autoencoders to bend a language model's emotional weather. Four steering experiments, qualitative only.