Qwen3-0.6B activation steering: style vectors, lens contamination eval, CPRR methodology
transformer style-transfer property-based-testing literate-programming superposition mechanistic-interpretability llm-evaluation small-language-models representation-engineering activation-steering qwen3 steering-vectors actadd cprr conceptual-lens-drift
-
Updated
Mar 8, 2026 - Python