Replicating Anthropic's Feature Steering Introspection on 7B Parameter Modelsjoshfonseca.com2 pointsvuciv7 months ago