CorrSteer: Steering Improves Task Performance and Safety in LLMs through Correlation-based Sparse Autoencoder Feature Selection
Paper
•
2508.12535
•
Published
•
2
•
2
Totally Free + Zero Barriers + No Login Required