Analyst memo
Nous Research Unveils New AI Steering Method
Nous Research introduces Contrastive Neuron Attribution (CNA), a method that enhances AI model steering without complex training, reducing refusal rates significantly while maintaining output quality.
Published May 24, 2026, 12:06 AMUpdated May 24, 2026, 12:06 AM
What happened
Nous Research developed Contrastive Neuron Attribution (CNA) to identify specific neurons in MLP layers that distinguish harmful from benign prompts without requiring SAE training or weight modification.
Why it matters
CNA offers a more precise approach to AI model steering, reducing refusal rates by over 50% in most models tested while maintaining high output quality, improving AI responsiveness.
Who is affected
This development could impact AI developers and tech companies looking to enhance AI model performance and reliability by providing a more effective and efficient steering technique.
Risks / uncertainty
The effectiveness of CNA may vary across different model architectures and sizes, and further testing is required to validate its application beyond the specific models tested.