Analyst memo

Research1 sourceDeveloping

Nous Research Unveils New AI Steering Method

Nous Research introduces Contrastive Neuron Attribution (CNA), a method that enhances AI model steering without complex training, reducing refusal rates significantly while maintaining output quality.

Published May 24, 2026, 12:06 AMUpdated May 24, 2026, 12:06 AM

What happened

Nous Research developed Contrastive Neuron Attribution (CNA) to identify specific neurons in MLP layers that distinguish harmful from benign prompts without requiring SAE training or weight modification.

Why it matters

CNA offers a more precise approach to AI model steering, reducing refusal rates by over 50% in most models tested while maintaining high output quality, improving AI responsiveness.

Who is affected

This development could impact AI developers and tech companies looking to enhance AI model performance and reliability by providing a more effective and efficient steering technique.

Risks / uncertainty

The effectiveness of CNA may vary across different model architectures and sizes, and further testing is required to validate its application beyond the specific models tested.