Research

The science behind the control layer

Envariant is built on mechanistic interpretability research. We publish the methods that power the SDK.

InterpretabilityFeb 2026

Feature dictionaries for frontier models at production scale

A method for extracting human-interpretable features from large models with low enough overhead to run on live traffic.

SteeringJan 2026

We show that bounded activation edits can reliably shift named behaviors while preserving general capability.

SafetyDec 2025

A framework for declaring and enforcing behavioral guarantees — like PII non-disclosure — at inference time.