Geometric Interpretability for Model Reliability
I develop geometric methods for measuring what happens inside AI models - how representations form, move, and can be measured - and apply them to interpretability, safety, and the reliability of the latent models the field is converging on.
The methods are peer-reviewed, open-source, and have been independently cited and extended by another research group. Underneath them sits FRESH, the geometric framework for self-models that generates the predictions these instruments test.
Subscribe to UpdatesPick your interest…
I want the research
Geometric interpretability for safety and model reliability: how representations form and move inside AI models, whether their uncertainty is calibrated, and the latent models they build.
Explore Research →
I want tools
Open-source geometric methods for measuring latent structure in AI systems. Curved Inference for measurement, Align-IT for measurement rigour, PRISM for experiments.
Explore Tools →
I want the theory
FRESH: a geometric framework treating consciousness as traversal through role-space. The theoretical engine that generates these measurement methods.
Explore Theory →Latest Research Updates
Loading latest updates from Substack...
Follow this research as it develops. Regular updates on methods, experiments, and findings.
Subscribe to Updates