April 7, 2026
EvaluationAccuracy Is the Floor, Not the Ceiling
Why 93% accuracy still leaves the failure boundary unknown, and what calibration, constraint adherence, and consequential error rate actually measure.
Essays on AI evaluation, constraint design, failure modes, and welfare measurement. Published here first, crossposted to LinkedIn.
April 7, 2026
EvaluationWhy 93% accuracy still leaves the failure boundary unknown, and what calibration, constraint adherence, and consequential error rate actually measure.
March 31, 2026
AI WelfareCurrent AI welfare frameworks are built to detect access consciousness, not valenced experience. That is the wrong target. Here is what a better framework would require.
March 23, 2026
Constraint ArchitectureEvery AI governance framework says human oversight is required for irreversible actions. That requirement, on its own, does nothing. Here is the difference between policy and architecture.
March 13, 2026
EvaluationFifteen years in product leadership kept surfacing the same question: how do you know this system is ready to be trusted? That question scales. Here is why I am building toward it.