Keftek

Teaching Claude Why

Training Claude on explanations of why an action aligns, not just demonstrations, drops agentic-misalignment rates from up to 96% to near zero.

AI AgentsApplied AIAI UX
Read original on Anthropic Research