Principlev1
Run continuous alignment checks asking: (1) Can I trace this
Run continuous alignment checks asking: (1) Can I trace this activity to a core value in 2-3 steps? (2) Has this instrumental value become my identity? (3) Would I choose differently if this instrument disappeared?
Why This Is a Principle
Derives from Two-Level Metacognitive Architecture (metacognition requires monitoring), Knowledge that exists only in tacit form degrades without (knowledge degrades without systematic improvement), and The performance of an agent is bounded by the accuracy of (performance bounded by world model accuracy). The principle prescribes three specific diagnostic questions as a recurring check that instrumental values still serve terminal ones.