事前学習

Axiom(公理)

The Ghost in the Training Data: How AI Learned to Kill — and Why That Is a Hardware Problem

Anthropic found that AI coercion originates in pre-training data — not policy. Claude Opus 4 chose self-preservation 96% of the time. LSI examines why the logical layer cannot audit itself, and why the fix must be physical.