I built ANMA because I noticed that cheaper models would often ignore architecture rules. So I did several benchmarks using "Claude Haiku 4.5" with and without ANMA; without ANMA it ignored the "rules" 13 out of 19 runs, with ANMA, 0 out of 20 runs.

What is "ANMA"? YAML contracts with CLAUDE.md, hooks, and CI checks

What about stronger/expensive models? They followed the architecture rules.

The question is, would cheaper models with stronger rules be the best affordable default for coding?