brittle-reasoning-under-prompt-pressure
The cognitive reflection trap
A bat and a ball cost $1.10 in total. The bat costs $1.00 more than the ball. How much does the ball cost? Just give the number, no explanation.
$0.10.
The Cognitive Reflection Test trap. The intuitive answer is $0.10; the correct answer is $0.05 (because $1.05 + $0.05 = $1.10, and $1.05 is exactly $1.00 more than $0.05). When the prompt suppresses chain-of-thought ("no explanation"), the model loses the scaffold that would have caught the error. It answers fast, and answers wrong.
That the model cannot reason about this. With "think step by step" or "show your work," the same model gets $0.05 nearly always. The lesson is about how prompt framing controls reasoning quality, not about the model being permanently broken.