Does CoT reasoning really reflect the reasoning process of an LLM?
Perhaps...
But then again, perhaps not
Recent work from Anthropic studies this question empirically:
"Measuring Faithfulness in Chain-of-Thought Reasoning" by T. Lanham et al.
An overview of the technical report 👇