LLM Meta-Cognition and Exploring the Adjacent Possible
Andrej Karpathy has a wonderful tweet on what he calls learned “cognitive strategies” but I think more generally is referred to as “meta cognition”. The piece I like is: …The models discover, in the process of trying to solve many diverse math/code/etc. problems, strategies that resemble the internal monologue of humans, which are very hard (/impossible) to directly program into the models. I call these “cognitive strategies” - things like approaching a problem from different angles, trying out different ideas, finding analogies, backtracking, re-examining, etc. Weird as it sounds, it’s plausible that LLMs can discover better ways of thinking, of solving problems, of connecting ideas across disciplines, and do so in a way we will find surprising, puzzling, but creative and brilliant in retrospect… ...