I referenced this paper (30 page PDF) from Apple in passing recently, but given how widely it was cited, it deserves its own listing. Recent large reasoning models (LRM), the authors argue, "have limitations in exact computation: they fail to use explicit algorithms and reason inconsistently across puzzles."
Today: Total: [] [Share]

