Scala
fromHackernoon
4 months agoHow We Curated Seven Algorithmic Reasoning Tasks From Big-Bench Hard | HackerNoon
Evaluation of LLMs for algorithmic reasoning is conducted using curated tasks in zero-shot settings to assess step-by-step reasoning capabilities.