#zero-shot

[ follow ]
Scala
fromHackernoon
4 months ago

How We Curated Seven Algorithmic Reasoning Tasks From Big-Bench Hard | HackerNoon

Evaluation of LLMs for algorithmic reasoning is conducted using curated tasks in zero-shot settings to assess step-by-step reasoning capabilities.
[ Load more ]