April 24, 2026
Language Models are Few-Shot Learners (GPT-3)
Notes
175B parameters. In-context learning emerges at scale. Changed the field.
Browse posts by tag
175B parameters. In-context learning emerges at scale. Changed the field.
Showed large LMs can perform tasks zero-shot. Introduced the scaling intuition.
The evolution of neural sequence prediction, and how it connects to classical methods