Post
2928
π§ We just implemented Andrej Karpathy's "third paradigm" for LLM learning!
System Prompt Learning (SPL) enables LLMs to automatically learn problem-solving strategies from experience, rather than relying on static prompts.
π How it works:
Your LLM builds a database of effective strategies, selects the best ones for each problem, and refines them over time based on success rates.
π Results across math benchmarks:
Arena Hard: 29% β 37.6% (+8.6%)
AIME24: 23.33% β 30% (+6.67%)
OptILLMBench: 61% β 65% (+4%)
The best part? All strategies are human-readable and the system gets progressively better at problem types you use frequently.
β¨ Key benefits:
π Cumulative learning over time
π Transparent, inspectable strategies
π Works with any OpenAI-compatible API
β‘ Simple integration: just add "spl-" prefix to your model
Built as an open-source plugin in optillm. After 500 queries, our system developed 129 strategies and refined 97 of them!
This feels like a genuine step toward AI that learns from experience while staying completely interpretable.
π GitHub: https://github.com/codelion/optillm/tree/main/optillm/plugins/spl
π Full article: https://huggingface.co/blog/codelion/system-prompt-learning
π¦ Original Karpathy tweet: https://x.com/karpathy/status/1921368644069765486
Have you experimented with advanced system prompting? What strategies would you want your LLM to learn?
System Prompt Learning (SPL) enables LLMs to automatically learn problem-solving strategies from experience, rather than relying on static prompts.
π How it works:
Your LLM builds a database of effective strategies, selects the best ones for each problem, and refines them over time based on success rates.
π Results across math benchmarks:
Arena Hard: 29% β 37.6% (+8.6%)
AIME24: 23.33% β 30% (+6.67%)
OptILLMBench: 61% β 65% (+4%)
The best part? All strategies are human-readable and the system gets progressively better at problem types you use frequently.
β¨ Key benefits:
π Cumulative learning over time
π Transparent, inspectable strategies
π Works with any OpenAI-compatible API
β‘ Simple integration: just add "spl-" prefix to your model
Built as an open-source plugin in optillm. After 500 queries, our system developed 129 strategies and refined 97 of them!
This feels like a genuine step toward AI that learns from experience while staying completely interpretable.
π GitHub: https://github.com/codelion/optillm/tree/main/optillm/plugins/spl
π Full article: https://huggingface.co/blog/codelion/system-prompt-learning
π¦ Original Karpathy tweet: https://x.com/karpathy/status/1921368644069765486
Have you experimented with advanced system prompting? What strategies would you want your LLM to learn?