Shyam Sunder Kumar

theainerd

AI & ML interests

Natural Language Processing

Recent Activity

liked a dataset 2 days ago
ai4bharat/Indic-Rag-Suite
liked a model 7 days ago
deepseek-ai/DeepSeek-R1-0528
upvoted a collection 7 days ago
MobileLLM
View all activity

Organizations

Neuropark's profile picture Speech Recognition Community Event Version 2's profile picture Open-Source AI Meetup's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture Hugging Face MCP Course's profile picture

theainerd's activity

reacted to codelion's post with šŸš€ 8 days ago
view post
Post
3344
🧠 We just implemented Andrej Karpathy's "third paradigm" for LLM learning!

System Prompt Learning (SPL) enables LLMs to automatically learn problem-solving strategies from experience, rather than relying on static prompts.

šŸš€ How it works:
Your LLM builds a database of effective strategies, selects the best ones for each problem, and refines them over time based on success rates.

šŸ“Š Results across math benchmarks:
Arena Hard: 29% → 37.6% (+8.6%)
AIME24: 23.33% → 30% (+6.67%)
OptILLMBench: 61% → 65% (+4%)

The best part? All strategies are human-readable and the system gets progressively better at problem types you use frequently.

✨ Key benefits:
šŸ”„ Cumulative learning over time
šŸ“– Transparent, inspectable strategies
šŸ”Œ Works with any OpenAI-compatible API
⚔ Simple integration: just add "spl-" prefix to your model

Built as an open-source plugin in optillm. After 500 queries, our system developed 129 strategies and refined 97 of them!

This feels like a genuine step toward AI that learns from experience while staying completely interpretable.

šŸ”— GitHub: https://github.com/codelion/optillm/tree/main/optillm/plugins/spl
šŸ“– Full article: https://huggingface.co/blog/codelion/system-prompt-learning
🐦 Original Karpathy tweet: https://x.com/karpathy/status/1921368644069765486

Have you experimented with advanced system prompting? What strategies would you want your LLM to learn?