Ksenia Se

Kseniase

AI & ML interests

None yet

Recent Activity

upvoted an article about 23 hours ago
Topic 28: What is Mixture-of-Mamba?
published an article about 23 hours ago
Topic 28: What is Mixture-of-Mamba?
reacted to their post with 😎 about 24 hours ago
8 New Applications of Test-Time Scaling We've noticed a huge interest in test-time scaling (TTS), so we decided to explore this concept further. Test-time compute (TTC) refers to the amount of computational power used by an AI model when generating a response. Many researchers are now focused on scaling TTC, as it enables slow, deep "thinking" and step-by-step reasoning, which improves overall models' performance. Here are 8 fresh studies on test-time scaling: 1. https://huggingface.co/papers/2502.05171 Introduces an LM that scales TTC by reasoning in latent space instead of generating more tokens with no special training. Here, a recurrent block to processes information iteratively. 2. https://huggingface.co/papers/2502.04728 Shows how TTS is applied to enhance model's Planning Domain Definition Language (PDDL) reasoning capabilities, which can be used to generate a symbolic world model. 3. https://huggingface.co/papers/2502.06703 Analyzes optimal TTS strategies and shows how small models can outperform much larger ones. 4. https://huggingface.co/papers/2502.04128 Shows how TTS improves expressiveness, timbre consistency and accuracy in speech synthesis with Llasa framework. It also dives into benefits of scaling train-time compute. 5. https://huggingface.co/papers/2502.07154 Suggests a modified training loss for better reasoning of LLMs when scaling TTC. 6. https://huggingface.co/papers/2502.05078 Unifies the strengths of chain, tree, and graph paradigms into one framework that expands reasoning only on necessary subproblems. 7. https://huggingface.co/papers/2502.01839 Explores scaling trends of self-verification and how to improve its capabilities with TTC. 8. https://huggingface.co/papers/2501.14723 Explores how scaling serial compute (iterations) and parallel compute (trajectories), can improve accuracy in real-world software engineering issues. Also, explore our article about TTS for more -> https://huggingface.co/blog/Kseniase/testtimecompute
View all activity

Organizations

Turing Post's profile picture Journalists on Hugging Face's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture Sandbox's profile picture

Kseniase's activity

upvoted an article about 23 hours ago
view article
Article

Topic 28: What is Mixture-of-Mamba?

By Kseniase and 1 other β€’
β€’ 2
upvoted an article 3 days ago
view article
Article

🌁#88: Can DeepSeek Inspire Global Collaboration?

By Kseniase β€’
β€’ 3
upvoted an article 6 days ago
view article
Article

🦸🏻#10: Does Present-Day GenAI Actually Reason?

By Kseniase β€’
β€’ 5
upvoted an article 8 days ago
view article
Article

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

By Kseniase and 1 other β€’
β€’ 10
upvoted an article 11 days ago
view article
Article

🌁#87: Why DeepResearch Should Be Your New Hire

By Kseniase β€’
β€’ 5
upvoted an article 15 days ago
view article
Article

What is test-time compute and how to scale it?

By Kseniase and 1 other β€’
β€’ 39
upvoted an article 18 days ago
view article
Article

🌁#86: Four Freedoms of truly open AI

By TuringPost and 1 other β€’
β€’ 5