4 252 51

Charles I Niswander II

charlesniswander

dhar174

AI & ML interests

None yet

Recent Activity

reacted to Kseniase's post with 🚀 1 day ago

10 Techniques for Boosting LLM Reasoning in 2025 Everyone’s chasing top reasoning, but sometimes it's still the bottleneck for many real-world tasks. This week, let's spotlight some powerful techniques that have shown promise in helping LLMs achieve more consistent logic, planning, and depth: 1. Retrieval-Augmented CoT Chaining (RAG+CoT) -> https://huggingface.co/papers/2504.13534 Combines Chain-of-Thought prompting with retrieval augmentation at intermediate steps. Relevant documents are fetched after each reasoning subgoal, updating context dynamically. Great for open-domain QA, math, logic and multi-hop fact-checking 2. Tool-use by example injection -> https://huggingface.co/papers/2502.05867 Injects few-shot tool interaction examples during training to implicitly teach calling patterns. Helps in plug-and-play tool use without training new architectures 3. Visual Scratchpads, or multimodal reasoning support -> https://huggingface.co/papers/2501.07542 Using structured visual inputs or sketchable intermediate steps (diagrams, grids, trees) boosts performance in tasks like planning, geometry, and multi-agent simulation. In real practice thanks to this GPT-4o, Claude, and Gemini show marked improvement 4. System 1 vs System 2 Prompt switching -> https://huggingface.co/papers/2505.20101 Changing a fast, intuitive response prompt with a slow, deliberate reasoning mode is among the most popular AI trends. E.g., models tend to respond more reliably when explicitly instructed to “think like a researcher.” This can also reduce hallucinations in open-ended generation and debate tasks 5. Adversarial Self-Chat Fine-Tuning -> https://huggingface.co/papers/2404.10642 Generate debates between model variants or model vs human, then fine-tune on the winner’s response. It helps models learn to better defend their reasoning. Used in Claude’s Constitutional AI and SPPO-style tuning Read further below👇 Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe

upvoted a paper 4 days ago

Reasoning with Exploration: An Entropy Perspective

upvoted a paper 5 days ago

From Bytes to Ideas: Language Modeling with Autoregressive U-Nets

View all activity

Organizations

None yet

Collections 1

models 1

charlesniswander/fruit_freshness_demo

Image Classification • Updated Jan 20 • 51

datasets 0

None public yet

Charles I Niswander II

AI & ML interests

Recent Activity

Organizations

Collections 1

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

models 1

charlesniswander/fruit_freshness_demo

datasets 0