-
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
Paper • 2412.18319 • Published • 37 -
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling
Paper • 2412.14860 • Published • 2 -
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
Paper • 2411.04282 • Published • 33 -
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Paper • 2412.15797 • Published • 17
Charles I Niswander II
charlesniswander
AI & ML interests
None yet
Recent Activity
liked
a model
about 10 hours ago
RWKV-Red-Team/ARWKV-7B-Preview-0.1
upvoted
a
paper
about 10 hours ago
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language
Model Born from Transformer
upvoted
a
paper
about 15 hours ago
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
Organizations
None yet
Collections
1
datasets
None public yet