1 58 66

wei

fengwei

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

lightblue/lb-reranker-0.5B-v1.0

upvoted a paper 8 days ago

RM-R1: Reward Modeling as Reasoning

upvoted a paper 8 days ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

View all activity

Organizations

None yet

fengwei's activity

upvoted 2 papers 8 days ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published 12 days ago • 66

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published 9 days ago • 59

upvoted 2 papers 13 days ago

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published 18 days ago • 52

UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Paper • 2504.20734 • Published 18 days ago • 61

upvoted a paper 28 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 29 days ago • 89

upvoted a paper about 1 month ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 124

upvoted a collection about 1 month ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Apr 12 • 65

upvoted 3 papers about 1 month ago

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published Apr 3 • 78

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 277

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published Apr 1 • 89

upvoted a paper about 2 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 152

upvoted an article about 2 months ago

Article

Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

•

Mar 11, 2022

• 11

upvoted 6 papers about 2 months ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 118

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7 • 79

upvoted 2 papers 3 months ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26 • 39

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103