34 4 203

Xiaoliu.x

xiaol

AI & ML interests

Researcher

Recent Activity

authored a paper 7 days ago

WuNeng: Hybrid State with Attention

authored a paper 7 days ago

Cross-attention for State-based model RWKV-7

authored a paper 7 days ago

Millions of States: Designing a Scalable MoE Architecture with RWKV-7 Meta-learner

View all activity

Organizations

authored 4 papers 7 days ago

WuNeng: Hybrid State with Attention

Paper • 2504.19191 • Published Apr 27

Cross-attention for State-based model RWKV-7

Paper • 2504.14260 • Published Apr 19

Millions of States: Designing a Scalable MoE Architecture with RWKV-7 Meta-learner

Paper • 2504.08247 • Published Apr 11

State Tuning: State-based Test-Time Scaling on RWKV-7

Paper • 2504.05097 • Published Apr 7

liked 2 models 4 months ago

AlphaGaO/DeepSeek-V3-0324-Fused-4E-29B-Unhealed-Preview

Text Generation • 29B • Updated Apr 8 • 16 • 2

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 115k • 1.71k

upvoted a paper 4 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 152

updated a Space 4 months ago

README

🌖

liked a Space 4 months ago

RWKV HF Space

🐦

Generate text based on input prompts

published a Space 4 months ago

RWKV HF Space

🐦

Generate text based on input prompts

upvoted a paper 5 months ago

BlackGoose Rimer: Harnessing RWKV-7 as a Simple yet Superior Replacement for Transformers in Large-Scale Time Series Modeling

Paper • 2503.06121 • Published Mar 8 • 5

commented a paper 5 months ago

BlackGoose Rimer: Harnessing RWKV-7 as a Simple yet Superior Replacement for Transformers in Large-Scale Time Series Modeling

Paper • 2503.06121 • Published Mar 8 • 5 •

updated 5 models 6 months ago

published a Space 6 months ago

README

🌖

upvoted a paper 6 months ago

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Paper • 2501.15570 • Published Jan 26 • 25

commented a paper 6 months ago

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Paper • 2501.15570 • Published Jan 26 • 25 •

Xiaoliu.x

AI & ML interests

Recent Activity

Organizations

xiaol's activity

README

RWKV HF Space

RWKV HF Space

README