1 28 53

罗杰斯

rojasdiego

https://rojasdiego.com

AI & ML interests

LLMs for Code Generation

Recent Activity

liked a model 11 days ago

google/gemma-3-27b-it

updated a model 16 days ago

rojasdiego/Qwen2.5-Coder-7B-Next-Action-Prediction

published a model 16 days ago

rojasdiego/Qwen2.5-Coder-7B-Next-Action-Prediction

View all activity

Organizations

rojasdiego's activity

liked a model 11 days ago

google/gemma-3-27b-it

Image-Text-to-Text • Updated 4 days ago • 682k • • 980

updated a model 16 days ago

rojasdiego/Qwen2.5-Coder-7B-Next-Action-Prediction

Text Generation • Updated 16 days ago • 3

published a model 16 days ago

rojasdiego/Qwen2.5-Coder-7B-Next-Action-Prediction

Text Generation • Updated 16 days ago • 3

upvoted 2 papers 19 days ago

LLM as a Broken Telephone: Iterative Generation Distorts Information

Paper • 2502.20258 • Published 27 days ago • 26

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published 20 days ago • 18

upvoted a paper 28 days ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published Feb 9 • 37

liked a model about 1 month ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated Jan 30 • 24.1k • 235

upvoted 2 papers about 2 months ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 58

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 116

liked 2 models 2 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated about 1 month ago • 1.5M • • 11.6k

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated about 1 month ago • 8.72k • 872

liked a dataset 2 months ago

bigcode/the-stack-v2-train-smol-ids

Viewer • Updated Apr 23, 2024 • 40.1M • 920 • 32

liked a model 3 months ago

numind/NuExtract-1.5

Text Generation • Updated Nov 18, 2024 • 14.3k • • 222

updated a collection 3 months ago

Code LLMs

Collection

6 items • Updated Jan 3 • 1

liked 2 models 3 months ago

infly/OpenCoder-1.5B-Base

Text Generation • Updated Nov 11, 2024 • 310 • 21

infly/OpenCoder-8B-Instruct

Text Generation • Updated Nov 14, 2024 • 1.66k • 187

updated a collection 3 months ago

CoT Models

Collection

2 items • Updated Jan 1

liked a model 3 months ago

PowerInfer/SmallThinker-3B-Preview

Text Generation • Updated Jan 16 • 108k • • 392

updated a collection 3 months ago

Code LLMs

Collection

6 items • Updated Jan 3 • 1

liked a model 3 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 230k • • 1.72k