Edoardo Federici's picture

Edoardo Federici

efederici

·

https://banda-larga.github.io

AI & ML interests

llms, ir, graphs & co

Recent Activity

liked a dataset 2 days ago

FreedomIntelligence/ShareGPT-4o-Image

liked a dataset 3 days ago

THU-KEG/Crab-SFT

liked a dataset 3 days ago

THU-KEG/LongWriter-Zero-RLData

View all activity

Organizations

upvoted a paper 3 days ago

Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Paper • 2506.19290 • Published 5 days ago • 46

upvoted a paper about 2 months ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7 • 65

upvoted a collection 2 months ago

Qwen3

72 items • Updated 13 days ago • 806

upvoted a paper 2 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 128

upvoted an article 3 months ago

Article

Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?

By

and 1 other •

Apr 4

• 14

upvoted 2 papers 3 months ago

φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Paper • 2503.13288 • Published Mar 17 • 51

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 132

upvoted a paper 7 months ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 61

upvoted an article 8 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 41

upvoted 2 papers 8 months ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8, 2024 • 83

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22, 2024 • 13

upvoted 3 papers 9 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 151

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 52

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Paper • 2406.06592 • Published Jun 5, 2024 • 30

upvoted a paper 10 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9, 2024 • 49

upvoted an article 10 months ago

Article

Selective fine-tuning of Language Models with Spectrum

By

•

Sep 3, 2024

• 36

upvoted a paper 10 months ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 52

upvoted a collection 11 months ago

Probably function calling datasets

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 38

upvoted 2 papers 12 months ago

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1, 2024 • 46

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 132