11 29 292

Matricardi Fabio

FM-1976

https://medium.com/@fabio.matricardi

AI & ML interests

control system engineering, AI, LLM with python. ThePoorGPUguy on substack

Recent Activity

liked a model 1 day ago

openbmb/MiniCPM-o-2_6-gguf

liked a model 2 days ago

mradermacher/Llama-3.1-Nemotron-Nano-4B-v1.1-GGUF

liked a model 2 days ago

second-state/FLUX.1-Redux-dev-GGUF

View all activity

Organizations

None yet

FM-1976's activity

upvoted 2 papers about 1 month ago

ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations

Paper • 2505.02819 • Published May 5 • 24

Practical Efficiency of Muon for Pretraining

Paper • 2505.02222 • Published May 4 • 37

upvoted an article 2 months ago

Article

Bamba: Inference-Efficient Hybrid Mamba2 Model

and 28 others •

Dec 18, 2024

• 55

upvoted a collection 2 months ago

🌙 March 2025 - Open releases from the Chinese community

Collection

32 items • Updated 22 days ago • 13

upvoted a collection 3 months ago

RWKV v7

Collection

9 items • Updated Mar 17 • 4

upvoted 2 papers 3 months ago

How far can we go with ImageNet for Text-to-Image generation?

Paper • 2502.21318 • Published Feb 28 • 26

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25 • 49

upvoted 3 papers 4 months ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 120

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published Feb 12 • 58

FoNE: Precise Single-Token Number Embeddings via Fourier Features

Paper • 2502.09741 • Published Feb 13 • 14

upvoted 3 collections 5 months ago

upvoted 7 papers 6 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 150

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 53

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 56

FluidML: Fast and Memory Efficient Inference Optimization

Paper • 2411.09242 • Published Nov 14, 2024 • 1

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 31

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 65

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 24