2 17 101

Alberto Cetoli PRO

fractalego

https://fractalego.social/@alberto

AI & ML interests

Entity/relation extraction, Q&A, Summarisation

Recent Activity

upvoted a paper about 2 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

liked a model about 2 months ago

openai/gpt-oss-20b

reacted to mitkox's post with 😎 2 months ago

I run Qwen3-Coder 480B locally on my Z8, with a 1-million token context window. It’s the equivalent of parallel-parking a Nimitz-class carrier in a kiddie pool. Thanks to whatever dark pact the llama.cpp, CUDA, and kernel folks signed, hybrid inferencing + VRAM↔RAM offload let me stream the model’s synapses across Xeon, RAM, and four lonely A6000s without summoning either the OOM killer or a small house fire.

View all activity

Organizations

upvoted a paper about 2 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 181

liked a model about 2 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 6.71M • • 3.61k

reacted to mitkox's post with 😎 2 months ago

Post

2103

liked 2 models 2 months ago

mradermacher/Agentic-R1-GGUF

8B • Updated Jul 9 • 117 • 1

moonshotai/Kimi-K2-Instruct

Text Generation • Updated 23 days ago • 277k • • 2.17k

reacted to merve's post with 🔥 4 months ago

Post

2973

Qwen2.5-Omni is soooo good that people build multimodal reasoning models off of it 🥹
> KE-Team/Ke-Omni-R-3B is open-source audio reasoning model sota on average of benchmarks, based on Qwen/Qwen2.5-Omni-3B 🗣️
> Haoz0206/Omni-R1 is a video reasoning model with pixel level grounding (see below) and it's super competitive ⏯️ based on Qwen/Qwen2.5-Omni-7B

liked 3 models 4 months ago

liked a dataset 5 months ago

nvidia/OpenMathReasoning

Viewer • Updated May 27 • 5.68M • 8.93k • 341

reacted to jeffboudier's post with 🚀 5 months ago

Post

2602

Transcribing 1 hour of audio for less than $0.01 🤯

@mfuntowicz cooked with 8x faster Whisper speech recognition - whisper-large-v3-turbo transcribes at 100x real time on a $0.80/hr L4 GPU!

How they did it: https://huggingface.co/blog/fast-whisper-endpoints

1-click deploy with HF Inference Endpoints: https://endpoints.huggingface.co/new?repository=openai%2Fwhisper-large-v3-turbo&vendor=aws&region=us-east&accelerator=gpu&instance_id=aws-us-east-1-nvidia-l4-x1&task=automatic-speech-recognition&no_suggested_compute=true

liked 2 models 5 months ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated Jul 26 • 260k • • 793

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.16k • 1.18k

liked 3 models 6 months ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 417k • • 1.1k

sesame/csm-1b

Text-to-Speech • Updated Jul 23 • 28.4k • 2.22k

manycore-research/SpatialLM-Llama-1B

Text Generation • 1B • Updated Mar 21 • 308 • 985

reacted to BrigitteTousi's post with 🔥🚀 7 months ago

Post

3448

LeRobot goes to driving school! 🚗🚗🚗

Hugging Face just announced a new collab with Yaak to bring the largest open-source self-driving dataset to LeRobot!

Major kudos to HF's @cadene , as well as @sandhawalia , @Shnissen and the Yaak team!

Check out the blog post here: https://huggingface.co/blog/lerobot-goes-to-driving-school

1 reply

reacted to csabakecskemeti's post with 🔥 7 months ago

Post

2877

Testing Training on AMD/ROCm the first time!

I've got my hands on an AMD Instinct MI100. It's about the same price used as a V100 but on paper has more TOPS (V100 14TOPS vs MI100 23TOPS) also the HBM has faster clock so the memory bandwidth is 1.2TB/s.
For quantized inference it's a beast (MI50 was also surprisingly fast)

For LORA training with this quick test I could not make the bnb config works so I'm running the FT on the fill size model.

Will share all the install, setup and setting I've learned in a blog post, together with the cooling shroud 3D design.

8 replies

upvoted an article 7 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Alberto Cetoli PRO

AI & ML interests

Recent Activity

Organizations

fractalego's activity

Open-R1: Update #1