Flavio Catalani's picture

Flavio Catalani

fakezeta

·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

hexgrad/Kokoro-82M

updated a model 21 days ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

published a model 21 days ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

View all activity

Organizations

fakezeta's activity

liked a model 4 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 20 days ago • 968k • 3.33k

updated a model 21 days ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

Text Generation • Updated 21 days ago • 32

published a model 21 days ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

Text Generation • Updated 21 days ago • 32

liked 2 Spaces 23 days ago

What could possibly go wrong?

Think in Sync

An addictive AI-powered word puzzle.

reacted to csabakecskemeti's post with 👀 25 days ago

Post

2309

I've run the open llm leaderboard evaluations + hellaswag on deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.

If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results

Am I made some mistake, or (at least this distilled version) not as good/better than the competition?

I'll run the same on the Qwen 7B distilled version too.

7 replies

·

upvoted a collection 25 days ago

Visual Language Models

Collection of OpenVINO optimized models for visual-language assistance • 9 items • Updated 25 days ago • 3

liked a Space 3 months ago

Hacker News Listener

Navigate and analyze Hacker News posts and comments.

liked a model 3 months ago

Nexusflow/Athene-V2-Chat

Text Generation • Updated Nov 26, 2024 • 9.15k • 280

upvoted an article 3 months ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 77

reacted to lunarflu's post with 🔥 3 months ago

Post

1759

great blogpost! 🔥@wolfram
https://huggingface.co/blog/wolfram/llm-comparison-test-2024-12-04

liked 2 models 3 months ago

Xkev/Llama-3.2V-11B-cot

Image-Text-to-Text • Updated Dec 16, 2024 • 2.38k • 145

kaitchup/Qwen2.5-72B-Instruct-AutoRound-GPTQ-4bit

Text Generation • Updated Nov 26, 2024 • 20 • 6

liked a Space 5 months ago

FacePoke

Import a portrait, click to move the head!

New activity in mistralai/Mistral-Small-Instruct-2409 5 months ago

Please make it CLEAR, this is NOT an OPEN SOURCE MODEL license

#15 opened 5 months ago by

updated 3 models 5 months ago

fakezeta/gemma-2-9b-it-SimPO-ov-int4

Updated Sep 16, 2024 • 5

fakezeta/gemma-2-9b-it-SimPO-ov-int8

Updated Sep 16, 2024 • 9

fakezeta/gemma-2-9b-it-ov-int4

Text Generation • Updated Sep 15, 2024 • 7

updated a collection 5 months ago

Gemma 2

4 items • Updated Sep 15, 2024