ponzi's picture

ponzi

ponzles

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

DevQuasar/IQuestLab.IQuest-Coder-V1-14B-Thinking-GGUF

liked a model 1 day ago

mradermacher/IQuest-Coder-V1-40B-Thinking-i1-GGUF

liked a model 1 day ago

IQuestLab/IQuest-Coder-V1-7B-Instruct

View all activity

Organizations

None yet

upvoted a paper 18 days ago

Delulu: A Verified Multi-Lingual Benchmark for Code Hallucination Detection in Fill-in-the-Middle Tasks

Paper • 2605.07024 • Published 24 days ago • 2

upvoted 2 papers about 1 month ago

SABER: Uncovering Vulnerabilities in Safety Alignment via Cross-Layer Residual Connection

Paper • 2509.16060 • Published Sep 19, 2025 • 1

Refusal in Language Models Is Mediated by a Single Direction

Paper • 2406.11717 • Published Jun 17, 2024 • 13

upvoted a changelog about 2 months ago

Hugging Face Changelog

Agent Traces on the Hub

Apr 7

• 138

upvoted a collection 3 months ago

Qwen 3.5 - 0.8, 2, 4, 9, 27, 35B - regular / uncensored

Min 256k context + images : Reg, Heretic, Heretic fine tunes of Qwen 3.5 in all sizes. • 43 items • Updated 1 day ago • 45

upvoted an article 3 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 507

upvoted 3 articles 5 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

itazap, ariG23498, ArthurZ, sergiopaniego, merve, pcuenq

•

Dec 18, 2025

• 124

Article

Shadow AI - Where are the CIOs?

jeffboudier

•

Dec 19, 2025

• 31

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

nvidia

•

Dec 15, 2025

• 111

upvoted an article 6 months ago

Article

Norm-Preserving Biprojected Abliteration

grimjim

•

Nov 6, 2025

• 81

upvoted a collection 6 months ago

The Bestiary

Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 5 items • Updated 10 days ago • 114

upvoted a paper 7 months ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 129

upvoted 4 collections 7 months ago

abliterated loras

6 items • Updated Nov 25, 2025 • 1

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 141

Qwen3

Models from the Qwen3 series • 10 items • Updated 20 days ago • 3

Granite Quantized Models

Quantized versions of IBM Granite models. • 47 items • Updated 4 days ago • 36

upvoted a collection 8 months ago

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 201

upvoted a paper 11 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7, 2025 • 83

upvoted 2 collections 11 months ago

ERNIE 4.5

collection of ERNIE 4.5 models. • 27 items • Updated Nov 11, 2025 • 189

BGE

31 items • Updated Feb 4 • 160