FermiQ's picture

FermiQ

FermiQ

·

FermiQ

AI & ML interests

None yet

Organizations

upvoted a collection 3 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated about 21 hours ago • 165

upvoted an article 3 months ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

May 20, 2025

•

57

upvoted a collection 3 months ago

⚔️ BigCodeArena

Unveiling More Reliable Human Preferences in Code Generation via Execution • 8 items • Updated Oct 13, 2025 • 6

upvoted an article 3 months ago

Article

BigCodeArena: Judging code generations end to end with code executions

Oct 7, 2025

•

19

upvoted 2 collections 8 months ago

miniCTX

miniCTX: Neural Theorem Proving with (Long-)Contexts (ICLR 2025 Oral) • 8 items • Updated Mar 19, 2025 • 2

L1

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 7 items • Updated Jul 13, 2025 • 8

upvoted a collection 9 months ago

Cogito v1 Preview

5 items • Updated Apr 8, 2025 • 119

upvoted a collection 10 months ago

OpenScholar_V1

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 43

upvoted an article 10 months ago

Article

Announcing BigCodeBench-Hard, and More

Jul 24, 2024

•

14

upvoted 6 collections 10 months ago

INTELLECT-MATH

6 items • Updated Oct 7, 2025 • 5

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 7, 2025 • 25

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 88

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated Jul 10, 2025 • 67

Gemma 3 Release

28 items • Updated Aug 11, 2025 • 600

upvoted a paper 11 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 46

upvoted 2 articles 11 months ago

Article

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model

+9

Aug 22, 2023

•

37

Article

Introduction to ggml

+1

Aug 13, 2024

•

262

upvoted a collection 12 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Dec 23, 2025 • 96

upvoted an article 12 months ago

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4, 2025

•

1.32k