Lemon Mint's picture

Lemon Mint

lemon-mint

·

lemon-mint

AI & ML interests

None yet

Recent Activity

liked a model 18 days ago

arcee-ai/Trinity-Large-Preview

new activity about 1 month ago

lemon-mint/gemma-ko-7b-it-v0.40:Update README.md

liked a model about 1 month ago

LGAI-EXAONE/K-EXAONE-236B-A23B

View all activity

Organizations

upvoted a collection 9 months ago

Kanana-1.5

Open Source Kanana-1.5 • 16 items • Updated Dec 1, 2025 • 29

upvoted a paper 9 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 82

upvoted a collection 9 months ago

Smoothie Qwen3

For more details, please visit https://github.com/dnotitia/smoothie-qwen • 9 items • Updated 20 days ago • 7

upvoted a collection 10 months ago

Mellum

Series of code models by JetBrains • 12 items • Updated Oct 1, 2025 • 36

upvoted a paper 10 months ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30, 2025 • 49

upvoted an article 10 months ago

Article

PipelineRL

Apr 25, 2025

•

43

upvoted a collection 11 months ago

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated Jul 10, 2025 • 67

upvoted 5 collections about 1 year ago

R1-like Datasets

19 items • Updated May 27, 2025 • 6

Korean Reasoning Datasets 한국어 추론 데이터셋

5 items • Updated Feb 12, 2025 • 3

Korean Instructions 한국어 인스트럭션 데이터셋

3 items • Updated Feb 28, 2025 • 4

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67

R1 Multilingual

5 items • Updated Jan 31, 2025 • 11

upvoted 8 collections over 1 year ago

Gemma-APS Release

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated Jul 10, 2025 • 24

Gemma 2 JPN Release

A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated Jul 10, 2025 • 30

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 309

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Jul 10, 2025 • 87

DeepSeek-V2.5

2 items • Updated Nov 27, 2025 • 47

DeepSeek-V2

8 items • Updated Nov 27, 2025 • 35

Reranker Model

A collection of Korean-specific reranking models • 2 items • Updated Jul 19, 2025 • 3

Hermes 3

The Hermes 3 Series of Models • 11 items • Updated Sep 8, 2025 • 132