alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

reacted to prithivMLmods's post with 🔥 19 days ago

I've made 8 Spaces in the Qwen-Image-Edit series, and out of them, 5 Spaces reached “Space of the Week”! A few Spaces are still topping the list even after many months. Cumulatively, the series has crossed 8.2 million+ ZeroGPU runs and nearly 4 million visitors overall. Thanks for all the community support! 🤗❤️ 🔗 Spaces: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

liked a dataset 19 days ago

AtAndDev/delta-4lang-8k

updated a dataset 19 days ago

AtAndDev/delta-4lang-8k

View all activity

Organizations

upvoted a paper 22 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published Apr 1 • 55

upvoted 3 articles about 1 month ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

Pringled

•

Oct 14, 2024

• 104

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

medmekk, marcsun13

•

Mar 7, 2025

• 98

Article

Granite 4.1 LLMs: How They’re Built

ibm-granite

•

Apr 29

• 78

upvoted an article 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 908

upvoted an article 3 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

+1

loubnabnl, anton-l, davanstrien

•

Mar 20, 2024

• 114

upvoted 2 papers 3 months ago

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

Paper • 2602.23866 • Published Feb 27 • 89

Z1: Efficient Test-time Scaling with Code

Paper • 2504.00810 • Published Apr 1, 2025 • 27

upvoted a collection 3 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.67k

upvoted an article 4 months ago

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

+4

burtenshaw, danielhanchen, shimmyshimmer, mlabonne, davanstrien, evalstate

•

Feb 20

• 103

upvoted 2 articles 5 months ago

Article

Why Your AI Strategy Needs Hugging Face Storage

AdrianLepers

•

Jan 26

• 13

Article

One Year Since the “DeepSeek Moment”

huggingface

•

Jan 20

• 62

upvoted a changelog 5 months ago

Hugging Face Changelog

Sort Models by Parameter Size

Jan 22

• 38

upvoted a collection 8 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated Dec 24, 2025 • 243

upvoted an article 9 months ago

Article

GRPO for GUI Grounding Done Right

HelloKKMe

•

Jun 11, 2025

• 37

upvoted a collection 9 months ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Mar 2 • 82

upvoted a changelog 9 months ago

Hugging Face Changelog

Emoji Autocomplete in Discussions and Posts

Sep 11, 2025

• 68

upvoted 2 papers 9 months ago

Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM

Paper • 2503.17793 • Published Mar 22, 2025 • 24

Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

Paper • 2508.07629 • Published Aug 11, 2025 • 43

upvoted a collection 9 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 355