Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
36.4
TFLOPS
15
96
400
alkinun
AtAndDev
Follow
Pent's profile picture
mhezhying's profile picture
MongTe0712's profile picture
48 followers
·
78 following
alkinun
alkinun
AI & ML interests
LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..
Recent Activity
liked
a model
about 1 hour ago
unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit
reacted
to
FlameF0X
's
post
with 👍
about 10 hours ago
SnowflakeCore-G1 development update: We're building a 24-layer transformer with 32K context and 1024 embedding dimensions - pretty ambitious! Even running at batch_size=1 with heavy gradient accumulation, we're hitting memory walls at 300GB RAM. Scaling up to ~1TB will take some time, but the architecture is looking promising. Thanks for following along with the journey! 😅
upvoted
a
paper
about 11 hours ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
View all activity
Organizations
Posts
9
view post
Post
2787
deepseek-ai/DeepSeek-R1-0528
This is the end
See translation
view post
Post
3065
Llama 4 is out...
View all Posts
spaces
3
Sort: Recently updated
Sleeping
marco-qwq-7B
💻
Sleeping
AIDC AI Marco O1
💻
Generate responses for AI chat
Runtime error
Hallo
👋
Generate realistic talking heads from image+audio
models
9
Sort: Recently updated
AtAndDev/SelfCoder-v0.0
Text Generation
•
4B
•
Updated
May 19
•
7
AtAndDev/lora_model
Updated
Apr 5
AtAndDev/marco-qwq-7B
Text Generation
•
8B
•
Updated
Dec 8, 2024
•
17
AtAndDev/Ogno-Monarch-Neurotic-9B-Passthrough
Text Generation
•
9B
•
Updated
Mar 1, 2024
•
17
AtAndDev/Ogno-Monarch-Neurotic-7B-Dare-Ties
Text Generation
•
7B
•
Updated
Mar 1, 2024
•
17
AtAndDev/Marcoro14-7B-Slerp
Text Generation
•
7B
•
Updated
Mar 1, 2024
•
18
AtAndDev/CapybaraMarcoroni-7B
Text Generation
•
7B
•
Updated
Jan 7, 2024
•
690
AtAndDev/ShortKing-3b-v0.2
Text Generation
•
3B
•
Updated
Oct 2, 2023
•
92
•
2
AtAndDev/ShortKing-1.4b-v0.1
Text Generation
•
1B
•
Updated
Sep 29, 2023
•
3.81k
•
2
datasets
15
Sort: Recently updated
AtAndDev/SPRL-v0.1
Viewer
•
Updated
about 1 month ago
•
936
•
108
AtAndDev/SelfCoder-Test
Viewer
•
Updated
about 1 month ago
•
936
•
61
AtAndDev/ranky-dataset
Viewer
•
Updated
Mar 19
•
2.86k
•
33
AtAndDev/symbolm
Viewer
•
Updated
Jan 23
•
20k
•
46
AtAndDev/symlm
Viewer
•
Updated
Jan 16
•
10.1k
•
31
AtAndDev/chain-of-diffusion
Viewer
•
Updated
Jan 7
•
6.45k
•
35
AtAndDev/clip-bicycle-e-bike
Viewer
•
Updated
Jan 2
•
6k
•
61
AtAndDev/QwQ-LongCoT-59k-cleaned
Viewer
•
Updated
Dec 6, 2024
•
59.2k
•
46
•
1
AtAndDev/sedir-clean
Viewer
•
Updated
Dec 5, 2024
•
11.8k
•
38
AtAndDev/sedir-unclean
Viewer
•
Updated
Dec 5, 2024
•
19.9k
•
30
View 15 datasets