46 8 15

Zachary Mueller

muellerzr

AI & ML interests

None yet

Recent Activity

liked a model 13 days ago

deepseek-ai/DeepSeek-R1-0528

liked a Space about 1 month ago

Qwen/Qwen3-Demo

liked a model about 2 months ago

ibm-granite/granite-speech-3.3-8b

View all activity

Organizations

muellerzr's activity

upvoted a collection 2 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 526

upvoted a collection 4 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Apr 28 • 618

upvoted a paper 7 months ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 25

upvoted an article 9 months ago

Article

Accelerate 1.0.0

and 2 others •

Sep 13, 2024

• 53

upvoted 2 articles 12 months ago

Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

and 8 others •

Jun 18, 2024

• 48

Article

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

and 3 others •

Jun 13, 2024

• 54

upvoted an article about 1 year ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

and 8 others •

Apr 29, 2024

• 78

upvoted a collection about 1 year ago

llama 3 self-align experiments

Collection

Replicating the pipeline for StarCoder-2 Instruct on Llama-3-8B with some tweaks https://huggingface.co/blog/sc2-instruct • 4 items • Updated May 9, 2024 • 6