Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
8
51
Troy Baker
jtroybaker
Follow
0 followers
·
19 following
jtroybaker
jtroybaker
AI & ML interests
Predictive Maintenance, Reinforcement Learning, Natural Language Processing
Recent Activity
liked
a model
about 11 hours ago
mradermacher/Qwen3-14B-Esper3-GGUF
liked
a model
about 11 hours ago
ValiantLabs/Qwen3-14B-Esper3
reacted
to
burtenshaw
's
post
with 👍
1 day ago
Qwen 3 Fine tuning >> MoE. Update the experiment thread to include config and script for fine-tuning the Qwen3-30B-A3B model. The goal is to make a low latency non-thinking model for a daily driver coding, so 3 billion parameters active should be perfect. ✔️ training running ✔️ evals running ⏭️ improve dataset The moe isn't going to fit into colab's A100 even with quantization (🙏 @UnslothAI ). So I've been working on HF spaces' H100s for this. Everything is available in the tread and I'll share more tomorrow. https://huggingface.co/burtenshaw/Qwen3-Code-Lite/discussions/1
View all activity
Organizations
spaces
1
Sleeping
Microsoft Orca 2 13b
😻
models
1
jtroybaker/rf-course-1-lunar
Updated
Aug 4, 2022
datasets
0
None public yet