Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Troy Baker's picture
8 51

Troy Baker

jtroybaker
·
  • jtroybaker
  • jtroybaker

AI & ML interests

Predictive Maintenance, Reinforcement Learning, Natural Language Processing

Recent Activity

liked a model about 11 hours ago
mradermacher/Qwen3-14B-Esper3-GGUF
liked a model about 11 hours ago
ValiantLabs/Qwen3-14B-Esper3
reacted to burtenshaw's post with 👍 1 day ago
Qwen 3 Fine tuning >> MoE. Update the experiment thread to include config and script for fine-tuning the Qwen3-30B-A3B model. The goal is to make a low latency non-thinking model for a daily driver coding, so 3 billion parameters active should be perfect. ✔️ training running ✔️ evals running ⏭️ improve dataset The moe isn't going to fit into colab's A100 even with quantization (🙏 @UnslothAI ). So I've been working on HF spaces' H100s for this. Everything is available in the tread and I'll share more tomorrow. https://huggingface.co/burtenshaw/Qwen3-Code-Lite/discussions/1
View all activity

Organizations

ZeroGPU Explorers's profile picture Hugging Face Discord Community's profile picture

spaces 1

Sleeping

Microsoft Orca 2 13b

😻

Dec 5, 2023

models 1

jtroybaker/rf-course-1-lunar

Updated Aug 4, 2022

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs