ben burtenshaw's picture

Open to Collab

ben burtenshaw PRO

burtenshaw

huggingface

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 8 hours ago

burtenshaw/transformers-pr-slop-dataset

updated a dataset about 15 hours ago

huggingface-course/supervised-finetuning_quiz_student_responses

updated a dataset about 16 hours ago

mcp-course/certificates

View all activity

Organizations

buckets 4

burtenshaw/home

burtenshaw/deepseek-v4-figures

burtenshaw/critpt-harness

burtenshaw/autolab-blog-assets

Posts 39

Post

8055

Smol course has a distinctive approach to teaching post-training, so I'm posting about how it’s different to other post-training courses, including the llm course that’s already available.

In short, the smol course is just more direct that any of the other course, and intended for semi-pro post trainers.

- It’s a minimal set of instructions on the core parts.
- It’s intended to bootstrap real projects you're working on.
- The material handsover to existing documentation for details
- Likewise, it handsover to the LLM course for basics.
- Assessment is based on a leaderboard, without reading all the material.

To start the smol course, follow here:

Articles 37

Article

39

DeepSeek-V4: a million-token context that agents can actually use

View all Articles

Collections 12

View 12 collections

Papers 3

arxiv:2509.25397

arxiv:2502.02737

arxiv:2408.16961

spaces 88

HF Bucket MCP

Access and manage your HF bucket via the MCP endpoint

Qwen3.5 RMSNorm Experiment Lab

Explore the full Qwen3.5 RMSNorm run history.

Autolab Trackio

Visualize GPS tracks on an interactive map

Echo Environment Server

Autoresearch Environment Server

Trenches US Model

Generate strategic US policy advice

models 61

burtenshaw/ptc-optimized-kernel

Updated 23 days ago

burtenshaw/codex

burtenshaw/dave

burtenshaw/LFM2.5-1.2B-Instruct-FineTome100k

burtenshaw/lfm25-finetome-unsloth

burtenshaw/lfm25-1.2b-finetome

burtenshaw/qwen3-0.6b-fineproofs-sft

burtenshaw/prime-lab-wordle

Updated Feb 12 • 4

burtenshaw/lfm25-12b-openmed-medical-reasoning-sft-mega-unsloth

burtenshaw/unsloth-lfm-medical-lr-2e-4-20260211b

datasets 65

burtenshaw/transformers-pr-slop-dataset

Viewer • Updated about 8 hours ago • 1.46M • 854 • 2

burtenshaw/1-million-rows

Updated 1 day ago • 4

burtenshaw/european-cities

Viewer • Updated 2 days ago • 40 • 14

burtenshaw/european-countries

Viewer • Updated 2 days ago • 47 • 14

burtenshaw/ptc-optimized-kernel-job-inputs

Updated 23 days ago • 56

burtenshaw/kernel-skill-source

Updated 28 days ago • 144

burtenshaw/qwen3-5-0-8b-rmsnorm-experiment

Viewer • Updated Mar 28 • 33 • 21

burtenshaw/hub-stats-papers-last-week-2026-03-28

Preview • Updated Mar 28 • 58

burtenshaw/test-rlm-sft

Viewer • Updated Mar 24 • 11 • 8

burtenshaw/community-evals-monitoring

Updated Mar 16 • 2

View 65 datasets