AI & ML interests
LLM training in simple, pure C/CUDA
Recent Activity
View all activity
Organization Card
LLMs in simple, pure C/CUDA with no need for 245MB of PyTorch or 107MB of cPython. Developer coordination happens in the Discussions and on Discord, either the #llmc channel on the Zero to Hero channel, or on #llmdotc on CUDA MODE Discord.
Find the best model created by the llmc community here!
Fun experiments with llm.c
-
yuchenj/gpt2_124M_100B_FinewebEdu_hf
Text Generation • 0.1B • Updated • 28 -
yuchenj/gpt2_350M_100B_FinewebEdu_hf
Text Generation • 0.4B • Updated • 14 -
yuchenj/gpt2_774M_100B_FinewebEdu_hf
Text Generation • 0.8B • Updated • 17 • 1 -
yuchenj/gpt2_1558M_100B_FinewebEdu_hf
Text Generation • 2B • Updated • 11 • 1
Fun experiments with llm.c
-
yuchenj/gpt2_124M_100B_FinewebEdu_hf
Text Generation • 0.1B • Updated • 28 -
yuchenj/gpt2_350M_100B_FinewebEdu_hf
Text Generation • 0.4B • Updated • 14 -
yuchenj/gpt2_774M_100B_FinewebEdu_hf
Text Generation • 0.8B • Updated • 17 • 1 -
yuchenj/gpt2_1558M_100B_FinewebEdu_hf
Text Generation • 2B • Updated • 11 • 1
models
0
None public yet
datasets
0
None public yet