Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Nicholas Stranges
nstranges
Follow
strangeman99
AI & ML interests
Reinforcement learning, robotics, LLM agents.
Recent Activity
updated
a model
3 days ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel-V2
published
a model
3 days ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel-V2
updated
a model
21 days ago
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel
View all activity
Organizations
None yet
models
7
Sort: Recently updated
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel-V2
8B
•
Updated
3 days ago
•
6
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel
8B
•
Updated
21 days ago
•
7
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-V2
8B
•
Updated
22 days ago
•
7
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta0.0-V2
8B
•
Updated
22 days ago
•
6
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-Random
8B
•
Updated
22 days ago
•
4
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta0.0
8B
•
Updated
Jun 27
•
7
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0
8B
•
Updated
Jun 17
•
5
datasets
0
None public yet