Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
12
Nicholas Stranges
nstranges
Follow
0 followers
·
1 following
strangeman99
AI & ML interests
Reinforcement learning, robotics, LLM agents.
Recent Activity
liked
a dataset
5 days ago
osunlp/Mind2Web-2
liked
a dataset
5 days ago
openai/gdpval
liked
a dataset
about 1 month ago
open-r1/DAPO-Math-17k-Processed
View all activity
Organizations
None yet
models
10
Sort: Recently updated
nstranges/smollm2-finetuned-chat-instruct-lora-adapters
Updated
Nov 22, 2025
nstranges/CSC2516-HW10-Original-Model
0.1B
•
Updated
Nov 21, 2025
•
1
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-Random-V2
8B
•
Updated
Sep 21, 2025
•
2
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel-V2
8B
•
Updated
Sep 12, 2025
•
2
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-RefModel
8B
•
Updated
Aug 26, 2025
•
1
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0-V2
8B
•
Updated
Aug 25, 2025
•
2
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta0.0-V2
8B
•
Updated
Aug 25, 2025
•
1
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-Random
8B
•
Updated
Aug 24, 2025
•
1
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta0.0
8B
•
Updated
Jun 27, 2025
•
3
nstranges/Meta-Llama-3-8B-Instruct-OnlineDPO-WIM-Zeta1.0
8B
•
Updated
Jun 17, 2025
•
1
datasets
0
None public yet