Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
1
9
PeterChan
peterchanjaon
Follow
0 followers
·
1 following
AI & ML interests
LLM
Recent Activity
upvoted
a
paper
12 days ago
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model
liked
a dataset
15 days ago
cfahlgren1/react-code-instructions
liked
a dataset
3 months ago
hkust-nlp/CodeIO-PyEdu-Reasoning
View all activity
Organizations
None yet
models
3
Sort: Recently updated
peterchanjaon/q-Taxi-v3
Reinforcement Learning
•
Updated
Oct 20, 2024
peterchanjaon/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Oct 20, 2024
peterchanjaon/lunartest
Reinforcement Learning
•
Updated
Oct 11, 2024
•
8
datasets
0
None public yet