Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
26
Jin Zhu
mamba413
Follow
Eehan's profile picture
Kyleyee's profile picture
2 followers
·
2 following
https://mamba413.github.io/
Mamba413
AI & ML interests
reinforcement learning
Recent Activity
liked
a dataset
about 3 hours ago
microsoft/wiki_qa
liked
a dataset
2 days ago
legacy-datasets/wikipedia
upvoted
a
paper
4 days ago
AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees
View all activity
Organizations
None yet
Papers
2
arxiv:
2510.01268
arxiv:
2212.14468
models
10
Sort: Recently updated
mamba413/Qwen2.5-1.5B-PPO-DR-HH-Seed1
2B
•
Updated
Mar 21
mamba413/Qwen2.5-1.5B-PPO-BENCH-HH-Seed1
2B
•
Updated
Mar 21
mamba413/Qwen2.5-1.5B-Instruct-Reward-BENCH-HH-Seed1
2B
•
Updated
Mar 21
mamba413/Qwen2.5-1.5B-Instruct-Reward-BENCH-HH-Seed0
Updated
Mar 20
mamba413/Qwen2.5-1.5B-Instruct-Reward-DR-HH-Seed0
Updated
Mar 20
mamba413/Qwen2-0.5B-Reward-DR-HH-Seed0
Text Classification
•
0.5B
•
Updated
Mar 19
mamba413/Qwen2.5-1.5B-Reward-DR-IMDB-Seed0
Updated
Mar 18
mamba413/Qwen2.5-1.5B-Reward-DR-SIMU-Seed0
Updated
Mar 18
mamba413/Qwen2-0.5B-Reward-DR-SIMU-Seed0
Text Classification
•
0.5B
•
Updated
Mar 16
mamba413/Qwen2-0.5B-Reward-DR-SIMU
Text Classification
•
0.5B
•
Updated
Mar 15
datasets
8
Sort: Recently updated
mamba413/GenerateText_Qwen2.5-1.5B-Instruct_GRPO_HH_Seed1
Viewer
•
Updated
Jun 10
•
7.06k
•
12
mamba413/GenerateText_HH_Seed1
Viewer
•
Updated
Mar 25
•
11.8k
•
12
mamba413/GenerateText_HH_Seed1_new
Viewer
•
Updated
Mar 24
•
640
•
16
mamba413/RewardModel-BENCH-HH-Seed1
Viewer
•
Updated
Mar 23
•
64
•
7
mamba413/RewardModel-DR-HH-Seed1
Viewer
•
Updated
Mar 23
•
64
•
6
mamba413/train_data_imdb_simu_valid
Viewer
•
Updated
Mar 16
•
48.1k
•
8
mamba413/train_data_imdb_simu
Viewer
•
Updated
Mar 15
•
48.1k
•
20
mamba413/train_data_imdb
Viewer
•
Updated
Mar 3
•
2
•
6