Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Reda alami
RedaAlami
Follow
21world's profile picture
mouadjer's profile picture
PaoloM's profile picture
8 followers
·
3 following
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a dataset
7 days ago
RedaAlami/stage1_76k_final
published
a dataset
7 days ago
RedaAlami/stage1_76k_final
updated
a dataset
7 days ago
RedaAlami/stage1_76k_v3
View all activity
Organizations
spaces
1
Sleeping
TestRecommenderSystem
👁
models
13
Sort: Recently updated
RedaAlami/Falcon3-7B-Instruct-OpenR1-Math
Text Generation
•
Updated
23 days ago
•
71
RedaAlami/Qwen-2.5-7B-Simple-RL
Updated
Feb 15
RedaAlami/Falcon3-7B-Instruct-Distill-DS-v1
Text Generation
•
Updated
Feb 12
•
53
RedaAlami/Qwen2-0.5B-GRPO-test
Updated
Feb 10
RedaAlami/merged-dataset0-dataset1
Updated
Aug 28, 2024
RedaAlami/zephyr-7b-gemma-dpo
Updated
Jul 31, 2024
•
3
RedaAlami/ultrafeedback_binarized_custom2
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_custom
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_processed
Updated
Jul 12, 2024
RedaAlami/falcon-11b-instruct-dpo-full
Updated
Jul 1, 2024
Expand 13 models
datasets
147
Sort: Recently updated
RedaAlami/stage1_76k_final
Viewer
•
Updated
7 days ago
•
75.9k
•
28
RedaAlami/stage1_76k_v3
Viewer
•
Updated
7 days ago
•
75.9k
•
12
RedaAlami/OpenR1-Math-split-v2
Viewer
•
Updated
21 days ago
•
93.7k
•
131
RedaAlami/OpenR1-Math-split-v1
Viewer
•
Updated
28 days ago
•
93.7k
•
134
RedaAlami/OpenR1-Math-split-modified
Viewer
•
Updated
28 days ago
•
93.7k
•
89
RedaAlami/OpenR1-Math-split
Viewer
•
Updated
28 days ago
•
93.7k
•
129
RedaAlami/OpenR1-Math-220k-default-50percent
Viewer
•
Updated
Feb 22
•
46.9k
•
95
RedaAlami/OpenR1-Math-220k-default
Viewer
•
Updated
Feb 21
•
93.7k
•
140
RedaAlami/merged-dpo-safety
Viewer
•
Updated
Feb 3
•
3.95k
•
42
RedaAlami/eng-batch-3-dpo-safety_test
Viewer
•
Updated
Feb 3
•
36
•
43
Expand 147 datasets