Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
2
1
MZ
Shahradmz
Follow
0 followers
·
3 following
https://emzedi.github.io/website/#
EMZEDI
AI & ML interests
LLMs, Graph Learning, Temporal Graph Learning, RL, Continual RL, Optimization
Recent Activity
updated
a dataset
about 2 months ago
LifelongAlignment/aifgen-merged
updated
a collection
about 2 months ago
AIFGEN
published
a dataset
about 2 months ago
LifelongAlignment/aifgen-merged
View all activity
Organizations
Shahradmz
's models
115
Sort: Recently updated
Shahradmz/Qwen2.5-0.5B-Instruct_cppo-reward_REWARD_1
0.5B
•
Updated
May 12
•
5
Shahradmz/Qwen2.5-0.5B-Instruct_cppo-reward_REWARD_0
0.5B
•
Updated
May 12
•
8
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_CPPO_1
Updated
May 1
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_CPPO_0
Updated
May 1
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_1
Updated
Apr 28
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_0
Updated
Apr 28
Shahradmz/Qwen2-1.5B-Instruct_cppo-reward_REWARD_0
2B
•
Updated
Mar 25
•
6
Shahradmz/Qwen2-1.5B-Instruct_cppo-reward_REWARD_1
Updated
Mar 25
Shahradmz/Qwen2-0.5B-Reward_debug_mas
Text Classification
•
0.5B
•
Updated
Mar 19
•
9
Shahradmz/Qwen2-0.5B-Reward
Updated
Mar 19
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_EWC_1
Updated
Mar 19
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_PPO_EWC_0
Updated
Mar 19
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_0
Updated
Mar 12
•
1
Shahradmz/Qwen2-0.5B-Instruct_continual_data_debug_REWARD_1
Updated
Mar 12
Shahradmz/Qwen2-0.5B-Reward-LoRA
Updated
Mar 3
Shahradmz/llama8b_SEND_1B-alpaca-5
Text Generation
•
1B
•
Updated
Feb 13
•
10
•
Shahradmz/llama8b_SEND_1B-legalbench-5
Text Generation
•
1B
•
Updated
Feb 13
•
32
•
Shahradmz/llama8b_SEND_1B-codesearchnet-5
Text Generation
•
1B
•
Updated
Feb 13
•
44
•
Shahradmz/llama8b_SEND_1B-helm-5
Text Generation
•
1B
•
Updated
Feb 13
•
10
•
Shahradmz/llama8b_SEND_1B-alpaca-4
Text Generation
•
1B
•
Updated
Feb 13
•
10
•
Shahradmz/llama8b_SEND_1B-codesearchnet-4
Text Generation
•
1B
•
Updated
Feb 13
•
9
•
Shahradmz/llama8b_SEND_1B-legalbench-4
Updated
Feb 13
Shahradmz/llama8b_SEND_1B-helm-4
Text Generation
•
1B
•
Updated
Feb 13
•
7
•
Shahradmz/llama8b_SEND_1B-legalbench-3
Text Generation
•
1B
•
Updated
Feb 13
•
10
•
Shahradmz/llama8b_SEND_1B-codesearchnet-3
Text Generation
•
1B
•
Updated
Feb 13
•
10
•
Shahradmz/llama8b_SEND_1B-alpaca-3
Text Generation
•
1B
•
Updated
Feb 13
•
45
•
Shahradmz/llama8b_SEND_1B-helm-3
Text Generation
•
1B
•
Updated
Feb 13
•
9
•
Shahradmz/llama8b_SEND_1B-legalbench-2
Text Generation
•
1B
•
Updated
Feb 13
•
8
•
Shahradmz/llama8b_SEND_1B-alpaca-2
Text Generation
•
1B
•
Updated
Feb 13
•
25
•
Shahradmz/llama8b_SEND_1B-codesearchnet-2
Text Generation
•
1B
•
Updated
Feb 13
•
31
•
Previous
1
2
3
4
Next