Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sravanthi pulijala
sravanthib
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
7 days ago
sravanthib/testing
updated
a model
13 days ago
sravanthib/custom-accele-Qwen-4b-500-steps
published
a model
13 days ago
sravanthib/custom-accele-Qwen-4b-500-steps
View all activity
Organizations
None yet
sravanthib
's models
163
Sort: Recently updated
sravanthib/RLFireworks270steps
8B
•
Updated
May 9
•
3
sravanthib/RLFireworks_caluse_code_40steps
8B
•
Updated
May 6
•
3
sravanthib/RLFireworks290steps
8B
•
Updated
May 4
•
3
sravanthib/fireworks290steps
Updated
May 4
sravanthib/RL_fireworks_290steps
8B
•
Updated
May 4
•
3
sravanthib/RL_fireworks_300steps
8B
•
Updated
May 4
•
3
sravanthib/RL_fireworks_310steps
8B
•
Updated
Apr 30
•
3
sravanthib/longcontext_RL_GRPO
8B
•
Updated
Apr 25
•
3
sravanthib/RL_on_longcontext_SFT_Qwen2.5_Simple-RL
Updated
Apr 25
sravanthib/RL-glaive-steps
8B
•
Updated
Apr 25
•
3
sravanthib/filetred-dataset-checkpoint10
8B
•
Updated
Apr 23
•
3
sravanthib/function-calling-Finetuned-RL-llama-100-steps
8B
•
Updated
Apr 23
•
3
sravanthib/Finetuned-qwen-2.5-7b-instruct-Nemo-10000steps
8B
•
Updated
Apr 15
•
3
sravanthib/NeMo-qwen-7b-merged
8B
•
Updated
Apr 13
•
3
sravanthib/OpenR1-Qwen2.5-7B-SFT
8B
•
Updated
Apr 7
•
3
sravanthib/RL-10steps
8B
•
Updated
Apr 6
•
3
sravanthib/new-OpenR1-llama3.1-8b-SFT
8B
•
Updated
Apr 6
•
3
sravanthib/OpenR1-Qwen-2.5-Math-instruct-SFT
Updated
Apr 6
sravanthib/OpenR1-llama3.1-8b-SFT
8B
•
Updated
Apr 4
•
3
sravanthib/RL-tuned-toolcalls-100records
8B
•
Updated
Apr 2
•
3
sravanthib/new-Qwen2.5-7B-Base-GRPO
Updated
Apr 2
sravanthib/Qwen2.5-7B-Base-GRPO
Updated
Apr 2
sravanthib/with_deepspeed_llama_RL_on_SFT
Updated
Apr 1
sravanthib/Qwen-2.5-7B-Base-Only-RL-without-SFT
Updated
Mar 29
sravanthib/function_calling_RL
Updated
Mar 24
sravanthib/Base-2-Qwen-7B-GRPO
Updated
Mar 23
sravanthib/Base-Qwen-7B-GRPO
Updated
Mar 23
sravanthib/llama-toolcall
Updated
Mar 21
sravanthib/non-math-Simple-RL
8B
•
Updated
Mar 20
•
2
sravanthib/qwen-base-RL
Updated
Mar 20
Previous
1
2
3
4
5
6
Next