Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sravanthi pulijala
sravanthib
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
8 days ago
sravanthib/testing
updated
a model
13 days ago
sravanthib/custom-accele-Qwen-4b-500-steps
published
a model
13 days ago
sravanthib/custom-accele-Qwen-4b-500-steps
View all activity
Organizations
None yet
sravanthib
's models
163
Sort: Recently updated
sravanthib/Qwen-base-open-RL
Updated
Mar 20
sravanthib/tool_llama_test
Updated
Mar 20
sravanthib/qwen-72b-base
Updated
Mar 19
sravanthib/with_accelarate_output_Qwen2-0.5B-GRPO-test
Updated
Mar 19
sravanthib/Qwen-GRPO
Updated
Mar 17
sravanthib/new-Qwen-2.5-7b-non-math-Simple-RL
Updated
Mar 16
sravanthib/Qwen-2.5-7b-non-math-Simple-RL
8B
•
Updated
Mar 16
•
1
sravanthib/Llama-Simple-RL
8B
•
Updated
Mar 16
•
2
sravanthib/Last-Llama-Simple-RL
Updated
Mar 15
sravanthib/llama3-8b-math-solver
Updated
Mar 15
sravanthib/Last-Qwen-2.5-7B-Simple-RL
8B
•
Updated
Mar 15
•
2
sravanthib/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
Mar 15
•
2
sravanthib/Qwen-math-open-RL
Updated
Mar 14
sravanthib/Qwen-math-Simple-RL
Updated
Mar 14
sravanthib/qwen-32b-multinode-try
Updated
Mar 13
sravanthib/new-multinode-try
Updated
Mar 13
sravanthib/multinode-try
Updated
Mar 13
sravanthib/with_accelerate_output_Qwen2-0.5B-GRPO-test
Updated
Mar 13
sravanthib/tokenizer-aded-Llama3.1-8b-instruct-RL
Updated
Mar 13
sravanthib/single_node_llama_custom-code-test
Updated
Mar 12
sravanthib/Final-try-Llama3.1-8b-instruct-RL
Text Generation
•
8B
•
Updated
Mar 11
•
2
sravanthib/grpo-output
Updated
Mar 11
sravanthib/Simple-RL
Updated
Mar 11
sravanthib/SFT_and_RL_final-Simple-RL
Updated
Mar 10
sravanthib/llama-3b-Simple-RL
Updated
Mar 10
sravanthib/RL_on_SFT
Updated
Mar 10
sravanthib/Monday_SFT_and_RLinstruct-Llama-3.1-8B-open-RL
Updated
Mar 10
sravanthib/SFT_and_RLinstruct-Llama-3.1-8B-open-RL
Updated
Mar 9
sravanthib/grpo_finetuned
Updated
Mar 9
sravanthib/weights_sft_new_grpo-output
Updated
Mar 9
Previous
1
...
3
4
5
6
Next