Vaidik
VaidikML0508
AI & ML interests
exploring another way to use gradient decent
Recent Activity
liked
a model
26 days ago
manycore-research/SpatialLM-Llama-1B
new activity
about 1 month ago
VaidikML0508/SharkTank-Offer-V1:[bot] Conversion to Parquet
published
a model
about 1 month ago
VaidikML0508/llama3.2-3B-Instruct-DPO-16bits-V1
Organizations
None yet
Collections
1
models
14
VaidikML0508/llama3.2-3B-Instruct-DPO-16bits-V1
Text Generation
•
Updated
•
5
VaidikML0508/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
VaidikML0508/Reinforce-pixel-copte-1
Reinforcement Learning
•
Updated
VaidikML0508/Reinforce-pixel-copter
Updated
VaidikML0508/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
VaidikML0508/ML-Agents-Pyramids
Reinforcement Learning
•
Updated
•
9
VaidikML0508/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
VaidikML0508/taxi-V3
Reinforcement Learning
•
Updated
VaidikML0508/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
VaidikML0508/ppo-SnowballTarget
Reinforcement Learning
•
Updated