Tanay's picture

Tanay

Tanaybh

·

tanaybhardwaj

AI & ML interests

Exploring RLHF/RLAIF techniques, LoRA adapters, and dialogue optimization. Building models that better understand and respond to human intent

Organizations

Tanaybh 's models 9

Tanaybh/microllm-v1

Updated Oct 20, 2025 • 1

Tanaybh/gpt-rope-swiglu

7.88M • Updated Oct 17, 2025 • 1

Tanaybh/nano-gpt-from-scratch

Text Generation • 1.07M • Updated Oct 5, 2025 • 8

Tanaybh/gpt2-rlhf-anthropic

Text Generation • 0.1B • Updated Oct 2, 2025 • 20

Tanaybh/gpt2-got-therapy

Text Generation • 0.1B • Updated Sep 30, 2025 • 4 • 1

Tanaybh/bipedal-walker-ppo

Reinforcement Learning • Updated Sep 21, 2025 • 2

Tanaybh/lunar-lander-ppo

Reinforcement Learning • Updated Sep 21, 2025 • 5

Tanaybh/my-first-lora-trash-model

Updated Sep 3, 2025 • 1

Tanaybh/dialogpt-medium-qlora-alpaca

Updated Sep 3, 2025