Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4.6
TFLOPS
Tanay
PRO
Tanaybh
Follow
yunihg's profile picture
karlpeterson's profile picture
sudanenator's profile picture
9 followers
·
5 following
tanaybhardwaj
AI & ML interests
Exploring RLHF/RLAIF techniques, LoRA adapters, and dialogue optimization. Building models that better understand and respond to human intent
Recent Activity
updated
a model
5 days ago
Tanaybh/microllm-v1
published
a model
5 days ago
Tanaybh/microllm-v1
updated
a model
8 days ago
Tanaybh/gpt-rope-swiglu
View all activity
Organizations
Tanaybh
's models
9
Sort: Recently updated
Tanaybh/microllm-v1
Updated
5 days ago
•
13
Tanaybh/gpt-rope-swiglu
7.88M
•
Updated
8 days ago
•
516
Tanaybh/nano-gpt-from-scratch
Text Generation
•
1.07M
•
Updated
20 days ago
•
394
Tanaybh/gpt2-rlhf-anthropic
Text Generation
•
0.1B
•
Updated
24 days ago
•
12
Tanaybh/gpt2-got-therapy
Text Generation
•
0.1B
•
Updated
26 days ago
•
16
Tanaybh/bipedal-walker-ppo
Reinforcement Learning
•
Updated
Sep 21
•
14
Tanaybh/lunar-lander-ppo
Reinforcement Learning
•
Updated
Sep 21
•
10
Tanaybh/my-first-lora-trash-model
Updated
Sep 3
•
5
Tanaybh/dialogpt-medium-qlora-alpaca
Updated
Sep 3
•
6