placebomancer
placebomancer
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Offline Regularised Reinforcement Learning for Large Language Models
Alignment
upvoted
a
paper
about 1 month ago
Concise Reasoning via Reinforcement Learning
new activity
10 months ago
TheDrummer/Tiger-Gemma-9B-v1:Differences between Tiger Gemma, Smegmma and Broken Gemma
Organizations
None yet
models
0
None public yet
datasets
0
None public yet