placebomancer's picture

2 2 1

placebomancer

placebomancer

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Offline Regularised Reinforcement Learning for Large Language Models Alignment

upvoted a paper about 1 month ago

Concise Reasoning via Reinforcement Learning

new activity 10 months ago

TheDrummer/Tiger-Gemma-9B-v1:Differences between Tiger Gemma, Smegmma and Broken Gemma

View all activity

Organizations

None yet

placebomancer's activity

upvoted 2 papers about 1 month ago

Offline Regularised Reinforcement Learning for Large Language Models Alignment

Paper • 2405.19107 • Published May 29, 2024 • 15

Concise Reasoning via Reinforcement Learning

Paper • 2504.05185 • Published Apr 7 • 2

New activity in TheDrummer/Tiger-Gemma-9B-v1 10 months ago

Differences between Tiger Gemma, Smegmma and Broken Gemma

#1 opened 10 months ago by

liked a Space 10 months ago

Gemma 2 llama.cpp 2B/9B/27B

Chat with Gemma 2 for text-based conversations

New activity in open-llm-leaderboard/open_llm_leaderboard 10 months ago

WizardLM-8x22B Evaluation failed

#823 opened 11 months ago by