David Stanojevic's picture

2

David Stanojevic

david-stan

·

david-stan

AI & ML interests

None yet

Recent Activity

updated a model about 2 months ago

JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt

published a model about 2 months ago

JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt

upvoted a paper about 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

View all activity

Organizations

updated a model about 2 months ago

JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt

Text Generation • 8B • Updated Nov 6 • 30 • 1

published a model about 2 months ago

JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt

Text Generation • 8B • Updated Nov 6 • 30 • 1

upvoted a paper about 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published Oct 27 • 20

upvoted a paper 3 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29 • 37

updated a model 10 months ago

david-stan/roberta-large-lora-class

published a model 10 months ago

david-stan/roberta-large-lora-class