PommesPeter's picture

PommesPeter

PommesPeter

·

PommesPeter

AI & ML interests

MM-LLM

Recent Activity

updated a model about 5 hours ago

PommesPeter/dp_ckpts

reacted to IliaLarchenko's post with 👍 2 days ago

I am presenting Decoder-Only Transformer (DOT) Policy a simple Behavioral Control policy that outperforms SOTA models on two simple benchmark tasks: ✅ PushT (pushing an object to a goal) – 84% success on keypoints, 74% on images (previous best: 75% / 69%) ✅ ALOHA Insert (precise bimanual insertion) – 30% success (previous best: ~21%) The best part? DOT is much smaller (sometimes 100 times less parameters) than previous SOTA models, trains faster, and avoids complexity: 🚫 No generative models (Diffusion, VAE, GANs) 🚫 No discretization/tokenization of actions 🚫 No reinforcement learning or multi-stage training ✅ Just learns from human demos, plain and simple This is still early — more complex real-life tasks need testing, and no guarantees it will actually work well there, but I think it's interesting to share. Sometimes, simpler approaches can be just as effective (or even better) than complex ones. 🔗 Open-source code and detailed description: https://github.com/IliaLarchenko/dot_policy Trained models on Hugging Face: https://huggingface.co/IliaLarchenko/dot_pusht_keypoints https://huggingface.co/IliaLarchenko/dot_pusht_images https://huggingface.co/IliaLarchenko/dot_bimanual_insert

reacted to IliaLarchenko's post with 🔥 2 days ago

I am presenting Decoder-Only Transformer (DOT) Policy a simple Behavioral Control policy that outperforms SOTA models on two simple benchmark tasks: ✅ PushT (pushing an object to a goal) – 84% success on keypoints, 74% on images (previous best: 75% / 69%) ✅ ALOHA Insert (precise bimanual insertion) – 30% success (previous best: ~21%) The best part? DOT is much smaller (sometimes 100 times less parameters) than previous SOTA models, trains faster, and avoids complexity: 🚫 No generative models (Diffusion, VAE, GANs) 🚫 No discretization/tokenization of actions 🚫 No reinforcement learning or multi-stage training ✅ Just learns from human demos, plain and simple This is still early — more complex real-life tasks need testing, and no guarantees it will actually work well there, but I think it's interesting to share. Sometimes, simpler approaches can be just as effective (or even better) than complex ones. 🔗 Open-source code and detailed description: https://github.com/IliaLarchenko/dot_policy Trained models on Hugging Face: https://huggingface.co/IliaLarchenko/dot_pusht_keypoints https://huggingface.co/IliaLarchenko/dot_pusht_images https://huggingface.co/IliaLarchenko/dot_bimanual_insert

View all activity

Organizations

New activity in Alpha-VLLM/Lumina-Next-SFT about 1 year ago

demo.py download model despite --ckpt passed the path

#4 opened about 1 year ago by

New activity in Alpha-VLLM/Lumina-Next-SFT-diffusers about 1 year ago

Diffuser Wrapper error

#2 opened about 1 year ago by

ImportError: cannot import name 'LuminaText2ImgPipeline' from 'diffusers'

#1 opened about 1 year ago by

New activity in Alpha-VLLM/Lumina-Next-T2I about 1 year ago

Can you add an fp8 or int4 quantization loader?

#3 opened over 1 year ago by

New activity in Alpha-VLLM/Lumina-Next-T2I over 1 year ago

Change repo path references pointing to previous model

#2 opened over 1 year ago by

Please use safetensors, not pickle

#1 opened over 1 year ago by

New activity in Alpha-VLLM/Lumina-Next-T2I over 1 year ago

Deploy demo using ZeROGPU

#2 opened over 1 year ago by

remove gradio from requirements

#1 opened over 1 year ago by

New activity in Alpha-VLLM/Lumina-T2I over 1 year ago

Text encoder is actually llama 2, is it not?

#2 opened over 1 year ago by