Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
One-RL-to-See-Them-All
/
Orsta-7B
like
10
Follow
One-RL-to-See-Them-All
18
Image-Text-to-Text
Transformers
Safetensors
One-RL-to-See-Them-All/Orsta-Data-47k
English
qwen2_5_vl
image-to-text
VLM
multimodal
reinforcement-learning
conversational
text-generation-inference
arxiv:
2505.18129
License:
mit
Model card
Files
Files and versions
xet
Community
2
Train
Deploy
Use this model
main
Orsta-7B
Commit History
Add project page link (
#2
)
14a1097
verified
ManTle
nielsr
HF Staff
commited on
Jun 4
Add reinforcement-learning tag (
#1
)
bcfb87a
verified
ManTle
nielsr
HF Staff
commited on
Jun 2
Update README.md
f747095
verified
ManTle
commited on
May 26
Update README.md
7f16915
verified
ManTle
commited on
May 26
Update README.md
f517dac
verified
ManTle
commited on
May 26
Update README.md
aa21535
verified
ManTle
commited on
May 26
Update README.md
df74f69
verified
ManTle
commited on
May 26
Upload folder using huggingface_hub
492ab70
verified
ManTle
commited on
May 25
initial commit
12b15fb
verified
ManTle
commited on
May 25