Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
One-RL-to-See-Them-All
/
Orsta-7B
like
9
Follow
One-RL-to-See-Them-All
15
Image-Text-to-Text
Transformers
Safetensors
One-RL-to-See-Them-All/Orsta-Data-47k
English
qwen2_5_vl
VLM
multimodal
reinforcement-learning
conversational
text-generation-inference
arxiv:
2505.18129
License:
mit
Model card
Files
Files and versions
xet
Community
2
Train
Deploy
Use this model
main
Orsta-7B
Commit History
Add project page link (
#2
)
14a1097
verified
ManTle
nielsr
HF Staff
commited on
8 days ago
Add reinforcement-learning tag (
#1
)
bcfb87a
verified
ManTle
nielsr
HF Staff
commited on
10 days ago
Update README.md
f747095
verified
ManTle
commited on
17 days ago
Update README.md
7f16915
verified
ManTle
commited on
17 days ago
Update README.md
f517dac
verified
ManTle
commited on
17 days ago
Update README.md
aa21535
verified
ManTle
commited on
17 days ago
Update README.md
df74f69
verified
ManTle
commited on
17 days ago
Upload folder using huggingface_hub
492ab70
verified
ManTle
commited on
18 days ago
initial commit
12b15fb
verified
ManTle
commited on
18 days ago