LEGION X
faisalhr1997
AI & ML interests
None yet
Recent Activity
reacted
to
salma-remyx's
post
with 🔥
22 days ago
SpaceThinker-Qwen2.5VL-3B shows a 3B VLM can compete with closed, frontier APIs in quantitative spatial reasoning, a key capability for embodied AI applications like drones and robotics.
Check out how it stacks up against Gemini and OpenAI on Q-Spatial-Bench in the ModelCard. Includes .gguf, colab quickstart, docker images.
SpaceThinker adopts the Qwen2.5VL-3B architecture, fine-tuned on the SpaceThinker dataset of synthetic spatial reasoning traces, created with VQASynth
This model builds upon the SpaceLLaVA series of VLMs finetuned for enhanced spatial reasoning using synthetic data by adding test-time compute for multimodal thinking.
Model: https://huggingface.co/remyxai/SpaceThinker-Qwen2.5VL-3B
Dataset: https://huggingface.co/datasets/remyxai/SpaceThinker
Space: https://huggingface.co/spaces/remyxai/SpaceThinker-Qwen2.5VL-3B
Code: https://github.com/remyxai/VQASynth
Discussion: https://huggingface.co/spaces/open-r1/README/discussions/10
liked
a model
about 1 month ago
reducto/RolmOCR
liked
a Space
2 months ago
TheStinger/UVR5_UI
Organizations
None yet
faisalhr1997's activity
Demo
🔥
5
6
#5 opened 11 months ago
by
merve

Llama CPP GGUF version of Autocoder?
2
#1 opened 12 months ago
by
Ekolawole
epoch 3 final? or 4 coming?
1
#6 opened almost 2 years ago
by
faisalhr1997

upload large v2
2
#1 opened about 2 years ago
by
faisalhr1997

.Bin format for embedding
2
#4 opened about 2 years ago
by
afshin2098
Was it trained on NSFW or SFW version of N_I?
4
#1 opened over 2 years ago
by
alexds9
Add Diffusers weights
#2 opened over 2 years ago
by
faisalhr1997
