geronimo PRO
g-ronimo
AI & ML interests
fafo
Recent Activity
liked
a model
6 days ago
nielsr/my-awesome-nanovlm-model
Organizations
g-ronimo's activity
vidore/colpali-v1.3
#1541 opened 12 days ago
by
g-ronimo

Qwen/Qwen3-30B-A3B
š„
1
#1316 opened 18 days ago
by
g-ronimo

Qwen/Qwen3-0.6B
š
29
#1299 opened 18 days ago
by
g-ronimo

Add License?
1
#2 opened 4 months ago
by
g-ronimo

GPL-2/CC-BY-SA/etc. copyleft content
3
#1 opened 5 months ago
by
mjbommar
Apply for community grant: Academic project (gpu and storage)
4
#1 opened 6 months ago
by
Hmrishav

Fine-tuning options?
7
#10 opened about 1 year ago
by
yukiarimo

How to access function calling capabilities?
š
š
3
3
#15 opened about 1 year ago
by
pyraminded
No base model?
2
#45 opened about 1 year ago
by
ucalyptus

Batch inference seems to be done sequentially
š
10
3
#50 opened almost 2 years ago
by
yard1
tokenizer doesn't work with the old API ?
2
#43 opened about 1 year ago
by
teddyyyy123
How can I run it on multiple GPUs?
š
3
11
#181 opened about 1 year ago
by
barbery
[Support] Community Articles
š
š¤
1
83
#5 opened about 1 year ago
by
victor

Note on adding new elements to the vocabulary
š
1
2
#21 opened about 1 year ago
by
johnhew

RuntimeError: FlashAttention backward for head dim > 192 requires A100/A800 or H100/H800
š
1
3
#18 opened about 1 year ago
by
g-ronimo

mistral 7b instruct v0.2
4
#4 opened over 1 year ago
by
cognitivetech

Pretrain?
3
#125 opened over 1 year ago
by
limha
How to finetune the model?
2
#129 opened over 1 year ago
by
akasranjan
Upvoting a blog post
š
25
7
#4 opened over 1 year ago
by
santiviquez

qlora adapter merge
1
#1 opened over 1 year ago
by
ajmoreno