geronimo's picture

geronimo PRO

g-ronimo

·

https://medium.com/@geronimo7

geronimi73

AI & ML interests

fafo

Organizations

g-ronimo's activity

New activity in huggingface/InferenceSupport about 1 month ago

vidore/colpali-v1.3

#1541 opened about 1 month ago by

Qwen/Qwen3-30B-A3B

#1316 opened about 1 month ago by

Qwen/Qwen3-0.6B

#1299 opened about 1 month ago by

New activity in mit-han-lab/dc-ae-f32c32-sana-1.0 5 months ago

Add License?

#2 opened 5 months ago by

New activity in PleIAs/Pleias-350m-Preview 6 months ago

GPL-2/CC-BY-SA/etc. copyleft content

#1 opened 6 months ago by

New activity in Hmrishav/FlipSketch 6 months ago

Apply for community grant: Academic project (gpu and storage)

#1 opened 6 months ago by

New activity in apple/OpenELM about 1 year ago

Fine-tuning options?

#10 opened about 1 year ago by

New activity in cognitivecomputations/dolphin-2.9-llama3-8b about 1 year ago

How to access function calling capabilities?

#15 opened about 1 year ago by

New activity in microsoft/Phi-3-mini-128k-instruct about 1 year ago

No base model?

#45 opened about 1 year ago by

New activity in tiiuae/falcon-40b about 1 year ago

Batch inference seems to be done sequentially

#50 opened almost 2 years ago by

New activity in meta-llama/Meta-Llama-3-8B about 1 year ago

tokenizer doesn't work with the old API ?

#43 opened about 1 year ago by

New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 about 1 year ago

How can I run it on multiple GPUs?

#181 opened about 1 year ago by

New activity in blog-explorers/README about 1 year ago

[Support] Community Articles

#5 opened about 1 year ago by

New activity in google/gemma-2b over 1 year ago

Note on adding new elements to the vocabulary

#21 opened over 1 year ago by

New activity in google/gemma-7b over 1 year ago

RuntimeError: FlashAttention backward for head dim > 192 requires A100/A800 or H100/H800

#18 opened over 1 year ago by

New activity in cognitivecomputations/samantha-mistral-7b over 1 year ago

mistral 7b instruct v0.2

#4 opened over 1 year ago by

New activity in mistralai/Mistral-7B-v0.1 over 1 year ago

Pretrain?

#125 opened over 1 year ago by

New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 over 1 year ago

How to finetune the model?

#129 opened over 1 year ago by

New activity in blog-explorers/README over 1 year ago

Upvoting a blog post

#4 opened over 1 year ago by

New activity in g-ronimo/phi-2-OpenHermes-2.5 over 1 year ago

qlora adapter merge

#1 opened over 1 year ago by