Juan CM

jucamohedano
ยท

AI & ML interests

AI Systems MSc at Trento ๐Ÿš€๐Ÿค–

Recent Activity

Organizations

SomosNLP's profile picture scikit-learn's profile picture Hugging Face Discord Community's profile picture

jucamohedano's activity

upvoted an article 4 months ago
view article
Article

Introducing smolagents: simple agents that write actions in code.

By m-ric and 2 others โ€ข
โ€ข 1.06k
upvoted 2 articles 4 months ago
view article
Article

Open-source DeepResearch โ€“ Freeing our search agents

By m-ric and 4 others โ€ข
โ€ข 1.25k
view article
Article

SmolVLM Grows Smaller โ€“ Introducing the 250M & 500M Models!

By andito and 2 others โ€ข
โ€ข 180
upvoted an article about 1 year ago
view article
Article

PaliGemma โ€“ Google's Cutting-Edge Open Vision Language Model

By merve and 2 others โ€ข
โ€ข 253
reacted to merve's post with ๐Ÿš€ about 1 year ago
view post
Post
1776
New open Vision Language Model by @Google : PaliGemma ๐Ÿ’™๐Ÿค

๐Ÿ“ Comes in 3B, pretrained, mix and fine-tuned models in 224, 448 and 896 resolution
๐Ÿงฉ Combination of Gemma 2B LLM and SigLIP image encoder
๐Ÿค— Supported in transformers

PaliGemma can do..
๐Ÿงฉ Image segmentation and detection! ๐Ÿคฏ
๐Ÿ“‘ Detailed document understanding and reasoning
๐Ÿ™‹ Visual question answering, captioning and any other VLM task!

Read our blog ๐Ÿ”– hf.co/blog/paligemma
Try the demo ๐Ÿช€ hf.co/spaces/google/paligemma
Check out the Spaces and the models all in the collection ๐Ÿ“š google/paligemma-release-6643a9ffbf57de2ae0448dda
Collection of fine-tuned PaliGemma models google/paligemma-ft-models-6643b03efb769dad650d2dda
ยท
upvoted an article about 1 year ago
view article
Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

By AviSoori1x โ€ข
โ€ข 34