Kevin

kvnptl

AI & ML interests

Robot perception

Recent Activity

Organizations

None yet

kvnptl's activity

upvoted an article about 1 month ago
view article
Article

SmolVLM - small yet mighty Vision Language Model

239
reacted to maxiw's post with 👍 about 2 months ago
view post
Post
3298
The new Qwen-2 VL models seem to perform quite well in object detection. You can prompt them to respond with bounding boxes in a reference frame of 1k x 1k pixels and scale those boxes to the original image size.

You can try it out with my space maxiw/Qwen2-VL-Detection

·
upvoted an article 2 months ago
view article
Article

Welcome PaliGemma 2 – New vision language models by Google

152