Post
2450
Gemma-3-4B : Image and Video Inference πΌοΈπ₯
π§€Space: prithivMLmods/Gemma-3-Multimodal
π₯ Git : https://github.com/PRITHIVSAKTHIUR/Gemma-3-Multimodal
@gemma3 : {Tag + Space_+ 'prompt'}
@video-infer : {Tag + Space_+ 'prompt'}
+ Gemma3-4B : google/gemma-3-4b-it
+ By default, it runs : prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Gemma 3 Technical Report : https://storage.googleapis.com/deepmind-media/gemma/Gemma3Report.pdf
π§€Space: prithivMLmods/Gemma-3-Multimodal
π₯ Git : https://github.com/PRITHIVSAKTHIUR/Gemma-3-Multimodal
@gemma3 : {Tag + Space_+ 'prompt'}
@video-infer : {Tag + Space_+ 'prompt'}
+ Gemma3-4B : google/gemma-3-4b-it
+ By default, it runs : prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Gemma 3 Technical Report : https://storage.googleapis.com/deepmind-media/gemma/Gemma3Report.pdf