MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. โข 13 items โข Updated 1 day ago โข 29
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs โข 8 items โข Updated 4 days ago โข 14
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 13 days ago โข 342