ConvLLaVA A collection of ConvLLaVA models. ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published May 24, 2024 • 46 ConvLLaVA/ConvLLaVA-sft-768 Text Generation • Updated May 28, 2024 • 16 • 1 ConvLLaVA/ConvLLaVA-sft-1024 Text Generation • Updated May 28, 2024 • 10 ConvLLaVA/ConvLLaVA-sft-1536 Text Generation • Updated May 28, 2024 • 8
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published May 24, 2024 • 46
ConvLLaVA A collection of ConvLLaVA models. ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published May 24, 2024 • 46 ConvLLaVA/ConvLLaVA-sft-768 Text Generation • Updated May 28, 2024 • 16 • 1 ConvLLaVA/ConvLLaVA-sft-1024 Text Generation • Updated May 28, 2024 • 10 ConvLLaVA/ConvLLaVA-sft-1536 Text Generation • Updated May 28, 2024 • 8
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published May 24, 2024 • 46