lei's picture

1 2 1

lei

lqlz

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Generic Token Compression in Multimodal Large Language Models from an Explainability Perspective

liked a dataset 6 months ago

lmms-lab/LLaVA-Video-178K

new activity 7 months ago

llava-hf/llava-onevision-qwen2-0.5b-ov-hf:Error when attempting to run either model... ValueError: embed_dim must be divisible by num_heads (got `embed_dim`: 1152 and `num_heads`: 14).

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Generic Token Compression in Multimodal Large Language Models from an Explainability Perspective

Paper • 2506.01097 • Published Jun 1 • 3

upvoted a collection 7 months ago

LLaVA-OneVision

a model good at arbitrary types of visual input • 15 items • Updated Oct 5, 2024 • 25