Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper โข 2504.08685 โข Published 10 days ago โข 119
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 โข 11 items โข Updated 21 days ago โข 446
Retentive Network: A Successor to Transformer for Large Language Models Paper โข 2307.08621 โข Published Jul 17, 2023 โข 170