inference Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
llm_model PygmalionAI/mythalion-13b Text Generation • 13B • Updated Sep 15, 2023 • 1.38k • • 160 Nitral-AI/Poppy_Porpoise-0.72-L3-8B Text Generation • 8B • Updated Jul 4, 2024 • 53 • • 35 Lewdiculous/KukulStanta-7B-GGUF-IQ-Imatrix 7B • Updated Apr 7, 2024 • 621 • 9 Lewdiculous/L3-8B-Stheno-v3.2-GGUF-IQ-Imatrix 8B • Updated Feb 2 • 20k • 208
inference Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
llm_model PygmalionAI/mythalion-13b Text Generation • 13B • Updated Sep 15, 2023 • 1.38k • • 160 Nitral-AI/Poppy_Porpoise-0.72-L3-8B Text Generation • 8B • Updated Jul 4, 2024 • 53 • • 35 Lewdiculous/KukulStanta-7B-GGUF-IQ-Imatrix 7B • Updated Apr 7, 2024 • 621 • 9 Lewdiculous/L3-8B-Stheno-v3.2-GGUF-IQ-Imatrix 8B • Updated Feb 2 • 20k • 208