
Zjcxy-SmartAI/Eagle-Qwen2.5-14B-Instruct
Updated
•
366
•
2
None defined yet.
Welcome to the Inference Acceleration Team under China Mobile (Zhejiang) Innovation Research Institute. We are dedicated to achieving efficient large model inference on NVIDIA and domestic GPU platforms, with a focus on cutting-edge inference acceleration technologies such as speculative decoding and model quantization. Feel free to explore our space!