view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs By wenhuach and 8 others • Apr 29 • 33
view article Article Build AI on premise with Dell Enterprise Hub By jeffboudier and 3 others • May 21, 2024 • 27
view article Article Benchmarking Language Model Performance on 5th Gen Xeon at GCP By MatrixYao and 2 others • Dec 17, 2024 • 5
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others • Jan 28 • 867
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 404
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other • Jan 16 • 74