SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Paper • 2502.09390 • Published Feb 13 • 16
Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 2 items • Updated Dec 23, 2024 • 9
view article Article A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake By juliensimon and 5 others • Mar 20, 2024 • 6
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Paper • 2408.02545 • Published Aug 5, 2024 • 37
view article Article A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake By juliensimon and 5 others • Mar 20, 2024 • 6
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding By ofirzaf and 10 others • Jan 30, 2024 • 9
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs Paper • 2306.16601 • Published Jun 28, 2023 • 4
Running on CPU Upgrade 12.8k 12.8k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
Intel/distilbert-base-uncased-squadv1.1-sparse-80-1x4-block-pruneofa Question Answering • Updated Sep 20, 2022 • 13
Intel/bert-large-uncased-squadv1.1-sparse-80-1x4-block-pruneofa Question Answering • Updated Aug 1, 2022 • 26 • 1