view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques By jmamou and 8 others • 5 days ago • 15
view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques By jmamou and 8 others • 5 days ago • 15
Term Set Expansion based on Multi-Context Term Embeddings: an End-to-end Workflow Paper • 1807.10104 • Published Jul 26, 2018 • 1
ABSApp: A Portable Weakly-Supervised Aspect-Based Sentiment Extraction System Paper • 1909.05608 • Published Sep 12, 2019
Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs Paper • 2210.10144 • Published Oct 18, 2022
Accelerating Speculative Decoding using Dynamic Speculation Length Paper • 2405.04304 • Published May 7, 2024 • 2
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model By danielkorat and 7 others • Oct 29, 2024 • 54
view article Article Faster Assisted Generation with Dynamic Speculation By jmamou and 6 others • Oct 8, 2024 • 46
view article Article Faster Assisted Generation with Dynamic Speculation By jmamou and 6 others • Oct 8, 2024 • 46
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23, 2024 • 18
view article Article Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon By danielkorat and 5 others • Apr 3, 2024 • 11
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding By ofirzaf and 10 others • Jan 30, 2024 • 9
view article Article SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit By ronenlap and 5 others • Dec 6, 2023 • 8