Yu-Ting Lee

theQuert

https://yutinglee.com

theQuert

AI & ML interests

NLP

Recent Activity

liked a model 3 days ago

OpenPipe/gemma-3-12b-it-text-only

liked a dataset 9 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset-v1

liked a dataset 23 days ago

voidful/fineweb-zhtw

View all activity

Organizations

None yet

theQuert's activity

liked a model 3 days ago

OpenPipe/gemma-3-12b-it-text-only

Text Generation • Updated 14 days ago • 276 • 2

liked a dataset 9 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset-v1

Viewer • Updated 11 days ago • 15.2M • 7.64k • 262

liked a dataset 23 days ago

voidful/fineweb-zhtw

Viewer • Updated 25 days ago • 48.1M • 2.68k • 39

liked a dataset about 1 month ago

yentinglin/twllm-data

Viewer • Updated Feb 23 • 25.8k • 316 • 13

liked a model about 1 month ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • Updated Jan 12 • 252k • • 449

upvoted an article 2 months ago

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Jan 16

• 71

liked a model 3 months ago

BAAI/bge-m3

liked a model 5 months ago

unsloth/gemma-2-9b-it-bnb-4bit

Text Generation • Updated Nov 22, 2024 • 15.7k • 29

liked a dataset 5 months ago

lavita/ChatDoctor-HealthCareMagic-100k

Viewer • Updated Sep 9, 2023 • 112k • 2.07k • 70

liked a model 6 months ago

unsloth/Llama-3.2-11B-Vision

Image-Text-to-Text • Updated Nov 22, 2024 • 692 • 31

liked a dataset 6 months ago

mlabonne/FineTome-100k

Viewer • Updated Jul 29, 2024 • 100k • 19.2k • 192

liked 5 models 6 months ago

upvoted a collection 7 months ago

DataGemma Release

Collection

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 2 days ago • 85

liked a model 7 months ago

google/gemma-2-9b-it

Text Generation • Updated Aug 27, 2024 • 276k • • 697

reacted to singhsidhukuldeep's post with 👀 7 months ago

Post

903

Just tried LitServe from the good folks at @LightningAI !

Between llama.cpp and vLLM, there is a small gap where a few large models are not deployable!

That's where LitServe comes in!

LitServe is a high-throughput serving engine for AI models built on FastAPI.

Yes, built on FastAPI. That's where the advantage and the issue lie.

It's extremely flexible and supports multi-modality and a variety of models out of the box.

But in my testing, it lags far behind in speed compared to vLLM.

Also, no OpenAI API-compatible endpoint is available as of now.

But as we move to multi-modal models and agents, this serves as a good starting point. However, it’s got to become faster...

GitHub: https://github.com/Lightning-AI/LitServe

1 reply

liked a model 7 months ago

google/gemma-2-27b-it

Text Generation • Updated Aug 27, 2024 • 142k • • 542