Jeremy Udit

jcudit

jcudit

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

Efficient Request Queueing – Optimizing LLM Performance

upvoted an article 4 days ago

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

upvoted an article 4 days ago

How Long Prompts Block Other Requests - Optimizing LLM Performance

View all activity

Organizations

upvoted 3 articles 4 days ago

Article

Efficient Request Queueing – Optimizing LLM Performance

•

Apr 2

• 13

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 25

Article

How Long Prompts Block Other Requests - Optimizing LLM Performance

•

Jun 12

• 5

liked a model 5 days ago

MadeAgents/Hammer2.1-3b

3B • Updated Jun 12 • 941 • 16

upvoted 3 articles 11 days ago

Article

What's Software 3.0? (Spoiler: You're Already Using It)

•

Jun 19

• 2

Article

Advanced Context Engineering for LLM Agents

•

28 days ago

• 1

Article

What Coding Agent Wins?

and 1 other •

29 days ago

• 7

upvoted 2 articles 13 days ago

Article

MCP is at a Tipping Point: Here's Why You Should Care

•

Jun 10

• 17

Article

ScreenEnv: Deploy your full stack Desktop Agent

and 1 other •

16 days ago

• 51

upvoted an article 16 days ago

Article

Nano-vLLM meets Inference Endpoints

•

about 1 month ago

• 9

liked a Space 18 days ago

1.17k

chat-ui

🔥

Redirect to HuggingChat for chatting

published an article 18 days ago

Article

Three Mighty Alerts Supporting Hugging Face’s Production Infrastructure

•

18 days ago

• 8

upvoted an article 19 days ago

Article

Transformers Are Getting Old: Variants and Alternatives Exist!

•

21 days ago

• 42

liked 2 models 22 days ago

Qwen/Qwen2.5-72B-Instruct

Text Generation • 73B • Updated Jan 12 • 95.6k • • 846

Qwen/Qwen2.5-Coder-32B

Text Generation • 33B • Updated Nov 18, 2024 • 6.03k • • 133

upvoted an article 22 days ago

Article

Should We Still Pretrain Encoders with Masked Language Modeling?

and 3 others •

24 days ago

• 21

published a Space 23 days ago

Huggingchat

🚀

reacted to merve's post with 🚀 25 days ago

Post

2536

so many multimodal releases these days 🤠
> ERNIE-4.5-VL: new vision language MoE models by Baidu https://huggingface.co/models?search=ernie-4.5-vl
> new visual document retrievers by NVIDIA (sota on ViDoRe!) nvidia/llama-nemoretriever-colembed-3b-v1 nvidia/llama-nemoretriever-colembed-1b-v1
> Ovis-3b: new image-text in image-text out models by Alibaba ⤵️ https://huggingface.co/spaces/AIDC-AI/Ovis-U1-

liked a model about 1 month ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

24B • Updated 1 day ago • 236k • 1.3k

liked a dataset about 1 month ago

tiny-agents/tiny-agents

Viewer • Updated 24 days ago • 9 • 688 • 28

Jeremy Udit

AI & ML interests

Recent Activity

Organizations

jcudit's activity

Efficient Request Queueing – Optimizing LLM Performance

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

How Long Prompts Block Other Requests - Optimizing LLM Performance

What's Software 3.0? (Spoiler: You're Already Using It)

Advanced Context Engineering for LLM Agents

What Coding Agent Wins?

MCP is at a Tipping Point: Here's Why You Should Care

ScreenEnv: Deploy your full stack Desktop Agent

Nano-vLLM meets Inference Endpoints

chat-ui

Three Mighty Alerts Supporting Hugging Face’s Production Infrastructure

Transformers Are Getting Old: Variants and Alternatives Exist!

Should We Still Pretrain Encoders with Masked Language Modeling?

Huggingchat