Jeremy Udit

jcudit

jcudit

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

Efficient Request Queueing – Optimizing LLM Performance

upvoted an article 4 days ago

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

upvoted an article 4 days ago

How Long Prompts Block Other Requests - Optimizing LLM Performance

View all activity

Organizations

upvoted 3 articles 4 days ago

Article

Efficient Request Queueing – Optimizing LLM Performance

•

Apr 2

• 13

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 25

Article

How Long Prompts Block Other Requests - Optimizing LLM Performance

•

Jun 12

• 5

upvoted 3 articles 11 days ago

Article

What's Software 3.0? (Spoiler: You're Already Using It)

•

Jun 19

• 2

Article

Advanced Context Engineering for LLM Agents

•

28 days ago

• 1

Article

What Coding Agent Wins?

and 1 other •

29 days ago

• 7

upvoted 2 articles 13 days ago

Article

MCP is at a Tipping Point: Here's Why You Should Care

•

Jun 10

• 17

Article

ScreenEnv: Deploy your full stack Desktop Agent

and 1 other •

16 days ago

• 51

upvoted an article 16 days ago

Article

Nano-vLLM meets Inference Endpoints

•

about 1 month ago

• 9

upvoted an article 19 days ago

Article

Transformers Are Getting Old: Variants and Alternatives Exist!

•

21 days ago

• 42

upvoted an article 22 days ago

Article

Should We Still Pretrain Encoders with Masked Language Modeling?

and 3 others •

24 days ago

• 21

upvoted 2 articles about 1 month ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

•

Apr 25

• 289

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

and 6 others •

Jun 12

• 115

upvoted a changelog about 2 months ago

Changelog

New Inference Providers Dashboard

Jun 5

• 61

upvoted an article about 2 months ago

Article

Microsoft and Hugging Face expand collaboration

and 2 others •

May 19

• 23

upvoted a collection 2 months ago

Granite Time Series Models

Collection

A collection of time series models trained by IBM licensed under Apache 2.0 license. • 7 items • Updated Jun 16 • 29

upvoted 2 articles 2 months ago

Article

The New and Fresh analytics in Inference Endpoints

and 4 others •

Mar 21

• 21

Article

Blazingly fast whisper transcriptions with Inference Endpoints

and 5 others •

May 13

• 72

upvoted 2 articles 8 months ago

Article

Fine-tuning Mistral on Your Dataset

•

Jul 22, 2024

• 25

Article

Releasing the largest multilingual open pretraining dataset

and 2 others •

Nov 13, 2024

• 102

Jeremy Udit

AI & ML interests

Recent Activity

Organizations

jcudit's activity

Efficient Request Queueing – Optimizing LLM Performance

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

How Long Prompts Block Other Requests - Optimizing LLM Performance

What's Software 3.0? (Spoiler: You're Already Using It)

Advanced Context Engineering for LLM Agents

What Coding Agent Wins?

MCP is at a Tipping Point: Here's Why You Should Care

ScreenEnv: Deploy your full stack Desktop Agent

Nano-vLLM meets Inference Endpoints

Transformers Are Getting Old: Variants and Alternatives Exist!

Should We Still Pretrain Encoders with Masked Language Modeling?

Tiny Agents: a MCP-powered agent in 50 lines of code

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

New Inference Providers Dashboard

Microsoft and Hugging Face expand collaboration

The New and Fresh analytics in Inference Endpoints

Blazingly fast whisper transcriptions with Inference Endpoints

Fine-tuning Mistral on Your Dataset

Releasing the largest multilingual open pretraining dataset