Jeff Boudier's picture

Jeff Boudier

jeffboudier

AI & ML interests

Hugging Face!

Recent Activity

Articles

Organizations

Hugging Face's profile picture Renault Group's profile picture Intel's profile picture Spaces-explorers's profile picture julsimon-test's profile picture AWS Inferentia and Trainium's profile picture Spotify's profile picture Qualcomm's profile picture Amazon SageMaker Community's profile picture Demo Corp's profile picture Hugging Face Infinity's profile picture Habana AI's profile picture Hugging Face Optimum's profile picture Hugging Test Lab's profile picture WIP's profile picture Evaluation on the Hub's profile picture HuggingFaceM4's profile picture Hackathon Team 1's profile picture Open-Source AI Meetup's profile picture model-attribution-challenge's profile picture model-attribution-challenge-admin's profile picture Inference Endpoints's profile picture Hugging Face OSS Metrics's profile picture EU org's profile picture Enterprise Explorers's profile picture Optimum Nvidia's profile picture Social Post Explorers's profile picture Optimum-Intel's profile picture Hugging Face Machine Learning Optimization's profile picture Hugging Face Discord Community's profile picture Hugging Face Party @ PyTorch Conference's profile picture Google Cloud 🀝🏻 Hugging Face's profile picture Huggingface HUGS's profile picture Nerdy Face's profile picture open/ acc's profile picture

jeffboudier's activity

upvoted an article 1 day ago
view article
Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

β€’ 42
reacted to andrewrreed's post with πŸ”₯ 10 days ago
view post
Post
2610
πŸš€ Supercharge your LLM apps with Langfuse on Hugging Face Spaces!

Langfuse brings end-to-end observability and tooling to accelerate your dev workflow from experiments through production

Now available as a Docker Space directly on the HF Hub! πŸ€—

πŸ” Trace everything: monitor LLM calls, retrieval, and agent actions with popular frameworks
1⃣ One-click deployment: on Spaces with persistent storage and integrated OAuth
πŸ›  Simple Prompt Management: Version, edit, and update without redeployment
βœ… Intuitive Evals: Collect user feedback, run model/prompt evaluations, and improve quality
πŸ“Š Dataset Creation: Build datasets directly from production data to enhance future performance

Kudos to the Langfuse team for this collab and the awesome, open-first product they’re building! πŸ‘ @marcklingen @Clemo @MJannik

πŸ”— Space: langfuse/langfuse-template-space
πŸ”— Docs: https://huggingface.co/docs/hub/spaces-sdks-docker-langfuse
  • 1 reply
Β·
posted an update 11 days ago
view post
Post
521
NVIDIA just announced the Cosmos World Foundation Models, available on the Hub: nvidia/cosmos-6751e884dc10e013a0a0d8e6

Cosmos is a family of pre-trained models purpose-built for generating physics-aware videos and world states to advance physical AI development.
The release includes Tokenizers nvidia/cosmos-tokenizer-672b93023add81b66a8ff8e6

Learn more in this great community article by @mingyuliutw and @PranjaliJoshi https://huggingface.co/blog/mingyuliutw/nvidia-cosmos
  • 1 reply
Β·
upvoted an article 11 days ago
updated a Space 11 days ago
New activity in reach-vb/2024-ai-timeline 11 days ago
reacted to MoritzLaurer's post with πŸ”₯ 12 days ago
view post
Post
2191
πŸš€ Releasing a new zeroshot-classifier based on ModernBERT! Some key takeaways:

- ⚑ Speed & efficiency: It's multiple times faster and uses significantly less memory than DeBERTav3. You can use larger batch sizes and enabling bf16 (instead of fp16) gave me a ~2x speed boost as well
- πŸ“‰ Performance tradeoff: It performs slightly worse than DeBERTav3 on average across my zeroshot classification task collection
- 🧠 Use cases: I recommend using it for scenarios requiring speed and a larger context window (8k).
- πŸ’‘ What’s next? I’m preparing a newer version trained on better + longer synthetic data to fully leverage the 8k context window and improve upon the training mix of my older zeroshot-v2.0 models. I also hope that there will be a multilingual variant in the future.

Great work by https://huggingface.co/answerdotai !

If you’re looking for a high-speed zeroshot classifier, give it a try!

πŸ“„ Resources below: πŸ‘‡
Base model: MoritzLaurer/ModernBERT-base-zeroshot-v2.0
Large model: MoritzLaurer/ModernBERT-large-zeroshot-v2.0
Updated zeroshot collection: MoritzLaurer/zeroshot-classifiers-6548b4ff407bb19ff5c3ad6f
ModernBERT collection with paper: answerdotai/modernbert-67627ad707a4acbf33c41deb
reacted to burtenshaw's post with πŸ€—β€οΈ 30 days ago
view post
Post
2941
People are flexing their end of year stats, so I made this app to show hub stats in a tidy design!

Thanks @Ameeeee and @jfcalvo for the feature from Argilla!
burtenshaw/recap
  • 1 reply
Β·
reacted to julien-c's post with πŸ€— about 1 month ago
view post
Post
8423
After some heated discussion πŸ”₯, we clarify our intent re. storage limits on the Hub

TL;DR:
- public storage is free, and (unless blatant abuse) unlimited. We do ask that you consider upgrading to PRO and/or Enterprise Hub if possible
- private storage is paid above a significant free tier (1TB if you have a paid account, 100GB otherwise)

docs: https://huggingface.co/docs/hub/storage-limits

We optimize our infrastructure continuously to scale our storage for the coming years of growth in Machine learning, to the benefit of the community πŸ”₯

cc: @reach-vb @pierric @victor and the HF team
Β·