Olivier Dehaene's picture

Olivier Dehaene

olivierdehaene

AI & ML interests

None yet

Recent Activity

Organizations

BigScience Workshop's profile picture OpenAssistant's profile picture LLHF's profile picture SLLHF's profile picture blhf's profile picture

olivierdehaene's activity

upvoted an article 22 days ago
view article
Article

The Transformers Library: standardizing model definitions

By lysandre and 3 others β€’
β€’ 110
New activity in Alibaba-NLP/gte-Qwen2-1.5B-instruct 9 months ago
New activity in mistralai/Mistral-Nemo-Instruct-2407 10 months ago
replied to mayank-mishra's post about 1 year ago
view reply

Nice blog!
@osanseviero we have been doing this in TGI and TEI for a while ;)
Padding free implementations also make dynamic batching easier to implement and more predictable in memory.

reacted to loubnabnl's post with β€οΈπŸ€―πŸ€— over 1 year ago
view post
Post
⭐ Today we’re releasing The Stack v2 & StarCoder2: a series of 3B, 7B & 15B code generation models trained on 3.3 to 4.5 trillion tokens of code:

- StarCoder2-15B matches or outperforms CodeLlama 34B, and approaches DeepSeek-33B on multiple benchmarks.
- StarCoder2-3B outperforms StarCoderBase-15B and similar sized models.
- The Stack v2 a 4x larger dataset than the Stack v1, resulting in 900B unique code tokens πŸš€
As always, we released everything from models and datasets to curation code. Enjoy!

πŸ”— StarCoder2 collection: bigcode/starcoder2-65de6da6e87db3383572be1a
πŸ”— Paper: https://drive.google.com/file/d/17iGn3c-sYNiLyRSY-A85QOzgzGnGiVI3/view
πŸ”— BlogPost: https://huggingface.co/blog/starcoder2
πŸ”— Code Leaderboard: bigcode/bigcode-models-leaderboard
published an article over 1 year ago
view article
Article

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

By lewtun and 6 others β€’
β€’ 12
New activity in BAAI/bge-reranker-large over 1 year ago

Add fast tokenizer

1
#4 opened over 1 year ago by
olivierdehaene
New activity in BAAI/bge-reranker-base over 1 year ago

Add fast tokenizer

1
#5 opened over 1 year ago by
olivierdehaene
New activity in HuggingFaceH4/zephyr-chat over 1 year ago
published an article about 2 years ago
view article
Article

The Falcon has landed in the Hugging Face ecosystem

By lvwerra and 7 others β€’
β€’ 14
New activity in uwnlp/guanaco-playground-tgi about 2 years ago
New activity in bigscience/bloomz about 2 years ago

disable inference API

5
#43 opened about 2 years ago by
olivierdehaene
New activity in OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5 about 2 years ago

How to run that?

23
#2 opened about 2 years ago by
Guilherme34