Stas Bekman
stas
AI & ML interests
Toolmaker. Software creator, optimizer and harmonizer.
Makes things work and fly at Contextual.AI
Training LLM/RAG/Generative AI/Machine Learning/Scalability
Recent Activity
posted
an
update
about 1 month ago
Do you want ArcticTraining at @SnowflakeDB to add an ability to post-train DeepSeek V3/R1 models with DPO using just a few GPU nodes?
Please vote here and tell others about it: https://github.com/snowflakedb/ArcticTraining/discussions/58
ArcticTraining is an open-source, easy to use post-training framework for NVIDIA GPUs built on top of DeepSpeed.
updated
a model
2 months ago
stas/ml-engineering-book
posted
an
update
4 months ago
If you remember my work on MAMF - to find the realistic TFLOPS achievable ceiling - the Intel AI team has shared their measurements and they scored ...
an incredible 99.4% TFLOPS efficiency for Gaudi 2!
That's quite amazing! Your ROI on these accelerators will be very high.
The full table is here: https://github.com/stas00/ml-engineering/tree/master/compute/accelerator#maximum-achievable-matmul-flops-comparison-table
As we have seen the competitors get their achievable efficiency worse with each new generation, I'm looking forward to see if Gaudi 3 will keep the high bar!
Thanks to Avi Rubin, Lakshman Chari, Imtiaz Sajwani, Ramy J and Zhiqi Tao for helping to get these numbers to the community.
Organizations
stas's activity
Space isn't working because there is a runtime error
1
#9 opened 7 months ago
by
stas

Fix FileNotFoundError
3
#2 opened 8 months ago
by
lhoestq

Casting Issue?
4
#40 opened 9 months ago
by
FelixLabelle
Upload book cover
1
#1 opened 12 months ago
by
julien-c

metadata: set license
1
#2 opened 12 months ago
by
julien-c

set the correct vision_config.hidden_act
#4 opened over 1 year ago
by
stas

set the correct vision_config.hidden_act
#19 opened over 1 year ago
by
stas

set the correct vision_config.hidden_act
#4 opened over 1 year ago
by
stas

set the correct vision_config.hidden_act
#2 opened over 1 year ago
by
stas

set the correct vision_config.hidden_act
#6 opened over 1 year ago
by
stas

Update README.md
1
#17 opened over 1 year ago
by
stas

Update config.json
#3 opened over 1 year ago
by
ybelkada

Update config.json
#3 opened over 1 year ago
by
stas

Update config.json
#5 opened over 1 year ago
by
stas

Update config.json
1
#2 opened over 1 year ago
by
ybelkada

Update config.json
#2 opened over 1 year ago
by
ybelkada

Update config.json
#4 opened over 1 year ago
by
ybelkada

Update config.json
#3 opened over 1 year ago
by
stas

Update config.json
#1 opened over 1 year ago
by
stas

Update config.json
#1 opened over 1 year ago
by
ybelkada
