Mayank Mishra

mayank-mishra

AI & ML interests

Large Language Models, Distributed Training and Inference

Recent Activity

Organizations

IBM's profile picture BigCode's profile picture Aurora-M/MDEL's profile picture Blog-explorers's profile picture Aurora-M's profile picture IBM Granite's profile picture IBM Research's profile picture

mayank-mishra's activity

upvoted an article 10 months ago
view article
Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

By lwtr and 5 others
35
upvoted an article 12 months ago