A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework.
Data Mining and Information Systems Lab
dmis-lab
AI & ML interests
None yet
Recent Activity
updated
a collection
6 days ago
Outlier-Safe Pre-Training (OSP)
upvoted
a
paper
6 days ago
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large
Language Models
updated
a model
7 days ago
dmis-lab/OSP-1.4B-100B-Shampoo-SSNorm-EmbProj
Organizations
None yet