Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Louis Owen's picture
6 6 2

Louis Owen

louisowen6
nahiar's profile picture 21world's profile picture onkarsus13's profile picture
·
https://louisowen6.github.io/
  • louisowen6
  • louisowen

AI & ML interests

GenAI, LLM, NLP

Organizations

Indonesian NLP's profile picture Yellow.ai's profile picture bluorion's profile picture

commented 3 papers 5 months ago

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published Apr 3 • 90 •
2

A Refined Analysis of Massive Activations in LLMs

Paper • 2503.22329 • Published Mar 28 • 14 •
3

Variance Control via Weight Rescaling in LLM Pre-training

Paper • 2503.17500 • Published Mar 21 • 5 •
2
New activity in Yellow-AI-NLP/komodo-7b-base over 1 year ago

weird response when using vllm

2
#3 opened over 1 year ago by
raihan2345

Why llama2 and not mistral 7b?

1
#1 opened over 1 year ago by
Kernel
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs