Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

stefan-it
/
ModernBERT-large-tokenizer-fix

Fill-Mask
Transformers
PyTorch
ONNX
Safetensors
English
modernbert
masked-lm
long-context
Model card Files Files and versions
xet
Community
ModernBERT-large-tokenizer-fix
Ctrl+K
Ctrl+K
  • 7 contributors
History: 30 commits
stefan-it's picture
stefan-it
docs: introduce section about tokenizer fixes
125ed8f verified 30 days ago
  • onnx
    Upload ONNX weights (#1) 6 months ago
  • .gitattributes
    1.52 kB
    initial commit 6 months ago
  • README.md
    9.02 kB
    docs: introduce section about tokenizer fixes 30 days ago
  • config.json
    1.19 kB
    Bump `max_position_embeddings` to 8192 6 months ago
  • model.safetensors
    1.58 GB
    xet
    Purge duplicate "decoder.weight", rely on tied weights instead 6 months ago
  • pytorch_model.bin
    1.58 GB
    xet
    Purge duplicate "decoder.weight", rely on tied weights instead 6 months ago
  • special_tokens_map.json
    694 Bytes
    Also update tokenizer/special_tokens_map 6 months ago
  • tokenizer.json
    2.13 MB
    Also update tokenizer/special_tokens_map 6 months ago
  • tokenizer_config.json
    20.8 kB
    fix: also use `add_prefix_space = True` in tokenizer config 30 days ago