Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CelesteChen 's Collections
confidence
deepsearch
models
code
diffusion
multilingual
reasoning
RAG
others
long-context
math
Align
LLM-general

LLM-general

updated Oct 25, 2024
Upvote
-

  • Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

    Paper • 2410.10814 • Published Oct 14, 2024 • 52

  • MiniPLM: Knowledge Distillation for Pre-Training Language Models

    Paper • 2410.17215 • Published Oct 22, 2024 • 17

  • CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

    Paper • 2410.16256 • Published Oct 21, 2024 • 61

  • CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

    Paper • 2410.18505 • Published Oct 24, 2024 • 11
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs