Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
hllj 's Collections
Pruning
PEFT
Quantization
Technical Report
(Continued) Pretraining
RLHF
Architectures
Retrieval Augmented Generation
Framework
Dataset
Dataset Processing Technique
Insight Paper
Vision-Language Model
Image-Text Models
Speculative Decoding
Code LLMs

(Continued) Pretraining

updated Jul 18, 2024
Upvote
-

  • Adapting Large Language Models via Reading Comprehension

    Paper • 2309.09530 • Published Sep 18, 2023 • 79

  • Gemma: Open Models Based on Gemini Research and Technology

    Paper • 2403.08295 • Published Mar 13, 2024 • 50

  • Simple and Scalable Strategies to Continually Pre-train Large Language Models

    Paper • 2403.08763 • Published Mar 13, 2024 • 52

  • DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

    Paper • 2401.02954 • Published Jan 5, 2024 • 49

  • Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Paper • 2404.14219 • Published Apr 22, 2024 • 257

  • OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

    Paper • 2404.14619 • Published Apr 22, 2024 • 128

  • PaliGemma: A versatile 3B VLM for transfer

    Paper • 2407.07726 • Published Jul 10, 2024 • 71
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs