Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lgaalves 's Collections
table-data-extraction
language-models
mixture-of-experts

language-models

updated Feb 27, 2024
Upvote
-

  • Mistral 7B

    Paper • 2310.06825 • Published Oct 10, 2023 • 48

  • BloombergGPT: A Large Language Model for Finance

    Paper • 2303.17564 • Published Mar 30, 2023 • 23

  • BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

    Paper • 1810.04805 • Published Oct 11, 2018 • 18

  • DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

    Paper • 1910.01108 • Published Oct 2, 2019 • 14

  • Llama 2: Open Foundation and Fine-Tuned Chat Models

    Paper • 2307.09288 • Published Jul 18, 2023 • 243

  • Attention Is All You Need

    Paper • 1706.03762 • Published Jun 12, 2017 • 61

  • Universal Language Model Fine-tuning for Text Classification

    Paper • 1801.06146 • Published Jan 18, 2018 • 7

  • Language Models are Few-Shot Learners

    Paper • 2005.14165 • Published May 28, 2020 • 13

  • BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Paper • 2211.05100 • Published Nov 9, 2022 • 31

  • Self-Rewarding Language Models

    Paper • 2401.10020 • Published Jan 18, 2024 • 148
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs