Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
appliedml42 's Collections
Learn: LLM Architecture 2025
Learn: Vision Language Models

Learn: LLM Architecture 2025

updated Jan 4
Upvote
-

  • RoFormer: Enhanced Transformer with Rotary Position Embedding

    Paper • 2104.09864 • Published Apr 20, 2021 • 13

  • DeepSeek-V3 Technical Report

    Paper • 2412.19437 • Published Dec 27, 2024 • 64
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs