Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YedsonUQ 's Collections
Hallucination Frameworks Ideas
Understanding LLM Representation
Efficient Inference
Test-Time Scaling (TTS)
Agents AI
Long-context
Foundational Deep Learning - Architecture
AI-Automated Scientific Research
Benchmark and Evaluation
Distributed Training and Federated Learning
Explainable AI - Interpretable AI
Findings
Theory, Conceptualization, Paradigms
Hallucination
Learning Paradigm/Scheme
Models Series
Reasoning - Chain-of-Thought
Reinforcement Learning (RL)
Retrieval Augmented Generation (RAG)
Uncertainty Quantification
Survey

Findings

updated Apr 23
Upvote
1

  • Large Language Models Think Too Fast To Explore Effectively

    Paper • 2501.18009 • Published Jan 29 • 24

  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28 • 122

  • Intuitive physics understanding emerges from self-supervised pretraining on natural videos

    Paper • 2502.11831 • Published Feb 17 • 19

  • Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

    Paper • 2502.13063 • Published Feb 18 • 73

  • LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

    Paper • 2502.15007 • Published Feb 20 • 175

  • LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

    Paper • 2504.16078 • Published Apr 22 • 20
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs