Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
CelesteChen 's Collections
confidence
deepsearch
models
code
diffusion
multilingual
reasoning
RAG
others
long-context
math
Align
LLM-general

code

updated Jan 13
Upvote
-

  • HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

    Paper • 2412.21199 • Published Dec 30, 2024 • 14

  • Training Software Engineering Agents and Verifiers with SWE-Gym

    Paper • 2412.21139 • Published Dec 30, 2024 • 24
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs