BigCode

Team

non-profit

https://www.bigcode-project.org/

BigCodeProject

bigcode-project

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Elfsong authored a paper about 1 month ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Elfsong authored a paper about 1 month ago

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Elfsong authored a paper about 1 month ago

EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

View all activity

Papers

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

View all Papers

Articles

BigCodeArena: Judging code generations end to end with code executions

Oct 7, 2025

•

gagan3012

authored 2 papers 11 days ago

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

Paper • 2603.08501 • Published 25 days ago

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published 15 days ago • 3

gagan3012

submitted a paper to Daily Papers 14 days ago

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published 15 days ago • 3

gagan3012

submitted a paper to Daily Papers 15 days ago

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

Paper • 2603.08501 • Published 25 days ago

RTT1

authored a paper 17 days ago

EvoClaw: Evaluating AI Agents on Continuous Software Evolution

Paper • 2603.13428 • Published 21 days ago • 21

sivareddyg

authored a paper 21 days ago

LLM2Vec-Gen: Generative Embeddings from Large Language Models

Paper • 2603.10913 • Published 23 days ago • 43

juyongjiang

authored 3 papers 22 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published Feb 9 • 70

ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning

Paper • 2603.05863 • Published 28 days ago • 5

juyongjiang

submitted a paper to Daily Papers 23 days ago

ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning

Paper • 2603.05863 • Published 28 days ago • 5

patryk-bartkowiak

authored a paper about 1 month ago

Seamlessly Integrating Tree-Based Positional Embeddings into Transformer Models for Source Code Representation

Paper • 2507.04003 • Published Jul 5, 2025

albertvillanova

posted an update about 1 month ago

Post

2204

🚀 TRL v0.29.0 introduces trl-training: an agent-native training skill.

This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)

We’re excited to see what the community builds on top of this.

If you’re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! 🤗

The future of ML tooling is agent-native.
🔗 https://github.com/huggingface/trl/releases/tag/v0.29.0