Learning UnkNown librAry

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

SivilTaram authored a paper about 1 month ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

huybery authored a paper about 1 month ago

Qwen3 Technical Report

huybery authored a paper about 1 month ago

Qwen3 Technical Report

View all activity

SivilTaram

authored a paper about 1 month ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 22

huybery

authored 4 papers about 1 month ago

SivilTaram

authored 7 papers 3 months ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Paper • 2411.07763 • Published Nov 12, 2024 • 2

When Attention Sink Emerges in Language Models: An Empirical View

Paper • 2410.10781 • Published Oct 14, 2024

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 18

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 57

Scaling up Masked Diffusion Models on Text

Paper • 2410.18514 • Published Oct 24, 2024

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24 • 31

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Paper • 2503.15450 • Published Mar 19 • 11

huybery

authored a paper 4 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

terryyz

authored a paper 4 months ago

CodeArena: A Collective Evaluation Platform for LLM Code Generation

Paper • 2503.01295 • Published Mar 3 • 8

SivilTaram

authored a paper 4 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 104

huybery

authored 5 papers 6 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

Iterative Forward Tuning Boosts In-Context Learning in Language Models

Paper • 2305.13016 • Published May 22, 2023 • 1

PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts

Paper • 2305.14839 • Published May 24, 2023 • 1

One Shot Learning as Instruction Data Prospector for Large Language Models

Paper • 2312.10302 • Published Dec 16, 2023 • 3

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 48

AI & ML interests

Recent Activity

Team members 3

luna-code's activity