SemiNAT

non-profit

AI & ML interests

None defined yet.

Recent Activity

sheep33333 authored a paper 9 days ago

ARIA: Training Language Agents with Intention-Driven Reward Aggregation

sheep33333 authored a paper 13 days ago

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

sheep33333 authored a paper 13 days ago

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

View all activity

SemiNAT's activity

sheep33333

authored a paper 9 days ago

ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Paper • 2506.00539 • Published 12 days ago • 29

sheep33333

authored 7 papers 13 days ago

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published 17 days ago • 64

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Paper • 2505.19914 • Published 17 days ago • 42

ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published 16 days ago • 43

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Paper • 2410.13413 • Published Oct 17, 2024

SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

Paper • 2406.04784 • Published Jun 7, 2024 • 2

TravelAgent: An AI Assistant for Personalized Travel Planning

Paper • 2409.08069 • Published Sep 12, 2024

From Persona to Personalization: A Survey on Role-Playing Language Agents

Paper • 2404.18231 • Published Apr 28, 2024 • 1

jiangjiechen

authored 5 papers 16 days ago

TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation

Paper • 2402.05733 • Published Feb 8, 2024

SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

Paper • 2406.04784 • Published Jun 7, 2024 • 2

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 128

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

Paper • 2504.13914 • Published Apr 10 • 1

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Paper • 2505.19914 • Published 17 days ago • 42

sheep33333

updated a dataset 30 days ago

SemiNAT/sft-v3

Updated 30 days ago • 54

ykzhang721

updated a dataset about 1 month ago

SemiNAT/sft-v3

Updated 30 days ago • 54

ykzhang721

published a dataset about 1 month ago

SemiNAT/sft-v3

Updated 30 days ago • 54

ykzhang721

updated a model about 1 month ago

SemiNAT/sft-v2

ykzhang721

published a model about 1 month ago

SemiNAT/sft-v2

sheep33333

updated a dataset about 1 month ago

SemiNAT/sft_prob_chunk_0429_57w

Viewer • Updated May 1 • 579k • 53

sheep33333

published a dataset about 1 month ago

SemiNAT/sft_prob_chunk_0429_57w

Viewer • Updated May 1 • 579k • 53