-
RedPajama: an Open Dataset for Training Large Language Models
Paper • 2411.12372 • Published • 57 -
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration
Paper • 2411.10958 • Published • 57 -
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
Paper • 2508.01191 • Published • 225
Kristinn Vikar
KristinnVikarJ
AI & ML interests
None yet
Recent Activity
updated
a collection
16 days ago
to-read
upvoted
a
paper
16 days ago
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
updated
a collection
9 months ago
to-read
Organizations
None yet