Jay P's picture

Jay P

jayomb

·

AI & ML interests

None yet

Recent Activity

liked a dataset about 12 hours ago

soob3123/GrayLine-QA-Reasoning

liked a dataset about 12 hours ago

soob3123/GrayLine-QA

liked a dataset about 12 hours ago

entfane/psychotherapy-dpo

View all activity

Organizations

jayomb's activity

upvoted a paper 12 days ago

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

Paper • 2505.03005 • Published 14 days ago • 27

upvoted a collection 2 months ago

🪿 RWKV7

RWKV7 models • 13 items • Updated 6 days ago • 7

upvoted a paper 2 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 148

upvoted 3 articles 2 months ago

Article

LeRobot goes to driving school: World’s largest open-source self-driving dataset

By

and 1 other •

Mar 11

• 80

Article

FastRTC: The Real-Time Communication Library for Python

By

and 1 other •

Feb 25

• 161

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 291

upvoted a collection 2 months ago

cool datasets

170 items • Updated 2 days ago • 16

upvoted a collection 3 months ago

Synthetic Data and Self-Improvement

82 items • Updated 25 days ago • 7

upvoted 5 papers 3 months ago

Non-instructional Fine-tuning: Enabling Instruction-Following Capabilities in Pre-trained Language Models without Instruction-Following Data

Paper • 2409.00096 • Published Aug 27, 2024 • 1

RNR: Teaching Large Language Models to Follow Roles and Rules

Paper • 2409.13733 • Published Sep 10, 2024 • 1

Response Tuning: Aligning Large Language Models without Instruction

Paper • 2410.02465 • Published Oct 3, 2024 • 13

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 229

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9 • 38

upvoted a paper 4 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 52

upvoted a collection 12 months ago

abliterated-v3

Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3, 2024 • 119

upvoted a paper 12 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted a paper about 1 year ago

Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training

Paper • 2405.06932 • Published May 11, 2024 • 21

upvoted an article about 1 year ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 237

upvoted 2 papers about 1 year ago

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Paper • 2404.07647 • Published Apr 11, 2024 • 4

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Paper • 2404.05892 • Published Apr 8, 2024 • 39