Mining Tasky Data

non-profit

Activity Feed

AI & ML interests

Mining Tasky Data

Recent Activity

craffel authored a paper about 17 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

craffel authored a paper 21 days ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

jordiclive authored a paper about 1 month ago

Lessons from the Trenches on Reproducible Evaluation of Language Models

View all activity

craffel

authored a paper about 17 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 2 days ago • 23

craffel

authored a paper 21 days ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published 22 days ago • 42

jordiclive

authored 2 papers about 1 month ago

Lessons from the Trenches on Reproducible Evaluation of Language Models

Paper • 2405.14782 • Published May 23, 2024

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 36

Muennighoff

authored 2 papers about 2 months ago

Crosslingual Reasoning through Test-Time Scaling

Paper • 2505.05408 • Published May 8 • 8

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 55

craffel

authored a paper 5 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 235

Muennighoff

authored a paper 5 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 126

manandey

in taskydata/deberta-v3-base_10xp3nirstbbflanse_5xc4 7 months ago

Adding `safetensors` variant of this model

#1 opened 8 months ago by

SFconvertbot

Muennighoff

authored a paper 9 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 120

Muennighoff

authored a paper 10 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 79

manandey

updated a dataset 11 months ago

taskydata/Pile-T5-Instruction_updated

Viewer • Updated Jul 25, 2024 • 23.5k • 43

Muennighoff

authored a paper 11 months ago

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Paper • 2407.16741 • Published Jul 23, 2024 • 73

manandey

updated a model 11 months ago

taskydata/pile-t5-xl-instruction

Text2Text Generation • 3B • Updated Jul 23, 2024 • 15

jordiclive

updated a model 11 months ago

taskydata/pile-t5-base-instruction

Text2Text Generation • 0.2B • Updated Jul 23, 2024 • 25

jordiclive

updated a dataset 11 months ago

taskydata/C4-Pile-T5-xl-Instructions

Viewer • Updated Jul 23, 2024 • 100 • 9

Muennighoff

authored a paper 12 months ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 39

craffel

authored a paper about 1 year ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 98

hails

authored a paper about 1 year ago

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

Paper • 2406.16838 • Published Jun 24, 2024 • 2

Muennighoff

authored a paper about 1 year ago

C-Pack: Packaged Resources To Advance General Chinese Embedding

Paper • 2309.07597 • Published Sep 14, 2023 • 1

AI & ML interests

Recent Activity

Team members 7

taskydata's activity

Adding `safetensors` variant of this model