148 2

Stella Biderman

stellaathena

http://www.stellabiderman.com

AI & ML interests

None yet

Recent Activity

new activity 19 days ago

common-pile/comma_v0.1_training_dataset:Add task category, license and tags

authored a paper 21 days ago

Emergent and Predictable Memorization in Large Language Models

authored a paper 21 days ago

KMMLU: Measuring Massive Multitask Language Understanding in Korean

View all activity

Organizations

authored 10 papers 21 days ago

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Paper • 2406.16746 • Published Jun 24, 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons

Paper • 2407.14933 • Published Jul 20, 2024 • 12

Lessons from the Trenches on Reproducible Evaluation of Language Models

Paper • 2405.14782 • Published May 23, 2024

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 9

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

Paper • 2406.17746 • Published Jun 25, 2024

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17 • 9

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published 22 days ago • 42

authored 2 papers 5 months ago

Open Problems in Mechanistic Interpretability

Paper • 2501.16496 • Published Jan 27 • 19

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 64

authored a paper about 1 year ago

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

Paper • 2406.04391 • Published Jun 6, 2024 • 9

authored 7 papers over 1 year ago

On the Societal Impact of Open Foundation Models

Paper • 2403.07918 • Published Feb 27, 2024 • 17

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

Paper • 2103.12028 • Published Mar 22, 2021 • 3

Datasheet for the Pile

Paper • 2201.07311 • Published Jan 13, 2022

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting

Paper • 2212.09535 • Published Dec 19, 2022 • 1

Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources

Paper • 2201.10066 • Published Jan 25, 2022

What Language Model to Train if You Have One Million GPU Hours?

Paper • 2210.15424 • Published Oct 27, 2022 • 2

Recasting Self-Attention with Holographic Reduced Representations

Paper • 2305.19534 • Published May 31, 2023 • 2

Stella Biderman

AI & ML interests

Recent Activity

Organizations

stellaathena's activity