481 127 955

Peter Szemraj PRO

pszemraj

https://pszemraj.carrd.co/

pszemraj

AI & ML interests

metallic intuition

Recent Activity

new activity about 23 hours ago

pszemraj/opt-peter-2.7B:Adding `safetensors` variant of this model

upvoted a paper 1 day ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

updated a model 3 days ago

BEE-spoke-data/wordpiece-tokenizer-32k-en_code-msp

View all activity

Organizations

pszemraj's activity

New activity in pszemraj/opt-peter-2.7B about 23 hours ago

Adding `safetensors` variant of this model

#5 opened 1 day ago by

SFconvertbot

upvoted a paper 1 day ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published 6 days ago • 66

updated 2 models 3 days ago

BEE-spoke-data/wordpiece-tokenizer-32k-en_code-msp

Updated 3 days ago

BEE-spoke-data/wordpiece-tokenizer-32k-en_code-orig

Updated 3 days ago

New activity in pszemraj/opt-peter-1.3B 5 days ago

Adding `safetensors` variant of this model

#4 opened 5 days ago by

SFconvertbot

upvoted a collection 6 days ago

SuperBPE

Collection

SuperBPE tokenizers and models trained with them • 8 items • Updated 6 days ago • 14

New activity in HuggingFaceTB/dclm-edu 7 days ago

size of parquet files

#2 opened 7 days ago by

pszemraj

updated a collection 8 days ago

tokenizers

Collection

trained and adapted tokenizers - various • 19 items • Updated 8 days ago

New activity in reducto/RolmOCR 8 days ago

Compatibility with olmOCR repo

#2 opened 10 days ago by

pszemraj

liked a model 9 days ago

reducto/RolmOCR

Image-Text-to-Text • Updated 13 days ago • 7.12k • 361

updated a model 10 days ago

BEE-spoke-data/bpe-tokenizer-32k-smolNeoX

Updated 10 days ago

published a model 10 days ago

BEE-spoke-data/bpe-tokenizer-32k-smolNeoX

Updated 10 days ago

updated a model 10 days ago

BEE-spoke-data/bpe-tokenizer-32k-smolNeoX

Updated 10 days ago

New activity in reducto/RolmOCR 10 days ago

Compatibility with olmOCR repo

#2 opened 10 days ago by

pszemraj

liked 2 models 10 days ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8

Image-Text-to-Text • Updated 6 days ago • 30.3k • • 96

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • Updated 6 days ago • 657k • • 773

liked a model 11 days ago

rasbt/llama-3.2-from-scratch

Updated 14 days ago • 252

updated a dataset 12 days ago

pszemraj/survivorlib-chemistry-hatmaking

Viewer • Updated 12 days ago • 3.13k • 46

upvoted 2 papers 12 days ago

PaperBench: Evaluating AI's Ability to Replicate AI Research

Paper • 2504.01848 • Published 13 days ago • 34

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published 17 days ago • 45