Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
17
6
Benjamin Feuer
penfever
Follow
NekoMikoReimu's profile picture
Mi6paulino's profile picture
Nighttell's profile picture
11 followers
·
25 following
FeuerBenjamin
penfever
AI & ML interests
Deep learning, computer vision, large language models, large vision language models
Recent Activity
updated
a model
about 20 hours ago
mlfoundations-dev/a1_math_formulas
authored
a paper
5 days ago
When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity
commented
on
a paper
5 days ago
When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity
View all activity
Organizations
Papers
12
arxiv:
2509.20293
arxiv:
2507.01544
arxiv:
2506.04178
arxiv:
2501.18511
Expand 12 papers
models
7
Sort: Recently updated
penfever/qwen2.5_7b_tbench_traces_sharegptv1
8B
•
Updated
Jul 28
•
6
penfever/qwen2_5_vl_7b_walton-multimodal-cold-start-r1-format-30k
8B
•
Updated
Jul 23
•
6
penfever/qwen2.5-7b-limo-fft
8B
•
Updated
Jul 6
•
2
penfever/qwen3_8b_limo
Updated
Jul 6
penfever/qwen2_5_vl_7b_vlaa-thinking-sft-to-r1-format
8B
•
Updated
Jul 5
•
5
penfever/oumi-l8b-ultrachat
Text Generation
•
8B
•
Updated
Feb 14
•
5
penfever/llama8b-wildchat
Updated
Feb 1
datasets
61
Sort: Recently updated
penfever/hellaswag-sb-trainable
Viewer
•
Updated
6 days ago
•
200
•
106
penfever/limo-vis-mid-resize-safe
Updated
6 days ago
•
5
penfever/lm-eval-harness-sandboxes
Viewer
•
Updated
7 days ago
•
77k
•
196
penfever/marvis_fft_results
Updated
16 days ago
•
426
penfever/open_genome_packing
Viewer
•
Updated
Aug 2
•
22.8k
•
46
penfever/walton-multimodal-cold-start-r1-format-30k
Viewer
•
Updated
Jul 22
•
30k
•
13
•
1
penfever/LiveXiv-VLMEvalKit
Viewer
•
Updated
Jul 21
•
17.9k
•
68
penfever/s1k_img_full_v3
Viewer
•
Updated
Jul 9
•
944
•
7
penfever/openthoughts_filtered
Viewer
•
Updated
Jul 9
•
100k
•
31
penfever/GAIR_LIMO_img_v1
Viewer
•
Updated
Jul 9
•
778
•
10
View 61 datasets