Jonas Geiping

JonasGeiping

https://jonasgeiping.github.io/

AI & ML interests

Machine Learning Safety, Security and Privacy; Optimization in Deep Learning; Mathematical Optimization: Federated Learning

Recent Activity

upvoted a paper 4 days ago

Scaling Open-Ended Reasoning to Predict the Future

upvoted a paper about 2 months ago

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

liked a model about 2 months ago

smcleish/Recurrent-TinyLlama-3T-train-recurrence-16

View all activity

Organizations

authored a paper 3 months ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Paper • 2509.18058 • Published Sep 22, 2025 • 12

authored 11 papers 6 months ago

Canary in a Coalmine: Better Membership Inference with Ensembled Adversarial Queries

Paper • 2210.10750 • Published Oct 19, 2022 • 1

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Paper • 2406.10209 • Published Jun 14, 2024 • 8

Universal Guidance for Diffusion Models

Paper • 2302.07121 • Published Feb 14, 2023

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Paper • 2502.19414 • Published Feb 26, 2025 • 20

Capability-Based Scaling Laws for LLM Red-Teaming

Paper • 2505.20162 • Published May 26, 2025 • 4

GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching

Paper • 2506.20480 • Published Jun 25, 2025 • 7

authored a paper 7 months ago

Pitfalls in Evaluating Language Model Forecasters

Paper • 2506.00723 • Published May 31, 2025 • 3

authored 2 papers 11 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 151

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6, 2025 • 33

authored a paper over 1 year ago

Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published May 27, 2024 • 54

authored 4 papers almost 2 years ago

Coercing LLMs to do and reveal (almost) anything

Paper • 2402.14020 • Published Feb 21, 2024 • 13

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Paper • 2401.12070 • Published Jan 22, 2024 • 45

Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models

Paper • 2212.03860 • Published Dec 7, 2022 • 1

A Cookbook of Self-Supervised Learning

Paper • 2304.12210 • Published Apr 24, 2023 • 6

Jonas Geiping

AI & ML interests

Recent Activity

Organizations

JonasGeiping's activity