Marian Kannwischer

canwiper

AI & ML interests

RLHF & Computer Vision

Recent Activity

liked a dataset about 1 month ago

Rapidata/Flux-2-pro_t2i_human_preference

liked a dataset about 1 month ago

Rapidata/text-2-audio-human-preference-benchmark

liked a dataset about 2 months ago

Rapidata/Face_Generation_Benchmark

View all activity

Organizations

liked 2 datasets about 1 month ago

Rapidata/Flux-2-pro_t2i_human_preference

Viewer • Updated Dec 2, 2025 • 44.9k • 1.02k • 8

Rapidata/text-2-audio-human-preference-benchmark

Viewer • Updated Nov 27, 2025 • 4.27k • 33 • 7

liked 2 datasets about 2 months ago

Rapidata/Face_Generation_Benchmark

Viewer • Updated Nov 10, 2025 • 2.31k • 38 • 16

Rapidata/text-2-video-human-preferences-veo3.1

Viewer • Updated Nov 6, 2025 • 1.64k • 82 • 8

liked a dataset 3 months ago

Rapidata/HunyuanImage-2.1_t2i_human_preference

Viewer • Updated Sep 26, 2025 • 44.8k • 115 • 8

upvoted 2 articles 4 months ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Jun 19, 2025

•

Article

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Jan 26, 2023

•

liked 2 datasets 4 months ago

Rapidata/Recraft-v3-24-7-25_t2i_human_preference

Viewer • Updated Aug 25, 2025 • 65.9k • 223 • 10

Rapidata/Imagen-4-ultra-24-7-25_t2i_human_preference

Viewer • Updated Aug 25, 2025 • 55.9k • 494 • 8

liked 4 datasets 5 months ago

liked a dataset 6 months ago

Rapidata/multilingual-llm-jokes-4o-claude-gemini

Viewer • Updated Jul 4, 2025 • 9.98k • 71 • 13

reacted to jasoncorkill's post with 🚀 7 months ago

Post

2440

Imagine you could have an Image Arena score equivalent at each checkpoint during training. We released the first version of just that:
Crowd-Eval

Add one line of code to your training loop and you will have a new real human loss curve in your W&B dashboard.

Thousands of real humans from around the world rating your model in real time at the cost of a few dollars per checkpoint is a game changer.

Check it out here: https://github.com/RapidataAI/crowd-eval

First 5 people to put it in their loop get 100'000 human responses for free! (ping me)

reacted to jasoncorkill's post with 👀 7 months ago

Post

4003

Benchmark Update: @google Veo3 (Text-to-Video)

Two months ago, we benchmarked @google ’s Veo2 model. It fell short, struggling with style consistency and temporal coherence, trailing behind Runway, Pika, @tencent , and even @alibaba-pai .

That’s changed.

We just wrapped up benchmarking Veo3, and the improvements are substantial. It outperformed every other model by a wide margin across all key metrics. Not just better, dominating across style, coherence, and prompt adherence. It's rare to see such a clear lead in today’s hyper-competitive T2V landscape.

Dataset coming soon. Stay tuned.

5 replies

reacted to jasoncorkill's post with ❤️ 8 months ago

Post

2882

🔥 Hidream I1 is online! 🔥

We just added Hidream I1 to our T2I leaderboard (https://www.rapidata.ai/leaderboard/image-models) benchmarked using 195k+ human responses from 38k+ annotators, all collected in under 24 hours.

It landed #3 overall, right behind:
- @openai 4o
- @black-forest-labs Flux 1 Pro
...and just ahead of @black-forest-labs Flux 1.1 Pro, @xai-org Aurora and @google Imagen3.

Want to dig into the data? Check out our dataset here:
Rapidata/Hidream_t2i_human_preference

What model should we benchmark next?

reacted to jasoncorkill's post with ❤️ 8 months ago

Post

5545

🚀 Building Better Evaluations: 32K Image Annotations Now Available

Today, we're releasing an expanded version: 32K images annotated with 3.7M responses from over 300K individuals which was completed in under two weeks using the Rapidata Python API.

Rapidata/text-2-image-Rich-Human-Feedback-32k

A few months ago, we published one of our most liked dataset with 13K images based on the @data-is-better-together 's dataset, following Google's research on "Rich Human Feedback for Text-to-Image Generation" (https://arxiv.org/abs/2312.10240). It collected over 1.5M responses from 150K+ participants.

Rapidata/text-2-image-Rich-Human-Feedback

In the examples below, users highlighted words from prompts that were not correctly depicted in the generated images. Higher word scores indicate more frequent issues. If an image captured the prompt accurately, users could select [No_mistakes].

We're continuing to work on large-scale human feedback and model evaluation. If you're working on related research and need large, high-quality annotations, feel free to get in touch: [email protected].

liked a dataset 8 months ago

Rapidata/text-2-image-Rich-Human-Feedback-32k

Viewer • Updated Apr 29, 2025 • 31.9k • 337 • 24

liked a dataset 9 months ago

Rapidata/2k-ranked-images-open-image-preferences-v1

Viewer • Updated Apr 10, 2025 • 2k • 21 • 26

Marian Kannwischer

AI & ML interests

Recent Activity

Organizations

canwiper's activity

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Using LoRA for Efficient Stable Diffusion Fine-Tuning