Model Card for Model ID

This model is trained using Flow-GRPO with LoRA. We provide only the LoRA weights here, so you will need to download the SD 3.5 Medium base model first.

Model Details

Model Sources

Repository: https://github.com/yifan123/flow_grpo
Paper: https://www.arxiv.org/pdf/2505.05470

Uses

import torch
from diffusers import StableDiffusion3Pipeline
from diffusers.schedulers import FlowMatchEulerDiscreteScheduler
from peft import PeftModel

model_id = "stabilityai/stable-diffusion-3.5-medium"
lora_ckpt_path = "jieliu/SD3.5M-FlowGRPO-PickScore"
device = "cuda"


pipe = StableDiffusion3Pipeline.from_pretrained(model_id, torch_dtype=torch.float16)
pipe.transformer = PeftModel.from_pretrained(pipe.transformer, lora_ckpt_path)
pipe.transformer = pipe.transformer.merge_and_unload()
pipe = pipe.to(device)

prompt = 'a jung male cyborg with white hair sitting down on a throne in a dystopian world, digital art, epic'
image = pipe(prompt, height=512, width=512, num_inference_steps=40,guidance_scale=4.5,negative_prompt="").images[0]  
image.save("flow_grpo_pickscore.png")

jieliu
/

SD3.5M-FlowGRPO-PickScore

Model Card for Model ID

Model Details

Model Sources

Uses

Model tree for jieliu/SD3.5M-FlowGRPO-PickScore

Space using jieliu/SD3.5M-FlowGRPO-PickScore 1

Collection including jieliu/SD3.5M-FlowGRPO-PickScore

SD3.5M-FlowGRPO