IdolSankaku SwinV2 Tagger v1

Supports ratings, characters and general tags.

Trained using https://github.com/SmilingWolf/JAX-CV.
TPUs used for training kindly provided by the TRC program.

Dataset

Trained on a human annotated dataset of real world photos.

Validation results

v1.0: P=R: threshold = 0.3094, F1 = 0.6161

What's new

Model v1.0/Dataset v1:
First version of the dataset, tags updated on 2024-08-31.
timm compatible! Load it up and give it a spin using the canonical one-liner!
ONNX model is compatible with code developed for the v3 series of WD tagger models.
The batch dimension of the ONNX model is not fixed to 1 anymore. Now you can go crazy with batch inference.
Switched to Macro-F1 to measure model performance since it gives me a better gauge of overall training progress.

Runtime deps

ONNX model requires onnxruntime >= 1.17.0

Inference code examples

For timm: https://github.com/neggles/wdv3-timm
For ONNX: https://huggingface.co/spaces/SmilingWolf/wd-tagger
For JAX: https://github.com/SmilingWolf/wdv3-jax

Final words

Subject to change and updates.
Downstream users are encouraged to use tagged releases rather than relying on the head of the repo.

Thanks

Thanks to the whole DeepGHS team for data gathering and encouraging me to push the models much further than they had any reason to attempt to reach, much less succeed.

Downloads last month
0
Safetensors
Model size
87.5M params
Tensor type
F32
Β·
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Spaces using deepghs/idolsankaku-swinv2-tagger-v1 12