Illustrious XL v0.1
trained by Onoma AI

Illustrious XL is the Illustration focused Stable Diffusion XL model which is continued from Kohaku XL Beta 5, trained by OnomaAI Research Team. The model focuses on utilizing large-scale annotated dataset, Danbooru2023. We release the v0.1 and v0.1-GUIDED model here, under fair public ai license, however discourages the usage of model over monetization purpose / any closed source purposes. For full technical details, please refer to our technical report.

Model Information:

Name: Illustrious-XL-v0.1
Model Type: Stable Diffusion XL Model
Dataset: Fine-tuned on Danbooru2023 Dataset

Description:

Illustrious-XL is a powerful generative model series, fine-tuned on the comprehensive Danbooru2023 dataset and its variants. It includes a wide variety of character designs, styles, and artistic knowledge derived from the dataset, making it suitable for creative and artistic AI generation tasks.
Illustrious-XL-v0.1 is untuned BASE model, which works as possible base for all future model variants. LoRAs / Adapters can be trained on this model, ensuring future usecases. The model is research-only purpose, as not tuned for aesthetics / preferences.
Illustrious-XL-v0.1-GUIDED is minimally safety controlled model, which works as better option for usual usecases.

We plan to release several aesthetic-finetuned model variants in near future.

Technical Details:

https://arxiv.org/abs/2409.19946

Terms and Conditions:

We recommend to use official repositories, to prevent malicious attacks.
Users must agree with LICENSE to use the model. As mentioned in LICENSE, we do NOT take any actions about generated results or possible variants.
As mentioned in LICENSE, users must NOT use the generated result for any prohibited purposes, including but not limited to:

Harmful or malicious activities: This includes harassment, threats, spreading misinformation, or any use intended to harm individuals or groups.
Illegal activities: Using generated content to violate any applicable laws or regulations.
Unethical, offensive content generation: Generating offensive, defamatory, or controversial content that violates ethical guidelines.

By using this model, users agree to comply with the conditions outlined in the LICENSE and acknowledge responsibility for how they utilize the generated content.

Safety Control Recommendation:

Generative models can occasionally produce unintended or harmful outputs.
To minimize this risk, it is strongly recommended to use the GUIDED model variant, which incorporates additional safety mechanisms for responsible content generation.
By choosing this variant, users can significantly reduce the likelihood of generating harmful or unintended content.
We plan to update GUIDED model variants and its methodologies, with extensive research.

Training/Merging Policy:
You may fine-tune, merge, or train LoRA based on this model. However, to foster an open-source community, you are required to:

Openly share details of any derived models, including references to the original model licensed under the fair-ai-public-license.
Provide information on datasets and "merge recipes" used for fine-tuning or training.
Adhere to the fair-ai-public-license, ensuring that any derivative works are also open source.

Uploading / Generation Policy:
We do not restrict any upload or spread of the generation results, as we do not own any rights regard to generated materials. This includes 'personally trained models / finetuned models / trained lora-related results'. However, we kindly ask you to open the generation details, to foster the open source communities and researches.

Monetization Prohibition:

You are prohibited from monetizing any close-sourced fine-tuned / merged model, which disallows the public from accessing the model's source code / weights and its usages.
As per the license, you must openly publish any derivative models and variants. This model is intended for open-source use, and all derivatives must follow the same principles.

Usage:
We do not recommend overusing critical composition tags such as 'close-up', 'upside-down', or 'cowboy shot', as they can be conflicting and lead to confusion, affecting model results.
Recommended sampling method: Euler a, Sampling Steps: 20–28, CFG: 5–7.5 (may vary based on use case).
We suggest using suitable composition tags like "upper body," "cowboy shot," "portrait," or "full body" depending on your use case.
The model supports quality tags such as: "worst quality," "bad quality," "average quality," "good quality," "best quality," and "masterpiece (quality)."
Note: The model does not have any default style. This is intended behavior for the base model.

Prompt:
1boy, holding knife, blue eyes, jewelry, jacket, shirt, open mouth, hand up, simple background, hair between eyes, vest, knife, tongue, holding weapon, grey vest, upper body, necktie, solo, looking at viewer, smile, pink blood, weapon, dagger, open clothes, collared shirt, blood on face, tongue out, blonde hair, holding dagger, red necktie, white shirt, blood, short hair, holding, earrings, long sleeves, black jacket, dark theme

Negative Prompt:
worst quality, comic, multiple views, bad quality, low quality, lowres, displeasing, very displeasing, bad anatomy, bad hands, scan artifacts, monochrome, greyscale, signature, twitter username, jpeg artifacts, 2koma, 4koma, guro, extra digits, fewer digits

Prompt:
1girl, extremely dark, black theme, silhouette, rim lighting, black, looking at viewer, low contrast, masterpiece

Negative Prompt:
worst quality, comic, multiple views, bad quality, low quality, lowres, displeasing, very displeasing, bad anatomy, bad hands, scan artifacts, monochrome, greyscale, twitter username, jpeg artifacts, 2koma, 4koma, guro, extra digits, fewer digits, jaggy lines, unclear

Illustrious XL Series Update

It’s been a while since we released Illustrious XL v0.1, and we know many of you have been eagerly waiting for updates. We also recognize that many are disappointed with the closed-source nature of Illustrious XL v1.0, and we want to address this directly. A lot has happened since then, and we’re truly grateful for the open-source community’s contributions—whether it’s large-scale fine-tuned models, ControlNets, or the countless LoRAs and adapters that have been developed.

Development Journey

When we started working on the Illustrious XL series, our goal was simple: there weren’t any strong pretrained models available for illustrations, so we decided to build one ourselves—a pretrain-level fine-tuned model that artists and researchers could actually use.

We also knew that keeping everything in-house wouldn’t help the field move forward. That’s why we released v0.1 to the public and focused on training newer variations, pushing the model’s capabilities further with improved quality, deeper knowledge, and architectural refinements.

Along the way, we discovered something unexpected. The model wasn’t just good at illustrations—it could also interpret natural language, handle complex prompts, and generate high-resolution images, far beyond what we originally planned.

Our Model Versions

v0.1 (trained in May 2024)
v1.0 (July 2024)
v1.1 (August 2024)
v2.0 (September 2024)
v3 (November 2024)
v3.5 (a special variant incorporating Google’s v-parameterization)

These models take another step forward in natural language composition and image generation.

That said, we can’t drop everything all at once. There’s a clear roadmap ahead, and open-source releases are part of it. But rather than rushing, we want to do this the right way—with explanations, insights, and research-backed improvements.

Our Future Plans

Now, after months of work behind the scenes, we’re finally ready to move forward. We’ll be rolling out our latest models step by step while progressively open-sourcing previous versions so they can be studied and improved upon. Expect breakthroughs like true 2K-resolution generation and better natural language alignment along the way.

Commitment to Open Source

This will take time, but we’re moving fast. Our next-generation models are already in development, tackling some of the fundamental limitations of the base SD XL architecture. As we progress, older models will naturally be deprecated, and weight releases will follow accordingly. Our team aims to proceed thoughtfully, ensuring that each release is accompanied by comprehensive explanations and insights.

Backward Compatibility

One last thing—we’re not just here to release models. Every model we’ve built is designed with backward compatibility in mind, because Illustrious XL wasn’t just about making something new—it was about creating a better foundation for fine-tuning. That’s why we’ve put so much effort into training LoRAs properly, and soon, we’ll be sharing insights on how to train them more effectively.

Summary

In summary, Onoma AI plans to roll out open-source weights step by step and encourages the community to stay tuned for upcoming developments—we’re just getting started.

OnomaAIResearch
/

Illustrious-xl-early-release-v0

Illustrious XL v0.1
trained by Onoma AI

Illustrious XL Series Update

Development Journey

Our Model Versions

Our Future Plans

Commitment to Open Source

Backward Compatibility

Summary

Model tree for OnomaAIResearch/Illustrious-xl-early-release-v0

Spaces using OnomaAIResearch/Illustrious-xl-early-release-v0 19

Illustrious XL v0.1 trained by Onoma AI

Illustrious XL Series Update

Development Journey

Our Model Versions

Our Future Plans

Commitment to Open Source

Backward Compatibility

Summary

Model tree for OnomaAIResearch/Illustrious-xl-early-release-v0

Spaces using OnomaAIResearch/Illustrious-xl-early-release-v0 19

Illustrious XL v0.1
trained by Onoma AI