ThinkDiffusionXL – v1.0

Related Keywords & Tags

Highlighted images

A menacing alien creature with sharp fangs, glowing red eyes, and detailed, textured skin, created using Stable Diffusion AI.

Alien Zombie in Detailed Horror Art

A dramatic AI generated image using Stable Diffusion of a grim reaper with a skeletal face, wearing tattered robes, standing on a rock with arms outstretched as lightning strikes. The background features a dark, stormy sky and rugged landscape.

Dark Skeleton Mage Ghostly Style Scene

A high-detail AI-generated image of a demonic creature with an intricately designed face, menacing teeth, and multiple horns emerging from its head, in a dark setting.

Demonic Creature with Horns in Dark Setting

A grim reaper in a dark forest with a misty, eerie atmosphere, created using stable diffusion.

GhostlyStyle Skeleton Mage by River

Highly detailed AI generated image using stable diffusion of a demonic creature with skeletal features and intricate, spiked armor. Dark, eerie atmosphere.

Renaissance Macabre Zombie Portrait

Tattoo Elegance on Female Nape

A terrifying monster with sharp fangs and a grotesque appearance, generated using Stable Diffusion AI.

Zombie Ghost Dragon in Ancient Stone City

Recommended Parameters

samplers	DPM++ 2M Karras
steps	30
cfg	4-8
resolution	1365×2048 to 4622×6753

Recommended Hires (High-Resolution) Parameters

upscale

None

Creator Sponsors

All sponsors are not affiliates of Diffus. Diffus provides an alternative online Stable Diffusion WebUI experience.

ThinkDiffusion

ThinkDiffusionXL (TDXL)

ThinkDiffusionXL is the result of our goal to build a go-to model capable of amazing photorealism that’s also versatile enough to generate high-quality images across a variety of styles and subjects without needing to be a prompting genius.

Please leave a review if you’re happy with it, this will encourage us to create more and improve on it.

The work

Data source: TDXL is trained on over 10,000 diverse images that span photorealism, digital art, anime, and more. The smallest resolution in our dataset is 1365×2048, but many images go up to resolutions as high as 4622×6753. In total, our dataset takes up 42GB.
Training: With 1.8 million steps, we’ve put in the work. For comparison, Juggernaut is at 600k steps and RealVisXL is at 348k steps
Hand-captioned images: Each image is carefully captioned by hand, enhancing the model’s ability to generate accurate and high-quality results from minimal prompts.
NSFW capabilities: The model includes over 1,000 tastefully curated NSFW images.

Our thoughts

Detail and quality: Most XL models in the Realistic category suffer from poor detail, especially in the background and even in basic features like eyes, teeth, and skin. We believe TDXL outperforms in these areas due to its large, high-quality dataset. For comparison, Juggernaut has about half the image material, and RealVisXL has only 1,700 images. Ultimately, TDXL simply possesses much more “knowledge”.
Less-Bias: We made sure to use an equal number of images for each style, gender, etc. Other models we tested over the past few months had some kind of bias, sometimes it was bias toward portrait shots, gender bias, certain ethnicities, etc. For instance, Juggernaut has a bias in the Close-Up area, and the Cinematic Light is quite dominant in that model. RealVisXL also has a bias towards Portrait shots. On the other hand, TDXL gives you what you want: Landscape, Midshot, Full Body, Close-Up, Portrait, Sideview, Backview, Action Shots, Cinematic…whatever you want without always being pushed in a certain direction due to a bias.
Versatile base: Because of its large balanced quality dataset, TDXL is versatile to serve as a base model for future trainings. You can create new finetunes in entirely different directions, add LoRAs to fill in missing concepts, or do additional trainings with more balanced quality data.

Contributor

Kate Thompson

I'm the gallery editor at Diffus and I write blogs on topics related to AI art. With expertise in Midjourney, Dalle 3, and Stable Diffusion, I actively contribute to Reddit, Facebook, and Discord communities. I meticulously curate top AI-generated content, ensuring our gallery's excellence.

Model collection - ThinkDiffusionXL

Ethereal dragon with translucent wings in a ruined palace, cloudy sky in the background, AI generated image using Stable Diffusion.

Checkpoints Models