Juggernaut XL v9 is one of the most important checkpoints in the
entire family because it sits exactly at the boundary between two eras.
It still belongs to the incremental-merge period of Juggernaut XL, but
its own release note openly says that this approach was getting harder
to sustain without quality regressions. In practical terms, v9 is the
moment where Juggernaut became a stronger photo model and, at the same
time, admitted it needed a deeper reboot.
If you only want the short version, this is the headline: standard
Juggernaut XL v9 + RunDiffusion Photo v2 was published on February
18, 2024, and a separate v9 Lightning variant followed on February
23, 2024. Those are related releases, but they are not the same
checkpoint and they should not be treated as the same model in guides or
benchmarks.
What V9 Actually Is
The official Hugging Face README calls the checkpoint
Juggernaut XL v9 + RunDiffusion Photo v2 Official. That wording is the
key to understanding the model.
V9 is not a pure single-source finetune. According to the release note:
- the Juggernaut-side training focused on skin details, lighting, and overall contrast
- the biggest change came from integrating RunDiffusion Photo Model v2
- the intended result was stronger photographic output than previous versions
That means v9 should be read as a hybrid release. The “v9” label tells you where it sits in the Juggernaut line, but the visual shift most users notice comes from the RunDiffusion photo merge.
Why V9 Matters in the Release History
The v9 README is unusually candid about the state of the project. The
author says that with each new release it had become harder to improve
Juggernaut without degrading some other area, and that this was already
visible in v8.
That admission matters because it explains three things at once:
- why v9 leans so heavily on RunDiffusion Photo v2
- why the model feels more intentionally photographic than some prior versions
- why the author immediately started talking about a complete
v10reboot with GPT-4 Vision recaptioning
So v9 is not just “the next checkpoint.” It is the public pivot point between incremental optimization and dataset-first rebuilding.
Release Dates and Naming Confusion
There are two public dates people often collapse into one:
- February 18, 2024:
V9 + RunDiffusionPhoto 2 - February 23, 2024:
V9+RDPhoto2-Lightning_4S
The second release is a speed-oriented Lightning derivative. It is not the base v9 release. If you see someone recommending four to six steps for “v9,” check whether they are actually talking about the Lightning version. The standard v9 checkpoint has much more traditional SDXL sampling guidance.
The Hugging Face repository for RunDiffusion/Juggernaut-XL-v9 was
created on February 18, 2024, which aligns with the base public v9
release date rather than the later Lightning release.
How V9 Differs From V8
V8 + RunDiffusion and v9 are close enough that people sometimes talk
about them interchangeably, but the release notes suggest they solved
different problems.
The public v8 summary emphasizes:
- hands
- feet
- skin details
- general photographic output
The public v9 summary shifts emphasis:
- targeted work on skin, lighting, and contrast
- bigger reliance on RunDiffusion Photo v2
- stronger overall photographic output than previous versions
- explicit discussion of the limits of continued incremental training
In plain language, v8 reads like a cleanup and quality push. v9
reads like a more opinionated photo-forward merge that also exposed the
ceiling of the current training strategy.
What V9 Looks Like in Practice
The signature of Juggernaut XL v9 is not raw novelty. It is controlled photographic polish.
Strengths
In the official notes and in community usage, v9 is especially strong for:
- portraits with better skin texture and local contrast
- editorial-style images with clean lighting and camera-like framing
- product-adjacent photography where realism matters more than painterly exaggeration
- prompts that benefit from a more photographic default without large negative prompt stacks
The bundled keyword list in the official README is revealing. It highlights categories like:
- architecture photography
- wildlife photography
- car photography
- food photography
- interior photography
- landscape photography
- hyperdetailed photography
- cinematic movie
- still mid shot photo
- full body photo
- skin details
That is less like a random token dump and more like a map of the model's intended comfort zone.
Tradeoffs
V9 is not documented as a magical all-purpose replacement for every other Juggernaut version. The author explicitly says training new versions was becoming difficult without quality sacrifices in some areas. The implication is important: v9 may be stronger photographically than earlier versions, but it is also the point where the project acknowledges that this tuning direction had costs.
That is one reason later releases moved toward recaptioning and broader retraining rather than simply piling on one more merge.
Official Recommended Settings
For the standard v9 checkpoint, the official Hugging Face README recommends:
- Resolution:
832x1216 - Sampler:
DPM++ 2M Karras - Steps:
30-40 - CFG:
3-7 - Negative prompt: start with none, then add only what you need
- VAE: already baked in
- Hi-res:
4xNMKD-Siax_200k,15steps,0.3denoise,1.5xupscale
Those settings are the standard-v9 baseline. They should not be confused
with the much lower-step Lightning setup used by the later
V9+RDPhoto2-Lightning_4S release.
Prompting Strategy for V9
V9 is most useful when you prompt like you are directing a photo shoot rather than dumping style soup into the prompt box.
1. Start With Shot Intent
The official token list makes it clear that camera framing language is
helpful. Phrases such as still mid shot photo, full body photo, or
domain-specific photography labels work because the model leans into
photographic categorization.
2. Let the Model Breathe Before Adding Negatives
The README specifically says to begin with no negative prompt. That is a meaningful instruction. It suggests the base visual prior of the model is already tuned enough that heavy negative prompt templates can easily overconstrain it.
3. Use Domain Cues Instead of Generic Hype Words
Architecture Photography is more informative than “masterpiece.”
Interior Photography is more informative than a pile of vague quality
tokens. V9 tends to reward prompts that identify subject, shot type,
lighting, and domain.
4. Keep CFG Moderate
The official CFG 3-7 guidance is fairly restrained for SDXL. Lower CFG
is explicitly described as looking more realistic. If your generations
start looking brittle or overcooked, pushing CFG higher is usually the
wrong move.
V9 vs V9 Lightning
This deserves its own section because the confusion is common.
Standard V9
- published February 18, 2024
- tuned for conventional SDXL sampling
- official recommendation is
30-40steps - best thought of as the full-quality base release
V9 Lightning
- published February 23, 2024
- separate speed-oriented derivative
- designed around a much faster workflow
- useful when generation speed matters more than preserving the exact sampling behavior of the base checkpoint
If you are evaluating image quality, compare standard v9 to other base checkpoints. If you are evaluating turnaround time, compare Lightning to other accelerated variants. Mixing those categories produces bad conclusions.
V9 vs Later Juggernaut Releases
V9 also becomes clearer when you compare it with what came after it.
V9 vs Juggernaut X
X belongs to the post-v9 reboot mindset. By that point, the public
conversation had shifted toward better prompt precision and coherence,
not just one more photo-oriented merge. V9 is therefore a late
incremental-era checkpoint; X is the start of the next chapter.
V9 vs XI
XI is publicly described as using the whole Juggernaut 9 dataset
recaptioned across 15k images. That makes XI feel more data- and
caption-driven. If v9 is about improving photographic output through
targeted training and a strong photo-model merge, XI is about improving
prompt adherence by rebuilding the underlying descriptive layer.
V9 vs XII
RunDiffusion materials position XII as the more expressive, more cinematic sibling to XI. That makes v9 feel narrower and more grounded: it is a photorealistic workhorse rather than the later, more stylized branch.
Should You Still Use V9?
Yes, if what you want is a practical SDXL checkpoint with:
- a strong photographic bias
- clear, usable official settings
- good portrait and editorial behavior
- less interest in “latest architecture” bragging rights and more interest in reliable visual output
V9 is especially relevant if you are studying the Juggernaut family historically. It is the clearest example of the old formula reaching its peak and its limit at the same time.
Bottom Line
Juggernaut XL v9 is not famous because it is merely old. It is famous because it captures the exact moment when Juggernaut's photo realism, RunDiffusion collaboration, and architectural constraints all became visible in one release.
It improved skin detail, lighting, and contrast. It absorbed a major upgrade from RunDiffusion Photo v2. It shipped with practical settings that many users could follow immediately. And it exposed the need for a more fundamental reboot that shaped the models that came after it.
That is why v9 still deserves its own discussion instead of being folded into a generic “mid-series release” paragraph.
Sources
Try it live
Explore Juggernaut XL in the workspace
Use the built-in workspace to test prompts, compare results, and turn the model notes from this article into actual output.
