
A realistic model, built for quality and creative range.
ze·nith
noun. the time at which something is most powerful or successful.
RobMix Zenith is the next iteration on my series of artisan photorealistic model merges with unnecessarily superlative version names.
This merge is like a classic martini—simple, with just a couple of ingredients, but mixed with precision and handled with care. It blends RobMix Evolution with Corcel's fantastic Mobius base model, with block-by-block tuning to draw out the best of the RobMix style with the best of Mobius' quality and creativity.
Note: Mobius requires a clip skip of -3. This merge doesn't, but you can get some interesting results by experimenting with clip skip values between -1 and -3.
Like my other merges, it's geared toward a photographic style with an emphasis on balancing realism with creativity, but also has some gems with illustrated or artistic styles if you prompt for them.
This model works great as a plug-and-play model out of the box, but it shines with some workflow optimizations. I've made some suggestions at the end of this post, and you can try them out with my workflows here.
Recommended Settings
In the sample images, second pass is a 1.5x latent upscale, 0.3 to 0.4 denoise, 40 steps. Everything was generated in Comfy.
-
Sampler: DPM++ 3M SDE
-
Scheduler: AlignYourSteps
-
CFG: 3-4 (or use Automatic CFG)
-
Steps: 30-40
-
Clip Skip: -2 or -3
-
Aspect Ratio: 1:1, 2:3, 3:4, 16:9, 21:9, vertical or horizontal
Advanced Settings
-
FreeU v2
-
b1: 1.05
-
b2: 1.08
-
s1: 0.95
-
s2: 0.8
-
-
Perturbed Attention Guidance
-
Scale: 0.5-1
-
Adaptive Scale: 0.1
-
How to prompt this model
This model works best with natural language style prompting. I've gotten the very best results by separating CLIP-G and CLIP-L, using natural language in CLIP-G and SD 1.5-style keyword based prompting in CLIP-L.
I've created a custom GPT to help with this. By default, it will generate CLIP-G style prompts, but you can optionally ask it for CLIP-L and/or T5 style prompts. The GPT follows my Prompt Pyramid style of prompting, which may not be the best, but it's how I do things.
Example CLIP-G*
A high-resolution, atmospheric photograph capturing a serene sunset over a mountainous landscape. The composition features a lone tree standing on a hillside, silhouetted against the warm, golden light of the setting sun. The sky is a gradient of soft oranges and yellows, blending into the horizon. Rays of sunlight stretch across the scene, creating long shadows and adding depth to the rolling hills. The overall mood is peaceful and contemplative, with a harmonious balance between light and shadow. The exposure is perfectly balanced, emphasizing the natural beauty and tranquility of the landscape.
* If your prompt exceeds 75 tokens, be sure to properly handle concatenation.
Example CLIP-L
High-resolution photograph, young woman, leaning out of vintage red car window, arms crossed on door, head tilted, calm contemplative expression, curious gaze, engaging connection, framed upper body, smooth vintage vehicle lines, nostalgic feel, softly blurred background, serene reflective mood, muted warm tones, timeless quality.
描述:
RobMix Evolution builds on my first merge, RobMix Ultimate.
This version starts with RobMix Ultimate, then merged block-by-block with some fantastic models to draw out the best parts of each and mitigate the downsides. My goal was to expand the creative range of the model while retaining the photorealistic style.
You can read my process for early experiments with Evolution on Medium. Note that the models referenced in the post went into a personal merge that I called Evolution A and Evolution B—the models referenced in that post aren't included in Evolution D.
The final stage of Evolution, Evolution D, started by blending:
-
Photonium for photorealism
That blend was merged back into RobMix Ultimate.
Next, I added a block-by-block merge of Juggernaut v9, followed by a dash of ICBINP XL v4.
This model can be sensitive to high CFGs.
Recommended settings:
-
DPM++ 3M SDE
-
AlignYourSteps scheduler
-
CFG 3.0-4.0
-
40 steps
-
Clip Skip -3
The higher the CFG, the more steps you'll need. If it's speed you're looking for, you may want to look elsewhere. Alternatively, consider using the genius Automatic CFG node in Comfy and set your CFG wherever you'd like.
This model is generally very capable with only a single pass, but really shines in quality with a 1.5x latent upscale at 0.30 to 0.40 denoise with an additional 40 steps.
With this full workflow, a generation on my 3080 takes around 40 seconds.
To get the very best results, add the following optimizations:
Free U
-
b1: 1.05
-
b2: 1.08
-
s1: 0.95
-
s2: 0.88
Perturbed Attention Guidance
-
Scale: 1.5
-
Adaptive scale: 0.100
Token Downsampling
-
Depth 1: 2.00
-
Depth 2: 2.00
训练词语:
名称: robmix_evolutionV20.safetensors
大小 (KB): 6775458
类型: Model
Pickle 扫描结果: Success
Pickle 扫描信息: No Pickle imports
病毒扫描结果: Success