
This is a Stable Cascade Stage C (full) and text encoder fine-tune specifically designed to generate images of the 1000-THR "EARTHMOVER" from ULTRAKILL. It's mostly a test of how well Cascade can be trained, and judging by the results, the answer is very well. I did several tests using various models (SD 1.5, SD 2.1, SDXL), and this one is by far the best. It handles small details such as the number of eyes, shape and position of the spear, and the environment very well, whereas other models will constantly get even basic things wrong.
Note that, however, here, the text encoder is doing most of the heavy lifting; you could generate half-coherent images of the Earthmover even with the base stage C model, however, this fine-tuned one puts the pieces in the right places.
Note that these are the raw U-Net and Text encoder checkpoints, so to use them in ComfyUI, you'll need to load them with the UNETLoader and CLIPLoader nodes respectively, with the files being placed in the unet and clip folders, in the models directory.
Have fun!
描述:
The text encoder. Very important!
训练词语: earthmover,charging spear,lights
名称: 1000THRStableCascade_textEncoder.safetensors
大小 (KB): 1356819
类型: Model
Pickle 扫描结果: Success
Pickle 扫描信息: No Pickle imports
病毒扫描结果: Success