
Description
(for best results, read the full description - usage guide below)
This is a merge of some random anime based and cartoon based models to achieve a somewhat cartoony anime style, more similar to what you would actually see in anime as opposed to the more common hyper-detailed anime models.
Versions 3, 4, and 4.5 include some custom training to further enhance the style. More details available in "About this Version" on the sidebar. Most positive prompts for the v3 sample images were randomly generated.
Usage Guide
-
(highly recommended) Use a negative embedding for best results
-
I use verybadimagenegative_v1.3 (all examples use this)
-
verybadimagenegative_v1.3
-
Place the downloaded file into the "embeddings" folder of the SD WebUI
-
In the negative prompt, paste "verybadimagenegative_v1.3"
-
-
(highly recommended) Upscaling at 2x (or more) is important to getting a good result. I would recommend the following settings:
-
Denoising strength of 0.45
-
Use the "R-ESRGAN 4x+ Anime6B" upscaler for a flatter look, or use "Latent" for a bit more detail
-
Leave hires steps at default of 0 (equal to your generation steps)
-
-
(highly recommended) Use DPM++2M Karas as the sampler. Other samplers can yield odd artifacts, though your mileage may vary depending on your specific setup.
-
(only for version 3 and below) Use the dynamic thresholding plugin (all example images do with cfg scale 10 mimic 7): https://github.com/mcmonkeyprojects/sd-dynamic-thresholding
-
Set the CFG scale to 10.0
-
Click the checkbox "Enable Dynamic Thresholding (CFG Scale Fix)"
-
Set the Mimic CFG Scale to 7
-
If you don't want to use this plugin, then set the config scale to 5 or 6
-
-
This model is very easy to prompt, and does not require a ton of prompt engineering to get good results. The following format will yield decent results:
-
Prompt:
-
(best-quality:0.8), perfect anime illustration, <normal description of the image, e.g. a woman running in tokyo at night, a flaming meteor, etc.>
-
-
Negative:
-
(worst quality:0.8), verybadimagenegative_v1.3, (surreal:0.8), (modernism:0.8), (art deco:0.8), (art nouveau:0.8)
-
-
-
The model is capable of NSFW
描述:
The new version was trained against a photorealistic dataset, then this trained version was subtracted against v2 using block merging to effectively invert the training - amplifying the cartoony parts and de-emphasizing the photoreal parts. The result is better performance at lower CFG scales, and a style that corrects the slight over-correction towards realistic in the previous version. The results are much more like what I had originally intended v2 to be - cleaner lines than v1, better colors at lower CFG scales (CFG of 6 will work quite well now if you don't have the dynamic threshold scaling plugin), and a more consistently hand-drawn look.
训练词语:
名称: flat2DAnimerge_v30.safetensors
大小 (KB): 2082643
类型: Model
Pickle 扫描结果: Success
Pickle 扫描信息: No Pickle imports
病毒扫描结果: Success