MakkiXL版本v0.1 (ID: 958667) 综合资源合集综合资源合集

Resumed from Illustrious XL v0.1 by makkishizu.

Mainly trained on high-rated Danbooru 2024 images, using 40k images on a 4060Ti over 2+8 epochs.

Due to computing limitations, I did not train on an extremely large dataset. The primary goal was to improve the “default art style”(no artists tag) of Illustrious XL while tracking user preferences. Refer to the following comparison sample images:

If you prefer models trained on larger datasets, you can try noobai-xl-nai-xl.

Usage Guidelines

This model is trained using Danbooru-style tags, not natural language! Please use tags only for optimal results.

The usage is basically the same as Illustrious XL, you can adjust the art style using the artist list. If you are interested in the artist list but don't know how to write it, you can try using novelai_300artist

This model supports resolutions from ARB 1024x1024, with a minimum resolution of 256 and a maximum resolution of 2048. While standard SDXL resolution can be used, it is recommended to opt for a slightly higher resolution than 1024x1024. Applying a hires-fix is also suggested for better output quality.

For more details, check the sample images provided.

In addition to following the usage instructions of Illustrious XL:

Recommended sampling method: Euler a, Sampling Steps: 20–28, CFG: 5–7.5 (may vary based on use case).

The model supports quality tags such as: "worst quality," "bad quality," "average quality," "good quality," "best quality," and "masterpiece (quality)."

This model also balances the distribution of rating tags, allowing you to distinguish images by different rating levels:

Rating Modifier	    Rating Criterion

safe	            General

sensitive	        Sensitive

nsfw	            Questionable

explicit, nsfw	    Explicit

Recommended prompt format:

<|special|>, 

<|characters|>, <|copyrights|>, 

<|artist|>, 



<|general|>, 



<|quality|>, <|meta|>, <|rating|>

Recommended Negative Prompt:

worst quality, comic, multiple views, bad quality, low quality, lowres, displeasing, very displeasing, bad anatomy, bad hands, scan artifacts, monochrome, greyscale, twitter username, jpeg artifacts, 2koma, 4koma, guro, extra digits, fewer digits, jaggy lines, unclear

Training Details:

The dataset for training this model was sourced from hakubooru.

The training of MakkiXL was facilitated by the LyCORIS project and the trainer from lora-scripts.

The original LoKr file is also provided as the "makki_illustrious_lokr" version. For detailed settings, refer to the LyCORIS config file from makki_illustrious_lokr.

Hardware: RTX 4060Ti



Num Train Images: 43,216



Total Epoch: 2+8



Total Steps: 6760



Batch Size: 1



Grad Accumulation Step: 64



Equivalent Batch Size: 64



Optimizer: Lion8bit



Learning Rate: 5e-5 for UNet /NO train TE



LR Scheduler: Constant



Min SNR Gamma: 5



Noise Offset: 0.03



Resolution: 1024x1024



Min Bucket Resolution: 256



Max Bucket Resolution: 2048



Mixed Precision: BF16

License

This model is released under the Fair-AI-Public-License-1.0-SD.

Please check this website for more information:

Freedom of Development freedevproject.org

Contributors' Repositories

lora-scripts

hakubooru

Thanks to onommai open source for providing such a powerful base model.

描述:

v0.1:
中文版本

MakkiXL

基于 Illustrious XL v0.1版本进行训练。

主要针对 Danbooru 2024 的高评分图片进行训练，使用 4w 张图片在 4060Ti 上训练了 2+8 个 epoch。

由于算力限制，我并没有使用特别大量的数据集进行训练。最初目的为以相对用户偏好改善 Illustrious XL 的“默认画风”（无画师串）。

如果你喜欢使用更大量数据集训练的模型，可以尝试使用 noobai-xl-nai-xl。

使用指南

该模型使用的是 Danbooru 的标签形式进行训练，而非自然语言！请务必以 tag only 的形式进行使用。

用法与 Illustrious XL 基本相同，你可以使用画师列表对画风进行调整。（如果你对画师列表感兴趣但不知道如何编写，可以尝试使用novelai_300artist）

该模型以 1024x1024 分辨率为基准，最低分辨率为 256，最高分辨率为 2048。虽然可以使用标准 SDXL 分辨率，但建议使用比 1024x1024 稍高的分辨率。同时建议应用 hires-fix 以获得更好的结果。

如需更多详细信息，请查看提供的示例图片。

除了遵循 Illustrious XL 公开的使用方法外：

推荐的采样方法：Euler a，采样步数：20–28，CFG：5–7.5（可能因使用场景不同而变化）。

该模型支持质量标签，如："worst quality," "bad quality," "average quality," "good quality," "best quality," 和 "masterpiece (quality),"

该模型还针对评分的分布进行了平衡，您可以使用以下评分标签来区分不同评分的图片：

评分修改	          评分标准

safe	            安全

sensitive	        敏感

nsfw	            色色

explicit, nsfw	    色色+

推荐的提示词格式：

<|special|>, 

<|characters|>, <|copyrights|>, 

<|artist|>, 



<|general|>, 



<|quality|>, <|meta|>, <|rating|>

推荐的负面提示词:

worst quality, comic, multiple views, bad quality, low quality, lowres, displeasing, very displeasing, bad anatomy, bad hands, scan artifacts, monochrome, greyscale, twitter username, jpeg artifacts, 2koma, 4koma, guro, extra digits, fewer digits, jaggy lines, unclear

训练内容详情：

该模型的训练数据集来源于 hakubooru 。

MakkiXL 使用LoKr形式进行训练，我将同时提供原始的 LoKr 文件，名为 "makki_illustrious_lokr"。有关详细设置，请参考 makki_illustrious_lokr 的元数据。

硬件：RTX 4060Ti



训练图片数量：43216



总 Epoch：2+8



总步数：6760



批量大小：1



梯度累积步数：64



等效批量大小：64



优化器：Lion8bit



学习率：5e-5 用于 UNet / 不训练 TE



学习率调度器：Constant



最小 SNR Gamma：5



噪声偏移：0.03



分辨率：1024x1024



最小 分桶 分辨率：256



最大 分桶 分辨率：2048



混合精度：BF16