A5 Stabilizer - RLHF D3PO - A5稳定器版本v1.0 (ID: 288531)

Please use sd-webui-additional-networks to load this model !

请使用 sd-webui-additional-networks 来加载本模型！

Introduction 简介

Q0: What is this model? 这个模型是什么？

A0: ? This is a stabilizer trained for Anything V5. 这是一个为 Anything V5 训练的“稳定器”,

Q1: What's the use of this model? 这个模型有什么用？

A1.1: ? This model can to some extent fix the collapsed structure in the generated illustration. 该模型能够一定程度上修复画面的崩坏部分,

A1.2: ? This model can fix the pupils of the generated characters. 该模型能够修复生成人物的瞳孔,

A1.3: ? This model modifies the lightning effects of the generated illustrations, so that they will be less look like generated by AI (i.e., prefered by human). 该模型会调整生成画像的光影效果，使之看起来“没那么像是AI生成的” （换言之，被人类偏好）,

A1.4: ? This model seems to better align the generated image with the input prompts. 该模型似乎能够更好地对齐生成的图片与输入的提示词,

Q2: How is this model Trained? 这个模型是如何训练的？

A2.1: ? This model is trained by Reinforcement Learning from Human Feedback (RLHF), which has been widely used in LLMs. 该模型由人类反馈强化学习技术训练，该技术已被广泛应用于微调LLMs,

A2.2: ? To be specific, this model is trained by D3PO, see Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model (arxiv 2311.13231). 具体而言，该模型的训练基于D3PO方法，详见Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model (arxiv 2311.13231),

A2.3: ? This model is instructed by a self-trained reward model in the training process, using 4 metrics (likes, collections, AI-generated probability, and views). The reward model is available at chikoto/ConvNeXtV2-IllustrationScorer. 该模型在训练时由一个自己训练的奖励模型指导，使用点赞数，收藏数，由AI生成的概率，以及浏览量四个指标来评价,该奖励模型可以在 chikoto/ConvNeXtV2-IllustrationScorer 找到,

Some Nonsense Crap 一些无关紧要的东西

? Any idea and suggestion is welcomed! 欢迎任提供何想法和建议！?

? Welcome to challenge NAI3 with a mortal body. 欢迎各位以凡人之躯挑战NAI3 :) ?

描述:

2024/1/6 First released the 210-epoch checkpoint.

训练词语:

名称: RLe210.safetensors

大小 (KB): 24951

类型: Model

Pickle 扫描结果: Success

Pickle 扫描信息: No Pickle imports

病毒扫描结果: Success

A5 Stabilizer - RLHF D3PO - A5稳定器

资源下载

下载价格VIP专享

仅限VIP下载升级VIP

犹豫不决让我们错失一次又一次机会！！！

原文链接：https://1111down.com/946278.html，转载请注明出处

A5 Stabilizer - RLHF D3PO - A5稳定器版本v1.0 (ID: 288531)

Please use sd-webui-additional-networks to load this model !

请使用 sd-webui-additional-networks 来加载本模型！

Introduction 简介

Some Nonsense Crap 一些无关紧要的东西

描述:

2024/1/6 First released the 210-epoch checkpoint.

训练词语:

在线客服

升级VIP

全屏浏览

夜间模式

繁简切换

返回顶部

A5 Stabilizer - RLHF D3PO - A5稳定器版本v1.0 (ID: 288531)

Please use sd-webui-additional-networks to load this model !

请使用 sd-webui-additional-networks 来加载本模型！

Introduction 简介

Some Nonsense Crap 一些无关紧要的东西

描述: 2024/1/6 First released the 210-epoch checkpoint.

训练词语:

猜你喜欢

Kesha Lora版本V1 (ID: 1298881)

Iron Patriot版本v1.0 (ID: 1291511)

Dark Ishihara版本V1 (ID: 1303876)

Female dancer posing SDXL版本V1 (ID: 1249895)

ybqy版本V1 (ID: 1314652)

Halina Pawlowská版本v1.0 (ID: 1294642)

在线客服

升级VIP

全屏浏览

夜间模式

繁简切换

返回顶部

社交账号快速登录

社交账号快速登录

描述:

2024/1/6 First released the 210-epoch checkpoint.