
Ponydiffusion is an excellent model for 2d content, but it seems rather inconsistent with 3d. This model is designed to more consistently produce photorealistic 3d images of a variety of subjects. Currently, the beta version still produces a more CGI effect as I do not believe I have enough sample images, but hopefully future versions will be more realistic. I would recommend checking the description of each version to see what it does and what its drawbacks are for the time being for more detailed info.
描述:
While I am calling this release 2.1, theres a fairly significant number of changes to it. For one, ive shortened the trigger tag to "source_photo" for ease of use (though I could probably just drop it to just "photo" in the future), and used Booru tags instead of sentence captions for the images. Its still capapable of processing sentence style prompts, obviously, though it does seem to like booru tags when specifying backgrounds. This version also doesnt need an excess of negatives like the prior versions, and seems to function best with a CFG scale at 4-5 and the DPM++ 2M Karras sampler at around 15-17 steps. After some additonal testing, I have determined that a CFG around 7 with a Euler A Automatic sampler at 30 steps also yields very good results, though it takes a bit longer on lower end hardware. Depending on the character, you may need to add more negative and positive tags to enforce realism and discourage it from doing a 2d or CGI effect. I also recommend putting
"plastic,plastic skin,overexposure,blurry" in the negatives for most prompts unless you like the shiny skin effect that most realistic AI models seem to give people.
训练词语: source_photo
名称: Ponyrealv3.safetensors
大小 (KB): 111773
类型: Model
Pickle 扫描结果: Success
Pickle 扫描信息: No Pickle imports
病毒扫描结果: Success