Hitomi T版本v1.10 (ID: 60814)

Hitomi T版本v1.10 (ID: 60814)

Background:

This Textual Inversion embedding bears a striking resemblance to a famous JAV Actress with world-renowned assets.

Trained using base SD1.5 on 50 high quality 512x512 cropped images of the subject "modeling" with text removed from the images. The embedding is 16 tokens--that seems excessive, but it's the only way I got good results.

I'm very new to this, and I've found that training is, unfortunately, an art, not a science. So, this embedding isn't perfect and has some limitations. Please keep that in mind when you rate!

Note: Remove the ".zip" from the end of the file name and place in the embeddings folder. The default trigger is 1hit1.

Important Points:

  • For whatever reason, the embedding works worse for realistic generations and on the base SD1.5 model. Other models and artistic styles produce better results.

  • The results get much better with prompt engineering and, especially, lengthy prompts.

  • You definitely have to adjust the weight of the trigger word relative to the rest of the prompt.

  • Generations of the subject's assets can be prone to distortions, especially when they are exposed. This doesn't occur often, but does happen. I believe this was due to the fact that the extreme size of those assets made it impossible to fit both them and the subject's face in a close-up when cropping to a square image. This seems to be mitigated by using words in the prompt to describe those assets.

  • You will need to include words like naked, nude, and topless in the negative prompt to avoid accidental wardrobe malfunctions (we wouldn't want that, now would we?).

  • I've found that including open mouth and teeth in the negative prompt improves generations.

  • Unless you include strong and specific descriptions of your intended background scenery or setting in your prompts, generations will tend to incorporate the following elements: palm trees, desert plants, brick walls, distant buildings, sunny weather, and the interior of houses. Prompting for background scenery and setting elements seems to mitigate this effect.

  • For further tips & tricks, see the PNG info in the sample images. Yes, my prompting style is weird and complicated, but it works, right?

Samples:

These sample generations were done in Galaxy Time Machine Photo for You and Deliberate. The crappier looking ones were made using simple prompts; the better looking ones required extensive prompt engineering. I didn't use HiRes Fix, Face Restoration, ControlNet, Img2Img, Lycoris, or Negative Embeddings on any of these generations. I believe I used model LORAs on a few, but none for concepts or subjects. Your results will probably improve if you use any of those!

Note: The new religious image I uploaded uses a lot of tricks. Oh, also, the one demon picture uses a ton of stuff too. Just showing what it's possible to do with the embedding and some extreme engineering.

PREVIEW

I've been working on Version 2 and had a breakthrough--it was leaps and bounds better than Version 1... but, after playing around with it more, there were some inconsistencies with generation. When it worked, it worked much better, but it was way more difficult to prompt for. Basically, unless you prompted for specific "features" of the subject, they wouldn't show up well.

I'm going to try fixing what I think the issue was (likely captioning) and try training again sometime soon. My hope is that I can produce an embedding that gets results as good as the one below with simpler prompts and more consistency.

描述:

Hitomi T - Version 1.10:

  • Trigger Word: 8hit8

  • Rename 8hit8.pt.zip to 8hit8.pt and place in the embeddings directory

The new version of my Hitomi T textual inversion embedding is finally out! It solves some of the issues with my last embedding, but does admittedly have some of its own, as noted below.

I'm still new at this and I'm always learning, so feel free to drop me some tips and tricks. I suspect that the issues I'm having with getting a consistent likeness during generations are related to my dataset somehow.

Note: Some of the preview images use a different trigger word for the embedding. This is not because it's a different version of the embedding, but because I was changing the embedding name as I was training and testing. In order to replicate the previews, you will need to change the trigger to 8hit8 in the prompt.

Improvements Over Version 1.0:

  • When the embedding gets good results, it seems like it gets a much closer likeness.

  • Results seem to default to a more photorealistic style, i.e., they are more detailed regardless of other style prompting. So, if you prompt for a fantasy style painting, it will still look like a painting in that style, but with finer intricacies than if you just prompted using something like "woman."

  • Background elements from the dataset, i.e., desert plants and stone walls, no longer subtly influence generations.

  • It's much easier to change things like clothing, poses, and hair styles on the subject.

  • It seems to be easier to get the subject to keep their clothing on.

Challenges and Caveats:

  • For whatever reason, the embedding works worse for realistic generations and on the base SD1.5 model. Other models and artistic styles produce better results.

  • The subject's assets might not be distorted as much as in Version 1.0; however, the size isn't as true-to-life. You might need to prompt about those assets to make them an appropriate size. This doesn't seem to be much of an issue, but it still seems to crop up.

  • The likeness seems to blend very easily if you prompt for other people or types of people, i.e., "basketball player." Facial features will blend and not resemble the subject as much.

  • As such, you might have to mess around with the weight of the trigger word more in order to maintain a good likeness, moreso than in Version 1.0.

  • This embedding still favors lengthier and more complex prompts.

  • There seem to be more frequent issues with hands, specifically fingernails. I would recommend using negative prompts and embeddings to correct hand issues.

  • You still need to use negative prompts to get the subject to keep their clothing on, i.e., "naked," "topless," and "nudity."

  • For further tips & tricks, see the PNG info in the sample images. Yes, my prompting style is weird and complicated, but it works, right?

Sample Generations:

These sample generations were done in Galaxy Time Machine Photo for You. I didn't use HiRes Fix, ControlNet, Img2Img, Lora, Lycoris, or Negative Embeddings on any of these generations. I did use Face Restoration on a few.

The Future:

I would like to improve the consistency of generations with a good likeness, but I've hit a bit of a wall. If anyone has any tips to resolve this issue, please let me know. I'm still a beginner and have lots of room for improvement!

训练词语: 8hit8

名称: 8hit8.pt

大小 (KB): 49

类型: Model

Pickle 扫描结果: Success

Pickle 扫描信息: No Pickle imports

病毒扫描结果: Success

Hitomi T

Hitomi T

Hitomi T

Hitomi T

Hitomi T

Hitomi T

Hitomi T

Hitomi T

Hitomi T

Hitomi T

资源下载
下载价格VIP专享
仅限VIP下载升级VIP
犹豫不决让我们错失一次又一次机会!!!
原文链接:https://1111down.com/912022.html,转载请注明出处
由于网站升级,部分用户密码全部设置为111111,登入后自己修改, 并且VIP等级提升一级(包月提升至包季,包季提升到包年 包年提升至永久)
没有账号?注册  忘记密码?

社交账号快速登录