This Adetailer model will segment speech bubbles, text and watermarks commonly found in training data. Trained this so I could eventually automatically clean images in a dataset. Only tested on Comfy, but should work on other webUIs too. This is a WIP, and I have many things in mind on which could be improved:
Known issues:
-
make sure you don't set minimum confidence too low, or else undesired objects will be segmented
-
can misidentify watermarks for text, speech bubbles for logos etc. but this should not matter since they are segmented anyway
-
Some text that is transparent/partially hidden won't be identified
-
Trained primarily on NSFW images, may not work too well with comics, images with large/strange fonts etc.
描述:
Increased dataset size, better annotation
训练词语:
名称: adetailerForTextSpeech_v20.zip
大小 (KB): 130139
类型: Archive
Pickle 扫描结果: Success
Pickle 扫描信息: No Pickle imports
病毒扫描结果: Success