Qwen 2vl Flux版本checkpoints (ID: 1100356)

Qwen 2vl Flux版本checkpoints (ID: 1100356)

Original Project found here: https://huggingface.co/Djrango/Qwen2vl-Flux

Qwen2vl-Flux is a state-of-the-art multimodal image generation model that enhances FLUX with Qwen2VL's vision-language understanding capabilities. This model excels at generating high-quality images based on both text prompts and visual references, offering superior multimodal understanding and control.

  • ComfyUI currently doesn't support and there is no available nodes to load the CLIP+LLM portion into it

  • This is just for reviewing/testing the finetuned trained part of the Flux model

  • CFG set to 1 on KSampler

  • Rendered an image in 150s using 8GB GPU @ 512px /10 steps using the bf16 model

  • This model comes will be available in 3 formats named after the folder it should be in

    • diffusion_models - This one is in diffusers format, it is just the merged safetensors file from HuggingFace page

    • checkpoints - This one has been converted to Flux Transformers format and prefix for stable_diffusion compatibility, does not include CLIP and VAE

    • unet - I will provide the q4_0 and q8 variants, make a comment if you'd like to see any other quants

描述:

  • This version goes in the checkpoints folder

  • This version is used with the Load Checkpoint node

  • VAE and CLIP not included, use standard Flux setup

训练词语:

名称: qwen2vlFlux_checkpoints.safetensors

大小 (KB): 23245040

类型: Model

Pickle 扫描结果: Success

Pickle 扫描信息: No Pickle imports

病毒扫描结果: Success

Qwen 2vl Flux

资源下载
下载价格VIP专享
仅限VIP下载升级VIP
犹豫不决让我们错失一次又一次机会!!!
原文链接:https://1111down.com/1156393.html,转载请注明出处
由于网站升级,部分用户密码全部设置为111111,登入后自己修改, 并且VIP等级提升一级(包月提升至包季,包季提升到包年 包年提升至永久)
没有账号?注册  忘记密码?

社交账号快速登录