Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)版本v1.0 (ID: 795785)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)版本v1.0 (ID: 795785)

NOTE: Ignore the model format listed! This is not an NF4 ONNX model, it is a Q5_K_M GGUF model.

This is a GGUF of flux_dev quantized in Q5_K_M GGUF format that should provide a significant quality boost over 4-bit quantizations while being a lot smaller than the 8-bit version (and since it's a relatively small GGUF, load times should be significantly improved over FP8 as well). This model is ideal of mid-sized graphics cards, and in my tests (without any memory optimizations such as offloading t5 onto the CPU) fits comfortably in 16GB of VRAM, and may work on as low as 8GB (if you have under 16GB of VRAM, please test it and leave a comment about whether it works for you).

UPDATE: Per this comment, this quant will work on systems with 8G of VRAM (Thanks to @VolatileSupernova for testing and responding!)

Tested and working in ComfyUI on my RTX 3050 with 8GB VRAM using ViT-L-14-TEXT-detail-improved-hiT-GmP-TE-only-HF for CLIP-L and t5-v1_1-xxl-encoder-Q4_K_M for T5. I usually use the Q4_K-S model which gives me images in 6.4 seconds per iteration at 896x1152 resolution, this model with the same settings and only the model changed gives me them in 7.5 seconds, not a big change at all! It does mean that unfortunately I can't use any Loras with your K_M model since it just barely fits in my VRAM but I'd rather have the higher quality than use Loras!

EDIT: I can actually use the less than 20MB Loras without issue!

Apart from being quantized, this is an unmodified version of Flux Dev that has not been finetuned in any way. It should get along just fine with any LoRAs that will work with the full size or FP8 versions of the model.

描述:

训练词语:

名称: fluxDevQ5KMGGUFQuantizationA_v10.gguf

大小 (KB): 8230617

类型: Model

Pickle 扫描结果: Success

Pickle 扫描信息: No Pickle imports

病毒扫描结果: Success

名称: fluxDevQ5KMGGUFQuantizationA_v10.zip

大小 (KB): 8186165

类型: Model

Pickle 扫描结果: Success

Pickle 扫描信息: No Pickle imports

病毒扫描结果: Success

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

Flux Dev Q5_K_M GGUF quantization (a nice balance of speed and quality in under 9 gigabytes)

资源下载
下载价格VIP专享
仅限VIP下载升级VIP
犹豫不决让我们错失一次又一次机会!!!
原文链接:https://1111down.com/1101782.html,转载请注明出处
由于网站升级,部分用户密码全部设置为111111,登入后自己修改, 并且VIP等级提升一级(包月提升至包季,包季提升到包年 包年提升至永久)
没有账号?注册  忘记密码?

社交账号快速登录