
**Please, read the correspondent "About this version" for the model you download**
Flux1-Hybrid: Blending Dev's Realism with Schnell's Speed and Improved Prompt Adherence
Introduction:
Introduce your new diffusion model, highlighting its unique combination of realism, speed, and prompt adherence. Emphasize the benefits of using this model compared to other popular options like Dev and Schnell. it is perfect for users with limited computational resources who want to generate high quality images quickly.
Key Features:
4-step generation: Produces high-quality images in a fraction of the time compared to many other models.
Enhanced prompt adherence: Accurately captures the nuances of your prompts, resulting in more relevant and creative outputs.
Photorealistic quality: Achieves a level of realism that rivals Dev, while maintaining the versatility of Schnell.
CFG sensitivity: Provides fine-grained control over image generation through the CFG scale.
Comparison with Other Models:
Create a detailed comparison table:
Feature Flux1-Hybrid Dev Schnell
Prompt Adherence Excellent Very Good Good
Realism High Very High Medium
Versatility High High High
Efficiency High (4 steps) Low Medium
Lighting Quality Very Good Excellent Good
Detail High Very High Medium
Artistic Style Diverse Realistic Artistic
Analysis:
Prompt adherence: Our hybrid model consistently generates images that closely align with the provided prompts, outperforming Dev in this regard.
Realism: While Dev excels in photorealism, our model offers a more balanced approach, combining realistic elements with artistic flair.
Efficiency: The 4-step generation process makes our model significantly faster than Dev, without compromising on quality.
Versatility: Our hybrid is capable of producing a wide range of styles, from photorealistic portraits to abstract art.
Inference Time: Intel I5, 32 RAM, Nvidia 3060 12Gb, 45s (same settings, dev, 2m42s). tested with SwarmUI working on ComfyUI.
Caveat: The time of generation of images is measured with the model preloaded. From the click on generate button until the image ends. The model first makes an initial image, which is then refined in 4 steps, the 4 steps last much less than the generation of the initial image.
How to Use:
Recommended settings: Random seed, CFG=1 (0.8-1.2) range, sampler: Euler, Scheduler: Normal/Simple, in the examples Resolution: 5:8 (768x1216).
Clip,T5 encoders and vae are incorporated with Kijai Flux1-FP8. It´s recommended to use --lowvram interface flag to pass the handle of encoders to CPU
Limitations: the range of CFG for non distorted images is narrow. Realistic images are affected by CFG, abstract and landscapes not the same, so for images without humans, in dark images you can raise CFG, and for shinny ones you can lower it a bit more CFG (0.6-1.4). There´s no FluxGuidance as in dev model. Text is an issue to solve.
Acknowledgements:
Thank the teams: Express gratitude to the Black Forest Labs team for creating Flux1 and the ComfyUI community for their valuable insights.
License: CC BY-NC 4.0, No commercial
The following photos contain each one a workflow to generate an image with Hybrid models, Baked and NoBaked versions, and save it in ComfyUI/Output directory. Just click and drag the image to the ComfyUI desktop to see the respective workflow.
https://cdrive.page.link/KcLFPz9evDfMHgWA7
https://cdrive.page.link/aTwDun5S6KLjt8Cu7
FLUX.1 is a powerful tool, but the results may vary depending on the complexity of the prompt and the parameters used. Have fun experimenting and creating your own artwork!
Up: Flux1 Dev, Down: Flux1 Hybrid
Prompt: portrait photography of a alien supermodel, with glitter makeup.
Seed:1826245848, Dev: 25 steps, Guidance 3.5, CFG 1, Sampler Euler, Scheduler Normal/Simple, Hybrid: 4 steps, CFG 1, Sampler Euler, Scheduler Normal/Simple
Up. flux1 Dev. Down: Hybrid
Prompt: Devil woman wearing headphones, in the style of mysterious abstractions, double exposure, bathed in vibrant neon colors. Face illuminated by a kaleidoscope of electric hues - cyan, magenta, yellow, and emerald green. Striking contrast between cool blues and warm oranges across the features. Iridescent, glitter-like particles scattered over the skin, creating a cosmic effect. Piercing eyes with multicolored reflections
Seed:1959765719 Dev: 25 steps, Guidance 3.5, CFG 1 Sampler Euler, Scheduler Normal/Simple, Hybrid: 4 steps, CFG 1, Sampler Euler, Scheduler Normal/SimpleUP: Flux1 Dev Down: Flux1 Hybrid
prompt: (1girl, pale white skinny 18 year old redhead nurse, white nurse outfit, short skirt,white stockings deep cleavage, very big juicy ass and wide hips, blue eyes, thick thighs, very very big natural milky jiggly breasts. curvy milf body, wavy black and red hair, pawg, fat ass, huge ass, cleavage, massive tits, huge breasts, detailed face, heavy breasts, horny face, mssv breasts), hospital, playfull, posing, horny look, bubble butt, posing for a photo,photorealistic, 8k uhd natural lighting, raw, rich, intricate details, key visual, atmospheric lighting, 35mm photograph, film, bokeh, professional, 4k, highly detailed, cinematic, colorful hospital background, 8k, dramatic lighting, highly detailed, hyper realistic, intricate, intricate sharp details,realistic, high resolution,
Seed: 1176787681 Dev: 25 steps, Guidance, 3.5 CFG 1, Sampler Euler, Scheduler Normal/Simple, Hybrid: 4 steps, CFG 1, Sampler Euler, Scheduler Normal/Simple
UP: Flux1 Dev Down: Flux1 Hybrid
prompt: eye catching, Bombshell beauty, Honey wild hair,
Silky long legs high heels Swedish Lounge Spunky Leather
Portrait of beauty Elegant lady in sheer dress and vintage clothing,
Bone corset
Seed: 726579420 Dev: 25 steps, Guidance 3.5 CFG 1, Sampler Euler, Scheduler Normal/Simple, Hybrid: 4 steps, CFG 1, Sampler Euler, Scheduler Normal/Simple
Dev is excelent doing images of thin models women at the price of prompt adherence.
描述:
This version of NSFW2 offers two ways to generate images:
-
Full NSFW2 Model: Use this for a quick and easy experience.
-
Base Model + NSFW2 LORA: For more customization, combine the base model (Flux1-Dev-FP8) with the NSFW2 LORA. Think of the LORA as a special ingredient that gives the images a unique style.
Why use the LORA?
-
Save space: The LORA is much smaller than the full model.
-
More flexibility: Experiment with different styles by combining the LORA with other base models.
-
Something you can´t do with checkpoints and unets, you can play with the strength of the lora. For some features you can enhance the image, e.g, make a woman more busty (1.2 is enough) 2 is the limit
Using the base model and the NSFW2 LORA together saves you space and makes it easier to experiment with different styles. It's like having a modular system where you can customize your cake with different toppings
This is an attempt to distribute a modification of a basic model in the LORA format instead of a full trained or merged model. Every time we download a trained model, for each model we download we download again: the basic model, the VAE, Clip-L and T5, in total about 17Gb. If you believe that GGUF is a solution, the penalty is only reduced by half. That is to say that if we download "n" models based on FluxDev, we have a redundancy of n x 17Gb. SSD vendors are very happy and grateful. Using a distribution based on LORAs, you just download the base model Flux1.Dev, with the included VAE, Clip-L and T5 once and then the specific LORA.
So, You have two ways to do the same image.
1) using the full NSFW2 FP8 full model
2) use flux1-Dev-FP8 (17Gb) as a base model + NSFW2LORA16FP
In other words, if I merged the lora with the base Flux1.Dev model, I would get the full NSFW2 model.
The base model to make the sample images was:
https://huggingface.co/lllyasviel/flux1_dev/resolve/main/flux1-dev-fp8.safetensors
If you want an image in 6-8 steps download Bytedance also include in the prompt,
https://huggingface.co/ByteDance/Hyper-SD/resolve/main/Hyper-FLUX.1-dev-8steps-lora.safetensors
For those who like GGUF, there are several cuantizations versions with 6-8 steps accelerator included
https://huggingface.co/mhnakif/flux-hyp8/tree/main
Example:
-
Prompt: Concept design for a futuristic motorbike in a exposition. Hyper-realistic, 4K, HDR,
-
Model: flux1-dev-fp8.safetensors
-
Loras: <lora:Flux/NSFW2FP16LORA><lora:Flux/Hyper-FLUX.1-dev-8steps-lora:.125>
-
Steps: 6
-
CFG: 1
-
Flux guidance: 3.5
-
Sampler: Euler
Another way to see how the image is generated, is to click "i" on an image and see all the settings that generate the sample image
FAQ - NSFW2 Model
What is NSFW2?
NSFW2 is an image generation model that allows you to create realistic and detailed images. It's designed to be versatile and easy to use, both for beginners and advanced users.
What is a LORA?
A LORA (Lightweight Object Recognition Architecture) is a small neural network used to fine-tune a base model. In the case of NSFW2, the LORA adds a specific style or characteristics to the base model.
Why do I need the LORA?
The LORA allows you to take advantage of the capabilities of the base model (Flux1-Dev-FP8) and customize the results with the NSFW2 style. It's like adding a special ingredient to a recipe to give it a unique flavor.
What's the difference between using the full NSFW2 model and using the base model with the LORA?
-
Full NSFW2 Model: It's like using a pre-made recipe. Everything is ready to use.
-
Base model + LORA: It's like starting with a basic recipe and adding a special ingredient. It gives you more control over the final result.
Can I use the LORA with other base models?
It is not recommended to use the NSFW2 LORA with other base models as it has been specifically trained to work with Flux1-Dev-FP8}
What is a base model?
A base model is a pre-trained neural network on a massive dataset that serves as a starting point for creating more specific models. In the case of NSFW2, the base model (Flux1-Dev-FP8) provides the ability to generate realistic images
Think of a base model as a blank canvas. You can paint anything on it, but the canvas provides the basic surface and materials. In the case of NSFW2, the base model is the canvas and the LORA is the paint that gives it a specific style.
A base model is trained on a massive dataset to learn general patterns and features. This allows it to generate new content, such as images or text, in a creative way. However, a base model may be too general for specific tasks. That's why LORAs are used to fine-tune the base model and specialize it in a particular style or task
训练词语:
名称: flux1DevFP8NF4GGUF6Steps_nsfw2fp16lora.safetensors
大小 (KB): 299244
类型: Model
Pickle 扫描结果: Success
Pickle 扫描信息: No Pickle imports
病毒扫描结果: Success