Open
Description
SDXL makes use of micro-conditioning, and it does have quite a bit of an effect on the end results. For more details, refer to the paper here.
Currently, not all of our SDXL trainers don't make use of micro-conditioning. So, it'd be nice to have micro-conditioning support as in https://github.com/huggingface/diffusers/blob/main/examples/text_to_image/train_text_to_image_lora_sdxl.py.
Below is a list of the training scripts where we'd like to have this change incorporated:
- DreamBooth SDXL LoRA
- SDXL LCM-LoRA (it needs to be slightly refactored as done in Fix a bug of flip in SDXL training script #6547 by @haofanwang)
- ControlNet SDXL
- T2I Adapter SDXL
- Textual inversion SDXL (same as what's mentioned for SDXL LCM-LoRA above)
- Advanced SDXL trainer
Feel free to open PRs targeting only ONE example at a time and tag me. Please also share an example training command while submitting the PRs. The command doesn't have to run the training for a large number of steps. Anything in the range of [4, 10] should suffice.