[Community pipeline] SD3 Differential Diffusion Img2Img Pipeline #8679

asomoza · 2024-06-24T08:23:56Z

What does this PR do?

Add the differential diffusion SD3 community pipeline.

Needs #8678 to work

How to test:

Gradient

import torch

from diffusers import FlowMatchEulerDiscreteScheduler
from diffusers.utils import load_image
from examples.community.pipeline_stable_diffusion_3_differential_img2img import (
    StableDiffusion3DifferentialImg2ImgPipeline,
)


pipe = StableDiffusion3DifferentialImg2ImgPipeline.from_pretrained(
    "stabilityai/stable-diffusion-3-medium-diffusers", torch_dtype=torch.float16, variant="fp16"
).to("cuda")

pipe.scheduler = FlowMatchEulerDiscreteScheduler.from_config(pipe.scheduler.config, shift=3.0)

prompt = "a green pear"

source_image = load_image(
    "https://huggingface.co/datasets/OzzyGT/testing-resources/resolve/main/differential/20240329211129_4024911930.png"
)
map = load_image(
    "https://huggingface.co/datasets/OzzyGT/testing-resources/resolve/main/differential/gradient_mask_2.png"
)

image = pipe(
    prompt=prompt,
    negative_prompt="",
    image=source_image,
    num_inference_steps=28,
    guidance_scale=4.5,
    strength=1.0,
    map=map,
).images[0]

source	gradient	result

Inpainting

import torch

from diffusers import FlowMatchEulerDiscreteScheduler
from diffusers.utils import load_image
from examples.community.pipeline_stable_diffusion_3_differential_img2img import (
    StableDiffusion3DifferentialImg2ImgPipeline,
)


pipe = StableDiffusion3DifferentialImg2ImgPipeline.from_pretrained(
    "stabilityai/stable-diffusion-3-medium-diffusers", torch_dtype=torch.float16, variant="fp16"
).to("cuda")

pipe.scheduler = FlowMatchEulerDiscreteScheduler.from_config(pipe.scheduler.config, shift=3.0)

prompt = "Photorealistic close-up portrait of a Golden Retriever puppy. In the background, on the rug beside the puppy, a red ball, enticing the pup to play."
prompt_3 = "Photorealistic close-up portrait of a Golden Retriever puppy, around 4 months old, with a playful expression bathed in warm sunlight streaming through the window of a modern living room. Capture the soft texture of its fluffy fur, the glint of light in its big brown eyes, and a colorful bandana around its neck. In the background, on the rug beside the puppy, a red ball, enticing the pup to play."

source_image = load_image(
    "https://huggingface.co/datasets/OzzyGT/testing-resources/resolve/main/differential/dog_source.png"
)
mask = load_image(
    "https://huggingface.co/datasets/OzzyGT/testing-resources/resolve/main/differential/dog_inpainting_mask.png"
)

image = pipe(
    prompt=prompt,
    prompt_3=prompt_3,
    negative_prompt="",
    image=source_image,
    num_inference_steps=28,
    guidance_scale=4.5,
    strength=0.78,
    map=mask,
    max_sequence_length=512,
).images[0]

image.save("diff-diff-inpaint-result.png")

source	mask	result

Depthmap

import torch

from diffusers import FlowMatchEulerDiscreteScheduler
from diffusers.utils import load_image
from examples.community.pipeline_stable_diffusion_3_differential_img2img import (
    StableDiffusion3DifferentialImg2ImgPipeline,
)


pipe = StableDiffusion3DifferentialImg2ImgPipeline.from_pretrained(
    "./models/stable_diffusion_3_medium",
    torch_dtype=torch.float16,
).to("cuda")

pipe.scheduler = FlowMatchEulerDiscreteScheduler.from_config(pipe.scheduler.config, shift=3.0)

prompt = "painting of a mountain landscape with a meadow and a forest, meadow background"

source_image = load_image(
    "https://huggingface.co/datasets/OzzyGT/testing-resources/resolve/main/differential/meadow.jpg"
)
depth_map = load_image(
    "https://huggingface.co/datasets/OzzyGT/testing-resources/resolve/main/differential/meadow_depth.png"
)

image = pipe(
    prompt=prompt,
    negative_prompt="",
    image=source_image,
    num_inference_steps=28,
    guidance_scale=4.5,
    strength=0.8,
    map=depth_map,
).images[0]

image.save("diff-diff-depth-result.png")

source	depth map	result

* depth map made with marigold and diffusers

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@yiyixuxu @vikm2o @exx8

HuggingFaceDocBuilderDev · 2024-06-24T08:29:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

vikm2o · 2024-06-24T22:54:05Z

@asomoza is generator required ?
When I run I still see the same issue

--> 914     latents = original_with_noise[i] * mask + latents * (1 - mask)
    915 # end diff diff
    916 
    917 # expand the latents if we are doing classifier free guidance
    918 latent_model_input = torch.cat([latents] * 2) if self.do_classifier_free_guidance else latents

IndexError: index 1 is out of bounds for dimension 0 with size 1

I am using sd3-diff-diff branch.

asomoza · 2024-06-25T01:24:21Z

@vikm2o

is generator required ?

No, that got in by mistake when I copied my code.

You need to use the fix-flow-match-scale-noise branch in my repo for it to work. They're in separate branches since I need to do separate PRs.

vikm2o · 2024-06-25T01:50:31Z

@asomoza Thanks, that worked!

examples/community/pipeline_stable_diffusion_3_differential_img2img.py

yiyixuxu

ready to merge too?

asomoza · 2024-06-27T15:14:44Z

I'll try to remove the additional code from it before, I mean the need for torchvision and the need to preprocess the images outside of the pipeline.

asomoza · 2024-06-29T00:32:11Z

@yiyixuxu now is ready

yiyixuxu · 2024-06-29T03:13:19Z

merged! it is really nice!

* new pipeline

new pipeline

7f2659d

yiyixuxu reviewed Jun 25, 2024

View reviewed changes

examples/community/pipeline_stable_diffusion_3_differential_img2img.py Outdated Show resolved Hide resolved

asomoza and others added 3 commits June 26, 2024 18:35

Merge branch 'main' into sd3-diff-diff

6b5f90f

apply suggestion

80316b3

Merge branch 'main' into sd3-diff-diff

76558bf

yiyixuxu approved these changes Jun 27, 2024

View reviewed changes

asomoza and others added 3 commits June 28, 2024 16:28

Merge branch 'main' into sd3-diff-diff

907bfec

removed need to preprocess images

c947fb6

removed unused imports

01e2581

asomoza requested a review from yiyixuxu June 29, 2024 00:30

yiyixuxu merged commit 9b7acc7 into huggingface:main Jun 29, 2024
7 of 8 checks passed

a-r-r-o-w mentioned this pull request Jul 22, 2024

Adding Differential Diffusion to Kolors, Auraflow, HunyuanDiT #8924

Closed

3 tasks

asomoza mentioned this pull request Oct 9, 2024

Add Differential Diffusion to Kolors #9423

Merged

sayakpaul pushed a commit that referenced this pull request Dec 23, 2024

[Community pipeline] SD3 Differential Diffusion Img2Img Pipeline (#8679)

2988416

* new pipeline

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Community pipeline] SD3 Differential Diffusion Img2Img Pipeline #8679

[Community pipeline] SD3 Differential Diffusion Img2Img Pipeline #8679

Uh oh!

asomoza commented Jun 24, 2024 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jun 24, 2024

Uh oh!

vikm2o commented Jun 24, 2024 •

edited

Loading

Uh oh!

asomoza commented Jun 25, 2024

Uh oh!

vikm2o commented Jun 25, 2024

Uh oh!

Uh oh!

yiyixuxu left a comment

Uh oh!

asomoza commented Jun 27, 2024

Uh oh!

asomoza commented Jun 29, 2024 •

edited

Loading

Uh oh!

Uh oh!

yiyixuxu commented Jun 29, 2024

Uh oh!

Uh oh!

[Community pipeline] SD3 Differential Diffusion Img2Img Pipeline #8679

[Community pipeline] SD3 Differential Diffusion Img2Img Pipeline #8679

Uh oh!

Conversation

asomoza commented Jun 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Gradient

Inpainting

Depthmap

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Jun 24, 2024

Uh oh!

vikm2o commented Jun 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asomoza commented Jun 25, 2024

Uh oh!

vikm2o commented Jun 25, 2024

Uh oh!

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

asomoza commented Jun 27, 2024

Uh oh!

asomoza commented Jun 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

yiyixuxu commented Jun 29, 2024

Uh oh!

Uh oh!

asomoza commented Jun 24, 2024 •

edited

Loading

vikm2o commented Jun 24, 2024 •

edited

Loading

asomoza commented Jun 29, 2024 •

edited

Loading