[draft] refactor DPMSolverMultistepScheduler using sigmas #4690

yiyixuxu · 2023-08-21T03:29:34Z

This PR refactor DPMSolverMultistepScheduler: update the computation to use sigmas.

currently, I only refactored "dpmsolver++" and "sde-dpmsolver++"
to-do left:

"dpmsolver++"
"sde-dpmsolver++"
"sde-dpmsolver"
"dpmsolver"

#4187

add print lines add print lines and change

HuggingFaceDocBuilderDev · 2023-08-21T04:50:56Z

The documentation is not available anymore as the PR was closed or merged.

yiyixuxu · 2023-09-09T08:12:04Z

@patrickvonplaten

Do we really need "sde-dpmsolver" and "dpmsolver" here?

We have so many options here, and I think that really confuses people. i.e. we have 4 algorithm_type, 2 solver_order, and 2 solver_type, so 16 combinations here 😅

Can we possibly trim it a little bit as I refactor it? From what I understand, the algorithm type "dpmsolver" and "sde-dpmsolver" (proposed in this paper before the same author came up with dpmsolver+++ https://huggingface.co/papers/2206.00927) are completely obsolete at this point.

yiyixuxu · 2023-09-11T08:00:59Z

testing "dpmsolver ++"

testing the implementation against k-diffusion

I followed the same math used in k-diffusion so results should match exact
diff< 3e-4 for both tests below

use_karras_sigmas=False

import torch
from diffusers import StableDiffusionKDiffusionPipeline, DPMSolverMultistepScheduler, StableDiffusionPipeline
import gc
import numpy as np

# test1: use_karras_sigmas=False
seed = 33

pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4")
pipe = pipe.to("cuda")
pipe.scheduler = DPMSolverMultistepScheduler.from_config(
    pipe.scheduler.config, use_karras_sigmas=False
)

prompt = "an astronaut riding a horse on mars"

generator = torch.Generator(device="cuda").manual_seed(seed)
image_d = pipe(prompt, generator=generator, num_inference_steps=20, output_type='np').images[0]

seed = 33

pipe = StableDiffusionKDiffusionPipeline.from_pretrained(
    "CompVis/stable-diffusion-v1-4"
).to("cuda")
pipe.set_scheduler("sample_dpmpp_2m")

prompt = "an astronaut riding a horse on mars"

generator = torch.Generator(device="cuda").manual_seed(seed)
image_k = pipe(
    prompt, generator=generator, num_inference_steps=20, use_karras_sigmas=False, output_type='np'
).images[0]

print(f"compare: {np.max(np.abs((image_d - image_k)))}")

-> 0.0002550482749938965

use_karras_sigmas=True

seed = 33

pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4")
pipe = pipe.to("cuda")
pipe.scheduler = DPMSolverMultistepScheduler.from_config(
    pipe.scheduler.config, use_karras_sigmas=True
)

prompt = "an astronaut riding a horse on mars"

generator = torch.Generator(device="cuda").manual_seed(seed)
image_d = pipe(prompt, generator=generator, num_inference_steps=20, output_type='np').images[0]

seed = 33

pipe = StableDiffusionKDiffusionPipeline.from_pretrained(
    "CompVis/stable-diffusion-v1-4"
).to("cuda")
pipe.set_scheduler("sample_dpmpp_2m")

prompt = "an astronaut riding a horse on mars"

generator = torch.Generator(device="cuda").manual_seed(seed)
image_k = pipe(
    prompt, generator=generator, num_inference_steps=20, use_karras_sigmas=True, output_type='np'
).images[0]

print(f"compare: {np.max(np.abs((image_d - image_k)))}")

-> 0.000287860631942749

compare against current implementation (`main` branch)

there are some numerical difference if we compare numpy output, but visually they are pretty identical I think

use_karras_sigmas=False

import torch
from diffusers import StableDiffusionKDiffusionPipeline, DPMSolverMultistepScheduler, StableDiffusionPipeline
import gc

seed = 33

pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4")
pipe = pipe.to("cuda")
pipe.scheduler = DPMSolverMultistepScheduler.from_config(
    pipe.scheduler.config, use_karras_sigmas=False
)

prompt = "an astronaut riding a horse on mars"

generator = torch.Generator(device="cuda").manual_seed(seed)
image = pipe(prompt, generator=generator, num_inference_steps=20).images[0]

image.save(f"dpmsolver++.png")

current implementation

This PR

use_karras_sigmas=True

seed = 33

pipe = StableDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4")
pipe = pipe.to("cuda")
pipe.scheduler = DPMSolverMultistepScheduler.from_config(
    pipe.scheduler.config, use_karras_sigmas=True
)

prompt = "an astronaut riding a horse on mars"

generator = torch.Generator(device="cuda").manual_seed(seed)
image = pipe(prompt, generator=generator, num_inference_steps=20).images[0]

image.save(f"dpmsolver++_k_sigma.png")

current implementation

this PR

yiyixuxu · 2023-09-12T00:34:59Z

think I'm overcomplicating things with this pr - will try a different approach!

yiyixuxu and others added 6 commits July 21, 2023 03:52

add index_counter

8f78025

update test

3b886af

update

a05a13a

add print lines add print lines and change

fix

670c782

style

c95b545

Merge branch 'main' into dpm-mstep-sigma

9eeb5e9

yiyixuxu added 2 commits August 22, 2023 00:29

sde-dpmsolver+++

515c105

v_prediction

a85a18c

remove round

f238e0d

yiyixuxu closed this Sep 12, 2023

yiyixuxu mentioned this pull request Sep 12, 2023

refactor DPMSolverMultistepScheduler using sigmas #4986

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[draft] refactor DPMSolverMultistepScheduler using sigmas #4690

[draft] refactor DPMSolverMultistepScheduler using sigmas #4690

Uh oh!

yiyixuxu commented Aug 21, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Aug 21, 2023 •

edited

Loading

Uh oh!

yiyixuxu commented Sep 9, 2023

Uh oh!

yiyixuxu commented Sep 11, 2023

Uh oh!

yiyixuxu commented Sep 12, 2023

Uh oh!

Uh oh!

[draft] refactor DPMSolverMultistepScheduler using sigmas #4690

[draft] refactor DPMSolverMultistepScheduler using sigmas #4690

Uh oh!

Conversation

yiyixuxu commented Aug 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yiyixuxu commented Sep 9, 2023

Uh oh!

yiyixuxu commented Sep 11, 2023

testing "dpmsolver ++"

testing the implementation against k-diffusion

use_karras_sigmas=False

use_karras_sigmas=True

compare against current implementation (main branch)

use_karras_sigmas=False

use_karras_sigmas=True

Uh oh!

yiyixuxu commented Sep 12, 2023

Uh oh!

Uh oh!

yiyixuxu commented Aug 21, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 21, 2023 •

edited

Loading

compare against current implementation (`main` branch)