Super-resolution in DeepFloyd is bugged for any non-64x64 to 256x256 upscales

### Describe the bug

Superresolution code is currently bugged because it does not accept height and width, and rather just assigns a height and width of 256x256. This results in squished images when you try to super resolution anything from the first stage model that is not 64x64.

https://github.com/huggingface/diffusers/blob/eade4308dabc7f7ba75eab508d386b66b3764513/src/diffusers/pipelines/deepfloyd_if/pipeline_if_superresolution.py#L809-L810

![stage1-2](https://user-images.githubusercontent.com/110263573/235313740-deddf185-1bd6-4edb-86ac-f76d1b221fc5.png)

![ch62g01fYFz4](https://user-images.githubusercontent.com/110263573/235313759-476875e0-04fe-4389-bce4-8781f2804934.png)

Solution: Just allow height and width to be passed in. I forked the class and did this manually and it works fine.

### Reproduction

```py
from diffusers import DiffusionPipeline
from diffusers.utils import pt_to_pil
import torch

# stage 1
stage_1 = DiffusionPipeline.from_pretrained("DeepFloyd/IF-I-XL-v1.0", variant="fp16", torch_dtype=torch.float16)
stage_1.enable_model_cpu_offload()

# stage 2
stage_2 = DiffusionPipeline.from_pretrained(
    "DeepFloyd/IF-II-L-v1.0", text_encoder=None, variant="fp16", torch_dtype=torch.float16
)
stage_2.enable_model_cpu_offload()


prompt = 'a photo of a kangaroo wearing an orange hoodie and blue sunglasses standing in front of the eiffel tower holding a sign that says "very deep learning"'
generator = torch.manual_seed(1)

# text embeds
prompt_embeds, negative_embeds = stage_1.encode_prompt(prompt)

# stage 1
image = stage_1(
    prompt_embeds=prompt_embeds,
    negative_prompt_embeds=negative_embeds,
    generator=generator,
    output_type="pt",
    height=96,
    width=64,
).images
pt_to_pil(image)[0].save("./if_stage_I.png")

# stage 2
image = stage_2(
    image=image,
    prompt_embeds=prompt_embeds,
    negative_prompt_embeds=negative_embeds,
    generator=generator,
    output_type="pt",
).images
pt_to_pil(image)[0].save("./if_stage_II.png")

```

### Logs

_No response_

### System Info

py 3.10.6, diffusers on latest main

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Super-resolution in DeepFloyd is bugged for any non-64x64 to 256x256 upscales #3289

Describe the bug

Reproduction

Logs

System Info

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	height = self.unet.config.sample_size
	width = self.unet.config.sample_size

Super-resolution in DeepFloyd is bugged for any non-64x64 to 256x256 upscales #3289

Description

Describe the bug

Reproduction

Logs

System Info

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions