[docs] Simplify loading guide #2694

stevhliu · 2023-03-15T18:35:43Z

This PR moves the high-level explanations in the loading guide into a separate Conceptual Guide so users can focus on executing the code more quickly. The explanatory guide is linked for users who are interested in learning more about what's happening.

HuggingFaceDocBuilderDev · 2023-03-15T18:40:13Z

The documentation is not available anymore as the PR was closed or merged.

stevhliu

This is just a draft because I have some questions about variants :)

docs/source/en/using-diffusers/loading.mdx

docs/source/en/conceptual/pipeline_explained.mdx

docs/source/en/using-diffusers/loading.mdx

sayakpaul

Nice cleanup!

I really like the new "explained" doc. But I would request @patrickvonplaten to take a closer look in case I missed anything.

docs/source/en/conceptual/pipeline_explained.mdx

docs/source/en/using-diffusers/loading.mdx

docs/source/en/conceptual/pipeline_explained.mdx

docs/source/en/using-diffusers/loading.mdx

patrickvonplaten · 2023-03-16T15:30:38Z

Hmm not too happy about the changes here. I see the benefit of simplifying the loading doc - however all the info that is deleted is very important for users. I don't think we should do a split (loading / conceptual guide here). I think we should leave everything on the loading page (which is visited a lot) and just split everything into "basic" and "advanced" use cases.

I don't think it makes sense to add a now "explanatory guide" here. IMO many users just want a quick look-up they can visit quickly to see how variants work instead of having to read through a guide/tutorial.

I see loading much closer to API doc string that is looked up a lot instead of something like a tutorial.

stevhliu · 2023-03-20T18:14:58Z

Thanks for the feedback!

I've refactored to keep everything on one page (I'll delete the "explained" page once we're happy with this), and split it into practical stuff at the top and explanatory things at the bottom. I think the practical steps are important for the quick look-up without necessarily needing to know how the folder is structured and the details of the model_index.json file, for example.

But if they are interested, I added a <Tip> where they can skip to the end and get more context about how it works.

I think all the info should still be on the page, but feel free to let me know if I'm missing anything!

patrickvonplaten · 2023-03-21T13:47:36Z

Ready for final review @stevhliu - should we move out of draft mode?

patrickvonplaten

Looks nice! Could we just add some more specification about the difference between variant and torch_dtype?

docs/source/en/using-diffusers/loading.mdx

patrickvonplaten · 2023-03-31T12:03:44Z

docs/source/en/using-diffusers/loading.mdx

-A checkpoint stored in [torch's half-precision / float16 format](https://pytorch.org/blog/accelerating-training-on-nvidia-gpus-with-pytorch-automatic-mixed-precision/) requires only half the bandwith and storage when downloading the checkpoint,
-**but** cannot be used when continuing training or when running the checkpoint on CPU.
-Similarly the *non-exponential-averaged* (or non-EMA) version of the checkpoint should be used when continuing fine-tuning of the model checkpoint, **but** should not be used when using the checkpoint for inference. 
+Load a variant by specifying the `variant` argument in [`DiffusionPipeline.from_pretrained`], and the `torch_dtype` if the variant is a different floating point type. 🧨 Diffusers won't download a variant unless it is explicitly specified, so you don't have to worry about downloading and caching more checkpoints than you need.


Here we need to be more precise I think.

variant -> this just defines what files should be loaded
torch_dtype -> this defines into what torch_dtype the weights should be converted to.

Can we maybe try to say something along the following:

Suggested change

Load a variant by specifying the `variant` argument in [`DiffusionPipeline.from_pretrained`], and the `torch_dtype` if the variant is a different floating point type. 🧨 Diffusers won't download a variant unless it is explicitly specified, so you don't have to worry about downloading and caching more checkpoints than you need.

Load a variant by specifying the `variant` argument in [`DiffusionPipeline.from_pretrained`]. 🧨 Diffusers won't download a variant unless it is explicitly specified, so you don't have to worry about unnecessarily downloading and caching more checkpoints than you need.

The `torch_dtype` argument is unrelated to the `variant` argument and decides which floating point precision the loaded checkpoints should have. If you want to save bandwidth by loading a `"fp16"` variant, you should also define `torch_dtype=torch.float16` as otherwise the fp16 weights will be converted to the default fp32 precision. Note that you can also load the original checkpoint without setting a variant and then converting it to float16 precision by passing `torch_dtype=torch.float16`. In this case you downloaded float32 weights and converted them to float16 after loading.

It would be great if the reader could understand the difference between torch_dtype and variant here (they are completely orthogonal.

Thanks for the explanation, this is super helpful!

patrickvonplaten

Looks good to me!

* simplify loading guide * apply feedbacks * clarify variants * clarify torch_dtype and variant * remove conceptual pipeline doc

simplify loading guide

b7141fd

stevhliu commented Mar 15, 2023

View reviewed changes

docs/source/en/using-diffusers/loading.mdx Show resolved Hide resolved

docs/source/en/using-diffusers/loading.mdx Show resolved Hide resolved

docs/source/en/using-diffusers/loading.mdx Show resolved Hide resolved

stevhliu requested review from yiyixuxu, patrickvonplaten and sayakpaul March 15, 2023 18:51

sayakpaul reviewed Mar 16, 2023

View reviewed changes

docs/source/en/conceptual/pipeline_explained.mdx Outdated Show resolved Hide resolved

sayakpaul reviewed Mar 16, 2023

View reviewed changes

docs/source/en/using-diffusers/loading.mdx Show resolved Hide resolved

sayakpaul approved these changes Mar 16, 2023

View reviewed changes

yiyixuxu reviewed Mar 16, 2023

View reviewed changes

docs/source/en/conceptual/pipeline_explained.mdx Outdated Show resolved Hide resolved

docs/source/en/using-diffusers/loading.mdx Show resolved Hide resolved

patrickvonplaten reviewed Mar 16, 2023

View reviewed changes

docs/source/en/conceptual/pipeline_explained.mdx Outdated Show resolved Hide resolved

patrickvonplaten reviewed Mar 16, 2023

View reviewed changes

docs/source/en/conceptual/pipeline_explained.mdx Outdated Show resolved Hide resolved

patrickvonplaten reviewed Mar 16, 2023

View reviewed changes

docs/source/en/using-diffusers/loading.mdx Show resolved Hide resolved

apply feedbacks

a9bbed7

stevhliu marked this pull request as ready for review March 22, 2023 00:08

clarify variants

a30f340

patrickvonplaten reviewed Mar 31, 2023

View reviewed changes

clarify torch_dtype and variant

8171539

patrickvonplaten approved these changes Apr 4, 2023

View reviewed changes

stevhliu and others added 2 commits April 4, 2023 09:38

remove conceptual pipeline doc

93b8e8a

Merge branch 'main' into update-loading

72036b8

stevhliu merged commit 0d0fa2a into huggingface:main Apr 4, 2023

stevhliu deleted the update-loading branch April 4, 2023 21:08

w4ffl35 pushed a commit to w4ffl35/diffusers that referenced this pull request Apr 14, 2023

[docs] Simplify loading guide (huggingface#2694)

ec75927

* simplify loading guide * apply feedbacks * clarify variants * clarify torch_dtype and variant * remove conceptual pipeline doc

dg845 pushed a commit to dg845/diffusers that referenced this pull request May 6, 2023

[docs] Simplify loading guide (huggingface#2694)

72f3731

* simplify loading guide * apply feedbacks * clarify variants * clarify torch_dtype and variant * remove conceptual pipeline doc

-Load a variant by specifying the `variant` argument in [`DiffusionPipeline.from_pretrained`], and the `torch_dtype` if the variant is a different floating point type. 🧨 Diffusers won't download a variant unless it is explicitly specified, so you don't have to worry about downloading and caching more checkpoints than you need.
+Load a variant by specifying the `variant` argument in [`DiffusionPipeline.from_pretrained`]. 🧨 Diffusers won't download a variant unless it is explicitly specified, so you don't have to worry about unnecessarily downloading and caching more checkpoints than you need.
+ The `torch_dtype` argument is unrelated to the `variant` argument and decides which floating point precision the loaded checkpoints should have. If you want to save bandwidth by loading a `"fp16"` variant, you should also define `torch_dtype=torch.float16` as otherwise the fp16 weights will be converted to the default fp32 precision. Note that you can also load the original checkpoint without setting a variant and then converting it to float16 precision by passing `torch_dtype=torch.float16`. In this case you downloaded float32 weights and converted them to float16 after loading.

[docs] Simplify loading guide #2694

[docs] Simplify loading guide #2694

Uh oh!

Conversation

stevhliu commented Mar 15, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Mar 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten commented Mar 16, 2023

Uh oh!

stevhliu commented Mar 20, 2023

Uh oh!

patrickvonplaten commented Mar 21, 2023

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

patrickvonplaten Mar 31, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Mar 31, 2023

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 3, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Mar 15, 2023 •

edited

Loading