(bug) audioldm ckpt to diffuser script is not working at medium or large size ckpt.


HI. i tried to convert diffusers model from audioldm.
small size ckpt like as audioldm-full-s-v2.ckpt audioldm-s-text-ft.ckpt are well,
but middle or large size are not woking (audioldm-m-text-ft.ckpt, audioldm-m-full.ckpt)below

shape torch.Size([960]) from checkpoint, the shape in current model is 
torch.Size([640]).
        size mismatch for mid_block.resnets.1.conv2.weight: copying a param with
shape torch.Size([960, 960, 3, 3]) from checkpoint, the shape in current model 
is torch.Size([640, 640, 3, 3]).
        size mismatch for mid_block.resnets.1.conv2.bias: copying a param with 
shape torch.Size([960]) from checkpoint, the shape in current model is 
torch.Size([640]).
        size mismatch for conv_norm_out.weight: copying a param with shape 
torch.Size([192]) from checkpoint, the shape in current model is 
torch.Size([128]).
        size mismatch for conv_norm_out.bias: copying a param with shape 
torch.Size([192]) from checkpoint, the shape in current model is 
torch.Size([128]).
        size mismatch for conv_out.weight: copying a param with shape 
torch.Size([8, 192, 3, 3]) from checkpoint, the shape in current model is 
torch.Size([8, 128, 3, 3]).
(base) circulus@circulus:~/git/diffusers/scr

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(bug) audioldm ckpt to diffuser script is not working at medium or large size ckpt. #3255

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

(bug) audioldm ckpt to diffuser script is not working at medium or large size ckpt. #3255

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions