Skip to content

(bug) audioldm ckpt to diffuser script is not working at medium or large size ckpt. #3255

Closed
@circuluspibo

Description

@circuluspibo

HI. i tried to convert diffusers model from audioldm.
small size ckpt like as audioldm-full-s-v2.ckpt audioldm-s-text-ft.ckpt are well,
but middle or large size are not woking (audioldm-m-text-ft.ckpt, audioldm-m-full.ckpt)below

shape torch.Size([960]) from checkpoint, the shape in current model is
torch.Size([640]).
size mismatch for mid_block.resnets.1.conv2.weight: copying a param with
shape torch.Size([960, 960, 3, 3]) from checkpoint, the shape in current model
is torch.Size([640, 640, 3, 3]).
size mismatch for mid_block.resnets.1.conv2.bias: copying a param with
shape torch.Size([960]) from checkpoint, the shape in current model is
torch.Size([640]).
size mismatch for conv_norm_out.weight: copying a param with shape
torch.Size([192]) from checkpoint, the shape in current model is
torch.Size([128]).
size mismatch for conv_norm_out.bias: copying a param with shape
torch.Size([192]) from checkpoint, the shape in current model is
torch.Size([128]).
size mismatch for conv_out.weight: copying a param with shape
torch.Size([8, 192, 3, 3]) from checkpoint, the shape in current model is
torch.Size([8, 128, 3, 3]).
(base) circulus@circulus:~/git/diffusers/scr

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions