Description
HI. i tried to convert diffusers model from audioldm.
small size ckpt like as audioldm-full-s-v2.ckpt audioldm-s-text-ft.ckpt are well,
but middle or large size are not woking (audioldm-m-text-ft.ckpt, audioldm-m-full.ckpt)below
shape torch.Size([960]) from checkpoint, the shape in current model is
torch.Size([640]).
size mismatch for mid_block.resnets.1.conv2.weight: copying a param with
shape torch.Size([960, 960, 3, 3]) from checkpoint, the shape in current model
is torch.Size([640, 640, 3, 3]).
size mismatch for mid_block.resnets.1.conv2.bias: copying a param with
shape torch.Size([960]) from checkpoint, the shape in current model is
torch.Size([640]).
size mismatch for conv_norm_out.weight: copying a param with shape
torch.Size([192]) from checkpoint, the shape in current model is
torch.Size([128]).
size mismatch for conv_norm_out.bias: copying a param with shape
torch.Size([192]) from checkpoint, the shape in current model is
torch.Size([128]).
size mismatch for conv_out.weight: copying a param with shape
torch.Size([8, 192, 3, 3]) from checkpoint, the shape in current model is
torch.Size([8, 128, 3, 3]).
(base) circulus@circulus:~/git/diffusers/scr