Open
Description
Did you like the remote VAE solution?
I do... it's helpful when training/finetuning models, makes the infrastructure easier to set up.
What can be improved about the current solution?
Adding TEs would be very helpful for a complete solution for auxiliary model off-loading during training.
Consider caching, then it could further simplify model fine-tuning setups.
What other VAEs you would like to see if the pilot goes well?
StepVideo VAE: https://huggingface.co/stepfun-ai/stepvideo-t2v/tree/main/vae
HunYuan CLIP (for Stepvideo): https://huggingface.co/stepfun-ai/stepvideo-t2v/tree/main/hunyuan_clip
TE for LuminaVision2: Gemma-2-2B