Why VQVAE? How about VAE? #18

LinB203 · 2024-03-03T10:39:59Z

LinB203
Mar 3, 2024
Maintainer

In fact, we believe that both VQVAE and VAE can be used, but we have noticed that there is already a Video-VQVAE in the community that can be directly used for testing purposes.

Additionally, the combination of VQVAE and Diffusion works well for Latent-Diffusion. We have attempted to train a Video-VAE, but it was too resource-intensive. In the early stages of the Open-Sora Plan, the open-source Video-VQVAE can serve as a viable alternative.

Edwardyan1112 · 2024-03-04T01:51:19Z

Edwardyan1112
Mar 4, 2024

According to your estimates, how much computational resources and video datasets are required to train a Video VAE (not Video-VQVAE)? Approximately, how long would the training process take?

1 reply

LinB203 Mar 4, 2024
Maintainer Author

We have tested that it can only batch size 2 on 80G. Input 16 frames x 256 x 256 and latent size 2 x 32 x 32.
We are not sure when it will converge because it is just too costly.

lucasjinreal · 2024-03-04T03:02:34Z

lucasjinreal
Mar 4, 2024

Does Sora used VAE not VQVAE?

3 replies

LinB203 Mar 4, 2024
Maintainer Author

It's unknown, and at the moment we use Video-VQVAE as just a compromise.

lucasjinreal Mar 4, 2024

Maybe VQVAE can handle it, stabledifusion3 also using VQVAE

CuddleSabe Mar 5, 2024

Maybe just another ViT as encoder

ttttt12345 · 2024-03-06T05:50:01Z

ttttt12345
Mar 6, 2024

it is unreal bro,dont waste your time on this project.

0 replies

LinB203 · 2024-03-07T03:11:58Z

LinB203
Mar 7, 2024
Maintainer Author

We are now able to support both vae and vqvae, please check our latest code.

1 reply

lucasjinreal Mar 7, 2024

how's the performance compare

LinB203 · 2024-03-08T16:35:34Z

LinB203
Mar 8, 2024
Maintainer Author

Refer to #93 for more details.

0 replies

hamza-m-farooqi · 2024-03-15T19:50:07Z

hamza-m-farooqi
Mar 15, 2024

looking at the intense resource requirements for Video-VAE and the practicality of Video-VQVAE, it seems like a smart compromise for now. eager to see how this compromise plays out 😊

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why VQVAE? How about VAE? #18

{{title}}

Replies: 6 comments 5 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Why VQVAE? How about VAE? #18

LinB203 Mar 3, 2024 Maintainer

Replies: 6 comments · 5 replies

Edwardyan1112 Mar 4, 2024

LinB203 Mar 4, 2024 Maintainer Author

lucasjinreal Mar 4, 2024

LinB203 Mar 4, 2024 Maintainer Author

lucasjinreal Mar 4, 2024

CuddleSabe Mar 5, 2024

ttttt12345 Mar 6, 2024

LinB203 Mar 7, 2024 Maintainer Author

lucasjinreal Mar 7, 2024

LinB203 Mar 8, 2024 Maintainer Author

hamza-m-farooqi Mar 15, 2024

LinB203
Mar 3, 2024
Maintainer

Replies: 6 comments 5 replies

Edwardyan1112
Mar 4, 2024

LinB203 Mar 4, 2024
Maintainer Author

lucasjinreal
Mar 4, 2024

LinB203 Mar 4, 2024
Maintainer Author

ttttt12345
Mar 6, 2024

LinB203
Mar 7, 2024
Maintainer Author

LinB203
Mar 8, 2024
Maintainer Author

hamza-m-farooqi
Mar 15, 2024