Skip to content

Release v1.0.0

Compare
Choose a tag to compare
@LinB203 LinB203 released this 09 Apr 06:43
· 548 commits to main since this release
  • Added text conditional control to generate videos.
  • Support HUAWEI NPU in hw branch.
  • Released all training data and annotations.
  • Add training, sampling scripts.
  • Add CausalVideoVAE training details.

We trained all models to use 40K videos crawled from the web, most of which are landscape related content. The complete training process takes about 2048 GPU hours. More detailed changes can be found in our report.

We hope this release further benefits the community and makes text-to-video models more accessible.