Release Release v1.0.0 · PKU-YuanGroup/Open-Sora-Plan

Added text conditional control to generate videos.
Support HUAWEI NPU in hw branch.
Released all training data and annotations.
Add training, sampling scripts.
Add CausalVideoVAE training details.

We trained all models to use 40K videos crawled from the web, most of which are landscape related content. The complete training process takes about 2048 GPU hours. More detailed changes can be found in our report.

We hope this release further benefits the community and makes text-to-video models more accessible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release v1.0.0