-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
训练显存 #32
Comments
1、DeepSpeed Zero2模式,全参微调,单卡80G可能无法运行(可以尝试打开low_vram,vae.enable_tiling参数) |
好的,谢谢您。那请问batchsize设置成1大概要train多少个step呢 |
感谢关注,这个得具体看实验结果来判断了,我还没尝试过bs1训练。 |
我看您论文中是batchsize为80,1.8k个step,那我train 大概80×1.8k个step可能会有成效吗 |
这里面有两个变量,我也不是很好判断:
|
明白,谢谢您的回答 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
请问训练的显存需要多少呢?
我在单张H100 80G上也会爆显存,使用了CogVideoX-5b-I2V为初始权重,加入is_train_lora参数也会爆显存
The text was updated successfully, but these errors were encountered: