Does the repo support OFA pre-training? #1

zzhanghub · 2022-09-22T05:12:37Z

Awesome job! The Huggingface version OFA looks more concise. Can this framework support multi-task pre-training like this repo?
I see that this code file contains many different tasks. Could you provide more details about pre-training (Such as data preparation and submission script)?

Thank you again and look forward to your reply!

faychu · 2022-09-22T05:44:43Z

Thanks for your support! I just released a version of the pretrain script. Please try pretrain.sh. But I do not recommend using this framework for pre-training. We have tried pre-training under this framework before, and its performance is not as good as the original repo (fairseq version).

zzhanghub · 2022-09-22T06:30:14Z

Thanks for your support! I just released a version of the pretrain script. Please try pretrain.sh. But I do not recommend using this framework for pre-training. We have tried pre-training under this framework before, and its performance is not as good as the original repo (fairseq version).
@faychu

Thank you for your prompt reply.

Does this suboptimal performance refer to training from scratch? Have you tried to load ckpt (from fairseq version) to continue adding new data or tasks for pre-training?

By the way, will OFA continue to maintain the pre-training in fairseq version in the future, or gradually change to the Huggingface version?

Thank you!!!

faychu · 2022-09-23T06:21:47Z

Yes, the performance refers to training from scratch, and we haven't tried pertaining with new data from loaded ckpt. If you are interested, welcome to try it.

In the future, the OFA pre-training will still be maintained in fairseq version, but the compression features (including distillation, pruning and quantization) will be supported by this repo.

zzhanghub · 2022-09-27T08:08:34Z

Thank you!
I checked the code and found a difference between this version and fairseq version in the pre-training settings. I don't know if it is the reason that affects the pre-training results. In fairseq version, pure image and text tasks account for less. From this line of code, the ratio is 8:1. (different code of of Huggingface version)
Do you have any experience with the proportion of each pre-training task?

In addition, I am a little confused about some details of these three codes (process_image_text_pair, L12&45 of pretrain.sh, and init_task.py). Their configurations and dimensions seem inconsistent.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does the repo support OFA pre-training? #1

Does the repo support OFA pre-training? #1

zzhanghub commented Sep 22, 2022

faychu commented Sep 22, 2022

zzhanghub commented Sep 22, 2022

faychu commented Sep 23, 2022

zzhanghub commented Sep 27, 2022

Does the repo support OFA pre-training? #1

Does the repo support OFA pre-training? #1

Comments

zzhanghub commented Sep 22, 2022

faychu commented Sep 22, 2022

zzhanghub commented Sep 22, 2022

faychu commented Sep 23, 2022

zzhanghub commented Sep 27, 2022