You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was wondering whether you have a plan to share the code for semi-automated dataset generation (the pipeline of using Katna to extract keyframes -> using BLIP2 & GRIT to generate frame-wise captions -> filtering with Tag2Text). If not, is it possible to share the generated dense captions from these large vision models?
Thank you!
The text was updated successfully, but these errors were encountered:
I appreciate your interest in our work. We recently released our work called VideoGPT+ and an improved semi-automatic video annotation pipeline for dataset generation. All the scripts to run the pipeline are also released.
Hi @mmaaz60, thanks for sharing this great work!
I was wondering whether you have a plan to share the code for semi-automated dataset generation (the pipeline of using Katna to extract keyframes -> using BLIP2 & GRIT to generate frame-wise captions -> filtering with Tag2Text). If not, is it possible to share the generated dense captions from these large vision models?
Thank you!
The text was updated successfully, but these errors were encountered: