Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Long Term consistency metric. #3

Open
LastZhabka opened this issue Aug 5, 2024 · 1 comment
Open

Long Term consistency metric. #3

LastZhabka opened this issue Aug 5, 2024 · 1 comment

Comments

@LastZhabka
Copy link

Hi!

"LT consistency is evaluated by checking the presence of the target character in
selected frames across the test set. Finally, we calculate the ratio of frames containing the target
character to the total number of selected frames as a measure of the accuracy in maintaining long-term
character consistency."

Can you please describe how you select "selected frames" and what do you use to detect the target character in selected frames? If it's possible please describe this metric in more details.

Thank you.

@volcverse
Copy link
Contributor

Hi, sorry for the late reply!

  • To select the frames where a character appears, we first select an image of the main character and use FARL to obtain the character's embedding. We then compute the similarity of this embedding with FARL embeddings of all other frames and select images above a certain threshold. Although this is not exactly precise, it still provide a way to evaluate the consistency.

  • For short-term consistency, we select all the frames where a character appears. We then calculate the similarity of CLIP embeddings of selected consecutive keyframes, as consecutive keyframes where the same character appears usually have high similarity.

  • For long-term consistency, We calculate the proportion of generated frames where the character appears to the ground truth frames where the character appears. For example, if the GT frames where a character appears are 1, 2, 3, 4, 5, and the frames generated by a method where the character appears are 1, 2, 3, 6, then the long-term consistency is calculated as 3/5.

We will improve our writing and present a better paper in the revised version :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants