performances issues on finetuning db_resnet50 on cord #1621

nikokks · 2024-05-30T12:12:42Z

nikokks
May 30, 2024

Hi all,

When I try to finetune a db_resnet50 on CORD I get 83.44 % of f1-score
But when we see that paper https://arxiv.org/pdf/2210.07903 we should get

Why does doctr have no the same performances ?

felixdittrich92 · 2024-05-31T11:47:51Z

felixdittrich92
May 31, 2024
Maintainer

Hey @nikokks 🤗

That's a really hard question because of so many factors like: post-pre -processing / used augmentations / training configuration / etc.

Maybe a short question have you used the new augmentation pipeline (#1577) or the old one ?

0 replies

nikokks · 2024-05-31T12:06:10Z

nikokks
May 31, 2024
Author

I actually used the new script with the new data augmentation that you pushed (congratulations again!)
I obtained these performances on CORD with the db_resnet50, with the rotation True and with eval_straight False

For my part, I was able to investigate and it turns out that doctr uses retacular polygons whereas for most open source detection models they use polygons with more than 4 sides which boosts their detection performance.

here for example are text masks without rectangles from different opensource models
https://github.com/GXYM/TextBPN
or
https://github.com/WenmuZhou/DBNet.pytorch

it turns out that in this paper (already mentioned previously https://arxiv.org/pdf/2210.07903) non-rectangular detection with more than 4 sides does not decreased the text recognition performances

do you think that doctr could allow polygons with more than 4 sides (non-rectangular?)

what do you think about it ? =)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performances issues on finetuning db_resnet50 on cord #1621

{{title}}

Replies: 2 comments

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

performances issues on finetuning db_resnet50 on cord #1621

nikokks May 30, 2024

Replies: 2 comments

felixdittrich92 May 31, 2024 Maintainer

nikokks May 31, 2024 Author

nikokks
May 30, 2024

felixdittrich92
May 31, 2024
Maintainer

nikokks
May 31, 2024
Author