number of trainable parameters #19

andrey999333 · 2019-02-02T07:37:08Z

I don't quite understand one point. When I downloaded your keras representation of BERT and check the number of trainable parameters in summary, it showed ~177 mil parameters, while in official bert it should be 110 mil for base model. Could you explain where this difference comes from?

Separius · 2019-02-02T07:46:43Z

Hi,
I'm not entirely sure, but maybe it's because of the subword embeddings?
most of the time people don't count input embeddings in their model parameters.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

number of trainable parameters #19

number of trainable parameters #19

andrey999333 commented Feb 2, 2019 •

edited

Loading

Separius commented Feb 2, 2019

number of trainable parameters #19

number of trainable parameters #19

Comments

andrey999333 commented Feb 2, 2019 • edited Loading

Separius commented Feb 2, 2019

andrey999333 commented Feb 2, 2019 •

edited

Loading