Home > other >  Bert why needs to be combined with other feature extraction device?
Bert why needs to be combined with other feature extraction device?

Time:10-12

Bert in text representation effect is very good, why many NLP paper also combine Bert other model? Such as Bert + BiGRU etc, what is the function of BiGRU here, is to deep Bert said after the text feature extraction?
  • Related