Difference between revisions of "Bidirectional Encoder Representations from Transformers (BERT)"

Revision as of 20:48, 4 October 2019

@@ Line 18: / Line 18: @@
 * [http://arxiv.org/abs/1909.10351 TinyBERT: Distilling BERT for Natural Language Understanding | X. Jiao, Y. Yin, L. Shang, X. Jiang, X. Chen, L. Li, F. Wang, and Q. Liu] researchers at Huawei produces a model called TinyBERT that is 7.5 times smaller and nearly 10 times faster than the original. It also reaches nearly the same language understanding performance as the original.
 * [[Google]]
+<img src="http://miro.medium.com/max/2070/1*IFVX74cEe8U5D1GveL1uZA.png" width="800" height="500">
 <youtube>bDxFvr1gpSU</youtube>