Difference between revisions of "Transformer-XL"

From
Jump to: navigation, search
Line 8: Line 8:
 
[http://www.google.com/search?q=Transformer+XL+attention+model+deep+machine+learning+ML ...Google search]
 
[http://www.google.com/search?q=Transformer+XL+attention+model+deep+machine+learning+ML ...Google search]
  
* [[BERT]]
+
* [[Bidirectional Encoder Representations from Transformers (BERT)]]
 
* [http://medium.com/dair-ai/a-light-introduction-to-transformer-xl-be5737feb13 A Light Introduction to Transformer-XL | Elvis - Medium]
 
* [http://medium.com/dair-ai/a-light-introduction-to-transformer-xl-be5737feb13 A Light Introduction to Transformer-XL | Elvis - Medium]
 
* [http://towardsdatascience.com/transformer-xl-explained-combining-transformers-and-rnns-into-a-state-of-the-art-language-model-c0cfe9e5a924 Transformer-XL Explained: Combining Transformers and RNNs into a State-of-the-art Language Model | Rani Horev - Towards Data Science]
 
* [http://towardsdatascience.com/transformer-xl-explained-combining-transformers-and-rnns-into-a-state-of-the-art-language-model-c0cfe9e5a924 Transformer-XL Explained: Combining Transformers and RNNs into a State-of-the-art Language Model | Rani Horev - Towards Data Science]

Revision as of 22:29, 27 February 2019