tag : transformers

2 publications
page_white_acrobat Language Models are Unsupervised Multitask Learners (2019) — Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, Ilya Sutskever
page_white_acrobat Attention Is All You Need (2017) — Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin