Transformer-XL: Attentive Language Models beyond a Fixed-Length Context
Zihang Dai
,
Zhilin Yang
,
Yiming Yang
,
Zihang Dai
,
Zhilin Yang
,
Yiming Yang
,
Jaime Carbonell
,
Quoc V. Le
,
Ruslan Salakhutdinov
2019
3,018 citations