DeFormer: Decomposing Pre-trained Transformers For Faster Question Answering. (arXiv:2005.00697v1 [cs.CL])

[Submitted on 2 May 2020]

Abstract: Transformer-based QA models use input-wide self-attention — i.e. across both
the question and the input passage — at all layers, causing them to be slow
and memory-intensive. It turns out that we can get by without input-wide
self-attention at all layers, especially in the lower layers. We introduce
DeFormer, a decomposed transformer, which substitutes the full self-attention
with question-wide and passage-wide self-attentions in the lower layers. This
allows for question-independent processing of the input text representations,
which in turn enables pre-computing passage representations reducing runtime
compute drastically. Furthermore, because DeFormer is largely similar to the
original model, we can initialize DeFormer with the pre-training weights of a
standard transformer, and directly fine-tune on the target QA dataset. We show
DeFormer versions of BERT and XLNet can be used to speed up QA by over 4.3x and
with simple distillation-based losses they incur only a 1% drop in accuracy. We
open source the code at this https URL.

Submission history

From: Qingqing Cao [view email]
[v1]
Sat, 2 May 2020 04:28:22 UTC (1,075 KB)

Source: http://arxiv.org/abs/2005.00697

Generative Data Intelligence

DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering. (arXiv:2005.00697v1 [cs.CL])

Submission history

Decoding the Black Box: How AI is unveiling the secrets of crypto money laundering

Ethereum Struggles To Reach A High Of $3,239 And Is Stuck In A Range

Latest Intelligence

Gaming Chain Immutable X Hints ‘Biggest Airdrop of the Year’ | BitPinas

Former Solicitor General Launches ‘Basta Bitcoin’ Podcast Exploring Monetary Evolution | BitPinas

In a bull market, everyone thinks they’re a VC

Altcoins Shine Amidst Bitcoin Gloom: $LINK, $DOT, $ADA, $XRP Attract Investors’ Interest

Biden Unlocks $7 Billion Solar Investment On Earth Day – CleanTechnica

Family Nature Expeditions Activity Kits crowdfunding opportunity project pitch by Indiegogo

Chat with us