The State of Transfer Learning in NLP
Sebastian Ruder
AUGUST 18, 2019
2017 ) and pretrained language models ( Peters et al., 2017 ; Peters et al., This goes back to layer-wise training of early deep neural networks ( Hinton et al., 2017 ; Howard and Ruder, 2018 ; Chronopoulou et al., CoNLL 2017 , Kirkpatrick et al., PNAS 2017 ). 2018 ; Akbik et al., 2006 ; Bengio et al.,
Let's personalize your content