State-of-the-art generalisation research in NLP:A taxonomy and review

Hupkes, D.; Giulianelli, M.; Dankers, V.; Artetxe, M.; Elazar, Y.; Pimentel, T.; Christodoulopoulos, C.; Lasri, K.; Saphra, N.; Sinclair, A.; Ulmer, D.; Schottmann, F.; Batsuren, K.; Sun, K.; Sinha, K.; Khalatbari, L.; Ryskina, M.; Frieske, R.; Cotterell, R.; Jin, Z.

Repository landing page

oai:dare.uva.nl:openaire_cris_publications/afe4488c-c94d-4bb8-a221-7b3e8a924bf1

State-of-the-art generalisation research in NLP:A taxonomy and review

Authors: D. Hupkes
M. Giulianelli
V. Dankers
M. Artetxe
Y. Elazar
T. Pimentel
C. Christodoulopoulos
K. Lasri
N. Saphra
A. Sinclair
D. Ulmer
F. Schottmann
K. Batsuren
K. Sun
K. Sinha
L. Khalatbari
M. Ryskina
R. Frieske
R. Cotterell
Z. Jin
Publication date: 6 October 2022
Publisher
Doi

Abstract

The ability to generalise well is one of the primary desiderata of natural language processing (NLP). Yet, what 'good generalisation' entails and how it should be evaluated is not well understood, nor are there any evaluation standards for generalisation. In this paper, we lay the groundwork to address both of these issues. We present a taxonomy for characterising and understanding generalisation research in NLP. Our taxonomy is based on an extensive literature review of generalisation research, and contains five axes along which studies can differ: their main motivation, the type of generalisation they investigate, the type of data shift they consider, the source of this data shift, and the locus of the shift within the modelling pipeline. We use our taxonomy to classify over 400 papers that test generalisation, for a total of more than 600 individual experiments. Considering the results of this review, we present an in-depth analysis that maps out the current state of generalisation research in NLP, and we make recommendations for which areas might deserve attention in the future. Along with this paper, we release a webpage where the results of our review can be dynamically explored, and which we intend to update as new NLP generalisation studies are published. With this work, we aim to take steps towards making state-of-the-art generalisation testing the new status quo in NLP

workingPaper

Similar works

Full text

Open in the Core reader

Download PDF

International Migration, Integration and Social Cohesion online publications

oai:dare.uva.nl:openaire_cris_...

Last time updated on 29/08/2023

This paper was published in International Migration, Integration and Social Cohesion online publications.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.