The Temporal Persistence of Generative Language Models in Sentiment Analysis

Medina-Alias, Pablo; Şimşek, Özgür

Repository landing page

oai:purehost.bath.ac.uk:publications/b833469c-e6c4-4cd3-9e9b-bbcbc1e616a7

The Temporal Persistence of Generative Language Models in Sentiment Analysis

Authors: Pablo Medina-Alias
Özgür Şimşek
Publication date: 21 September 2023
Publisher

Abstract

Pre-trained transformer-based language models (PLMs) have revolutionised text classification tasks, but their performance tends to deteriorate on data distant in time from the training dataset. Continual supervised re-training help address this issue but it is limited by the availability of newly labelled samples. This paper explores the longitudinal generalisation abilities of large generative PLMs, such as GPT-3 and T5, and smaller encoder-only alternatives for sentiment analysis in social media. We investigate the impact of time-related variations in data, model size, and fine-tuning on the classifiers’ performance. Through competitive evaluation in the CLEF-2023 LongEval Task 2, we compare results from fine-tuning, few-shot learning, and zero-shot learning. Our analysis reveals the superior performance of large generative models over the benchmark RoBERTa and highlights the benefits of limited exposure to training data in achieving robust predictions on temporally distant test sets. The findings contribute to understanding how to build more temporally robust transformer-based text classifiers, reducing the need for continuous re-training with annotated data.</p

Similar works

Full text

Open in the Core reader

Download PDF

OPUS

oai:purehost.bath.ac.uk:public...

Last time updated on 20/11/2023Provided by our Supporting member

This paper was published in OPUS.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.