Improving Dutch vaccine hesitancy monitoring via multi-label data augmentation with GPT-3.5
Abstract: In this paper, we leverage the GPT-3.5 language model both using the Chat-GPT API interface and the GPT-3.5 API interface to generate realistic examples of anti-vaccination tweets in Dutch with the aim of augmenting an imbalanced multi-label vaccine hesitancy argumentation classification dataset. In line with previous research, we devise a prompt that, on the one hand, instructs the model to generate realistic examples based on the human dataset (gold standard) and, on the other hand, to assign one or multiple labels to the generated instances. We then augment our gold standard data... Mehr ...
Verfasser: | |
---|---|
Dokumenttyp: | conferenceObject |
Erscheinungsdatum: | 2023 |
Schlagwörter: | Linguistics |
Sprache: | Englisch |
Permalink: | https://search.fid-benelux.de/Record/base-27061536 |
Datenquelle: | BASE; Originalkatalog |
Powered By: | BASE |
Link(s) : | https://hdl.handle.net/10067/2032100151162165141 |