Why gender and age prediction from tweets is hard : lessons from a crowdsourcing experiment
There is a growing interest in automatically predicting the gender and age of authors from texts. However, most research so far ignores that language use is related to the social identity of speak- ers, which may be different from their biological identity. In this paper, we combine insights from sociolinguistics with data collected through an online game, to underline the importance of approaching age and gender as social variables rather than static biological variables. In our game, thousands of players guessed the gender and age of Twitter users based on tweets alone. We show that more tha... Mehr ...
Verfasser: | |
---|---|
Dokumenttyp: | conference |
Erscheinungsdatum: | 2014 |
Verlag/Hrsg.: |
Dublin City University and Association for Computational Linguistics
|
Schlagwörter: | Languages and Literatures / social media / Dutch / twitter / gender / crowdsourcing / Lt3 / computational sociolinguistics |
Sprache: | Englisch |
Permalink: | https://search.fid-benelux.de/Record/base-26675649 |
Datenquelle: | BASE; Originalkatalog |
Powered By: | BASE |
Link(s) : | https://biblio.ugent.be/publication/8694786 |