1 Million Captioned Dutch Newspaper Images

Images naturally appear alongside text in a wide variety of media, such as books, magazines, newspapers, and in online articles. This type of multi-modal data offers an interesting basis for vision and language research but most existing datasets use crowdsourced text, which removes the images from their original context. In this paper, we introduce the KBK-1M dataset of 1.6 million images in their original context, with co-occurring texts found in Dutch newspapers from 1922 - 1994. The images are digitally scanned photographs, cartoons, sketches, and weather forecasts; the text is generated f... Mehr ...

Verfasser:	Elliott, Desmond Kleppe, Martijn
Dokumenttyp:	conferencePaper
Erscheinungsdatum:	2016
Verlag/Hrsg.:	European Language Resources Association (ELRA)
Schlagwörter:	Digital Scholarship / Corpus / Tools / Systems / Applications / Multimedia Document Processing
Sprache:	unknown
Permalink:	https://search.fid-benelux.de/Record/base-26689742
Datenquelle:	BASE; Originalkatalog
Powered By:	BASE
Link(s) :	https://zenodo.org/record/844462

Suche in Bibliothekskatalogen:

	Prüfen Sie die Verfügbarkeit in Ihrer Heimatbibliothek
	Suche deutschlandweit und international (KVK – Karlsruher Virtueller Katalog)
	Suche weltweit im Worldcatworldwide_worldcat

Suche via Google:

Suche via Google

Suche in Google Scholar

Suche in Google Books