Scans and transcriptions of the VOC and the Haarlem notarial deeds archives ...
The National Archives of the Netherlands and the Noord-Hollands Archief started a collaboration with the Transkribus HTR (Handwritten Text Recognition) platform in order to semi automatically transcribe 2 million pages of old Dutch texts. The archives are 17th and 18th century material from the Dutch East-Asia Company (VOC) and 19th century notarial deeds from the city of Haarlem. In order to train the HTR software, human made transciptions had to be made. These datasets contain the scans (.jpg images) with the transcriptions in ALTO xml format (word level) that have been made in order to trai... Mehr ...
Verfasser: | |
---|---|
Dokumenttyp: | dataset |
Erscheinungsdatum: | 2020 |
Verlag/Hrsg.: |
Zenodo
|
Schlagwörter: | Transciptions / Verenigde Oost-Indische Compagnie / Notarial deeds / Nationaal Archief / Noord-Hollands Archief / Transkribus |
Sprache: | unknown |
Permalink: | https://search.fid-benelux.de/Record/base-29071472 |
Datenquelle: | BASE; Originalkatalog |
Powered By: | BASE |
Link(s) : | https://dx.doi.org/10.5281/zenodo.3906480 |