WhiteLab 2.0:A web interface for corpus exploitation

The OpenSoNaR-CGN project set out to develop WhiteLab 2.0 for the online exploitation of the SoNaR-500 and CGN corpora. Important changes in comparison to the first version of WhiteLab are the addition of audio support and support for multiple corpora. The web interface has been redeveloped and adapted to accommodate these changes. At the backend, WhiteLab 2.0 comes with a new data importer and plugin for Neo4j, while also remaining compatible with BlackLab. Although performance of the new backend is not yet up to par with BlackLab, the investment in new technology that will likely be further... Mehr ...

Verfasser: van de Camp, Matje
Reynaert, Martin
Oostdijk, Nelleke
Dokumenttyp: bookPart
Erscheinungsdatum: 2017
Verlag/Hrsg.: Ubiquity Press
London
Schlagwörter: Computer Sciences / Computers and the Humanities / Language and literature / Linguistics / Online corpora / Dutch / written language / BlackLab
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-27060549
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://research.tilburguniversity.edu/en/publications/8b8f9b41-101e-438c-b6ca-a4198d78cff7

The OpenSoNaR-CGN project set out to develop WhiteLab 2.0 for the online exploitation of the SoNaR-500 and CGN corpora. Important changes in comparison to the first version of WhiteLab are the addition of audio support and support for multiple corpora. The web interface has been redeveloped and adapted to accommodate these changes. At the backend, WhiteLab 2.0 comes with a new data importer and plugin for Neo4j, while also remaining compatible with BlackLab. Although performance of the new backend is not yet up to par with BlackLab, the investment in new technology that will likely be further developed is expected to make the application more future-proof and a great addition to the set of tools available to the humanities.