An Approach to Geotag a Web Sized Corpus of Documents with Addresses in Randstad, Netherlands ...

This paper describes a cluster compute workflow about how a web sized corpus of documents (3.6 ×10^9 documents, 260 TiB of data) can be geotagged and how semantic similarities of documents geotagged to the same address could be used to verify these tags. ... : Adjunct Proceedings of the 14th International Conference on Location Based Services ...

Verfasser: Czech, Alexander
Dokumenttyp: Scholarlyarticle
Erscheinungsdatum: 2018
Verlag/Hrsg.: ETH Zurich
Schlagwörter: Geotagging / Data Science / Data Mining / Natural Language Processing
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-29161028
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://dx.doi.org/10.3929/ethz-b-000225615