An Approach to Geotag a Web Sized Corpus of Documents with Addresses in Randstad, Netherlands ...
This paper describes a cluster compute workflow about how a web sized corpus of documents (3.6 ×10^9 documents, 260 TiB of data) can be geotagged and how semantic similarities of documents geotagged to the same address could be used to verify these tags. ... : Adjunct Proceedings of the 14th International Conference on Location Based Services ...
Verfasser: | |
---|---|
Dokumenttyp: | Scholarlyarticle |
Erscheinungsdatum: | 2018 |
Verlag/Hrsg.: |
ETH Zurich
|
Schlagwörter: | Geotagging / Data Science / Data Mining / Natural Language Processing |
Sprache: | Englisch |
Permalink: | https://search.fid-benelux.de/Record/base-29161028 |
Datenquelle: | BASE; Originalkatalog |
Powered By: | BASE |
Link(s) : | https://dx.doi.org/10.3929/ethz-b-000225615 |