Evaluation of Named Entity Recognition in Dutch Online Criminal Complaints

The possibility for citizens to submit crime reports and criminal complaints online is becoming ever more common, especially for cyber- and internet-related crimes such as phishing and online trade fraud. Such user-submitted crime reports contain references to entities of interest, such as the complainant, counterparty, items being traded, and locations. Using named entity recognition (NER) algorithms these entities can be identified and used in further eDiscovery and legal reasoning. This paper describes an evaluation of the de facto standard NER algorithm for Dutch on crime reports provided... Mehr ...

Verfasser: Schraagen, M.P.
Brinkhuis, M.J.S.
Bex, F.J.
Dokumenttyp: Part of book
Erscheinungsdatum: 2017
Schlagwörter: named entity recognition / evaluation / crime reports
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-26682190
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://dspace.library.uu.nl/handle/1874/429601

The possibility for citizens to submit crime reports and criminal complaints online is becoming ever more common, especially for cyber- and internet-related crimes such as phishing and online trade fraud. Such user-submitted crime reports contain references to entities of interest, such as the complainant, counterparty, items being traded, and locations. Using named entity recognition (NER) algorithms these entities can be identified and used in further eDiscovery and legal reasoning. This paper describes an evaluation of the de facto standard NER algorithm for Dutch on crime reports provided by the Dutch police. An analysis of confusion in entity type assignment and recall errors is presented, as well as suggestions for performance improvement. The paper concludes with a general discussion on the use of NER in eDiscovery.