Contextual property detection in Dutch diagnosis descriptions for uncertainty, laterality and temporality ...
Abstract Background Accurate, coded problem lists are valuable for data reuse, including clinical decision support and research. However, healthcare providers frequently modify coded diagnoses by including or removing common contextual properties in free-text diagnosis descriptions: uncertainty (suspected glaucoma), laterality (left glaucoma) and temporality (glaucoma 2002). These contextual properties could cause a difference in meaning between underlying diagnosis codes and modified descriptions, inhibiting data reuse. We therefore aimed to develop and evaluate an algorithm to identify these... Mehr ...
Verfasser: | |
---|---|
Dokumenttyp: | Datenquelle |
Erscheinungsdatum: | 2021 |
Verlag/Hrsg.: |
figshare
|
Schlagwörter: | Medicine / 39999 Chemical Sciences not elsewhere classified / FOS: Chemical sciences / 69999 Biological Sciences not elsewhere classified / FOS: Biological sciences / 80699 Information Systems not elsewhere classified / FOS: Computer and information sciences / Cancer / Science Policy |
Sprache: | unknown |
Permalink: | https://search.fid-benelux.de/Record/base-28983875 |
Datenquelle: | BASE; Originalkatalog |
Powered By: | BASE |
Link(s) : | https://dx.doi.org/10.6084/m9.figshare.c.5368416 |
Abstract Background Accurate, coded problem lists are valuable for data reuse, including clinical decision support and research. However, healthcare providers frequently modify coded diagnoses by including or removing common contextual properties in free-text diagnosis descriptions: uncertainty (suspected glaucoma), laterality (left glaucoma) and temporality (glaucoma 2002). These contextual properties could cause a difference in meaning between underlying diagnosis codes and modified descriptions, inhibiting data reuse. We therefore aimed to develop and evaluate an algorithm to identify these contextual properties. Methods A rule-based algorithm called UnLaTem (Uncertainty, Laterality, Temporality) was developed using a single-center dataset, including 288,935 diagnosis descriptions, of which 73,280 (25.4%) were modified by healthcare providers. Internal validation of the algorithm was conducted with an independent sample of 980 unique records. A second validation of the algorithm was conducted with 996 ...