TY - GEN
T1 - Ontological attention ensembles for capturing semantic concepts in ICD code prediction from clinical text
AU - Falis, Matúš
AU - Pajak, Maciej
AU - Lisowska, Aneta
AU - Schrempf, Patrick
AU - Deckers, Lucas
AU - Mikhael, Shadia
AU - Tsaftaris, Sotirios A.
AU - O'Neil, Alison Q.
PY - 2019
Y1 - 2019
N2 - We present a semantically interpretable system for automated ICD coding of clinical text documents. Our contribution is an ontological attention mechanism which matches the structure of the ICD ontology, in which shared attention vectors are learned at each level of the hierarchy, and combined into label-dependent ensembles. Analysis of the attention heads shows that shared concepts are learned by the lowest common denominator node. This allows child nodes to focus on the differentiating concepts, leading to efficient learning and memory usage. Visualisation of the multilevel attention on the original text allows explanation of the code predictions according to the semantics of the ICD ontology. On the MIMIC-III dataset we achieve a 2.7% absolute (11% relative) improvement from 0.218 to 0.245 macro-F1 score compared to the previous state of the art across 3,912 codes. Finally, we analyse the labelling inconsistencies arising from different coding practices which limit performance on this task.
AB - We present a semantically interpretable system for automated ICD coding of clinical text documents. Our contribution is an ontological attention mechanism which matches the structure of the ICD ontology, in which shared attention vectors are learned at each level of the hierarchy, and combined into label-dependent ensembles. Analysis of the attention heads shows that shared concepts are learned by the lowest common denominator node. This allows child nodes to focus on the differentiating concepts, leading to efficient learning and memory usage. Visualisation of the multilevel attention on the original text allows explanation of the code predictions according to the semantics of the ICD ontology. On the MIMIC-III dataset we achieve a 2.7% absolute (11% relative) improvement from 0.218 to 0.245 macro-F1 score compared to the previous state of the art across 3,912 codes. Finally, we analyse the labelling inconsistencies arising from different coding practices which limit performance on this task.
UR - https://www.scopus.com/pages/publications/85119384836
UR - https://www.scopus.com/pages/publications/85119384836#tab=citedBy
U2 - 10.18653/v1/D19-6220
DO - 10.18653/v1/D19-6220
M3 - Conference contribution
T3 - LOUHI@EMNLP 2019 - 10th International Workshop on Health Text Mining and Information Analysis, Proceedings
SP - 168
EP - 177
BT - Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019)
PB - Association for Computational Linguistics (ACL)
T2 - 10th International Workshop on Health Text Mining and Information Analysis, LOUHI@EMNLP 2019
Y2 - 3 November 2019
ER -