Friday, June 23, 2017 - 09:00 to 18:45
Automatic extraction of breast cancer information from clinical reports
The majority of clinical data is only available in unstructured text documents. Thus, their automated usage in data-based clinical application scenarios, like quality assurance and clinical decision support by treatment suggestions, is hindered because it requires high manual annotation efforts. In this work, we introduce a system for the automated processing of clinical reports of mamma carcinoma patients that allows for the automatic extraction and seamless processing of relevant textual features. Its underlying information extraction pipeline employs a rule-based grammar approach that is integrated with semantic technologies to determine the relevant information from the patient record. The accuracy of the system, developed with nine thousand clinical documents, reaches accuracy levels of 90% for lymph node status and 69% for the structurally most complex feature, the hormone status.
Claudia Breischneider's picture
Claudia Breischneider
Sonja Zillner's picture
Sonja Zillner
Siemens AG (DE)
Matthias Hammon's picture
Matthias Hammon
Paul Gass's picture
Paul Gass
Daniel Sonntag's picture
Daniel Sonntag
German Research Center for AI (DE)