{"id":2876,"date":"2025-04-07T10:48:09","date_gmt":"2025-04-07T13:48:09","guid":{"rendered":"https:\/\/icc.fcen.uba.ar\/?p=2876"},"modified":"2025-04-07T10:48:09","modified_gmt":"2025-04-07T13:48:09","slug":"information-extraction-from-electronic-health-records-written-in-spanish-for-epidemic-intelligence","status":"publish","type":"post","link":"https:\/\/icc.fcen.uba.ar\/en\/information-extraction-from-electronic-health-records-written-in-spanish-for-epidemic-intelligence\/","title":{"rendered":"Information Extraction from Electronic Health Records Written in Spanish for Epidemic Intelligence"},"content":{"rendered":"<div class=\"fusion-fullwidth fullwidth-box fusion-builder-row-1 fusion-flex-container has-pattern-background has-mask-background nonhundred-percent-fullwidth non-hundred-percent-height-scrolling\" style=\"--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;\" ><div class=\"fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap\" style=\"max-width:1144px;margin-left: calc(-4% \/ 2 );margin-right: calc(-4% \/ 2 );\"><div class=\"fusion-layout-column fusion_builder_column fusion-builder-column-0 fusion_builder_column_1_1 1_1 fusion-flex-column\" style=\"--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:1.92%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:1.92%;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:1.92%;--awb-spacing-left-medium:1.92%;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;\"><div class=\"fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column\"><div class=\"fusion-text fusion-text-1\"><p>Authors: Javier Petri, Pilar Barcena Barbeira and Viviana Cotik.<\/p>\n<p>Abstract:<br \/>\nAutomatic symptom detection from electronic health records is a valuable source for event-based surveillance systems. In this study, we develop tools to automatically detect symptoms associated with febrile illnesses in electronic health records written in Spanish. Therefore, we use a custom corpus, comprising 6228 expertly labeled and approximately 1 million unlabeled health reports. Our approach involved fine-tuning state-of-the-art named entity recognition models, including BiLSTM-CRF and transformer-based models like RoBERTa. We focused on domain-adaptive and task-adaptive models to enhance performance: the former were pretrained on biomedical corpora, while the latter were further pretrained on our unlabeled health reports. Despite computational constraints, our models demonstrated promising results, with RoBERTa-Clinico, a task-adaptive transformer model pretrained in our unlabeled corpus, showing the best micro recall performance (79.30), and 70.83 micro F1 score, which are comparable to results in similar studies. In this way, we contribute to the limited body of work in BioNLP in Spanish.<\/p>\n<p>More information:<br \/>\n<a href=\"http:\/\/dx.doi.org\/10.1007\/978-3-031-80366-6_35\" target=\"_blank\" rel=\"noopener\">http:\/\/dx.doi.org\/10.1007\/978-3-031-80366-6_35<\/a><\/p>\n<\/div><\/div><\/div><\/div><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":9,"featured_media":2877,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[98],"tags":[],"class_list":["post-2876","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-papers"],"_links":{"self":[{"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/posts\/2876","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/users\/9"}],"replies":[{"embeddable":true,"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/comments?post=2876"}],"version-history":[{"count":1,"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/posts\/2876\/revisions"}],"predecessor-version":[{"id":2878,"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/posts\/2876\/revisions\/2878"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/media\/2877"}],"wp:attachment":[{"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/media?parent=2876"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/categories?post=2876"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/icc.fcen.uba.ar\/en\/wp-json\/wp\/v2\/tags?post=2876"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}