Susanne Ibing, Julian Hugo, Florian Borchert, Linea Schmidt, Caroline Benson, Allison Marshall, Colleen Chasteau, Ujunwa Korie, Diana Paguay, Jan Philipp Sachs, Bernhard Y. Renard, Judy H. Cho, Erwin P. Böttinger, Ryan C. Ungaro. Electronic Health Records-based identification of newly diagnosed Crohn’s Disease cases. Artificial Intelligence in Medicine, Volume 159, January 2025, 103032 IF = 6.1
Publications
2025
2024
Florian Borchert, Ignacio Llorca, Matthieu-P. Schapranow. Improving biomedical entity linking for complex entity mentions with LLM-based text simplification. Database, Volume 2024, 2024, baae067 [Code]
Keno K. Bressem, Jens-Michalis Papaioannou, Paul Grundmann, Florian Borchert, Lisa C. Adams, Leonhard Liu, Felix Busch, Lina Xu, Jan P. Loyen, Stefan M. Niehues, Moritz Augustin, Lennart Grosser, Marcus R. Makowski, Hugo JWL. Aerts, Alexander Löser. medBERT.de: A Comprehensive German BERT Model for the Medical Domain. Expert Systems with Applications (2024): 121598 [Hugging Face Model] IF = 8.5
2023
Florian Borchert, Ignacio Llorca, Roland Roller, Bert Arnrich, Matthieu-P. Schapranow xMEN: A Modular Toolkit for Cross-Lingual Medical Entity Normalization. arXiv preprint arXiv:2310.11275 (2023). [Code] [Hugging Face Models]
Smilla Fox, Martin Preiß, Florian Borchert, Aadil Rasheed, Matthieu-P. Schapranow HPIDHC at NTCIR-17 MedNLP-SC: Data Augmentation and Ensemble Learning for Multilingual Adverse Drug Event Detection. NTCIR 17 Conference: Proceedings of the 17th NTCIR Conference on Evaluation of Information Access Technologies. pp. 185–192 (2023)
Florian Borchert and Matthieu-P. Schapranow. HPI-DHC @ BC8 SympTEMIST Track: Detection and Normalization of Symptom Mentions with SpanMarker and xMEN. Proceedings of the BioCreative VIII Challenge and Workshop: Curation and Evaluation in the Era of Generative Models. New Orleans, USA (2023) 🏆 1st place SympTEMIST shared task (entity linking subtrack) [Code]
Danielly de Paula, Florian Borchert, Ariane Sasso, Falk Uebernickel Understanding emotions in the context of IT-based self-monitoring . arXiv preprint arXiv:2311.05449 (2023).
Linea Schmidt, Susanne Ibing, Florian Borchert, Julian Hugo, Allison Marshall, Jellyana Peraza, Judy H. Cho, Erwin P. Böttinger, Ryan C. Ungaro Extraction of Crohn’s Disease Clinical Phenotypes from Clinical Text Using Natural Language Processing. medRxiv 2023.10.16.23297099 (2023)
Nico Steckhan, Raphaela Ring, Florian Borchert, Daniela A. Koppold Triangulation of Questionnaires, Qualitative Data and Natural Language Processing: A Differential Approach to Religious Bahá’í Fasting in Germany. J Relig Health (2023)
Florian Borchert, Ignacio Llorca, Matthieu-P. Schapranow Cross-Lingual Candidate Retrieval and Re-ranking for Biomedical Entity Linking. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2023. Lecture Notes in Computer Science, vol 14163. Springer, Cham 🏆 Best of Labs (BioASQ, CLEF 2022)
Ignacio Llorca, Florian Borchert, Matthieu-P. Schapranow A Meta-dataset of German Medical Corpora: Harmonization of Annotations and Cross-corpus NER Evaluation. In: Proceedings of the 5th Clinical Natural Language Processing Workshop, pages 171–181, Toronto, Canada. Association for Computational Linguistics
Niklas Kämmer*, Florian Borchert*, Silvia Winkler, Gerard de Melo, and Matthieu-P. Schapranow Resolving Elliptical Compounds in German Medical Text.. In: The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, pages 292–305, Toronto, Canada. Association for Computational Linguistics
Matthieu-P. Schapranow, Florian Borchert, Nina Bougatf, Hauke Hund, and Roland Eils. Software-Tool Support for Collaborative, Virtual, Multi-Site Molecular Tumor Boards. SN Computer Science 4, 358, 2023
Nektarios Ladas, Florian Borchert, Stefan Franz, Alina Rehberg, Natalia Strauch, Kim Katrin Sommer, Michael Marschollek, Matthias Gietzelt Programming techniques for improving rule readability for rule-based information extraction natural language processing pipelines of unstructured and semi-structured medical texts. Health Informatics Journal; 29(2) (2023)
Phillip Richter-Pechanski, Philipp Wiesenbach, Dominic M. Schwab, Christina Kiriakou, Mingyang He, Michael M. Allers, Anna S. Tiefenbacher, Nicola Kunz, Anna Martynova, Noemie Spiller, Julian Mierisch, Florian Borchert, Charlotte Schwind, Norbert Frey, Christoph Dieterich & Nicolas A. Geis. A distributable German clinical corpus containing cardiovascular clinical routine doctor’s letters. Scientific Data 10, 207 (2023) [Data Access] IF = 10.8
Julian Hugo, Susanne Ibing, Florian Borchert, Jan Philipp Sachs, Judy Cho, Ryan C. Ungaro and Erwin P. Böttinger. Machine Learning Based Prediction of Incident Cases of Crohn’s Disease Using Electronic Health Records from a Large Integrated Health System. In: Juarez, J.M., Marcos, M., Stiglic, G., Tucker, A. (eds) Artificial Intelligence in Medicine. AIME 2023. Lecture Notes in Computer Science, vol 13897. Springer, Cham 🏆 Best Student Paper
Sandro Steinwand*, Florian Borchert*, Silvia Winkler and Matthieu-P. Schapranow. GGTWEAK: Gene Tagging with Weak Supervision for German Clinical Text. In: Juarez, J.M., Marcos, M., Stiglic, G., Tucker, A. (eds) Artificial Intelligence in Medicine. AIME 2023. Lecture Notes in Computer Science, vol 13897. Springer, Cham [Code]
2022
Florian Borchert and Matthieu-P. Schapranow. HPI-DHC @ BioASQ DisTEMIST: Spanish Biomedical Entity Linking with Pre-trained Transformers and Cross-lingual Candidate Retrieval. Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, pp. 244-258. Bologna, Italy. 🏆 1st place DisTEMIST shared task (entity linking subtrack) [Link] [Code]
Florian Borchert, Christina Lohr, Luise Modersohn, Jonas Witt, Thomas Langer, Markus Follmann, Matthias Gietzelt, Bert Arnrich, Udo Hahn and Matthieu-P. Schapranow. GGPONC 2.0 - The German Clinical Guideline Corpus for Oncology: Curation Workflow, Annotation Policy, Baseline NER Taggers. LREC 2022 — Proceedings of the Language Resources and Evaluation Conference, pp. 3650‑3660. Marseille, France, European Language Resources Association, 2022 [Data Access] [Code]
2021
Florian Borchert, Laura Meister, Thomas Langer, Markus Follmann, Bert Arnrich, and Matthieu-P. Schapranow. Controversial Trials First: Identifying Disagreement Between Clinical Guidelines and New Evidence, Proceedings of the AMIA Annual Symposium, pp. 237-246, San Diego, USA (2021) 🏆 Distinguished Paper Award [Link]
Aadil Rasheed, Florian Borchert, Lasse Kohlmeyer, Richard Henkenjohann, and Matthieu-P. Schapranow. A Comparison of Concept Embeddings for German Clinical Corpora. IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 2314-2321, Online, 2021
Richard Henkenjohann, Benjamin Bergner, Florian Borchert, Nina Bougatf, Hauke Hund, Roland Eils, and Matthieu-P. Schapranow. An Engineering Approach towards Multi-Site Virtual Molecular Tumor Board Software Support. Proceedings of the 1st Conference on ICT for Health, Accessibility and Wellbeing. Springer International Publishing, 2021
Florian Borchert*, Andreas Mock*, Aurelie Tomczak*, Jonas Hügel, Samer Alkarkoukly, Alexander Knurr, Anna-Lena Volckmar, Albrecht Stenzinger, Peter Schirmacher, Jürgen Debus, Dirk Jäger, Thomas Longerich, Stefan Fröhling, Roland Eils, Nina Bougatf, Ulrich Sax, Matthieu-P Schapranow. Knowledge Bases and Software Support for Variant Interpretation in Precision Oncology, Briefings in Bioinformatics, Volume 22, Issue 6, November 2021, bbab134 (* equal contribution) IF = 11.6
2020
Florian Borchert*, Christina Lohr*, Luise Modersohn*, Thomas Langer, Markus Follmann, Jan Philipp Sachs, Udo Hahn, Matthieu-P. Schapranow. GGPONC: A Corpus of German Medical Text with Rich Metadata Based on Clinical Practice Guidelines. In Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis, pp. 38–48. Online: Association for Computational Linguistics, 2020. (* equal contribution) [Data Access] [Code]