GGPONC 2.0 @ LREC

Published: April 04, 2022

Our paper for the new release of GGPONC 2.0 has been accepted at LREC! The new dataset is currently the largest, freely distributable annotated corpus of German medical text (1.87M tokens, 250K annotations). We also created baseline NER models with HuggingFace transformers.

Get access to the corpus and models here

Twitter Facebook LinkedIn

Florian Borchert