[1] Marisa Delz, Benjamin Layer, Sarah Schulz, and Johannes Wahle. Overgeneralization of verbs — The change of the German verb system. In Proceedings of the 9th International Conference on the Evolution of Language, Evolang IX, 96–103. Kyoto, Japan, 3 2012. URL:

[2] Bart Desmet, Orphée De Clercq, Marjan Van de Kauter, Sarah Schulz, Cynthia Van Hee, and Veronique Hoste. Taaltechnologie 2.0: sentimentanalyse en normalisatie, pages 157–161. Beschouwingen uit een talenhuis : opstellen over onderwijs en onderzoek in de vakgroep Vertalen, Tolken en Communicatie aangeboden aan Rita Godyns. Academia Press, 2014. URL:\_2014\_Desmet.pdf.

[3] Sarah Schulz, Verena Lyding, and Lionel Nicolas. Compiling a diverse web corpus for South Tyrolean German - STirWaC. In Proceedings of the 8th Web as Corpus Workshop, 37–45. Lancaster, UK, 2013.

[4] Orphée De Clercq, Sarah Schulz, Bart Desmet, Els Lefever, and Véronique Hoste. Normalization of Dutch User-Generated Content. In Proceedings of the 9th International Conference on Recent Advances in Natural Language Processing. Hissar, Bulgaria, 2013.

[5] Orphée De Clercq, Schulz Schulz, Bart Desmet, and Véronique Hoste. Towards Shared Datasets for Normalization Research. In Nicoletta Calzolari (Conference Chair), Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis, editors, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14). Reykjavik, Iceland, May 2014. European Language Resources Association (ELRA).

[6] Sarah Schulz. Named-Entity Recognition for User-Generated Content. In Proceedings of European Summer School in Logic Language and Computation 2014 Student Session. Springer, 2014.

[7] Sarah Schulz, Guy De Pauw, Orphée De Clercq, Bart Desmet, Véronique Hoste, Walter Daelemans, and Lieve Macken. Multimodular text normalization of dutch user-generated content. ACM Trans. Intell. Syst. Technol., July 2016. URL:, doi:10.1145/2850422.

[8] Sarah Schulz. The Taming of the Shrew - non-standard text processing in the Digital Humanities. PhD thesis, University of Stuttgart, 2018.

[9] Janis Pagel, Nils Reiter, Ina Rösiger, and Sarah Schulz. A Unified Annotation Workflow for Diverse Goals. In Sandra Kübler and Heike Zinsmeister, editors, Proceedings of the Workshop on Annotation in Digital Humanities, co-located with ESSLLI 2018. August 2018.

[10] Ina Roesiger, Sarah Schulz, and Nils Reiter. Towards coreference for literary text: analyzing domain-specific phenomena. In Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, 129–138. Santa Fe, New Mexico, August 2018. Association for Computational Linguistics. URL:

[11] Sarah Schulz and Jonas Kuhn. Multi-modular domain-tailored OCR post-correction. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2716–2726. Copenhagen, Denmark, September 2017. Association for Computational Linguistics. URL:, doi:10.18653/v1/D17-1288.

[12] Nils Reiter, Sarah Schulz, Gerhard Kremer, Roman Klinger, Gabriel Viehhauser, and Jonas Kuhn. Teaching Computational Aspects in the Digital Humanities Program at University of Stuttgart – Intentions and Experiences. In Proceedings of the Workshop on Teaching NLP for Digital Humanities (Teach4DH 2017) co-located with GSCL 2017, 43–48. Berlin, Germany, September 2017.

[13] D. Doran, S.C. Schulz, and T. R. Besold. What does explainable ai really mean? a new conceptualization of perspectives. CEUR Workshop Proceedings, March 2018. Copyright © 2018 for this paper by its authors. Copying permitted for private and academic purposes. Proceedings of the First International Workshop on Comprehensibility and Explanation in AI and ML 2017 co-located with 16th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2017) Bari, Italy, November 16th and 17th, 2017. URL:

[14] Sarah Schulz and Jonas Kuhn. Learning from within? comparing PoS tagging approaches for historical text. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16), 4316–4322. Portorož, Slovenia, May 2016. European Language Resources Association (ELRA). URL:

[15] Sarah Schulz and Mareike Keller. Code-switching ubique est - language identification and part-of-speech tagging for historical mixed text. In Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, 43–51. Berlin, Germany, August 2016. Association for Computational Linguistics. URL:, doi:10.18653/v1/W16-2105.

[16] Sarah Schulz and Nora Ketschik. From 0 to 10 million annotated words: part-of-speech tagging for Middle High German. Language Resources and Evaluation, 53(4):837–863, Dec 2019. URL:, doi:10.1007/s10579-019-09462-8.

[17] Schulz, Sarah and Seva, Jurica and Rodriguez, Samuel and Ostendorff, Malte and Rehm, Georg. Named Entities in Medical Case Reports: Corpus and Experiments. Proceedings of The 12th Language Resources and Evaluation Conference. URL: