Recognising non-named spatial entities in literary texts: a novel spatial entities classifier

Daniel Kababgi*, Giulia Grisot, Federico Pennino, J. Berenike Herrmann

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Predicting spatial representations in literature is a challenging task that requires advanced machine learning methods and manual annotations. In this paper, we present a study that leverages manual annotations and a BERT language model to automatically detect and recognise non-named spatial entities in a historical corpus of Swiss novels. The annotated data, consisting of Swiss narrative texts in German from the period of 1840 to 1950, was used to train the machine learning model and fine-tune a deep learning model specifically for literary German. The annotation process, facilitated by the use of Prodigy, enabled iterative improvement of the model’s predictions by selecting informative instances from the unlabelled data. Our evaluation metrics (F1 score) demonstrate the model’s ability to predict various categories of spatial entities in our corpus. This new method enables researchers to explore spatial representations in literary text, contributing both to digital humanities and literary studies. While our study shows promising results, we acknowledge challenges such as representativeness of the annotated data, biases in manual annotations, and domain-specific language. By addressing these limitations and discussing the implications of our findings, we provide a foundation for future research in sentiment and spatial analysis in literature. Our findings not only contribute to the understanding of literary narratives but also demonstrate the potential of automated spatial analysis in historical and literary research.
Original languageEnglish
Title of host publicationProceedings of the Computational Humanities Research Conference 2024
EditorsWouter Haverals, Marijn Koolen, Laure Thompson
Place of PublicationAahrus, Denmark
Publication statusPublished - 20 Nov 2024
EventCHR 2024 - Computational Humanities Research 2024 - Aarhus University, Aarhus, Denmark
Duration: 3 Dec 20246 Dec 2024
https://2024.computational-humanities-research.org/

Conference

ConferenceCHR 2024 - Computational Humanities Research 2024
Country/TerritoryDenmark
CityAarhus
Period3/12/246/12/24
Internet address

Fingerprint

Dive into the research topics of 'Recognising non-named spatial entities in literary texts: a novel spatial entities classifier'. Together they form a unique fingerprint.

Cite this