Third Austrian Meeting on Digital Linguistics
When: Friday, 5. December 2025
This workshop is co-located with the 49. Österreichische Linguistiktagung (ÖLT), taking place in Klagenfurt, Austria from 5. - 8. December 2025.
Where: University of Klagenfurt
ÖLT Room number 9, Room number (Uni) N.0.27 (30)
Registration: closed
Organisation: Tanja Wissik - ( Austrian Academy of Sciences ), Andreas Baumann ( University of Vienna ), Julia Neidhardt (TU Vienna), Claudia Posch ( University of Innsbruck ), Gerhard Rampl ( University of Innsbruck )
Digital linguistics is a growing interdisciplinary field at the intersection of linguistics, information technology, and the social sciences. This is reflected by a growing number of new projects, publication series, and university courses. A central focus of digital linguistics is language data, i.e., digital artifacts that use human language as a form of expression. The range of these language data includes social media content, parliamentary transcripts, newspapers, and medieval manuscripts, among others. Such data are processed, annotated, analyzed, curated, shared, archived, and reused, among other activities. Also new technologies such as large language models (LLMs) and generative AI play a growing role in digital linguistics. Therefore, the topics covered in this workshop span from the creation of digital language resources (corpora, dictionaries, etc.), new methods (application of LLMs and generative AI), analysis of language data (e.g., semantic change detection, emotion and sentiment analysis), to the use of standards and research infrastructures, as well as methods for long-term archiving or reuse of language data.
The variety of research in this field in Austria was shown during the first Austrian Meeting on Digital Linguistics and the second Austrian meeting on Digital Linguistics , as well as within the context of the previous Austrian Meeting on Sentiment Inferenz (ÖTSI 2021, 2023) , where 37 researchers from different Austrian and international research institutions presented their projects.
This year’s workshop “Third Austrian Meeting on Digital Linguistics” is a continuation of this workshop series, organized in the framework of CLARIAH-AT. Again, the aim of the workshop is to highlight recent developments in the Austrian research landscape and to connect different projects working with or on methods in digital linguistics, as well as the researchers involved. The workshop aims to facilitate the exchange of methodological insights and the creation of synergies through the mutual sharing of digital language resources, also within the framework of the research infrastructure CLARIAH-AT. Furthermore, the workshop also addresses international researchers, who are working in the field of digital linguistics and who want to present their research and exchange and connect with the Austrian research community.
Programme
The workshop programme is composed of 14 presentations, which were peer-reviewed. The book of abstracts is available below.
| Time | Room number ÖLT: 9 / room number (Uni) N.0.27 (30) |
|---|---|
| 9:00 - 9:05 | ”Introduction” - Tanja Wissik, Julia Neidhardt, Andreas Baumann |
| 9:05 - 9:30 | ”Interoperable Corpora of Historical Newspapers: the PressMint Project” - Tanja Wissik, Maciej Ogrodniczuk, Petya Osenova |
| 9:30 - 10:00 | ”Processing Digitized Text on an Example of Job Advertisements from Austrian Periodicals from 1850-1950” - Klara Venglerova |
| 10:00 - 10:30 | ”Potential of Generative AI for Text Transcription” - Jona Marie Hassenbach, Magdalena Miteva |
| 10:30 - 11:00 | Pause |
| 11:00 - 11:30 | ”Large language models as synthetic participants in psycholinguistic experiments: the case of German noun plural formation.” - Teodor Petrič |
| 11:30 - 12:00 | ”Analysis of Word Order Biases in Language Models: A Controlled Investigation Using Artificial and Natural Languages” - Varvara Arzt, Allan Hanbury, Terra Blevins |
| 12:00 - 12:15 | ”The reliability of detecting and exploring basic emotions in short social media texts using the BEMDI-metre” (short paper) - Natalia Borza |
| 12:15 - 12:30 | ”Phylogenetic analysis of folktale evolution: Reconstructing cultural transmission through computational Methods” (short paper) - Edlira Gugu |
| 12:30 - 14:30 | Pause |
| 14:30 - 15:00 | ”The rise of present participial -ing in Middle English: Reducing ambiguity in word structure Signals” - Michelle van de Bilt, Florian Jung, Matthis Hupertz, Irene Böhm, Nikolaus Ritt |
| 15:00 - 15:30 | ”Kommunikationsverben als Indikatoren für situativen Kontext – Ein Forschungsprojekt zu Indexikalität in Korpusanalyse und Sprachwissen” - Cordula Meißner, Janina Deilke, Anna-Lena Randermann |
| 15:30 - 16:00 | ”Philosophical considerations of digital text analysis: Core assumptions and methodological challenges” - Rashid Mustafin |
| 16:00 - 16:30 | Pause |
| 16:30 - 16:45 | ”Vergleichende Analyse von nominierten Texten des Ingeborg-Bachmann-Preises” (short paper) - Katharina Horn, Fabian Navarro, Jasmin Bettstein |
| 16:45 - 17:00 | ”Heatmap-based visualisation of the linguistic variation (on the material of East Slavic small territorial lects)” (sort paper) - Ilia Afanasev |
| 17:00 - 17:30 | ”Regional Semantic Change and Variation in Ukrainian with LLMs” - Nataliia Cheilytko |
| 17:30 - 18:00 | ”Linguistic diversity and digitalization: a progress report” - Juliane Benson, Julia Neidhardt, Katharina Zeh, Andreas Baumann, Hannes Essfores, Hannes Fellner |
related links: