Uncategorized

Out now: Our new demonstrator tool TWONderland

In the past weeks, our TWON researcher Fabio Sartori (KIT) and his colleagues worked on a new demonstrator tool to make the dynamics of Online Social Networks tangible for the broad public. The result is: TWONderland!

In our simulation TWONderland, we assign the user the job as the lead designer of a new Online Social Network. In a playful and interactive way, users explore how as the platform designer, they influence the interaction on the platform and how even the tiniest design choices can ripple out to shape behavior, sentiments and relationships between the users – and potentially spark fragmentation and fuel polarization.

Unique about this demonstrator is the step-by-step walkthrough of the functionalities of Online Social Networks (OSNs). The user starts by assigning moods – from aggressive to calm – to fictive platform users. We then visualize how their fictive users are connected to each other on the platform, and how their moods adapt as they are confronted with posts of each other. In TWONderland, every OSN user participates within a specific sentiment corridor, meaning that they will interact with and adapt to other users as long as their differences in sentiment are not too significant. Here, for instance, a very calm user would not immediately interact with somebody who is very aggressive. However, in our demonstrator, we visualize that the sentiment on a platform can still shift in positive and negative directions gradually. These network dynamics were modelled based on the Axelrod model (for further information and technicalities please refer to our Deliverable).

After getting an understanding of network dynamics, the user is asked to experiment with alternative platform mechanisms that determine what users (and their moods) influence their own fictive platform user. Based on the ranking algorithms the user sets, posts with different moods – again, aggressive to calm – will become visible to their fictive character, which influence their mood. From this individual level, the demonstrator then moves on to visualizing bigger networks in which many users influence each other based on the designated platform mechanics. To understand how users influence each other’s mood on OSNs, the user can run comparative simulations and experiment how polarization is fueled or minimized only through the ranking mechanics.

New paper by TWON researcher Simon Münker: Fingerprinting LLMs through Survey Item Factor Correlation: A Case Study on Humor Style Questionnaire

We are proud to announce that our researcher Simon Münker published a new paper with the title: Fingerprinting LLMs through Survey Item Factor Correlation: A Case Study on Humor Style Questionnaire. It is published in the Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing and the results will be presented in Shanghai on 5 November.

LLMs increasingly engage with psychological instruments, yet how they represent constructs internally remains poorly understood. Simon Münker introduces a novel approach to “fingerprinting” LLMs through their factor correlation patterns on standardized psychological assessments to deepen the understanding of LLMs constructs representation. Using the Humor Style Questionnaire as a case study, he analyzes how six LLMs represent and correlate humor-related constructs to survey participants. His results show that they exhibit little similarity to human response patterns. In contrast, participants’ subsamples demonstrate remarkably high internal consistency. Exploratory graph analysis further confirms that no LLM successfully recovers the four constructs of the Humor Style Questionnaire. These findings suggest that despite advances in natural language capabilities, current LLMs represent psychological constructs in fundamentally different ways than humans, questioning the validity of application as human simulacra.

It’s a wrap: CitizenLab 2025 in Chemnitz

On 8 October, we hosted another CitizenLab in the Stadthallenpark in Chemnitz, where we got to speak with citizens about our research on Online Social Networks.

We presented our demonstrators MicroTWONY, MacroTWONY, and TWONderland to interested citizens and participants, had inspiring conversations about the impact of Online Social Networks on society and democracy, as well as possibilities for regulation and ethical design. We are glad to see how many participants enjoyed experimenting with the demonstrators and exploring how digital dynamics become tangible!

In the evening, we joined an interesting event on memory culture in digital spaces at the NSU Documentation Center with TWON researcher Jonas Fegert, journalist Nhi Le and Susanne Siegert from the channel @keineerinnerungskultur, moderated by Benjamin Fischer. The discussion focused on the opportunities social networks offer for democratic education, especially for younger audiences, and on the limitations imposed by platform mechanisms that tend to amplify hate speech and misinformation.

A day full of dialogue, reflection, and future perspectives – thank you for everybody who was a part of it, and we’re looking forward to the next CitizenLab!

Damian Trilling Discusses Platform Data Access at ERC Workshop

TWON researcher Damian Trilling was invited to an ERC workshop on Article 40 of the Digital Services Act (DSA), held in October 2025 in Brussels. At the workshop, he shared insights from the Twin of Online Social Networks project with EU policy makers, focusing on the regulatory conditions needed to support research on platform dynamics.

Drawing on TWON’s experiences, Damian Trilling highlighted the importance of reliable data access for studying online social networks and developing digital twins. In particular, he addressed the challenges posed by limited access to platform data, which can affect the calibration and evaluation of digital twin models.

The exchange contributed to ongoing discussions about how regulatory frameworks such as the DSA can support independent research, transparency and accountability in the study of digital platforms.

TWON x ZKM Karlsruhe

On 24 October, TWON researchers Achim Rettinger and Simon Münker joined the symposium “The Fabrication of Truth” at ZKM Karlsruhe, which focused on fake news, deepfakes, and the post-factual society. The event offered an opportunity to engage with visitors and present macroTWONy, a TWON demonstrator that explores how network topology and algorithmic sorting strategies influence collective opinion formation and emotional polarization.

By connecting current debates on misinformation and digital public spheres with interactive research tools, the symposium highlighted the importance of understanding how online environments shape social dynamics and public discourse. Visitors were invited to explore macroTWONy and gain insights into the mechanisms that contribute to opinion formation and polarization in online social networks.

New publication: Can we use automated approaches to measure the quality of online political discussion?

We’re proud to announce that our consortium members Sjoerd Stolwijk, Damian Trilling (both University of Amsterdam) and Simon Münker (Trier University) contributed to a freshly published paper on measuring the debate quality of online political discussions. The paper was released in the “Communication Methods and Measures” journal by Routledge and is open access.

Our researchers review how debate quality has been measured in communication science, and systematically compare 50 automated metrics against numerous manually coded comments. Based on their experiments, they were able to give clear recommendations for how to (not) measure debate quality in terms of interactivity, diversity, rationality, and (in)civility according to Habermas.

Their results show that transformer models and generative AI (like Llama and GPT-models) outperform older methods, yet there is variance and the success depends on the measured concept, as some (e.g. rationality) remain difficult to capture also by human coding. Which measure should be preferred for future empirical applications is likely dependent on the
objective of the study in question. For some genres, language and communication style (e.g. satire), it is strongly advised to test the accuracy of automated methods against the human interpretation beforehand, even if methods are widely used. Some approaches and implementations performed so poorly that they are not suitable for studying debate quality.

Event Recap

TWON researcher Ljubisa Bojic participated in the DGPuK & ICA conference on Digital Communication and Human-Machine Communication, held from 15 to 17 September at Technische Universität Dresden. Under the theme “Machines as (new) actors in digital communication: challenges and opportunities for science and society,” the conference explored how AI systems shape communication, society and knowledge.

As part of the conference programme, Ljubisa Bojic presented insights from his research in the TWON project on social stereotypes and attitudes in large language models. His presentation examined how LLMs evaluate different social groups, including women, men, refugees, asylum seekers and economic migrants, along the stereotype dimensions of warmth and competence.

The findings indicate significant differences in how LLMs assess these groups. Statistical analysis confirms that these differences are highly significant, with particularly large variations in competence scores for refugees and economic migrants. The presentation also highlighted interaction effects between model and language, showing that both the choice of LLM and the language used can shape stereotype-related outcomes.

By addressing social stereotypes in LLMs, the presentation contributes to ongoing discussions on the societal implications of AI systems and the challenges of ensuring fairness, accountability and transparency in digital communication.

Zero-shot prompt-based classification @ACL Vienna

Simon Münker recently presented his research on the use of zero-shot, prompt-based classification for analysing political discourse on German Twitter during the European energy crisis at the 2025 Association for Computational Linguistics Conference in Vienna. He gave a poster presentation and a talk about his newly published paper.

In their paper, Dr. Achim Rettinger, Kai Kugler and Simon Münker assess advancements in NLP, specifically large foundation models, for automating annotation processes on German Twitter data concerning European crises.

The study explores how recent advances in large language models (LLMs) can reduce the need for long manual work when labeling and categorizing social media content. Instead of training models with thousands of examples, LLMs can follow written prompts to classify tweets in a zero-shot setting, meaning without prior training on the specific task.

The dataset used was collected from a German Twitter dataset based on survey questions from the SOSEC project about the energy crisis in winter 2022/23. Two domain experts and native speakers annotated a random sample of around 7,000 tweets.

The models that were evaluated included: a baseline Naive Bayes classifier using token counts; a fine-tuned German-specific BERT transformer (“gbert-base”)- a model further adapted with additional pretraining on domain-specific tweets to improve domain relevance; and instruction-tuned models based on T5, which follow prompts to classify texts without domain-specific fine-tuning using zero-shot prompting techniques.

The results show that prompt-based approaches perform almost as well as fine-tuned BERT models. The study therefore concludes that a prompt-based approach can achieve comparable performance to fine-tuned BERT without requiring annotated training data.

However, the study also emphasizes limitations such as the inherited and potentially amplified biases present in the training data and differences in outcomes related to the language used (German/English), as well as cultural nuances.

Automating the analysis of political and social debates raises questions about the role AI can and should play in interpreting sensitive public discourse.

TWON Research Presented in Tirana

TWON researcher Sjoerd Stolwijk presented work from the Twin of Online Social Networks project during a visit to Tirana, Albania. The visit took place in the context of academic cooperation between Vrije Universiteit Amsterdam and the University of Elbasan “Aleksandër Xhuvani.”

The exchange highlighted the importance of international research collaboration and knowledge diplomacy. Research and education can serve as bridges between cultures and institutions, fostering dialogue, cooperation and mutual understanding across Europe.

Damian Trilling Presents TWON Research at ICA Denver

TWON researcher Damian Trilling represented the Twin of Online Social Networks project at the International Communication Association (ICA) conference in Denver, Colorado. His talk focused on the use of LLM-based agents in simulations and contributed to the wider panel discussion on Generative AI for Computational Communication Research.

The session attracted strong interest, with around 100 attendees joining the talk and discussion. Damian Trilling’s contribution highlighted how generative AI and agent-based approaches can open new pathways for computational communication research, while also raising important methodological questions for the field.