GermEval@KONVENS 2025

Candy speech detection is an official GermEval shared task co-located with the KONVENS 2025 conference.

Background

The task is to identify expressions of candy speech („Flausch“) in online posts (YouTube comments). We define candy speech as expression of positive attitudes on social media toward individuals or their output (videos, comments, etc.). The purpose of candy speech is to encourage, cheer up, support and empower others. It can be viewed as the counterpart to hate speech, as it also aims to influence the self-image of the target person or group, but in a positive way.

Motivation

Numerous methods have been developed for detecting and censoring negative speech (e.g., hate speech or offensive or harmful language) on social media platforms. However, there is much less focus on identifying and promoting positive supportive discourse in online communities. Our shared task aims to address this gap and encourage researchers to focus on such positive expressions.

Task Details

Candy speech detection is the task of identifying the presence of candy speech (on the span level) in a given YouTube comment and classifying each such expression in one of the predefined categories. This shared task focuses on German speaking YouTube communities. Participants will be provided with a dataset of YouTube comments manually annotated for different types of candy speech.

The shared task includes the following two subtasks:

Subtask 1: Coarse-Grained Classification

The goal of this task is to identify whether the given comment contains candy speech or not. The dataset is manually annotated for the presence of candy speech (binary classification task).

Subtask 2: Fine-Grained Classification

The goal of this subtask is to identify the span of each candy speech expression in a given comment and classify it into one of the predefined categories. The dataset is manually annotated for ten different types of candy speech expressions, such as “positive feedback”, “compliment”, “group membership” etc.

Approaches and Evaluation

Please note that we see the shared task not primarily as a pure (machine learning) engineering task, but as an opportunity for computational linguistic exploration. We therefore encourage participants to focus on the creative use of (linguistic) analysis, approaches, tools, and external ressources.

In principle, all model types and approaches are allowed, but we prefer the use of open-source systems and models and freely available ressources to facilitate reproducibility.

All submissions will be evaluated by the shared task organizers. We provide evaluation scripts based on precision, recall and f1 score (available with the data as specified below). The primary evaluation metric will be f1 score for Subtask 1 and a strict span-based f1 score for Subtask 2, though other metrics will be provided and analyzed to judge submissions and for development.

Data

We will provide the participants with the annotated training (and development) and unlabeled test datasets containing complete written, German language comment threads under YouTube videos posted by different content creators. The content creators and communities vary in topic, style, age group, etc. The training and test datasets do not overlap in terms of YouTube videos. Furthermore, the test dataset mostly contains (comments on) videos from content creators that differ from those in the training dataset. The communities commenting on these videos can therefore be expected to vary.

The data for the shared task is hosted on OSF.

Sample Data

YouTube comment	Candy speech (Subtask 1)	Candy speech type (Subtask 2)
hahahahah voll cool . der lehrer hat auswehrsehen das video angeklickt war voll geil	yes	positive feedback [hahahahah voll cool],[war voll geil]
lehrer neven ganz übel ich beebde den unterricht ( facepalm )	no
cool wie immer . Macht weiter so :)	yes	positive feedback [cool wie immer]; encouragement [Macht weiter so :)]
+ Lu Spindler ran an die Sportklamotten ! 😁	no
Omg 😍 omg 😍 omg 😍 das video ist einfach so shönn geworden aww deine augen sind so sc Hönn 😍 😍 ich liebe diich und deine videos yoh sehr ❤ love you LuNa 😍 😘	yes	positive feedback [Omg 😍 omg 😍 omg 😍], [das video ist einfach so shönn geworden]; compliment [aww deine augen sind so sc Hönn 😍 😍]; positive feedback/affection declaration [ich liebe diich und deine videos yoh sehr ❤]; affection declaration [love you LuNa 😍 😘]

Important dates

Trial data available: February 15, 2025
Training data available: March 3, 2025
Test data available: June 10, 2025
Evaluation start: June 16, 2025
Evaluation end: ~~June 27, 2025~~ June 28, 2025
Survey submission due: July 7, 2025
Paper submission due: July 11, 2025
Camera ready due: August 15, 2025
GermEval workshop: September 10, 2025 (co-located with KONVENS)

All deadlines are 23:59 UTC+0.

Participation

You can register on CodaBench for the subtask you want to participate in: Subtask 1/Subtask 2

Participants’ responses to the survey can be found here

Presentation slides: AIxcellentVibes, HHUflauschig

Posters: Coling-UniA, Die SuperGLEBer, NLP_Augsburg_04, TUM NLP Group

Workshop - September 10, 2025

Schedule

Time	Activity
10:30 AM – 11:30 AM	Oral Session
10:30 AM – 10:45 AM	Welcome + Overview by the Organizers
10:45 AM – 11:00 AM	AIxcellent Vibes: Improving Model Performance by Span-Level Training
11:00 AM – 11:15 AM	HHUflauschig: Hybrid Approaches for Binary Classification and Span Typing
11:15 AM – 11:30 AM	Open Discussion / Q&A
11:30 AM – 12:30 PM	GermEval Joint Poster Session

Poster Format

Please prepare your poster in one of the following formats:

A1: portrait or landscape orientation
A0: landscape orientation only

Organizers

Yulia Clausen, Ruhr-Universität Bochum, Germany
Tatjana Scheffler, Ruhr-Universität Bochum, Germany
Michael Wiegand, Universität Wien, Austria

Contact: germeval-2025-candy-speech@ruhr-uni-bochum.de

This shared task is conducted as part of the project D03, CRC 1567 “Virtual Lifeworlds”.