GermEval@KONVENS 2025

Candy speech detection (Flausch-Erkennung)

GermEval 2025 features a task on candy speech detection, co-located with KONVENS.

Data
Important dates

Background

The task is to identify expressions of candy speech („Flausch“) in online posts (YouTube comments). We define candy speech as an expression of positive attitudes in social media toward individuals or their output (videos, comments, etc.). The purpose of candy speech is to encourage, cheer up, support and empower others. It can be viewed as the counterpart to hate speech, as it also aims to influence the self-image of the target person or group, but in a positive way.

Motivation

Numerous methods have been developed for detecting and censoring negative speech (e.g., hate speech or offensive or harmful language) on social media platforms. However, there is much less focus on identifying and promoting positive supportive discourse in online communities. Our shared task aims to address this gap and encourage researchers to focus on such positive expressions.

Task Details

Candy speech detection is the task of identifying the presence of candy speech (at the span level) in a given YouTube comment thread and classifying each expression in one of the predefined categories. This shared task focuses on German speaking YouTube communities. Participants will be provided with a dataset of YouTube comments manually annotated for different types of candy speech expressions.

The shared task includes the two following subtasks:

Subtask 1: Coarse-Grained Classification

The goal of this task is to identify whether the given comment contains candy speech or not. The dataset is manually annotated for the presence of candy speech (binary classification task).

Subtask 2: Fine-Grained Classification

The goal of this subtask is to identify the span of each candy speech expression in a given text and classify it in one of the predefined categories. The dataset is manually annotated for ten different types of candy speech expressions, such as “positive feedback”, “compliment”, “group membership” etc.

Data

We will provide the participants with annotated training (and development) and unlabeled test datasets containing complete written, German language comment threads under YouTube videos posted by different content creators. The content creators and communities vary in topic, style, age group, etc. The test data and training data do not overlap wrt. to the original content creator of the video – the communities commenting on the videos can therefore be expected to differ.

Sample Data

YouTube comment Candy speech (Subtask 1) Candy speech type (Subtask 2)
hahahahah voll cool . der lehrer hat auswehrsehen das video angeklickt war voll geil yes positive feedback [hahahahah voll cool],[war voll geil]
lehrer neven ganz übel ich beebde den unterricht ( facepalm ) no
cool wie immer . Macht weiter so :) yes positive feedback [cool wie immer]; encouragement [Macht weiter so :)]
+ Lu Spindler ran an die Sportklamotten ! 😁 no
Omg 😍 omg 😍 omg 😍 das video ist einfach so shönn geworden aww deine augen sind so sc Hönn 😍 😍 ich liebe diich und deine videos yoh sehr ❤ love you LuNa 😍 😘 yes positive feedback [Omg 😍 omg 😍 omg 😍], [das video ist einfach so shönn geworden]; compliment [aww deine augen sind so sc Hönn 😍 😍]; positive feedback/affection declaration [ich liebe diich und deine videos yoh sehr ❤]; affection declaration [love you LuNa 😍 😘]

Important dates

All deadlines are 23:59 UTC-12 (“anywhere on Earth”)

Organizers

Contact: germeval-2025-candy-speech@ruhr-uni-bochum.de