Franziska Scheurer – DFKI – Interactive Machine Learning Lab

Human-Centred Active Learning through Visual Analytics

Machine learning models are powerful, yet they often sideline the domain experts who understand the data best. In conventional active learning pipelines, the model drives the process while the user simply responds, leaving domain knowledge underutilised and treating the annotator as a passive responder rather than an active contributor. This Read more

By Franziska Scheurer, 4 weeksJune 30, 2026 ago

Computational Sustainability

Interactive Weak Supervision for Transferring Sound Libraries to Passive Acoustic Monitoring

Passive Acoustic Monitoring (PAM) enables continuous and non-invasive biodiversity monitoring, but analysing large acoustic datasets remains difficult because sound event detectors usually require temporally precise annotations. Creating such instance-level labels is expensive and requires expert knowledge. At the same time, museum and community-run sound libraries provide large numbers of animal Read more

By Franziska Scheurer, 1 monthJune 24, 2026 ago

Machine Learning

4D reasoning from demonstration data for VLA

Visual-Language-Action (VLA) models are typically trained through imitation learning, which teaches policies to reproduce demonstrated actions but provides limited supervision about the conditions that define task success. We propose a framework that automatically extracts executable 3D task verifiers from demonstrations and uses them to improve policy learning beyond imitation. Given Read more

By Franziska Scheurer, 1 monthJune 22, 2026 ago

Computational Sustainability

Machine Learning for Passive Acoustic Wildlife Monitoring: Methods for Semi-Automated Population and Species Assessment

Passive acoustic monitoring (PAM) has become a powerful tool for studying wildlife by continuously recording environmental soundscapes. However, analysing large acoustic datasets remains highly time-consuming, as recordings are often annotated manually by domain experts. In this work, we investigate how machine learning can support scalable biodiversity monitoring by enabling efficient Read more

By Franziska Scheurer, 2 monthsJune 5, 2026 ago

Computational Sustainability Machine Learning Natural Language Processing

Grounded Label Space Engineering for Knowledge-Centric Annotation Workflows

Building reliable AI models depends not only on how much data is annotated, but on the quality and meaning of the labels used during annotation. In many workflows, labels are flat, task-specific class names. They are easy to apply, but lack explicit semantic structure, provenance, and links to shared domain Read more

By Franziska Scheurer, 2 monthsJune 3, 2026 ago

Computational Sustainability

IQUANA: Efficient Image Annotation and Quantification

Image annotation remains a significant bottleneck in image analysis pipelines. In research settings especially, annotating large image corpora demands substantial effort, often forcing teams to compromise on quality by resorting to coarser methods like point annotations rather than full outlines. To address this, we collaborated with the Helmholtz Institute for Read more

By Franziska Scheurer, 2 monthsJune 2, 2026 ago

Natural Language Processing

Explainable Biomedical Claim Verification (Accenture)

In the Autoprompt project funded by a grant from Accenture, one of the world’s leading consulting, technology and outsourcing companies, we focus on developing automated biomedical claim verification systems designed to assist clinicians and researchers in addressing the risks posed by misinformation in the healthcare domain. By providing accurate, evidence-based Read more

By Franziska Scheurer, 2 yearsJanuary 20, 2025 ago

Natural Language Processing

Investigating Natural Language Inference Capabilities of Large Language Modes in Biomedical Claim Verification

Left: Examples from HealthVer [1]; Right: Example of a claim that is supported and refuted by different evidence [2] With the rapid growth of biomedical research and the concurrent rise in misinformation, ensuring the accuracy of claims about treatment effectiveness is increasingly critical. Inaccurate or misleading information can have profound Read more

By Franziska Scheurer, 2 yearsDecember 6, 2024 ago

Natural Language Processing

Optimizing Relation Extraction in Medical Texts through Active Learning: A Comparative Analysis of Trade-offs

Example from n2c2 of relation extraction [1] This work explores the effectiveness of employing Clinical BERT for Relation Extraction (RE) tasks in medical texts within an Active Learning (AL) framework. Our main objective is to optimize RE in medical texts through AL while examining the trade-offs between performance and computation Read more

By Franziska Scheurer, 2 yearsDecember 6, 2024 ago

Natural Language Processing

Building A German Clinical Named Entity Recognition System without In-domain Training Data

Clinical Named Entity Recognition (NER) is essential for extracting important medical insights from clinical narratives. Given the challenges in obtaining expert training datasets for real-world clinical applications related to data protection regulations and the lack of standardised entity types, this work represents a collaborative initiative aimed at building a German Read more

By Franziska Scheurer, 2 yearsDecember 6, 2024 ago