NeuroXVocal: detection and explanation of alzheimer’s disease through non-invasive analysis of picture-prompted speech

Nikolaos Ntampakis, Konstantinos Diamantaras, Ioanna Chouvarda, Magda Tsolaki, Panagiotis Sarigianndis, Vasileios Argyriou

Research output: Contribution to conferencePaperpeer-review

Abstract

The early diagnosis of Alzheimer’s Disease (AD) through non-invasive methods remains a significant healthcare challenge. We present NeuroXVocal, the first end-to-end explainable AD classification system that achieves state-of-the-art performance while providing clinically interpretable explanations. Our novel dual-component architecture consists of: (1) Neuro, a multimodal classifier implementing a unique transformer based fusion strategy that projects acoustic, textual, and speech embeddings into a common dimensional space for complex cross-modal interactions; and (2) XVocal, a specialized RAG-based explainer that retrieves relevant clinical literature to generate evidence-based explanations. Unlike previous approaches using late fusion or simple concatenation, our architecture enables both robust classification and meaningful clinical insights. Using the IS2021 ADReSSo Challenge benchmark dataset, NeuroXVocal achieved 95.77% accuracy, significantly outperforming previous state-of-the-art. Medical professionals validated the clinical relevance of XVocal’s explanations through structured evaluation. This work advances beyond pure classification to bridge the gap between machine learning predictions and clinical decision-making.
Original languageEnglish
Number of pages10
Publication statusAccepted/In press - 17 Jun 2025
EventInternational Conference on Medical Image Computing and Computer Assisted Intervention - Daejeon, Korea, Democratic People's Republic of
Duration: 23 Sept 202527 Sept 2025
Conference number: 28
https://conferences.miccai.org/2025/en/

Conference

ConferenceInternational Conference on Medical Image Computing and Computer Assisted Intervention
Abbreviated titleMICCAI 2025
Country/TerritoryKorea, Democratic People's Republic of
CityDaejeon
Period23/09/2527/09/25
Internet address

Fingerprint

Dive into the research topics of 'NeuroXVocal: detection and explanation of alzheimer’s disease through non-invasive analysis of picture-prompted speech'. Together they form a unique fingerprint.

Cite this