NeuroXVocal: detection and explanation of alzheimer’s disease through non-invasive analysis of picture-prompted speech

  • Nikolaos Ntampakis
  • , Konstantinos Diamantaras
  • , Ioanna Chouvarda
  • , Magda Tsolaki
  • , Panagiotis Sarigianndis
  • , Vasileios Argyriou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The early diagnosis of Alzheimer’s Disease (AD) through non-invasive methods remains a significant healthcare challenge. We present NeuroXVocal, the first end-to-end explainable AD classification system that achieves state-of-the-art performance while providing clinically interpretable explanations. Our novel dual-component architecture consists of: (1) Neuro, a multimodal classifier implementing a unique transformer based fusion strategy that projects acoustic, textual, and speech embeddings into a common dimensional space for complex cross-modal interactions; and (2) XVocal, a specialized RAG-based explainer that retrieves relevant clinical literature to generate evidence-based explanations. Unlike previous approaches using late fusion or simple concatenation, our architecture enables both robust classification and meaningful clinical insights. Using the IS2021 ADReSSo Challenge benchmark dataset, NeuroXVocal achieved 95.77% accuracy, significantly outperforming previous state-of-the-art. Medical professionals validated the clinical relevance of XVocal’s explanations through structured evaluation. This work advances beyond pure classification to bridge the gap between machine learning predictions and clinical decision-making.
Original languageEnglish
Title of host publicationMedical Image Computing and Computer Assisted Intervention – MICCAI 2025
EditorsJames C. Gee, Daniel C. Alexander, Jaesung Hong, Juan Eugenio Iglesias, Carole H. Sudre, Archana Venkataraman, Polina Golland, Jong Hyo Kim, Jinah Park
PublisherSpringer Cham
Pages410-419
Number of pages10
Volume15973
ISBN (Electronic)9783032051851
ISBN (Print)9783032051844
DOIs
Publication statusPublished - 20 Sept 2025

Publication series

NameMICCAI : International Conference on Medical Image Computing and Computer-Assisted Intervention
PublisherSpringer Cham
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fingerprint

Dive into the research topics of 'NeuroXVocal: detection and explanation of alzheimer’s disease through non-invasive analysis of picture-prompted speech'. Together they form a unique fingerprint.

Cite this