Skip to main navigation Skip to search Skip to main content

NeuroXVocal: detection and explanation of alzheimer’s disease through non-invasive analysis of picture-prompted speech

  • Nikolaos Ntampakis
  • , Konstantinos Diamantaras
  • , Ioanna Chouvarda
  • , Magda Tsolaki
  • , Panagiotis Sarigianndis
  • , Vasileios Argyriou
  • International Hellenic University
  • MetaMind Innovations P.C.
  • Greek Association of Alzheimer’s Disease & Related Disorders
  • University of Western Macedonia
  • Aristotle University of Thessaloniki

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The early diagnosis of Alzheimer’s Disease (AD) through non-invasive methods remains a significant healthcare challenge. We present NeuroXVocal, the first end-to-end explainable AD classification system that achieves state-of-the-art performance while providing clinically interpretable explanations. Our novel dual-component architecture consists of: (1) Neuro, a multimodal classifier implementing a unique transformer-based fusion strategy that projects acoustic, textual, and speech embeddings into a common dimensional space for complex cross-modal interactions; and (2) XVocal, a specialized RAG-based explainer that retrieves relevant clinical literature to generate evidence-based explanations. Unlike previous approaches using late fusion or simple concatenation, our architecture enables both robust classification and meaningful clinical insights. Using the IS2021 ADReSSo Challenge benchmark dataset, NeuroXVocal achieved 95.77% accuracy, significantly outperforming previous state-of-the-art. Medical professionals validated the clinical relevance of XVocal’s explanations through structured evaluation. This work advances beyond pure classification to bridge the gap between machine learning predictions and clinical decision-making. Code available at: https://github.com/NNtamp/NeuroXVocal.

Original languageEnglish
Title of host publicationMedical Image Computing and Computer Assisted Intervention, MICCAI 2025 - 28th International Conference, 2025, Proceedings
EditorsJames C. Gee, Jaesung Hong, Carole H. Sudre, Polina Golland, Jinah Park, Daniel C. Alexander, Juan Eugenio Iglesias, Archana Venkataraman, Jong Hyo Kim
PublisherSpringer Cham
Pages410-419
Number of pages10
Volume15973
ISBN (Electronic)9783032051851
ISBN (Print)9783032051844
DOIs
Publication statusPublished - 20 Sept 2025

Publication series

NameLecture Notes in Computer Science
Volume15973 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • Alzheimer
  • Explainable Healthcare AI
  • Multimodal

Fingerprint

Dive into the research topics of 'NeuroXVocal: detection and explanation of alzheimer’s disease through non-invasive analysis of picture-prompted speech'. Together they form a unique fingerprint.

Cite this