Toggle Main Menu Toggle Search

Open Access padlockePrints

A novel audio-visual model with real-time gradient modulation for mental disorder detection

Lookup NU author(s): Dr Yichun Li, Dr Mohsen Naqvi

Downloads


Licence

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).


Abstract

Recently, mental disorders have emerged as one of the major contributors to global healthcare challenges. Deep learning methods based on fMRI and EEG have improved the efficiency and accuracy of detecting certain mental disorders. However, these methods often entail substantial costs for equipment and trained staff. Furthermore, most models are designed for specific mental disorders rather than serving as potential tools for widespread screening. This paper focuses on the emotional expression features of mental disorders and introduces a diagnosis model based on audio-visual. The proposed model incorporates a spatio-temporal (S-T) attention mechanism combined with Convolutional Neural Networks (CNNs) and employs Real-Time Gradient Modulation (RTGM). This model effectively captures audio-visual features while dynamically adjusting the contributions of both modalities during training to optimize performance for two mental disorders. Additionally, we introduce dynamically varying Gaussian noise to prevent potential degradation of generalization ability caused by modulation. The effectiveness and feasibility of the proposed model are validated through comparative analyses of various networks, fusion strategies, and modulation methods across three datasets focused on the diagnosis and analysis of two mental disorders: ADHD and depression. The proposed model demonstrates state-of-the-art performance, achieving over 90% accuracy for ADHD classification and improving depression score estimation on AVEC 2013 and AVEC 2014.


Publication metadata

Author(s): Li Y, Li Y, Nair R, Naqvi SM

Publication type: Article

Publication status: Published

Journal: Biomedical Signal Processing and Control

Year: 2026

Volume: 120

Issue: B

Print publication date: 01/07/2026

Online publication date: 23/03/2026

Acceptance date: 18/03/2026

Date deposited: 24/03/2026

ISSN (print): 1746-8094

ISSN (electronic): 1746-8108

Publisher: Elsevier

URL: https://doi.org/10.1016/j.bspc.2026.110164

DOI: 10.1016/j.bspc.2026.110164

Data Access Statement: Data will be made available on request


Altmetrics

Altmetrics provided by Altmetric


Share