Regularized bagged canonical component analysis for multiclass learning in brain imaging

Sevilla Salcedo, CarlosGómez Verdejo, VanessaTohka, JussiAlzheimer’s Disease Neuroimaging Initiative (ADNI)2022-01-102022-01-102020-10Sevilla-Salcedo, C., Gómez-Verdejo, V. & Tohka, J. (2020). Regularized Bagged Canonical Component Analysis for Multiclass Learning in Brain Imaging. Neuroinformatics, 18(4), 641–659.1539-2791https://hdl.handle.net/10016/33848Alzheimer’s Disease Neuroimaging Initiative (ADNI) is a Group/Institutional Author. Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report.A fundamental problem of supervised learning algorithms for brain imaging applications is that the number of features far exceeds the number of subjects. In this paper, we propose a combined feature selection and extraction approach for multiclass problems. This method starts with a bagging procedure which calculates the sign consistency of the multivariate analysis (MVA) projection matrix feature-wise to determine the relevance of each feature. This relevance measure provides a parsimonious matrix, which is combined with a hypothesis test to automatically determine the number of selected features. Then, a novel MVA regularized with the sign and magnitude consistency of the features is used to generate a reduced set of summary components providing a compact data description. We evaluated the proposed method with two multiclass brain imaging problems: 1) the classification of the elderly subjects in four classes (cognitively normal, stable mild cognitive impairment (MCI), MCI converting to AD in 3 years, and Alzheimer’s disease) based on structural brain imaging data from the ADNI cohort; 2) the classification of children in 3 classes (typically developing, and 2 types of Attention Deficit/Hyperactivity Disorder (ADHD)) based on functional connectivity. Experimental results confirmed that each brain image (defined by 29.852 features in the ADNI database and 61.425 in the ADHD) could be represented with only 30 − 45% of the original features. Furthermore, this information could be redefined into two or three summary components, providing not only a gain of interpretability but also classification rate improvements when compared to state-of-art reference methods.19eng© Springer Science+Business Media, LLC, part of Springer Nature 2020.Brain imagingCanonical correlation analysisFeature selectionMulticlass classificationRegularized bagged canonical component analysis for multiclass learning in brain imagingresearch articleTelecomunicacioneshttps://doi.org/10.1007/s12021-020-09470-yopen access6414659Neuroinformatics18AR/0000027574