Publication:
A saliency-based attention LSTM model for cognitive load classification from speech

Loading...
Thumbnail Image
Identifiers
Publication date
2019
Defense date
Advisors
Tutors
Journal Title
Journal ISSN
Volume Title
Publisher
International Speech Communication Association (ISCA)
Impact
Google Scholar
Export
Research Projects
Organizational Units
Journal Issue
Abstract
Cognitive Load (CL) refers to the amount of mental demand that a given task imposes on an individual's cognitive system and it can affect his/her productivity in very high load situations. In this paper, we propose an automatic system capable of classifying the CL level of a speaker by analyzing his/her voice. Our research on this topic goes into two main directions. In the first one, we focus on the use of Long Short-Term Memory (LSTM) networks with different weighted pooling strategies for CL level classification. In the second contribution, for overcoming the need of a large amount of training data, we propose a novel attention mechanism that uses the Kalinli's auditory saliency model. Experiments show that our proposal outperforms significantly both, a baseline system based on Support Vector Machines (SVM) and a LSTM-based system with logistic regression attention model.
Description
Proceeding of: Interspeech 2019, 20th Annual Conference of the International Speech Communication Association, Graz, Austria, 15-19 September 2019
Keywords
Cognitive load, Speech, LSTM, Weigthed pooling, Auditory saliency, Attention model
Bibliographic citation
Interspeech 2019, 20th Annual Conference of the International Speech Communication Association, Graz, Austria, 15-19 September 2019 [Proceedings]. ISCA, 2019, Pp. 216-220