Recognizing Emotional States Using Speech Information

Printer-friendly versionSend by email
Book Chapter
Papakostas, M., Siantikos, G., Giannakopoulos, T., Spyrou, E., & Sgouropoulos, D.
Emotion recognition plays an important role in several applications, such as human computer interaction and understanding affective state of users in certain tasks, e.g., within a learning process, monitoring of elderly, interactive entertainment etc. It may be based upon several modalities, e.g., by analyzing facial expressions and/or speech, using electroencephalograms, electrocardiograms etc. In certain applications the only available modality is the user’s (speaker’s) voice. In this paper we aim to analyze speakers’ emotions based solely on paralinguistic information, i.e., not depending on the linguistic aspect of speech. We compare two machine learning approaches, namely a Convolutional Neural Network and a Support Vector Machine. The former is trained using raw speech information, while the latter is trained on a set of extracted low-level features. Aiming to provide a multilingual approach, training and testing datasets contain speech from different languages.
Computational Intelligence Laboratory (CIL)
Software and Knowledge Engineering Laboratory (SKEL)
Publication Name: 
GeNeDis 2016
Springer, Cham
Publication Series: 
Advances in Experimental Medicine and Biology (AEMB)
Page Start: 
Page End: 
ISBN Code: 
978-3-319-57348-9 (online) & 978-3-319-57347-2 (print)

© 2018 - Institute of Informatics and Telecommunications | National Centre for Scientific Research "Demokritos"