BulDML at Institute of Mathematics and Informatics: A Cognitive Science Reasoning in Recognition of Emotions in Audio-Visual Speech

	Home

Browse
	Communities & Collections
	Issue Date
	Author
	Title
	Subject

Sign on to:
	Receive email updates
	My DSpace authorized users
	Edit Profile


	About DSpace

BulDML at Institute of Mathematics and Informatics >
ITHEA >
International Journal ITK >
2008 >
Volume 2 Number 4 >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10525/193

Title:	A Cognitive Science Reasoning in Recognition of Emotions in Audio-Visual Speech
Authors:	Slavova, Velina Verhelst, Werner Sahli, Hichem
Keywords:	Artificial Intelligence Cognitive simulation Natural language processing Speech recognition and synthesis
Issue Date:	2008
Publisher:	Institute of Information Theories and Applications FOI ITHEA
Abstract:	In this report we summarize the state-of-the-art of speech emotion recognition from the signal processing point of view. On the bases of multi-corporal experiments with machine-learning classifiers, the observation is made that existing approaches for supervised machine learning lead to database dependent classifiers which can not be applied for multi-language speech emotion recognition without additional training because they discriminate the emotion classes following the used training language. As there are experimental results showing that Humans can perform language independent categorisation, we made a parallel between machine recognition and the cognitive process and tried to discover the sources of these divergent results. The analysis suggests that the main difference is that the speech perception allows extraction of language independent features although language dependent features are incorporated in all levels of the speech signal and play as a strong discriminative function in human perception. Based on several results in related domains, we have suggested that in addition, the cognitive process of emotion-recognition is based on categorisation, assisted by some hierarchical structure of the emotional categories, existing in the cognitive space of all humans. We propose a strategy for developing language independent machine emotion recognition, related to the identification of language independent speech features and the use of additional information from visual (expression) features.
URI:	http://hdl.handle.net/10525/193
ISSN:	1313-048X
Appears in Collections:	Volume 2 Number 4

Files in This Item:

File	Description	Size	Format
ijitk02-4-p05.pdf		371.82 kB	Adobe PDF	View/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Creative Commons License