Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Curriculum learning for speech emotion recognition from crowdsourced labels
Lotfian R., Busso C.  IEEE/ACM Transactions on Audio, Speech and Language Processing 27 (4): 815-826, 2019. Type: Article
Date Reviewed: Aug 20 2021

In computer applications such as synergistic games, gratifying robots, and speech recognition systems, the ability to identify emotions is invaluable. But how should effective algorithms and systems be designed for discerning emotions from diverse speech? Lotfian and Busso present a machine learning program for effectively boosting the training procedure of deep neural networks (DNNs) in speech sentiment detection systems.

The authors present extensive yet concise reviews of research efforts and solutions related to understanding speech emotion. Without a doubt, new systems must be equipped with (1) models that capitalize on using artificial intelligence (AI) techniques to delve into the inadequate training datasets for recognizing speech emotion, and (2) algorithms for identifying known uncertainties in the emotional speech by humans and computers with imperfect training.

Considering the challenges that speech sentence evaluators of abstruse emotional content face, how should reliable metrics be created for reconciling the differences among evaluators to construct reliable classifications of emotional perceptions? The authors present statistical models for identifying, clustering, and categorizing emotional attributes. Specifically, a regression model is proposed for categorizing the dimensions of emotions for speech evaluation, a dichotomous model is used to identify the boundaries of emotional dimensions, and an algorithm is used to estimate the accuracy of speech emotional ratings by alternative evaluators.

The paper presents numerous experiments performed with datasets from various sources. Compared to similar research, the results tend to indicate the reliability of the multifaceted approach outlined. The authors clearly recognize the impact of being able to accurately identify training samples. They offer new techniques for determining reliability among speech evaluators.

Reviewer:  Amos Olagunju Review #: CR147337
Bookmark and Share
  Reviewer Selected
Featured Reviewer
General (K.3.0 )
General (H.5.0 )
Would you recommend this review?
Other reviews under "General": Date
Jupyter Notebook in CS1: an experience report
Zastre M.  WCCCE 2019 (Proceedings of the Western Canadian Conference on Computing Education, Calgary, AB, Canada,  May 3-4, 2019) 1-6, 2019. Type: Proceedings
Jun 17 2020
On the acceptance and usefulness of personalized learning objectives in MOOCs
Rohloff T., Sauer D., Meinel C.  [email protected] (Proceedings of the Sixth ACM Conference on Learning @ Scale, Chicago, IL,  Jun 24-25, 2019) 1-10, 2019. Type: Proceedings
Jun 5 2020
Predicting students’ final degree classification using an extended profile
Al-Sudani S., Palaniappan R.  Education and Information Technologies 24(4): 2357-2369, 2019. Type: Article
Jan 21 2020

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright © 2000-2021 ThinkLoud, Inc.
Terms of Use
| Privacy Policy