Speech based Emotion Recognition and Gender Identification using FNN and CNN Models

Susithra, N (2022) Speech based Emotion Recognition and Gender Identification using FNN and CNN Models. In: 2022 3rd International Conference for Emerging Technology (INCET), Belgaum, India.

Full text not available from this repository.

Abstract

Detection of human emotions by analyzing the speech patterns in the audio signals has various applications including effective human-computer interaction, evaluation or automation of the customer care call quality assurance process, and medical applications such as assisting a doctor in understanding the emotional state of a patient. For instance, speech produced in a state of anger, fear, or happiness has higher and wider pitch and it is louder and faster whereas slow and low-pitched speech is produced when a person is sad. Recognition of emotion, gender and age from speech signals is one of the active research areas in speech processing where there is need for improvement in accuracy. The work proposed in this paper makes use of machine learning neural networks to recognize the gender and emotion of a speaker. The model is designed to classify four different emotions (neutral, happy, sad, angry). The work involves the integration of two modules namely, Gender identification block and Emotion detection block. A simple Feed forward neural network, tested and trained by a large dataset forms the gender identification block, while Convolutional Neural Network (CNN) with adequate datasets were used to train and test the emotion detection block. The proposed model results in detection of gender with 91.46 percent accuracy while the emotion detection module provides 86 percent accuracy.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Convolutional neural network; Detection blocks; Emotion detection; Emotion recognition; Feed forward neural net works; Gender identification; Machine learning convolutional neural network; Machine-learning; Neural network model; Speech emotion recognition
Subjects: A Artificial Intelligence and Data Science > Text and Speech Analysis
C Computer Science and Engineering > Human-Computer Interaction
C Computer Science and Engineering > Neural Networks
C Computer Science and Engineering > Health Care, Disease
C Computer Science and Engineering > Machine Learning
E Electronics and Communication Engineering > Intelligent Instrumentaion
Divisions: Electronics and Communication Engineering
Depositing User: Dr Krishnamurthy V
Date Deposited: 18 Sep 2024 04:22
Last Modified: 18 Sep 2024 04:23
URI: https://ir.psgitech.ac.in/id/eprint/1142

Actions (login required)

View Item
View Item