Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Tibetan speech emotion recognition method based on CNN and LSTM

A speech emotion recognition, Tibetan language technology, applied in speech analysis, instruments, etc.

Active Publication Date: 2021-12-17
TIBET UNIV
View PDF10 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the above-mentioned deficiencies in the prior art, a kind of Tibetan speech emotion recognition method based on CNN and LSTM provided by the present invention solves the problem for Tibetan speech emotion recognition question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tibetan speech emotion recognition method based on CNN and LSTM
  • Tibetan speech emotion recognition method based on CNN and LSTM
  • Tibetan speech emotion recognition method based on CNN and LSTM

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] The specific embodiments of the present invention are described below so that those skilled in the art can understand the present invention, but it should be clear that the present invention is not limited to the scope of the specific embodiments. For those of ordinary skill in the art, as long as various changes Within the spirit and scope of the present invention defined and determined by the appended claims, these changes are obvious, and all inventions and creations using the concept of the present invention are included in the protection list.

[0071] Such as figure 1 As shown, in one embodiment of the present invention, the present invention provides a kind of Tibetan speech emotion recognition method based on CNN and LSTM, comprises the steps:

[0072] S1, establish the Tibetan speech emotion corpus;

[0073] The specific steps of the step S1 are as follows:

[0074] S11, recording Tibetan voice data;

[0075] S12. Perform emotion labeling on the Tibetan spee...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Tibetan speech emotion recognition method based on CNN and LSTM, belonging to the technical field of speech emotion recognition. The method comprises the following steps: building a Tibetan speech emotion corpus; preprocessing Tibetan speech data in the Tibetan speech emotion corpus; performing feature extraction on the Tibetan speech data in the preprocessed Tibetan speech emotion corpus to obtain a Tibetan speech spectrum; training a Tibetan speech emotion recognition network according to the Tibetan speech spectrum to obtain a trained Tibetan speech emotion recognition network; and inputting Tibetan speech data needing to be recognized into the trained Tibetan speech emotion recognition network after being subjected to preprocessing and feature extraction so as to obtain a Tibetan speech emotion classification result corresponding to the Tibetan speech data. According to the Tibetan speech emotion recognition method based on the CNN and the LSTM, the problem of Tibetan speech emotion recognition is solved.

Description

technical field [0001] The invention belongs to the technical field of speech emotion recognition, in particular to a method for recognizing Tibetan speech emotion based on CNN and LSTM. Background technique [0002] Speech emotion recognition is a computer simulation of the process of human emotion perception and understanding. Its task is to extract the acoustic features expressing emotion from the collected voice signals, and find out the mapping relationship between these acoustic features and human emotions, and then through this Mapping relationship, identifying the input voice signal, to achieve the purpose of human-computer interaction, and Tibetan speech emotion recognition is a special case of speech emotion recognition, that is, using Tibetan speech with emotion as input, so that the computer can build the basis of the mapping relationship It recognizes Tibetan speech with emotion on the Internet to realize human-computer interaction. [0003] In recent years, wi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/63G10L25/30G10L25/03
CPCG10L25/63G10L25/30G10L25/03
Inventor 边巴旺堆王希王君堡卓嘎云登努布
Owner TIBET UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products