Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Extraction and modeling method for Chinese speech sensibility information

A technology of extraction method and modeling method, which is applied in the extraction and modeling of Chinese speech emotion information, and can solve the problems of long time for establishment and training of hidden Markov models, high computational complexity, and small sample learning.

Inactive Publication Date: 2011-05-25
BEIHANG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Before the present invention, the emotional features mainly used by researchers included prosodic features and their derived parameters, such as time, energy, gene frequency, formant, etc. However, the relationship between these parameters and emotional expression is very complicated, and different emotional divisions often It is suitable to adopt a specific combination of features, and there is little research on it in the past literature
The speech signal is a typical non-stationary signal, and the linear models adopted by the K-nearest neighbor method and the principal component analysis method are too simple to obtain a good recognition rate; the establishment and training of the hidden Markov model (HMM) takes too long, In practice, it is necessary to solve the problem of high computational complexity; the artificial neural network has a high degree of nonlinear modeling and strong classification ability, but it cannot overcome the local minimum problem
In addition, the appeal recognition methods all belong to the category of statistical machine learning. According to theoretical analysis, the recognition performance can only be guaranteed when the number of training samples tends to infinity, but there is always the problem of small sample learning in practical applications.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Extraction and modeling method for Chinese speech sensibility information
  • Extraction and modeling method for Chinese speech sensibility information
  • Extraction and modeling method for Chinese speech sensibility information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The technical solution of the present invention will be further elaborated below in conjunction with the accompanying drawings.

[0036] Such as figure 1 As shown, it is a flow chart of the extraction and modeling method of Chinese speech emotion information, which is mainly divided into two parts: the extraction method of Chinese speech emotion information and the modeling method of Chinese speech emotion information.

[0037] One, the extraction method of Chinese speech emotion information, this method step is as follows:

[0038] Step 1. Develop the Emotional Speech Database Specification

[0039] Every step in the entire production process of the voice library should comply with specific specifications, including speaker specifications, recording script design specifications, recording specifications, audio file naming specifications, and experiment recording specifications. According to the application requirements of speech emotion research, the production speci...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for extracting and modeling the emotional information of a Chinese sound; the extracting method for the emotional information of the Chinese sound is that: formulate the specification of a emotional speech database, which includes the pronouncer specification, the recording play book design specification and the naming specification of audio files and so on; collect the emotional speech data; evaluate the validity of the emotional speech, namely, at least ten evaluators apart from a speaker carry out a subjective listen evaluation experiment on the emotional speech data. The modeling method of the emotional information of the Chinese sound is that: extract the emotional characteristics of the sound, define and distinguish the characteristic combination of each emotion type; adopt different characteristic combinations to train the SVM model of a multilevel sound emotion recognition system; verify the identification effect of the classifying models, namely, verify the classification effect of the multilevel classification models of sound emotion in a situation unrelated to the speaker by adopting a cross leave-one-out method. The method solves the problems that the domestic emotional speech databases are less in emotion type and the number of the domestic emotional speech database is very limited; at the same time, the method realizes an efficientspeech emotion identification system.

Description

(1) Technical field: [0001] The invention relates to a method for extracting and modeling Chinese speech emotion information, belonging to the field of information technology. (two) background technology: [0002] The research on speech emotion automatic recognition technology mainly involves two aspects: one is to establish a high-quality emotional speech database to provide necessary data for speech emotion research; the other is to establish an efficient speech emotion recognition model to quickly and effectively recognize the emotion of speech signals state. [0003] To establish an emotional speech database, the categories and classification methods of emotions must first be established. In most research methods, researchers use everyday language labels to identify and classify emotions, that is, describe emotion classification as a discrete model. Emotional speech data can be divided into three types according to the naturalness of expression: natural type, performanc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/00G10L15/08G10L15/28G10L15/06G10L25/63
Inventor 毛峡陈立江
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products