Extraction and modeling method for Chinese speech sensibility information

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of extraction method and modeling method, which is applied in the extraction and modeling of Chinese speech emotion information, and can solve the problems of long time for establishment and training of hidden Markov models, high computational complexity, and small sample learning.

Inactive Publication Date: 2011-05-25

BEIHANG UNIV

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Before the present invention, the emotional features mainly used by researchers included prosodic features and their derived parameters, such as time, energy, gene frequency, formant, etc. However, the relationship between these parameters and emotional expression is very complicated, and different emotional divisions often It is suitable to adopt a specific combination of features, and there is little research on it in the past literature

The speech signal is a typical non-stationary signal, and the linear models adopted by the K-nearest neighbor method and the principal component analysis method are too simple to obtain a good recognition rate; the establishment and training of the hidden Markov model (HMM) takes too long, In practice, it is necessary to solve the problem of high computational complexity; the artificial neural network has a high degree of nonlinear modeling and strong classification ability, but it cannot overcome the local minimum problem

In addition, the appeal recognition methods all belong to the category of statistical machine learning. According to theoretical analysis, the recognition performance can only be guaranteed when the number of training samples tends to infinity, but there is always the problem of small sample learning in practical applications.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0035] The technical solution of the present invention will be further elaborated below in conjunction with the accompanying drawings.

[0036] Such as figure 1 As shown, it is a flow chart of the extraction and modeling method of Chinese speech emotion information, which is mainly divided into two parts: the extraction method of Chinese speech emotion information and the modeling method of Chinese speech emotion information.

[0037] One, the extraction method of Chinese speech emotion information, this method step is as follows:

[0038] Step 1. Develop the Emotional Speech Database Specification

[0039] Every step in the entire production process of the voice library should comply with specific specifications, including speaker specifications, recording script design specifications, recording specifications, audio file naming specifications, and experiment recording specifications. According to the application requirements of speech emotion research, the production speci...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a method for extracting and modeling the emotional information of a Chinese sound; the extracting method for the emotional information of the Chinese sound is that: formulate the specification of a emotional speech database, which includes the pronouncer specification, the recording play book design specification and the naming specification of audio files and so on; collect the emotional speech data; evaluate the validity of the emotional speech, namely, at least ten evaluators apart from a speaker carry out a subjective listen evaluation experiment on the emotional speech data. The modeling method of the emotional information of the Chinese sound is that: extract the emotional characteristics of the sound, define and distinguish the characteristic combination of each emotion type; adopt different characteristic combinations to train the SVM model of a multilevel sound emotion recognition system; verify the identification effect of the classifying models, namely, verify the classification effect of the multilevel classification models of sound emotion in a situation unrelated to the speaker by adopting a cross leave-one-out method. The method solves the problems that the domestic emotional speech databases are less in emotion type and the number of the domestic emotional speech database is very limited; at the same time, the method realizes an efficientspeech emotion identification system.

Description

(1) Technical field: [0001] The invention relates to a method for extracting and modeling Chinese speech emotion information, belonging to the field of information technology. (two) background technology: [0002] The research on speech emotion automatic recognition technology mainly involves two aspects: one is to establish a high-quality emotional speech database to provide necessary data for speech emotion research; the other is to establish an efficient speech emotion recognition model to quickly and effectively recognize the emotion of speech signals state. [0003] To establish an emotional speech database, the categories and classification methods of emotions must first be established. In most research methods, researchers use everyday language labels to identify and classify emotions, that is, describe emotion classification as a discrete model. Emotional speech data can be divided into three types according to the naturalness of expression: natural type, performanc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/00G10L15/08G10L15/28G10L15/06G10L25/63

Inventor 毛峡陈立江

Owner BEIHANG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Extraction and modeling method for Chinese speech sensibility information

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology