Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speaking man recognizing method using base frequency envelope to eliminate emotion voice

A technology of speaker recognition and fundamental frequency envelope, applied in the field of biometric recognition, can solve the problems of inconvenience of use, the decline of the recognition rate of emotional difference speech, and the lack of consideration of the speaker, so as to improve the recognition performance and overcome the inconvenience. Effect

Inactive Publication Date: 2008-05-14
ZHEJIANG UNIV
View PDF0 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In the traditional ASR (Automatic Speaker Recognition) system, the influence of the speaker's emotion is not considered, which leads to a sharp drop in the speech recognition rate of emotional differences
For how to improve the performance of the speaker recognition system under the influence of emotion, the existing methods generally need to provide the emotional speech of the test speaker during training or need to provide the emotional state information of the test speech during testing. The use of this system brings some inconvenience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaking man recognizing method using base frequency envelope to eliminate emotion voice
  • Speaking man recognizing method using base frequency envelope to eliminate emotion voice
  • Speaking man recognizing method using base frequency envelope to eliminate emotion voice

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0010] The present invention will be further introduced below in conjunction with accompanying drawing and embodiment: the method of the present invention is divided into five steps altogether.

[0011] The first step: speech signal preprocessing

[0012] 1. Sampling and quantization

[0013] A), filter the voice signal with a sharp cut-off filter to make its Nyquist frequency F N 4KHz;

[0014] B), setting voice sampling rate F=2F N ;

[0015] C), for the voice signal s a (t) Sampling by cycle to obtain the amplitude sequence of the digital voice signal s ( n ) = s a ( n F ) ;

[0016] D) Perform quantization coding on s(n) by pulse code modulation (PCM), and obtain the quantized value representation s'(n) of the amplitude sequence.

[0017] 2. Pre-emphasis processing

[0018...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a speaker recognition method for filtering the emotional tone by using pitch envelop. In the test for speaker recognition, the mutually corresponding cepstrum features and pitch frequency which are firstly extracted from a segment of tone; the gender information is obtained by testing on the gender model which is trained in advance according to the cepstrum features; the thresholds adopted in the method for filtering the emotional tone are determined by gender information; the pitch envelope is picked out according to the thresholds, and then the cepstrum features are filtered according to the serial number of each frame in pitch envelope, thus acquiring the processed cepstrum features; finally, the GMM system test is carried out on the processed cepstrum features.The beneficial effects of the invention are as follows: the inconvenience to the system which needs providing the emotional tone of the speaker in the training or the emotion information of the speech in the test in a traditional method is eliminated, and the recognition performance is increased by 8% compared with traditional ASR system.

Description

technical field [0001] The invention relates to biometric feature recognition technology, and mainly relates to a speaker recognition method using fundamental frequency envelope to eliminate emotional speech. Background technique [0002] Biometric identification technology refers to a technology that uses human's own physiological or behavioral characteristics for identity authentication through computers. Based on behavioral characteristics (voice, keystroke, gait, signature, etc.), the powerful functions of computers and network technology are used for image processing and pattern recognition to identify people's identities. Voiceprint recognition or speaker recognition is one of them. It is a technology that automatically identifies the speaker's identity based on the voice parameters in the voice waveform that reflect the speaker's physiological and behavioral characteristics. [0003] Human speech contains not only text information, but also people's emotional informa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L17/00G10L15/02G10L15/06G10L15/08G10L15/28G10L17/04G10L25/24
Inventor 吴朝晖杨莹春黄挺
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products