Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Phonetic feature processing method of voiceprint identification in noise environment

A voiceprint recognition and voice feature technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as reducing the robustness of voiceprint recognition, reduce instability, ensure sensitivity and accuracy, and ensure system performance effect

Active Publication Date: 2016-06-15
CHONGQING UNIV OF POSTS & TELECOMM
View PDF6 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The acoustic model only uses the data in the voice sample library for training, and the voice collection is usually in a low-noise environment, which is often difficult to match with a variety of noise environments. The feature distortion caused by environmental noise reduces the robustness of voiceprint recognition. sex

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phonetic feature processing method of voiceprint identification in noise environment
  • Phonetic feature processing method of voiceprint identification in noise environment
  • Phonetic feature processing method of voiceprint identification in noise environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] Below in conjunction with accompanying drawing, the present invention will be further described:

[0036] Such as figure 1 As shown, suppose the speech signal is x(n).

[0037] Step 1: The pre-emphasis filter x'(n)=x(n)-ax(n-1) adopted, wherein a takes a constant of 0.95 to preprocess the speech signal; the speech signal is windowed by Hamming window processing; first select a larger threshold T according to the short-term energy envelope 1 (According to the speech signal energy statistics, it is set to 9.58) to make a rough judgment, and it is determined to be a speech signal higher than the threshold, and the start and end points of the speech signal are located outside the corresponding time point of the intersection of the threshold and the short-term energy envelope. Determine a lower threshold T on the average energy 2 (according to speech signal energy statistics, set to 5.56), and from T 1 The intersection points search for both sides of the signal respectiv...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a phonetic signal feature processing method of voiceprint identification in a noise environment, which includes the steps of: (1) carrying out early stage processing on signals according to a phonetic signal characteristic, the processing including signal pre-emphasis, endpoint detection, and selection of window functions; (2) estimating a fundamental tone period of a sounding individual, carrying out spectrum smoothing processing on the phonetic signal based on the fundamental tone period, obtaining a new spectrum envelope, calculating the energy passing through a Mel filter, and finally obtaining a Mel smoothing coefficient (SFCC) through Discrete Cosine Transform (DCT) calculation; and (3) carrying out post-processing on the SFCC by combination of a mean value reduction method, variance normalization, a time sequence filter method, and a weight autoregression moving average filter method, and obtaining a regression balance parameter (MVDA). The purpose is to remove individual sounding unstable factors by smoothing the spectrum envelope and to remove the ambient noise influence through a post-processing algorithm, and the false identification rate of the voiceprint identification is finally reduced.

Description

technical field [0001] The invention relates to the field of speech signal processing, and proposes a speech feature extraction method based on pitch characteristics and noise characteristics. Background technique [0002] With the development of speech science and information communication technology, as a more convenient identity verification technology, voiceprint recognition technology has made remarkable progress. As one of the most basic natural attributes of human beings, language is the most direct and convenient way for information transmission between human beings. As an individual, the vocal organs are not only related to innate factors, but also greatly affected by factors such as the acquired developmental environment, so the voice has a very significant individual color. This individual characteristic also gave rise to a scientific research hotspot - voiceprint recognition. When an individual is speaking, the voice produced is related to the individual's voca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L19/02G10L21/0332
Inventor 张毅谢延义徐晓东萧红罗久飞黄超王可佳倪雷
Owner CHONGQING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products