Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Noise spectrum estimation and voice mobility detection method based on unsupervised learning

An unsupervised learning, voice activity technology, applied in voice analysis, instruments, etc., can solve problems such as enhancing the adaptability of voice application systems, and achieve the effect of strong practicability, strong practicability, and enhanced adaptability

Active Publication Date: 2010-10-06
INST OF ACOUSTICS CHINESE ACAD OF SCI
View PDF0 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Aiming at the shortcomings of the previous voice activity detector and noise power spectrum estimator, the present invention proposes a tightly coupled solution, so that voice activity detection and noise power spectrum estimation are unified under an unsupervised learning framework, thereby enhancing Adaptability of Speech Application System to Noise Environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Noise spectrum estimation and voice mobility detection method based on unsupervised learning
  • Noise spectrum estimation and voice mobility detection method based on unsupervised learning
  • Noise spectrum estimation and voice mobility detection method based on unsupervised learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a noise spectrum estimation and voice mobility detection method based on unsupervised learning, which comprises the steps of: 1, establishing a GMM (Gaussian Mixture Model) model aiming at logarithm amplitude features of a voice signal on each frequency point; 2, setting M frames of buffers for one section of voice data, storing the former M frames of input signals into the buffers, extracting a logarithm amplitude spectrum of M frames in the buffers, and substituting into the GMM mode of the step 1 for initializing to obtain an initialized model Lambda0,k; and 3, updating the GMM model by frames by adopting an incremental learning mode from the (M+1)th frame after the initialize model Lambda0,k is obtained, and carrying out sequential recursion to obtain chances of occurrence of the noise value and the voice signals on the kth frequency point of the ith frame. The invention is a tight coupling solution of spectrum estimation and voice mobility detection, which can enhance the adaptability of the voice application system to the noise environment. The invention is independent from the hypothesis of the noise initialization, and can provide the description of the voice mobility on the time frequency two-dimensional space.

Description

technical field The present invention relates to the technical field of speech signal processing, in particular, the present invention relates to a noise power spectrum estimation and speech activity detection method based on unsupervised learning. Among them, speech activity detection is an algorithm for judging the presence or absence of speech in the time dimension. It can not only answer the existence of speech in the form of "yes" or "no", but also describe the presence of speech with the probability of speech occurrence. Background technique Most speech application systems have to deal with environmental noise interference. Predecessors have proposed many methods to remove the interference of noise on the speech system, almost all of which rely on speech activity detection and noise power spectrum estimation. There is a close relationship between these two modules, and their accuracy directly affects the overall noise immunity of the system. Traditional solutions hav...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L19/00G10L21/02G10L19/038G10L19/18
Inventor 应冬文颜永红付强潘接林
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products