Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

80 results about "Speech enhancement algorithm" patented technology

Prior signal-to-noise ratio estimating method based on MMSE error criterion

The invention discloses a prior signal-to-noise ratio estimating method based on an MMSE error criterion and used for voice enhancement, and belongs to the technical field of voice signal processing. Aimed at the prior signal-to-noise ratio estimating problem in the voice enhancement technology, the method comprises the steps of: firstly carrying out preliminary estimation on a prior signal-to-noise ratio of noised voices based on the MMSE error criterion, carrying out Wiener filtering calculation on an obtained prior signal-to-noise ratio estimated value to obtain a first system gain factor, carrying out calculation on the first system gain factor an amplitude spectrum value of the noised voices to obtain a voice power spectrum estimated value, then utilizing the obtained voice power spectrum estimated value and a power spectrum estimated value of noise to carry out estimation once again, and obtaining a final prior signal-to-noise ratio estimated value. The prior signal-to-noise ratio estimated value is substituted into a subsequent voice enhancing step for processing, and de-noised estimated voice clearing signals are obtained. The prior signal-to-noise ratio estimating method based on the MMSE error criterion can effectively inhibit background noise components in estimated cleared voices, and excessive damages to the cleared voice components are avoided, so that the hearing quality of the estimated cleared voices is improved, and the performance of a voice enhancement algorithm is improved.
Owner:SYSU CMU SHUNDE INT JOINT RES INST +1

Directional speech enhancement method based on small microphone array

The invention provides a directional speech enhancement method based on a small microphone array, which comprises the following steps of: 1) utilizing two fully directional microphones to acquire a sound signal; 2) utilizing a self-adaptive null-forming algorithm to perform data processing on the acquired sound signal to obtain a delay substraction signal x(t) and a signal z(t) after the self-adaptive filtering; 3) processing the x(t) and the z(t) to form speech spectrums, namely X (omega) and Z(omega) respectively; 4) calculating preliminary gain G'(omega) by utilizing a single-channel speech enhancement method according to the X (omega) and the Z(omega), and calculating the existence probability P(omega) of a target signal according to the X (omega) and the Z(omega); 5) utilizing the existence probability P(omega) of the target signal to amend the preliminary gain G'(omega) to obtain final gain G(omega), wherein the G(omega) is equal to (the G'(omega))Gm, and Gm is a preset minimum gain value; and 6) utilizing the final gain G(omega) to enhance the signal z(t) after the self-adaptive filtering to obtain a final enhanced speech signal r(t). The method can realize a directional speech enhancement algorithm in small volume, can obtain noise suppression to a greater extent, and improve the signal-to-noise ratio.
Owner:INST OF ACOUSTICS CHINESE ACAD OF SCI +1

Method for enhancing voice based on signal to noise ratio soft masking

The invention discloses a method for enhancing voice based on signal to noise ratio soft masking. The method comprises: establishing noise power spectrum updates of a sub-band time varying coefficient, using different threshold update smooth spectrums for different frequency points, enhancing voice and restraining noise; determining a posterior signal to noise ratio from a noise power spectrum, performing iterative computation to obtain a prior signal to noise ratio of a present frame according to the posterior signal to noise ratio and the prior signal to noise ratio of a previous frame, and obtaining that each frequency point is in a masking region or in a target signal region according to values of the prior signal to noise ratio. A masking value is calculated by probability distribution obtained by hypothesis testing. The method uses correlation of adjacent frames to extract information to realize enhancement of voice spectrum smooth iteration estimation. For non-stationary noise and strong background noise, a voice enhancement algorithm based on signal to noise ratio soft masking is provided. A rapid tracking noise algorithm performs smooth update on the non-stationary noise frame by frame, preferably estimating a noise spectrum. The algorithm can effectively restrain background noise and improve voice quality and speech intelligibility after denoising.
Owner:HUNAN INT ECONOMICS UNIV

Adversarial sample attack defense method and device based on speech enhancement algorithm

The embodiment of the invention provides an adversarial sample attack defense method and device based on a voice enhancement algorithm. The method comprises the steps of obtaining a to-be-identified voice sample and spectrum features of the to-be-identified voice sample; according to the spectrum characteristics of the to-be-identified voice sample, calculating a noise spectrum of the to-be-identified voice sample through a preset algorithm, and denoising the to-be-identified voice sample by using the estimated noise spectrum obtained through calculation to obtain a denoised voice sample, wherein the algorithm comprises a spectral subtraction method based on continuous minimum tracking and a logarithm minimum mean square error algorithm MMSE algorithm combined with speech existence probability; and recognizing the denoised voice sample through a pre-trained voice recognition model to obtain a recognition result. Therefore, after the to-be-identified voice sample is obtained and de-noising processing is carried out on the to-be-identified voice sample, the de-noised voice sample is identified, so that the voice identification accuracy is improved and the efficiency of defending against the attack of the countermeasure sample is improved.
Owner:BEIJING UNIV OF POSTS & TELECOMM

Short-wave channel speech anti-fading auxiliary enhancement method based on convolutional neural network

The invention discloses a short-wave channel speech anti-fading auxiliary enhancement method based on a convolutional neural network, belongs to the technical field of communication, and particularly relates to a short-wave fading resistant speech enhancement auxiliary method. Firstly, an applicable short-wave speech communication model is defined; after a transmitting terminal obtains a speech signal sample, background environment noise is eliminated by using an existing speech enhancement technology, then SSB modulation is carried out, up-conversion is carried out to a short-wave frequency band for transmission, a transmitting signal reaches a receiver at a far end through a short-wave channel, and speech enhancement is carried out on the received signal after down-conversion and SSB demodulation, so that the purpose of the invention is achieved. The anti-fading convolutional neural network can be used for assisting most speech enhancement algorithms based on speech feature extraction and further improving the quality of short-wave received speech signals, for example, the anti-fading convolutional neural network can be combined with a spectral subtraction method, a method based on a statistical model, an NMF algorithm and the like described in the background technology.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Single-channel speech enhancement algorithm based on compensation phase spectrum

The invention provides a single-channel speech enhancement algorithm based on a compensation phase spectrum. The single-channel speech enhancement algorithm comprises the following steps: preprocessing a noisy speech signal, and framing and windowing the noisy speech signal; carrying out fourier transform; dividing a critical frequency band by using an ERB scale; calculating the value of the segmented signal-to-noise ratio; calculating a new compensation factor, and obtaining a primarily enhanced voice complex frequency spectrum through power spectrum subtraction; performing additive calculation on the phase spectrum compensation function and the primarily enhanced voice complex frequency spectrum to obtain a compensated complex frequency spectrum; solving a phase angle of the compensatedcomplex frequency spectrum to obtain a compensated phase spectrum; and overlapping and adding the speech amplitude spectrum after basic spectrum subtraction and the compensation phase spectrum obtained in the seventh step, and then carrying out inverse Fourier transform to obtain an enhanced speech signal. The invention verifies that the improved compensation factor not only has an effect on steady-state noise, but also is more beneficial to the denoising effect of unsteady-state noise, and the speech enhancement algorithm provided by the invention is relatively wide and effective in applicable noise environment.
Owner:JIANGSU UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products