Speech enhancement post-processing method and device based on harmonic structure prediction

A post-processing device and voice enhancement technology, applied in the field of information processing, can solve the problems of affecting communication quality, unable to recover high-frequency harmonics, harmonic loss, etc., to achieve good communication quality and improve the effect of voice perception quality

Pending Publication Date: 2022-04-15
SUIRUI TECH CO LTD
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, the main disadvantages of the method for estimating the time-frequency masking value in the prior art are as follows: 1. The existing time-frequency masking method is easy to ignore the harmonic structure of the voice, resulting in the loss of some harmonics and affecting the communication quality; 2. The remote speaking scene Under the influence of reverberation, the high-frequency harmonics will be weakened, and the existing time-frequency masking method cannot recover the weakened high-frequency harmonics.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement post-processing method and device based on harmonic structure prediction
  • Speech enhancement post-processing method and device based on harmonic structure prediction
  • Speech enhancement post-processing method and device based on harmonic structure prediction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0066] In order to enable those skilled in the art to better understand the solutions of the present invention, the present invention will be further described in detail below in conjunction with specific embodiments.

[0067] Such as figure 1 As shown, one embodiment of the present invention is a speech enhancement post-processing method based on harmonic structure prediction.

[0068] Specifically, it includes the following four implementation steps:

[0069] S1: Short-time Fourier transform is performed on the voice signal of the microphone to obtain a time-frequency domain expression.

[0070] Wherein, the voice signal of the microphone is a digital signal after the sound pressure collected by the microphone passes through the ADC.

[0071] Before step S1, it also includes acquiring the voice signal of the microphone, and the acquired voice signal is as follows: Suppose x(n) represents the original time domain signal picked up by the microphone array element in real time...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice enhancement post-processing method and device based on harmonic structure prediction, and belongs to the field of information processing, and the method comprises the following steps: S1, carrying out the short-time Fourier transform of a voice signal of a microphone, and obtaining a time-frequency domain expression; s2, carrying out harmonic loss estimation and correction on the time-frequency domain signal to obtain an estimated power spectrum density; s3, estimating a time-frequency masking value according to the power spectrum density; and S4, according to the estimated time-frequency masking value, obtaining the frequency domain estimation of the target voice, and further obtaining the time domain estimation of the target voice. According to the method, the lost harmonic structure can be predicted to a certain extent, the recovered voice better conforms to the characteristics of the near-speaking voice, and the intelligibility and the voice perception quality are higher.

Description

technical field [0001] The invention belongs to the field of information processing, and in particular relates to a speech enhancement post-processing method and device based on harmonic structure prediction. Background technique [0002] In many applications such as voice conferencing systems, background noise can degrade the communication quality of the intercom system. It is one of the key technologies necessary for conference system related applications to suppress the noise signal collected by the microphone through an algorithm. However, when the noise suppression method suppresses the noise, the speech signal will also be damaged. Therefore, while suppressing noise, it is also necessary to consider how to enhance speech signals, especially the harmonic structure of speech. [0003] In the prior art, noise suppression and voice enhancement are key technologies for voice communication quality in conference systems or conference equipment. The traditional signal proce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L21/0224G10L21/0232G10L21/0316
CPCG10L21/02G10L21/0224G10L21/0232G10L21/0316
Inventor 何平蒋升
Owner SUIRUI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products