Speech enhancement post-processing method and device based on harmonic structure prediction

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A post-processing device and voice enhancement technology, applied in the field of information processing, can solve the problems of affecting communication quality, unable to recover high-frequency harmonics, harmonic loss, etc., to achieve good communication quality and improve the effect of voice perception quality

Pending Publication Date: 2022-04-15

SUIRUI TECH CO LTD

View PDF0 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] At present, the main disadvantages of the method for estimating the time-frequency masking value in the prior art are as follows: 1. The existing time-frequency masking method is easy to ignore the harmonic structure of the voice, resulting in the loss of some harmonics and affecting the communication quality; 2. The remote speaking scene Under the influence of reverberation, the high-frequency harmonics will be weakened, and the existing time-frequency masking method cannot recover the weakened high-frequency harmonics.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0066] In order to enable those skilled in the art to better understand the solutions of the present invention, the present invention will be further described in detail below in conjunction with specific embodiments.

[0067] Such as figure 1 As shown, one embodiment of the present invention is a speech enhancement post-processing method based on harmonic structure prediction.

[0068] Specifically, it includes the following four implementation steps:

[0069] S1: Short-time Fourier transform is performed on the voice signal of the microphone to obtain a time-frequency domain expression.

[0070] Wherein, the voice signal of the microphone is a digital signal after the sound pressure collected by the microphone passes through the ADC.

[0071] Before step S1, it also includes acquiring the voice signal of the microphone, and the acquired voice signal is as follows: Suppose x(n) represents the original time domain signal picked up by the microphone array element in real time...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice enhancement post-processing method and device based on harmonic structure prediction, and belongs to the field of information processing, and the method comprises the following steps: S1, carrying out the short-time Fourier transform of a voice signal of a microphone, and obtaining a time-frequency domain expression; s2, carrying out harmonic loss estimation and correction on the time-frequency domain signal to obtain an estimated power spectrum density; s3, estimating a time-frequency masking value according to the power spectrum density; and S4, according to the estimated time-frequency masking value, obtaining the frequency domain estimation of the target voice, and further obtaining the time domain estimation of the target voice. According to the method, the lost harmonic structure can be predicted to a certain extent, the recovered voice better conforms to the characteristics of the near-speaking voice, and the intelligibility and the voice perception quality are higher.

Description

technical field [0001] The invention belongs to the field of information processing, and in particular relates to a speech enhancement post-processing method and device based on harmonic structure prediction. Background technique [0002] In many applications such as voice conferencing systems, background noise can degrade the communication quality of the intercom system. It is one of the key technologies necessary for conference system related applications to suppress the noise signal collected by the microphone through an algorithm. However, when the noise suppression method suppresses the noise, the speech signal will also be damaged. Therefore, while suppressing noise, it is also necessary to consider how to enhance speech signals, especially the harmonic structure of speech. [0003] In the prior art, noise suppression and voice enhancement are key technologies for voice communication quality in conference systems or conference equipment. The traditional signal proce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L21/02G10L21/0224G10L21/0232G10L21/0316

CPCG10L21/02G10L21/0224G10L21/0232G10L21/0316

Inventor 何平蒋升

Owner SUIRUI TECH CO LTD

Speech enhancement post-processing method and device based on harmonic structure prediction

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology