Method, device, equipment and storage medium for short-term speech signal processing

A voice signal processing and time domain signal technology, applied in the field of short-term voice signal processing, can solve problems such as affecting the recognition rate, affecting the voice quality, and unclean echo signal elimination, so as to improve clarity and suppress residual echo and environmental noise. Effect

Active Publication Date: 2021-08-24
SHANGHAI XIAODU TECHNOLOGY CO LTD
View PDF18 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing technology has the following defects: when the terminal uses the audio input and audio output functions at the same time, for example, when the speaker and the microphone of the smart device work at the same time, the echo signal in the preprocessed sound signal is not eliminated cleanly, and still contains residual Echo and ambient noise
In the short-term voice signal processing system of the terminal, the residual echo and environmental noise in the short-term voice signal will reduce the clarity of the voice signal and affect the normal operation of the system
For example, in the voice message application scenario, residual echo and environmental noise will affect the voice quality; for a speech recognition system with a small word size, residual echo and environmental noise will affect the recognition rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device, equipment and storage medium for short-term speech signal processing
  • Method, device, equipment and storage medium for short-term speech signal processing
  • Method, device, equipment and storage medium for short-term speech signal processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] figure 1 It is a flow chart of a short-term speech signal processing method provided by Embodiment 1 of the present invention. This embodiment is applicable to the case of processing speech signals, and the method can be performed by a speech signal processing device. The device It is implemented by software and / or hardware, and generally can be integrated in a voice signal processing device. Devices for processing voice signals include but are not limited to computers and the like. Exemplarily, the voice signal processing device includes a terminal device with a speaker-microphone circuit, which may be an audio collection device such as a smart phone, a smart bracelet, a smart speaker, or a smart TV. Especially for the short-term voice signal processing system of the voice signal processing equipment, the method can effectively suppress the residual echo and environmental noise in the short-time voice signal, improve the clarity of the short-time voice signal, and ens...

Embodiment 2

[0053] figure 2 It is a flow chart of a short-term speech signal processing method provided by Embodiment 2 of the present invention. This embodiment optimizes step 102 on the basis of the above-mentioned embodiments: The frequency domain signal corresponding to the domain signal and the error time domain signal respectively determines the audio acquisition status that matches the near-end time domain signal. The audio acquisition status includes: single-speak status or double-speak status, including: acquisition of the near-end frequency domain of the current frame signal and the far-end frequency domain signal, and determine the error frequency domain signal according to the near-end frequency domain signal and the far-end frequency domain signal, wherein, the near-end frequency domain signal, the far-end frequency domain signal and the error frequency domain signal are related to the near-end time domain signal, far-end time-domain voice signal and error time-domain signal...

Embodiment 3

[0083] image 3 It is a flow chart of a short-term speech signal processing method provided by Embodiment 3 of the present invention. This embodiment optimizes step 103 on the basis of the above embodiments: according to the remote time domain signal, the error time domain signal and Determine the amplitude spectrum of the residual echo and the amplitude spectrum of the ambient noise corresponding to the near-end time domain signal in the audio acquisition state, including: determine the noise threshold of the error time domain signal according to the error time domain signal and the audio acquisition state, wherein the noise includes the residual echo and the environment Noise: determine the residual echo amplitude spectrum according to the error time domain signal, the remote time domain signal, the audio collection status and the noise threshold; determine the environmental noise amplitude spectrum according to the error time domain signal, the audio collection status and th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a short-term speech signal processing method, device, equipment and storage medium. Among them, the method includes: acquiring the near-end time domain signal, and determining the far-end time-domain signal and the error time-domain signal matching the near-end time-domain signal; determining the audio collection state matching the near-end time-domain signal, the audio collection state Including: single-speak state or double-speak state; determine the residual echo amplitude spectrum and environmental noise amplitude spectrum corresponding to the near-end time-domain signal according to the far-end time-domain signal, error time-domain signal and audio acquisition status; according to the residual echo amplitude spectrum, Ambient noise magnitude spectrum and error time domain signal to generate an output time domain signal that matches the near-end time domain signal. The technical solution of the embodiment of the present invention can effectively suppress the residual echo and environmental noise in the voice signal in an echo scene, and improve the clarity of the voice signal.

Description

technical field [0001] Embodiments of the present invention relate to audio processing technologies, and in particular to a short-term speech signal processing method, device, equipment, and storage medium. Background technique [0002] With the continuous development of terminals, more and more terminals have audio input and audio output functions, and the output audio is picked up by the audio input device again, forming an echo. For example, a smart device with a speaker and a microphone. The presence of an echo signal will affect the quality of the audio signal. [0003] In the prior art, the processing of the echo of the terminal generally adopts an adaptive filter to construct an echo canceller to cancel the echo. The adaptive filter is subtracted from the near-end audio signal picked up by the microphone to output an estimated echo signal, and the subtraction result is called an error signal. Ideally, the error signal is considered to be the effective speech signal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04M9/08
CPCH04M9/08
Inventor 陈超邓滨宋晨枫
Owner SHANGHAI XIAODU TECHNOLOGY CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products