Voice increasing method, system and device and storage medium

A speech enhancement, speech technology, applied in speech analysis, instruments, etc., can solve the problems of inaccurate sound source localization, high computational complexity, affecting speech signal processing, etc., to achieve the effect of reducing computational overhead

Pending Publication Date: 2020-08-28
苏州奇梦者科技有限公司
View PDF6 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the actual application scenario with strong noise, since it is impossible to determine in advance which one is the target sound source, it may lead to inaccurate sound source localization and affect subsequent speech signal processing; and the relatively complex an...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice increasing method, system and device and storage medium
  • Voice increasing method, system and device and storage medium
  • Voice increasing method, system and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0061] Such as figure 1 As shown, in step S10, the multi-channel audio signal is continuously collected by the audio collection device. At this time, the audio signal collected by the audio collection device is original and complex, and may contain various noises and environmental sounds. Therefore, it is impossible to determine which segment belongs to the target speech segment.

[0062] Therefore, if sound source location is performed and then speech enhancement is performed at this stage, not only noise may lead to inaccurate positioning, but also the speech location and speech enhancement algorithms need to be continuously run for a long time, and the computational overhead will be particularly large, even in some cases where computing resources are very limited. does not work at all on local devices.

[0063] Therefore, the present invention introduces a system that can effectively reduce the computational complexity while targeting speech signals in a strong noise envir...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a voice signal processing method, in particular to a voice enhancement method, which comprises the following steps of S10, audio acquisition, S20, screening effective voice signals, S30, preliminarily enhancing the voice, S40, screening target voice, S50, enhancing the voice signal again, S60, carrying out wake-up word detection: sending the re-enhanced voice into a high-precision wake-up word detection model, carrying out wake-up word detection, and entering S70 when a wake-up word is detected, and otherwise, returning to S20, and S70, continuously enhancing the voicedetected by the wake-up word, then sending the enhanced voice to a recognition end and carrying out recognition. According to the speech enhancement method provided by the invention, the calculationoverhead can be effectively reduced, and the recognition task can be accurately carried out even in a strong noise scene. The method is suitable for being applied to a local end with complex environment and limited computing resources.

Description

technical field [0001] The invention relates to a voice signal processing method, in particular to a voice enhancement method, system, device and storage medium. Background technique [0002] Speech enhancement refers to the technical means of extracting effective target speech signals from the received complex speech signals and reducing or suppressing interference from non-target speech signals. At present, the speech enhancement algorithm usually needs to know the orientation of the target sound source or the prior distribution of the noise in advance, and then perform speech enhancement through a certain algorithm. [0003] However, in the actual application scenario with strong noise, since it is impossible to determine in advance which one is the target sound source, it may lead to inaccurate sound source localization and affect subsequent speech signal processing; and the relatively complex and accurate sound source localization algorithm and Speech enhancement algor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L21/02G10L25/30G10L25/51G10L17/24
CPCG10L21/02G10L25/30G10L25/51G10L17/24Y02D30/70
Inventor 鄢戈王飞唐浩元王佳珺王欢良
Owner 苏州奇梦者科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products