Multi-channel anti-crosstalk dynamic planning strategy based on feedforward memory network

A dynamic programming and multi-channel technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as difficulty in being picked up and recognized by speakers, word loss noise filtering effect, and a large number of interfering sound sources, so as to improve the recording effect, Adjusting parameters is easy to use and reduces the effect of crosstalk

Pending Publication Date: 2022-05-31
杭州云嘉云计算有限公司
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The present invention is to overcome the following phenomena in the prior art that are prone to occur in the recording environment of the applied speech recognition system: 1) it is difficult for a speaker with a small voice to be picked up and recognized; 2) a speaker with a loud voice is easy to be strung into other acquisition devices Cause interference; 3) Multiple speakers speaking at the same time are prone to crosstalk and drop words. 4) The on-site environment has poor noise filtering effect, resulting in a large number of interference sound sources, which leads to problems such as confusion in the recognition results. A feed-forward memory network based on The multi-channel anti-crosstalk dynamic planning strategy greatly solves the problem of low sound source loss and crosstalk of each channel, effectively improving the effect of speech recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-channel anti-crosstalk dynamic planning strategy based on feedforward memory network
  • Multi-channel anti-crosstalk dynamic planning strategy based on feedforward memory network
  • Multi-channel anti-crosstalk dynamic planning strategy based on feedforward memory network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be further specifically described below in conjunction with the accompanying drawings and specific embodiments.

[0045] Such as figure 1 Shown, the present invention comprises the following steps:

[0046] Step S1: By collecting the voice signals of each channel collected by the sound collection microphone and related sound pickup components, the digital signal and acoustic information of the sound source are respectively extracted; in practical applications, since the collection devices have been differentiated, each The speaker's real-time voice digital signal.

[0047] This step requires the following specific instructions:

[0048] First, the voice signal recorded by a single sound pickup device cannot actually provide sufficient information to tell whether there is crosstalk at the moment. , only different sound sources can be separated from the mixed voice recorded by the sound pickup device, but it is impossible to determine which s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-channel anti-crosstalk dynamic planning strategy based on a feedforward memory network. The multi-channel anti-crosstalk dynamic planning strategy specifically comprises the following steps of S1, collecting sound source information; s2, self-adaption of sound source gain is carried out; s3, a crosstalk filtering process; s4, outputting the processed sound, and obtaining a multi-channel recognition result; the self-adaptive sound source gain process provided by the invention is not limited by a specific field environment structure, hardware sound collection equipment and behaviors of a spokesman, and sound source gain which contributes to improving the recognition effect can be dynamically carried out on the acoustic signal of the spokesman in real time, so that the tuning cost of a traditional method is avoided, and the real recording process is guaranteed; through effective acoustic feature extraction and a crosstalk identification strategy model, crosstalk channels are identified and filtered in real time, and occurrence of crosstalk phenomena is significantly reduced. For an extreme field environment, adjustment parameters provided by the invention are simple and easy to use, targeted adjustment can be quickly and timely made, and the real recording effect is improved.

Description

technical field [0001] The invention relates to the field of sound processing, in particular to a multi-channel anti-crosstalk dynamic programming strategy based on a feedforward memory network. Background technique [0002] In the multi-person recording system based on speech recognition, the indoor environment structure, microphone hardware, speaker position and voice all significantly affect the on-site recognition effect. The recognition rate drops and the recognition results are wrong, which affects the user experience. Also do not have a kind of method, device or equipment that effectively solves the above problems simultaneously on the market. The self-adaptive sound source gain and anti-crosstalk method provides simple and general configuration parameters, realizes adaptation to different recording sites, comprehensively considers factors such as environment, hardware, software, and speaker mode, analyzes the acoustic information of each speaker channel in real time...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/16G10L15/20G10L15/22G10L15/26G10L15/28G10L21/0208H04R1/08H04R3/04
CPCH04R1/08H04R3/04G10L15/22G10L15/20G10L15/16G10L15/02G10L15/28G10L15/26G10L21/0208G10L2021/02087
Inventor 麦联韬唐海江朱宇袁宇豪
Owner 杭州云嘉云计算有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products