Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for generating multi-channel noisy speech

A noisy voice, multi-channel technology, applied in voice analysis, voice recognition, instruments, etc., can solve the problems of low efficiency of noisy voice collection, low efficiency of wake-up word training, long collection cycle, etc., to reduce configuration requirements, The effect of improving customization efficiency and shortening the collection cycle

Active Publication Date: 2021-02-26
AISPEECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] In order to at least solve the need for certain sound quality and parameters of the noisy voice training wake-up words in the prior art, a large number of recording personnel can only be unified in a specific recording site for recording, making the efficiency of noisy voice collection low. The long collection period makes the wake-up word training less efficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for generating multi-channel noisy speech
  • Method and system for generating multi-channel noisy speech

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0045] As an implementation manner, the performing mixing processing on the multi-channel noise-only audio set recorded by the second recording device and the far-field multi-channel pure speech audio set includes:

[0046] adjusting the amplitude of each noise audio in the multi-channel pure noise audio set according to the signal-to-noise ratio;

[0047] According to the amplitude of each noise audio, the multi-channel pure noise audio set recorded by the second recording device is mixed with the far-field multi-channel pure speech audio set.

[0048] In this embodiment, since the microphone recordings are linearly superimposed, the multi-channel pure noise audio set recorded by the second recording device can be mixed with the far-field multi-channel pure speech audio set, and adjusted according to the signal-to-noise ratio. The amplitude of the data, and then the far-field noisy multi-channel voice data is obtained in batches.

[0049] It can be seen from this embodiment ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An embodiment of the present invention provides a method for generating multi-channel noisy speech. The method includes: receiving a near-field single-channel pure voice audio set recorded in a quiet environment by the first recording device for collecting wake-up words; The final direct audio is superimposed to determine the far-field single-channel pure voice audio set in the reverberation environment; according to the phase delay function of the second recording device that collects noise, the far-field single-channel pure voice audio set is simulated as the second recording device for recording A collection of far-field multi-channel voice-only audio; it is mixed to generate far-field multi-channel noisy voice audio in batches. The embodiment of the present invention also provides a system for generating multi-channel noisy speech. The embodiment of the present invention adapts and adjusts the audio recorded by ordinary equipment, which reduces the configuration requirements of the recording equipment in the wake-up word training, and the personnel do not need to go to the recording site to record, which improves the collection efficiency of multi-channel noisy voice.

Description

technical field [0001] The invention relates to the field of wake-up word customization, in particular to a method and system for generating multi-channel noisy speech. Background technique [0002] Wake-up word customization needs to obtain a large amount of noisy voice and audio, and the recording device needs to be placed in a noisy environment. At the same time, people need to speak the wake-up word at a certain distance, and record multi-channel noisy voice data through the recording device. [0003] In the improved version of the wake-up word customization, it is first necessary to place an environmental noise source next to the recording device, the recording device records multi-channel audio data of pure noise, and then records the wake-up word spoken by a person in a quiet environment at a certain distance as a multi-channel The pure voice data, and finally, the multi-channel audio data with pure noise and the multi-channel pure voice data are mixed in a certain wa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/22G10L21/0216G10L21/0224
CPCG10L15/22G10L21/0216G10L21/0224G10L2015/223G10L2021/02082
Inventor 孙海涛
Owner AISPEECH CO LTD
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More