Method and system for generating multi-channel noisy speech

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A noisy voice, multi-channel technology, applied in voice analysis, voice recognition, instruments, etc., can solve the problems of low efficiency of noisy voice collection, low efficiency of wake-up word training, long collection cycle, etc., to reduce configuration requirements, The effect of improving customization efficiency and shortening the collection cycle

Active Publication Date: 2021-02-26

AISPEECH CO LTD

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0007] In order to at least solve the need for certain sound quality and parameters of the noisy voice training wake-up words in the prior art, a large number of recording personnel can only be unified in a specific recording site for recording, making the efficiency of noisy voice collection low. The long collection period makes the wake-up word training less efficient

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment approach

[0045] As an implementation manner, the performing mixing processing on the multi-channel noise-only audio set recorded by the second recording device and the far-field multi-channel pure speech audio set includes:

[0046] adjusting the amplitude of each noise audio in the multi-channel pure noise audio set according to the signal-to-noise ratio;

[0047] According to the amplitude of each noise audio, the multi-channel pure noise audio set recorded by the second recording device is mixed with the far-field multi-channel pure speech audio set.

[0048] In this embodiment, since the microphone recordings are linearly superimposed, the multi-channel pure noise audio set recorded by the second recording device can be mixed with the far-field multi-channel pure speech audio set, and adjusted according to the signal-to-noise ratio. The amplitude of the data, and then the far-field noisy multi-channel voice data is obtained in batches.

[0049] It can be seen from this embodiment ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

An embodiment of the present invention provides a method for generating multi-channel noisy speech. The method includes: receiving a near-field single-channel pure voice audio set recorded in a quiet environment by the first recording device for collecting wake-up words; The final direct audio is superimposed to determine the far-field single-channel pure voice audio set in the reverberation environment; according to the phase delay function of the second recording device that collects noise, the far-field single-channel pure voice audio set is simulated as the second recording device for recording A collection of far-field multi-channel voice-only audio; it is mixed to generate far-field multi-channel noisy voice audio in batches. The embodiment of the present invention also provides a system for generating multi-channel noisy speech. The embodiment of the present invention adapts and adjusts the audio recorded by ordinary equipment, which reduces the configuration requirements of the recording equipment in the wake-up word training, and the personnel do not need to go to the recording site to record, which improves the collection efficiency of multi-channel noisy voice.

Description

technical field [0001] The invention relates to the field of wake-up word customization, in particular to a method and system for generating multi-channel noisy speech. Background technique [0002] Wake-up word customization needs to obtain a large amount of noisy voice and audio, and the recording device needs to be placed in a noisy environment. At the same time, people need to speak the wake-up word at a certain distance, and record multi-channel noisy voice data through the recording device. [0003] In the improved version of the wake-up word customization, it is first necessary to place an environmental noise source next to the recording device, the recording device records multi-channel audio data of pure noise, and then records the wake-up word spoken by a person in a quiet environment at a certain distance as a multi-channel The pure voice data, and finally, the multi-channel audio data with pure noise and the multi-channel pure voice data are mixed in a certain wa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/22G10L21/0216G10L21/0224

CPCG10L15/22G10L21/0216G10L21/0224G10L2015/223G10L2021/02082

Inventor 孙海涛

Owner AISPEECH CO LTD

Features

Generate Ideas
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method and system for generating multi-channel noisy speech

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment approach

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology