Unlock instant, AI-driven research and patent intelligence for your innovation.

Audio data enhancement method and device, electronic equipment and storage medium

An audio data and audio technology, applied in the field of audio data enhancement methods, devices, electronic equipment and storage media, can solve the problems of not being able to know the baby crying in time, and achieve the effect of improving user experience

Active Publication Date: 2022-07-15
SHENZHEN MICROBT ELECTRONICS TECH CO LTD
View PDF15 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, for the event detection of a baby crying, the edge smart voice device and the baby are in one room, and the user is in another room for some reason, and needs to use the edge smart voice device to detect whether the baby is crying and enter the room where the baby is in time Take care of the baby. In this case, if the baby's crying sound is detected after a long time after the baby's crying, the user may not be able to learn the baby's crying in time and take corresponding measures in time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio data enhancement method and device, electronic equipment and storage medium
  • Audio data enhancement method and device, electronic equipment and storage medium
  • Audio data enhancement method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] In order to make the objectives, technical solutions and advantages of the present disclosure more clear, the present disclosure will be described in further detail below with reference to the accompanying drawings and embodiments.

[0061] It should be noted that the terms "first", "second" and the like in the description and claims of the present disclosure and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used may be interchanged under appropriate circumstances such that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods consistent with some aspects of the pres...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an audio data enhancement method and device, electronic equipment and a storage medium, and the method comprises the steps: determining an audio recognition task which is a keyword detection task and / or a sound event detection task; receiving audio data associated with the audio recognition task; splitting and recombining the audio data according to the audio recognition task to obtain enhanced sample data for the audio recognition task; and obtaining an audio training sample for the audio recognition task according to the enhanced sample data and the audio recognition task. According to the invention, the audio data is split and recombined, and the obtained audio training sample has more prominent keyword features for a keyword detection task or more prominent sound features for a sound event detection task. The voice recognition accuracy of the keyword detection task can be improved, the detection response duration of the sound event detection task can be shortened, and the user experience of the keyword detection task and / or the sound event detection task can be improved.

Description

technical field [0001] The present disclosure relates to the field of computers, and in particular, to an audio data enhancement method, apparatus, electronic device, and storage medium. Background technique [0002] Currently, Key Word Spotting (KWS, Key Word Spotting) and Sound Event Detection (SED, Sound Event Detection) are two common speech tasks for edge smart speech devices. [0003] The keyword detection task needs to reduce the false wake-up rate while ensuring the detection rate. The sound event detection task requires as short a detection delay as possible, that is, the closer the detection time point is to the sound event occurrence time point, the better. [0004] Existing keyword detection tasks and / or sound event detection tasks are usually implemented using deep learning methods. Data augmentation is an important speech data processing method used in deep learning methods for keyword detection tasks and / or sound event detection tasks. Currently, data enhanc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/02G10L25/84G10L15/06
CPCG10L21/02G10L25/84G10L15/063
Inventor 郑鑫江凌明杨作兴艾国
Owner SHENZHEN MICROBT ELECTRONICS TECH CO LTD