Unlock instant, AI-driven research and patent intelligence for your innovation.

Audio data labeling method, device, electronic equipment and storage medium

A technology of audio data and quantity, applied in the field of speech, can solve the problems of low accuracy, high cost and low efficiency, and achieve the effect of improving accuracy, accurate recognition and reducing cost

Active Publication Date: 2020-11-03
BEIJING ORION STAR TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Embodiments of the present invention provide an audio data labeling method, device, electronic equipment, and storage medium to solve the problems of low accuracy, high cost, and low efficiency of the audio data labeling method in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio data labeling method, device, electronic equipment and storage medium
  • Audio data labeling method, device, electronic equipment and storage medium
  • Audio data labeling method, device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention.

[0027] Any number of elements in the drawings is for illustration rather than limitation, and any designation is for distinction only and has no limiting meaning.

[0028] In the specific practice process, there are mainly two existing methods for labeling audio data. One is to record the audio data of a designated person through a recording device, and the other is to manually label the collected audio data. The first method requires hiring specialized personnel to record audio data, which is costly and inefficient, and cannot obtain enough audio data from different users to meet the needs of model training. The second method also needs to hire a large number...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of voice technology, and discloses an audio data labeling method, device, electronic equipment and storage medium. The method includes: acquiring audio data; extracting voiceprint features of the audio data; and aggregating the audio data according to the voiceprint features class, divide the audio data into at least one cluster; mark the audio data in the same cluster with the same user ID. The technical solution provided by the embodiment of the present invention makes the entire labeling process without manual intervention, realizes automatic labeling of audio data, avoids the problem of high error rate of manual labeling, improves the accuracy of audio data labeling, and does not need to invest a lot of manpower. And it can be labeled 24 hours a day, which improves the efficiency of labeling and reduces costs.

Description

technical field [0001] The invention relates to the technical field of speech, in particular to an audio data labeling method, device, electronic equipment and storage medium. Background technique [0002] The training of the voiceprint deep learning model requires a large amount of audio data marked with user identification. At present, there are mainly two methods for labeling audio data, one is to record the audio data of a designated person through a recording device, and the other is to manually label the collected audio data. [0003] The first method requires hiring specialized personnel to record audio data, which is costly and inefficient, and cannot obtain enough audio data from different users to meet the needs of model training. [0004] The second method also needs to hire a large number of people to label the audio data, which is costly and inefficient, and because each person has a different ability to distinguish sounds, manual labeling cannot guarantee the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/04G10L17/02G10L17/08G06K9/62
CPCG10L17/02G10L17/04G10L17/08G06F18/23
Inventor 乔登科
Owner BEIJING ORION STAR TECH CO LTD