Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Efficient voice keyword detector training sample use method

A technology of training samples and training methods, which is applied in the field of data processing, can solve problems such as underfitting and discarding, and achieve the effects of accelerated training, low cost, and improved detection performance

Pending Publication Date: 2021-12-21
SOUTH CHINA UNIV OF TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, if samples are discarded at will, samples that are "important" to training may be discarded, resulting in underfitting

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Efficient voice keyword detector training sample use method
  • Efficient voice keyword detector training sample use method
  • Efficient voice keyword detector training sample use method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments. It is only stated here that the words for directions such as up, down, left, right, front, back, inside, and outside that appear or will appear in the text of the present invention are only based on the accompanying drawings of the present invention, and are not specific to the present invention. limited.

[0051] In this embodiment, the samples in the AISHELL-2 Chinese corpus are used as the experimental data set, wherein the ratio of positive and negative samples in the training set is 10107:101070=1:10, that is, there are a total of 111177 samples in the training set, and the ratio of positive and negative samples in the test set 2018:4036=1:2; in the experimental data set, keywords only appear in a part of a speech sample, and at the same time, there may be multiple ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an efficient voice keyword detector training sample use method. The method comprises the following steps: training a detector for n rounds by using all samples; in the preparation stage of the kth round of training, acquiring a target score S (T) and a competitor score S (C) of a sample based on a detector Dk-1 obtained in the (k-1) th round of training, and constructing the probability that the sample participates in the kth round of training; selecting a sample set Zk used in the kth round according to the probability that the samples participate in training; and using the Zk for training in the kth round, acquiring a new detector Dk, repeating the processes of obtaining the probability that the samples participate in training, selecting the samples and training, and ending the training till the model converges or the training round is larger than a preset value. According to the method, the samples capable of providing more distinguishing information for model training are selected for training, so that the training pays more attention to the important samples, and the training efficiency is improved while the system performance is improved; the problem of class sample imbalance in keyword detection training can be relieved; and the method is simple and efficient, and has a wide application prospect.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to a method for using training samples of an efficient speech keyword detector. Background technique [0002] Voice is an important medium for information exchange. Speech keyword spotting (Keyword Spotting, KWS) refers to the detection of predefined words from a continuous speech stream. Compared with Automatic Speech Recognition (ASR), KWS only pays attention to the defined keywords, and does not pay attention to words other than keywords, which greatly reduces the difficulty of system development. Due to its fast and flexible features, KWS has a wide range of applications in the fields of audio monitoring, voice retrieval, and device control. [0003] For KWS training, the collection cost of positive samples is high, while the acquisition of negative samples is relatively easy. At the same time, the training of KWS does require a large number of rich and diverse negati...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/51G10L15/26G10L25/24G06K9/62
CPCG10L25/51G10L15/26G10L25/24G06F18/214G06F18/2415
Inventor 贺前华兰小添田颖慧郑若伟
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products