Training data screening method, system and device, and medium
A technology for training data and screening methods, applied in neural learning methods, instruments, biological neural network models, etc., can solve problems such as instability, reduce consumption, reduce costs, and improve model recognition effects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0049] In order to make the object, technical solution and advantages of the present invention clearer, the implementation of the method of the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.
[0050] here will combine figure 1 A flowchart illustrating the main steps of one embodiment of the method of the present invention is shown. The method is mainly based on a semi-supervised learning method, through a speech recognition model trained by a small amount of manual labeling labels, or a speech recognition model that initializes text labels, and directly inputs multiple unlabeled audio data in the data set to the model. Perform speech recognition in the system, and output corresponding multiple decoding results, so that when one or more decoding results with high accuracy are screened out, the unlabeled audio data corresponding to the decoding results are directly marked with ...
Embodiment 2
[0070] In order to make the object, technical solution and advantages of the present invention clearer, the implementation of the system of the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.
[0071] here will combine figure 2 A block diagram showing the main structure of an embodiment of the method of the present invention will be described. The system is also mainly based on the semi-supervised learning method, through the speech recognition model trained by a small number of manual labeling labels, or the speech recognition model set by initializing text labels, directly input multiple unlabeled audio data in the data set into the Perform speech recognition in the above model, and output multiple corresponding decoding results, so that when one or more high-accuracy decoding results are selected, the unlabeled audio data corresponding to the decoding results can be direct...
Embodiment 3
[0091] An overall application scenario is described below in conjunction with Embodiments 1 and 2 to further illustrate the implementation process of the present invention:
[0092] Speech recognition using the Kaldi toolkit. A model for speech recognition in Kaldi based on semi-supervised learning training configurations. The training can be carried out directly using unlabeled audio. The training combines independent labeling, screening training data (audio), and retraining to obtain labeled training data in a simple, high-efficiency, low-cost, and low-resource consumption method, which is scalable Adapt to the model in various scenarios and train the model to improve the performance of the model and enhance its recognition effect and quality.
[0093] The speech recognition model of the Kaldi toolkit can use sound models such as HMM combined with language models to recognize the input audio. For example, the process is: the input audio is divided into frames and then its s...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


