Corpus acquisition method and device
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- 广州欢城文化传媒有限公司
- Publication Date
- 2021-05-28
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The present application relates to the technical field of speech recognition, in particular to a method and device for acquiring corpus. Background technique
[0002] With the rapid development of artificial intelligence, there are more and more data training tasks based on deep learning. In order to achieve better model quality, it is particularly important to obtain high-quality data sets in the early stage. In order to achieve the effect of human communication with the accuracy of human-computer interaction, it is necessary to collect vertical field corpus as a data set for supervised learning of the recognition engine to obtain a high-quality recognition model. In actual project development, voice data collection accounts for one-third of the entire project development cycle. In order to speed up the project development progress, it is necessary to improve the efficiency of data labeling.
[0003] Looking at the voice research and development depa...