Data labeling method and device
A technology for data and labeling data, which is applied in electrical digital data processing, special data processing applications, digital data information retrieval, etc., and can solve the problems of low labeling accuracy and long time consumption.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] figure 1 A schematic flowchart of a data labeling method provided by the first embodiment of the present invention is shown.
[0057] The data labeling method includes the following steps:
[0058] In step S110, each data in the dataset to be labeled is input into K labeling models, and K labels are obtained for each data.
[0059] Specifically, the K labeled models are obtained by training K sub-training sets, and the K sub-training sets are obtained by performing K random sampling with replacement on samples in the total training set, and K is an integer greater than 1.
[0060] Since K labeling models are trained in this embodiment, if the sampling is not replaced, the similarity between the labeling models will be relatively small, and each labeling model will run independently, and the verification between the K labeling models The results of the comparison are relatively poor, and the generalization ability of each annotation model is poor, so the final annotati...
Embodiment 2
[0211] Figure 5 A schematic structural diagram of a data tagging device provided by the second embodiment of the present invention is shown. The data labeling device 500 corresponds to the data labeling method in Embodiment 1, and the data labeling method in Embodiment 1 is also applicable to the data labeling device 500 , and details are not repeated here.
[0212] The data labeling device 500 includes an input module 510 , a sample determination module 520 and a labeling module 530 .
[0213] The input module 510 is used to input each data in the data set to be labeled into K labeling models respectively, and K labels are obtained for each data, wherein the K labeling models are respectively obtained by training K sub-training sets, so The K sub-training sets are obtained by performing K random sampling with replacement on samples in the total training set, and K is an integer greater than 1.
[0214] The sample determination module 520 is configured to classify the label...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com