Generation method and device for training samples
A technology for training samples and samples, applied in the field of information processing, can solve the problems of unguaranteed accuracy, long labeling time period, and difficulty in obtaining training samples for manual labeling, so as to reduce workload and improve labeling efficiency.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0029] figure 1 It is a flow chart of a method for generating training samples provided in the first embodiment of the present invention. The method of this embodiment can be executed by a device for generating training samples, which can be implemented by means of hardware and / or software, and generally Can be integrated in the server. The method of this embodiment specifically includes:
[0030] 110. Use marked samples to train the benchmark scoring model to generate an adjusted training model, wherein the marked samples are pre-marked with sample scores.
[0031] In this embodiment, a semi-supervised sample labeling method is proposed, that is, firstly, the benchmark score model is trained using labeled samples manually marked with sample scores to generate an adjusted training model, and then based on the adjusted training model , to complete the process of generating sample scores for unlabeled samples.
[0032] As described in the background technology, the method of ...
no. 2 example
[0052] Figure 2a is a flowchart of a method for generating training samples according to the second embodiment of the present invention. This embodiment is optimized based on the above embodiments. In this embodiment, according to the correlation between the sample score of the labeled sample and the model score of the labeled sample, and the relationship between the sample to be labeled The corresponding model score, determining the labeled sample score corresponding to the sample to be labeled is specifically optimized as follows: according to the sample score and model score corresponding to each labeled sample, obtain the labeled sample corresponding to the same target sample score target model score; according to the proportion of each target model score in all target model scores, determine the model score frequency distribution curve corresponding to the target sample score; according to the model score frequency distribution curve, obtain the The high-frequency model...
no. 3 example
[0080] Figure 3a is a flowchart of a method for generating training samples according to the third embodiment of the present invention. This embodiment is optimized on the basis of the above embodiments. In this embodiment, it also preferably includes: merging the new marked samples generated after marking with the existing marked samples, and adjusting the training The model is used as a new benchmark scoring model; return to the operation of using the labeled samples to train the benchmark scoring model, generate and adjust the training model, until the end labeling condition is met.
[0081] Correspondingly, the method in this embodiment specifically includes:
[0082] 310. Use marked samples to train the benchmark scoring model to generate an adjusted training model, wherein the marked samples are pre-marked with sample scores.
[0083] 320. Input the sample to be labeled into the adjusted training model, and generate a model score corresponding to the sample to be labe...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com