Unlock instant, AI-driven research and patent intelligence for your innovation.

Classification result correction method, system, equipment and medium

A technology of classification results and correction methods, which is applied in text database clustering/classification, unstructured text data retrieval, instruments, etc. It can solve the problems of low model output accuracy, biased prediction results, and inability to correct input sample bias, etc. problem, to achieve the effect of eliminating bias

Active Publication Date: 2022-03-15
SUZHOU LANGCHAO INTELLIGENT TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the difference in the frequency of labels in the pre-training corpus, the model will have a preference for the prediction results, that is, the model output accuracy is low
Therefore, the existing correction methods can only correct the bias of the model to the label, and cannot correct the bias caused by the input sample.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Classification result correction method, system, equipment and medium
  • Classification result correction method, system, equipment and medium
  • Classification result correction method, system, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to make the object, technical solution and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0048] It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are to distinguish two entities with the same name but different parameters or parameters that are not the same, see "first" and "second" It is only for the convenience of expression, and should not be construed as a limitation on the embodiments of the present invention, which will not be described one by one in the subsequent embodiments.

[0049] According to one aspect of the present invention, an embodiment of the present invention proposes a classification result correction method, such as figure 1 As shown, it may include the steps of:

[0050] S1, constructing a data set and mar...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a classification result correction method. The method comprises the following steps: constructing a data set and labeling a classification label of a corresponding category for each piece of data in the data set; inputting each piece of data in the data set into the trained model to obtain the probability of the corresponding classification label, and calculating a correction matrix by using the classification label probability corresponding to each piece of data; expanding the classification labels of each category into a plurality of sub-labels; adjusting the output of the trained model to the probability of a plurality of sub-tags corresponding to each category; inputting to-be-classified data into the trained model to obtain the probability of a plurality of sub-tags corresponding to each category; and determining the final category of the to-be-classified data by using the probability of the plurality of sub-tags corresponding to each category and the correction matrix. The invention further discloses a system, computer equipment and a readable storage medium. According to the scheme provided by the invention, the labels are expanded, so that bias caused by different occurrence frequencies of the labels is eliminated.

Description

technical field [0001] The invention relates to the field of classification, in particular to a classification result correction method, system, equipment and storage medium. Background technique [0002] The core capabilities of the massive model are zero-sample learning and small-sample learning. That is, when faced with different application tasks, there is no need to retrain the model. However, the huge amount of models will bring bias from the corpus during pre-training, resulting in low accuracy or unstable performance of downstream tasks. The current solution is to compensate the biased label words through text-free input, calibrate them to an unbiased state, and reduce the difference between different prompt choices. However, due to the difference in the frequency of tags appearing in the pre-training corpus, the model will have a preference for the prediction results, that is, the model output accuracy is low. Therefore, the existing correction methods can only c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06F40/247G06K9/62
CPCG06F16/35G06F40/247G06F18/214
Inventor 刘红丽李峰于彤周镇镇
Owner SUZHOU LANGCHAO INTELLIGENT TECH CO LTD