Unlock instant, AI-driven research and patent intelligence for your innovation.

Multi-label text classification method and system based on dynamic weight contrastive learning

A text classification and dynamic weight technology, applied in the field of information detection, can solve the problems of poor low-frequency label prediction performance, long-tail problem, neglect of label relevance, etc., to prevent label semantic confusion, solve long-tail problems, and improve model performance. Effect

Active Publication Date: 2022-08-02
PEKING UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Existing multi-label text classification methods generally directly predict the probability of text corresponding to each label when dealing with multi-label classification tasks. This method ignores the correlation between labels, and has poor prediction performance for low-frequency labels, that is, there is a long tail problem.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-label text classification method and system based on dynamic weight contrastive learning
  • Multi-label text classification method and system based on dynamic weight contrastive learning
  • Multi-label text classification method and system based on dynamic weight contrastive learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046]The present invention will now be discussed with reference to exemplary embodiments. It should be understood that the discussed embodiments are only provided to enable those of ordinary skill in the art to better understand and thus implement the content of the present invention, and are not intended to imply any limitation on the scope of the present invention.

[0047] As used herein, the term "including" and variations thereof are to be read as open-ended terms meaning "including, but not limited to." The term "based on" is to be read as "based at least in part on". The terms "one embodiment" and "one embodiment" are to be read as "at least one embodiment."

[0048] figure 1 Schematically represents a flow chart of a multi-label text classification method based on dynamic weight contrastive learning according to an embodiment of the present invention. like figure 1 As shown, in this embodiment, the multi-label text classification method based on dynamic weight com...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of information detection, and proposes a multi-label text classification method and system based on dynamic weight comparison learning, wherein the method includes: preprocessing to obtain training set word vectors and label sequences; The classification model of the long-term memory network encodes the multi-label text of the training set to obtain a vectorized representation containing semantic information; decodes the label sequence and the vectorized representation containing semantic information to obtain the predicted label sequence; through the joint cross-entropy probability distribution Loss and Contrastive Learning Loss Calculate the loss between the predicted label sequence and the label sequence, and optimize the classification model as a multi-label text classification model according to the loss; input the multi-label text of the test set to be classified into the multi-label text classification model, and the output corresponds to the final tag sequence. According to this method, the multi-label semantic confusion phenomenon and the long-tail problem in multi-label classification datasets are effectively solved.

Description

technical field [0001] The invention relates to the technical field of information detection, in particular to a method, system, electronic device and computer-readable storage medium for multi-label text classification based on dynamic weight comparison learning. Background technique [0002] Multi-label classification tasks have a wide range of application scenarios in the NLP field, including but not limited to text classification, label recommendation, and information retrieval. Multi-label classification scenarios are different from multi-classification scenarios: in multi-classification scenarios, a piece of data has only one label, but this label may have multiple categories. For example, in sentiment classification tasks, judging the sentimental tendency of a text is usually classified as One of "positive", "negative" or "neutral"; while in a multi-label classification scenario, a piece of data may have multiple labels, for example, a piece of news may be classified ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/30G06K9/62G06N3/04G06N3/08
CPCG06F40/30G06N3/08G06N3/044G06N3/045G06F18/2431
Inventor 叶蔚张世琨谢睿俞鼎耀
Owner PEKING UNIV