Extreme multi-label classification data enhancement method based on label and text block attention mechanism
A technology for classifying data and text blocks, which is applied in digital data processing, natural language data processing, semantic analysis, etc. It can solve the problems of poor classification performance of "long-tail" tags, and the inability to obtain better classification results from tags. boosted effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0028] The data enhancement method for extreme multi-label classification based on the label and text block attention mechanism provided by the present invention will be described in detail below with reference to the drawings and specific embodiments.
[0029] The present invention mainly adopts theories and methods related to natural language processing. In order to ensure the normal operation of the method, in the specific implementation, it is required that the computer platform used is equipped with a memory of not less than 16G, the number of CPU cores is not less than 4 and the main frequency Not lower than 2.6GHz, Linux operating system, and install Python 3.6 and above, pytorch framework and other necessary software environments.
[0030] In steps 1, 2): the original data set can be expressed as X N :
[0031]
[0032] Where N represents the number of data in the dataset, x i represents a piece of text, y i ∈ {0, 1} L , the label set corresponding to this piece...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


