Multi-label text classification method and system based on attention mechanism
A text classification and attention technology, applied in neural learning methods, computer components, instruments, etc., can solve the problems of low classification accuracy, ignore the correlation between labels and text, and achieve the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0062] Considering that most of the current multi-label text classification methods ignore the correlation between labels and texts, and when the scale of labels is large and the category distribution is unbalanced, the classification accuracy is low. In this embodiment, a multi-label based attention mechanism is proposed. Text classification method, the flowchart of the method can be found in figure 1 , the method includes the following steps:
[0063] S1. Obtain a text training set containing labels;
[0064] In this embodiment, before obtaining the text training set containing the label, it also includes: obtaining the text data set to be classified, and performing a preprocessing operation on the text to be classified in the data set; The dataset is obtained centrally.
[0065] Among them, the preprocessing operations on the text to be classified in the dataset include:
[0066] Use regular expressions to perform text filtering on the text to be classified, and then per...
Embodiment 2
[0103] Such as image 3 As shown, the present invention also proposes a multi-label text classification system based on an attention mechanism, which is used to implement the multi-label classification method proposed in Embodiment 1, and the system includes:
[0104] Training set obtaining module 11, is used for obtaining the text training set that comprises label;
[0105] The word vector conversion module 12 is used to carry out word vectorization to the text in the text training set, and convert the text in the text training set into a multidimensional text feature vector;
[0106] The tag structure matrix acquisition module 13 constructs a tag coexistence graph according to the coexistence of tags in the text training set, introduces a graph embedding algorithm to optimize the similarity between tags in the tag coexistence graph, and obtains a tag structure matrix;
[0107] Multi-label text classification model construction module 14, for constructing the multi-label tex...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com