Text classification method and system based on Attention graph attention network
A technology of text classification and attention, applied in text database clustering/classification, biological neural network model, unstructured text data retrieval, etc., can solve inaccurate, difficult data acquisition and classification, unstructured text obscure and imprecise And other issues
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach 1
[0048] Embodiment 1: A text classification system based on an Attention-based graph attention network, the system includes a text collection module, a data preprocessing module, a text construction module, a feature node module and a text classification module, and the modules are connected in a progressive logical order ;
[0049] Firstly, the text collection module is responsible for data collection, labeling and segmentation. Secondly, the data preprocessing module is responsible for preprocessing the data obtained by the text collection module. Then, the text construction module is responsible for combining the sentences in the text with the words or words in the data set. As a node, a graph is formed after establishing an edge and an attention mechanism is introduced. Again, the feature node module extracts and updates the feature vectors of adjacent nodes, and finally the text classification module performs geographical text classification according to the existing tag da...
specific Embodiment approach 2
[0050] Embodiment 2: A text classification method based on the Attention-based graph attention network. By introducing the attention mechanism, the ordinary graph convolution formula is improved, so that the geographical information text can aggregate the characteristics of the context, so that the geographical information in the text is information is more discernible.
[0051] The overall steps of this embodiment are as follows figure 2 As shown, it is realized through the following method steps:
[0052] S101: Collect text, label part of the data, and complete the segmentation of training data and test data;
[0053] S102: Perform word segmentation on the data, remove stop words and difficult-to-recognize special characters, and complete data preprocessing.
[0054] S103: Construct the text as graph structure data, use each sentence and the words or words in the data set as nodes, and establish edges with the relationship between words;
[0055] S104: Construct a graph ...
specific Embodiment approach 3
[0089] In addition to the system and method steps described in the specific embodiment one and two, such as image 3 As shown, this embodiment is realized in the following way:
[0090] Collect text data in network circulation, select part of the data from the total data for labeling, and then select 80% as the training set and 20% as the data set.
[0091] The graph data construction module S201 constructs the preprocessed text serialized data into graph data with a topological structure.
[0092] The graph attention network module S202 is used to train and test the entire graph data set, so that the initial features of each text are aggregated to the features of adjacent nodes to be updated.
[0093] The classification module S203 uses the fully connected layer and the softmax function to classify the updated feature vectors. There are two methods for text segmentation, word-level word segmentation and word-level word segmentation. Therefore, when a text sequence is converte...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com