Text graph construction method based on text content characteristics
A technology of content features and construction methods, applied in the field of text graph construction, can solve problems such as the inability to prepare and express text semantic features, and achieve the effect of improving flexibility
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0038] In this embodiment, a method for constructing a text graph based on text content features is disclosed, which is used to convert the text to be converted into a text graph.
[0039] The text graph construction method provided in this embodiment retains the semantic relationship of word nodes while detaching from dependence on co-occurrence relationships when constructing edges.
[0040] The described method is carried out according to the following steps:
[0041] Step 1, obtain the text to be converted;
[0042] General text is enough, it can be a sentence or an article. Both Chinese and English are acceptable, and the corresponding text processing methods are as follows;
[0043] Step 2, performing text preprocessing on the text to be converted to obtain the preprocessed text; the text preprocessing includes sequential word segmentation, cleaning and standardization;
[0044] Wherein said preprocessed text includes multiple words;
[0045] In this embodiment, the ...
Embodiment 2
[0082] In this embodiment, the method provided by the present invention is verified experimentally, taking classification as an example, using the method provided by the present invention to construct a text graph, and then using the graph attention network (GAN) to learn and classify. GAN is a graph neural network based on attention mechanism, refer to the paper "Graph Attention Networks". Text-GAN(1) is the first method used when constructing graphs (the method of step 4.1-step 4.3 in Example 1), pre-trained word vectors; Text-GAN(2) is used when constructing graphs The second method (the method of step I-step III in embodiment one), pre-trained the word vector; Text-GAN (2)-rand also used the second graph construction method (step I-step in embodiment one III method), randomly initialize word vectors. Use Text-GCN as a comparison algorithm, which comes from the paper "Graph Convolutional Networks for TextClassification". Similarly, the first is to construct a graph based o...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com