Check patentability & draft patents in minutes with Patsnap Eureka AI!

Hypergraph-based text classification method

A text classification and document technology, applied in the field of machine learning, can solve the problems of poor model effect and difficulty in capturing text meaning, and achieve the effect of accurate classification

Pending Publication Date: 2020-05-08
BEIJING TECHNOLOGY AND BUSINESS UNIVERSITY
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Existing text classification methods usually regard the texts in the corpus as independent samples, use models such as recurrent neural networks and support vector machines to predict the category for each piece of text independently, and do not model the relationship between texts
In addition, when using a cyclic neural network for text classification, when the text length is too long, it is difficult for the cyclic neural network to capture the meaning expressed by the text, and the model effect is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hypergraph-based text classification method
  • Hypergraph-based text classification method
  • Hypergraph-based text classification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0033] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0034] The hypergraph-based text classification method of the present invention can be applied to multiple fields such as comment classification, news classification, and fraud detection. Figure 1-2 As shown, the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a hypergraph-based text classification method, which comprises the following steps of: 1, constructing a corpus and a keyword library of the corpus, and generating a hypergraphbased on the corpus to obtain hyperedges and nodes of the hypergraph; 2, calculating an adjacency matrix of the keywords based on a co-occurrence window; 3, expressing each document in the corpus through a hyperedge vector to form a hyperedge matrix; 4, calculating the similarity between the hyperedges, and constructing a similarity matrix of the hyperedges; step 5, constructing a hypergraph nodefeature matrix composed of word vectors; 6, classifying the hyperedges by using a graph neural network model to obtain a first prediction probability of each document category in the corpus; 7, updating a parameter matrix of the graph neural network model by adopting a stochastic gradient descent algorithm based on the real label of the document, and finishing classification of the label-free text in the corpus. According to the method, the label-free texts in the corpus are accurately classified.

Description

technical field [0001] The invention relates to the technical field of machine learning, in particular to a hypergraph-based text classification method. Background technique [0002] Graphs representing objects and their relationships exist everywhere in real life, such as social networks, e-commerce networks, biological networks, and transportation networks. At the same time, due to the rich potential information contained, the graph is also recognized as a structure that can be deeply understood. In the past decade, graph deep learning has become an extremely important part of artificial intelligence and machine learning. It has shown superior performance in audio, image and natural language processing, and has obvious effects in extracting potential complex patterns in data. [0003] The essential feature of a hypergraph lies in its hyperedges. The degree of a hyperedge can be greater than 2, and it can connect multiple nodes at the same time. Ordinary graphs are a speci...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/35G06N3/04
CPCG06F16/353G06F16/355G06N3/04Y02D10/00
Inventor 韩忠明周朋飞段大高张珣
Owner BEIJING TECHNOLOGY AND BUSINESS UNIVERSITY
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More