Text classification method and device, equipment and storage medium

A text classification and text technology, applied in the field of artificial intelligence, to achieve the effect of accurate text classification

Pending Publication Date: 2022-02-25
华润数字科技有限公司
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the embodiment of the present invention is to propose a text classification method, device, equipment and storage medium, which solves the problem of text classification based on the capture of local features in the prior art, extracts semantic features from multiple dimensions, and improves the Accuracy of text classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and device, equipment and storage medium
  • Text classification method and device, equipment and storage medium
  • Text classification method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by those skilled in the technical field of the present invention; the terms used herein in the description of the application are only to describe specific embodiments The purpose is not to limit the present invention; the terms "comprising" and "having" and any variations thereof in the specification and claims of the present invention and the description of the above drawings are intended to cover non-exclusive inclusion. The terms "first", "second" and the like in the description and claims of the present invention or the above drawings are used to distinguish different objects, rather than to describe a specific order.

[0034] Reference herein to an "embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the present invention. The occurrences...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention belongs to the field of artificial intelligence, and particularly relates to a text classification method, device and equipment and a storage medium, and the method comprises the following steps: obtaining a to-be-analyzed text, carrying out word segmentation on the text to form a segmented word set, and obtaining subject terms of the text according to a subject model to form a subject term set; obtaining a word embedding vector of each word in the subject word set, performing dimension reduction on the word embedding vectors, mapping the word embedding vectors to a plane, and constructing a Voronoi diagram according to mapping points on the plane; calculating a semantic distance between the non-subject terms and the subject terms, and adding the non-subject terms into a Voronoi diagram; identifying a word node type of each word in the Voronoi diagram, and calculating a semantic distance between word nodes through a corresponding algorithm according to the word node type; and inputting the semantic distance between the word nodes into a pre-constructed graph convolutional neural network to output a graph implicit vector, and carrying out text classification according to the graph implicit vector. According to the invention, the accuracy of text classification is improved.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a text classification method, device, equipment and storage medium. Background technique [0002] Text classification is a common task in the field of natural language processing. In recent years, from the traditional classification method based on feature engineering + machine learning, such as extracting features based on TF-IDF (word frequency-inverse document frequency) and then training machine learning classifiers, to various methods based on CNN (convolutional neural network), RNN ( Recurrent neural network) deep learning method introduces text classification, such as fasttext, textCNN, TextRNN, etc. Another example is the emergence of pre-trained language models BERT (transformer-based bidirectional encoder representation technology) and ELMO (language model embedding) based on deep neural networks in recent years, which have greatly improved the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/30G06N3/04G06N3/08
CPCG06F40/289G06F40/30G06N3/08G06N3/045
Inventor 王伟黄勇其于翠翠张黔
Owner 华润数字科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products