Text classification method and system fusing self-attention mechanism and deep learning

A technology of deep learning and text classification, applied in neural learning methods, semantic analysis, computer components, etc., can solve the problems that text information affects the recognition effect of text classification

Pending Publication Date: 2021-06-08
HENAN UNIVERSITY
View PDF0 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] To this end, the present invention provides a text classification method and system that integrates self-attention mechanism and deep learning, which overcomes the shortcomings of obtaining semantic associations between contexts and the inability of text information to penetrate well in the existing technology Into the deep learning neural network and then affect the text classification recognition effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification method and system fusing self-attention mechanism and deep learning
  • Text classification method and system fusing self-attention mechanism and deep learning
  • Text classification method and system fusing self-attention mechanism and deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] In order to make the purpose, technical solution and advantages of the present invention more clear and understandable, the present invention will be further described in detail below in conjunction with the accompanying drawings and technical solutions.

[0030] An embodiment of the present invention provides a text classification method that integrates self-attention mechanism and deep learning, see figure 1 As shown, it includes the following content: obtain and preprocess the text data set to be classified; use the trained deep learning model to classify the preprocessed text data set to be classified, wherein the deep learning model includes: sequentially connected to The ERNIE pre-training module that extracts the sentence-level word vector representation from the text dataset to be classified, the BiLSTM module that extracts the context information of each word based on the sentence-level word vector representation, and the BiLSTM module that extracts the sentence...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of text classification, and particularly relates to a text classification method and system fusing a self-attention mechanism and deep learning, and the method comprises the steps: obtaining a to-be-classified text data set, and carrying out the preprocessing; and carrying out classification processing on the preprocessed to-be-classified text data set by utilizing a trained deep learning model, wherein the deep learning model comprises an ERNIE pre-training module used for extracting sentence-level word vector representation in a to-be-classified text data set, a BiLSTM module used for extracting context information of each word according to the sentence-level word vector representation, a DPCNN module which is used for deeply extracting the context information of each word according to the sentence-level word vector representation and the context information of each word, an attention mechanism module which is used for extracting a text depth information distribution weight according to the sentence-level side vector representation and the context information of each word, and a Softmax module which is used for carrying out classified output according to the text depth information distribution weight. The text classification recognition effect is improved by combining an attention mechanism and each model.

Description

technical field [0001] The invention belongs to the technical field of text classification, in particular to a text classification method and system integrating self-attention mechanism and deep learning. Background technique [0002] With the continuous development of artificial intelligence, more and more researchers pay attention to natural language processing. More and more researchers apply machine learning and deep learning methods to text classification, which has gradually become a popular trend in text classification. The application of text classification in machine learning is mainly reflected in the following four aspects: (1) Logistic regression algorithm, which is used in most cases to judge the probability of its category; (2) Naive Bayesian algorithm, which uses advanced mathematics (3) random forest algorithm, which selects some decision trees in a random way, and then builds them into the forest for classification, thereby improving the accuracy of classif...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/211G06F40/30G06K9/62G06N3/04G06N3/08
CPCG06F40/289G06F40/211G06F40/30G06N3/084G06N3/044G06F18/2415
Inventor 徐树维高旭洋李兆可司高飞
Owner HENAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products