Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Interpretability analysis method based on CNN text classification model

A technology of text classification and analysis method, which is applied in the field of interpretability analysis based on CNN text classification model, which can solve the problems of difficult to quantify the importance of keywords and the inability to visualize the importance of keywords, and achieve intuitive and universal results. good sex effect

Pending Publication Date: 2021-02-09
JILIN UNIV
View PDF9 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, there are relatively few studies on the interpretability of text classification models, especially those based on CNN models, and they have certain limitations. degree of visualization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Interpretability analysis method based on CNN text classification model
  • Interpretability analysis method based on CNN text classification model
  • Interpretability analysis method based on CNN text classification model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0048] Such as figure 1 As shown, the embodiment of the present invention discloses an interpretability analysis method based on a CNN text classification model, including:

[0049] S1. Obtain original text data, and preprocess the original text data;

[0050] S2. Construct a text classification model based on convolutional neural network, use the text classification model to convert the preprocessed original text data into a distributed matrix, and perform c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an interpretability analysis method based on a CNN text classification model, and the method comprises the steps: obtaining one or more pieces of original text data, and carrying out the preprocessing of the original text data; constructing a text classification model based on a convolutional neural network, converting the preprocessed original text data into a distributedmatrix by using the text classification model, and performing classification prediction based on the distributed matrix to obtain a text classification result; backtracking and analyzing the importance of each identifier influencing the text classification result in each dimension to generate an importance vector matrix; and generating a visual analysis graph based on the importance vector matrix.According to the method, the contribution value of each dimension of each identifier in the text to the prediction result can be quantitatively determined on the basis of the interpretability of theCNN text classification model and the generation reason of the reverse backtracking text classification result, and the analysis result is presented through the visual graph.

Description

technical field [0001] The present invention relates to the technical field of text classification, and more specifically relates to an interpretability analysis method based on a CNN text classification model. Background technique [0002] Convolutional neural networks are built to imitate biological visual perception mechanisms. Initially, they have made great progress in the field of computer vision, and in recent years they have gradually developed rapidly in the field of natural language processing. There is a very important application in the field of natural language processing, which is text classification, which is based on the content of the text, to predict the category of the text, such as: judging whether the emotional tendency of a comment is praise or criticism (two classifications), judging a paragraph News is a category of finance, sports, education, politics, etc. (multi-category). If the text belongs to only one category, it is called a single label. If it...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35
CPCG06F16/35G06F16/358
Inventor 包铁孙铭彭策刘露孟宪全孙岩
Owner JILIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products