Chinese question classification method based on text error correction and neural network

A classification method and a technology of questions, which are applied in the field of intelligent information processing and computers, can solve problems such as prediction errors, the classification effect of a single classification method cannot fully meet the requirements, few cyclic neural networks and convolutional neural networks, etc., to achieve accurate classification Effect

Inactive Publication Date: 2019-09-13
SHANGHAI MARITIME UNIVERSITY
View PDF0 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Current research and applications have proved that based on recurrent neural networks such as long-term short-term memory and bidirectional gated recurrent units, it is better able to learn the upper and lower semantic information of Chinese questions, and based on convolutional neural networks, it is better able to learn local features in sentences and extract sentences in sentences. However, there are many kinds of classifications of Chinese questions, and the classification effect of a single classification method still cannot fully meet the requirements in practi

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese question classification method based on text error correction and neural network
  • Chinese question classification method based on text error correction and neural network
  • Chinese question classification method based on text error correction and neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0027] The present embodiment provides a kind of Chinese question classification method based on text error correction and neural network, described method comprises the following steps:

[0028] Step 1: Obtain the text data of Chinese question sentences;

[0029] Step 2: Preprocessing the Chinese questions;

[0030] Step 3: Use the language model to correct the preprocessed Chinese questions;

[0031] Step 4: Use the word vector tool to vectorize the corrected Chinese questions to generate a Chinese question word vector matrix;

[0032] Step 5: Use the bidirectional gated recurrent unit layer to transform the Chinese question word vector matrix into a fixed-dimensional intermediate semantic matrix vector containing context and semantic information;

[0033] Step 6: Use the self-attention mechanism to assign attention weights to the intermediate semantic matrix vectors to generate attention matrix vectors;

[0034] Step 7: Use the attention matrix vector as the input of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese question classification method. The invention aims to solve the problem that the classification accuracy is not high enough due to the fact that faulty wording, wrongly written characters, needless characters and the like exist in input questions and the inherent defect that an existing classification method is single. The Chinese question classification method comprises the steps: 1, obtaining Chinese question text data; 2, preprocessing the Chinese questions; 3, carrying out error correction by utilizing a language model; 4, vectorizing the Chinese questionsby using a word vector tool; 5, obtaining an intermediate semantic matrix vector containing semantic information by utilizing the bidirectional gating circulation unit layer; 6, generating an attention matrix vector by using a self-attention mechanism; 7, extracting a plurality of local features by utilizing a plurality of convolution kernels with different sizes, and obtaining a global feature matrix vector through pooling and splicing; and 8, outputting probability distribution of a corresponding category by utilizing a full connection layer and a normalization exponential function, and taking the category with the maximum probability value as a predicted category, namely, a result of Chinese question classification. The Chinese question classification method is applied to the field ofnatural language processing.

Description

technical field [0001] The invention belongs to the field of intelligent information processing and computer technology, and in particular relates to a method for classifying Chinese question sentences. Background technique [0002] With the rapid development of technology in the Internet age and the influx of a large amount of data, searching for keywords through search engines requires manual screening of the returned results, which is time-consuming and labor-intensive for users. The question answering system can quickly obtain the user's intention, and can return the most accurate answer to the user among hundreds or thousands of candidate answers. [0003] The classification of Chinese questions is the first step of the question answering system, and it is one of the key technologies for the question answering system to achieve accurate answers. By classifying Chinese questions, the question answering system can effectively narrow the range of answers and determine the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/35G06K9/62G06N3/04G06N3/08
CPCG06F16/35G06N3/08G06N3/044G06N3/045G06F18/2411
Inventor 刘晋杨一何林芯玥
Owner SHANGHAI MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products