Sentence classification method based on LSTM and combining part-of-speech and multi-attention mechanism

A classification method and attention technology, applied in neural learning methods, text database clustering/classification, semantic analysis, etc., can solve the problem of not considering word part-of-speech information, etc., and achieve the effect of strong versatility and high accuracy.

Active Publication Date: 2019-04-16
SOUTH CHINA UNIV OF TECH
View PDF6 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these studies are mainly focused on the attention at the content

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sentence classification method based on LSTM and combining part-of-speech and multi-attention mechanism
  • Sentence classification method based on LSTM and combining part-of-speech and multi-attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0021] This embodiment provides a sentence classification method based on LSTM combined with part-of-speech and multi-attention mechanism. The words in the sentence are marked, and combined with the simplified part-of-speech tag set (mainly including nouns, verbs, adjectives, adverbs, ending tags UNK, etc.) to convert the part-of-speech into the form of serial numbers, and then map and learn through the embedding layer; then, A shared bidirectional LSTM is used to learn the context information of semantic word vectors and part-of-speech word vectors, and the forward and reverse learning results of each time step are concatenated and combined to output, so as to obtain the contextual relationship of words and parts of speech respectively; here On the basis, a self-attention layer is used to learn the position information in the sentence for the semantic word vector sequence and the part-of-speech word vector sequence output by the LSTM layer, and construct the corresponding atte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sentence classification method based on LSTM and combining part-of-speech and a multi-attention mechanism. The method comprises steps: each sentence is converted into a semantic word vector matrix and a part-of-speech word vector matrix which are based on continuity and density in an input layer; learning context information of words or part-of-speech in the sentences ina shared bidirectional LSTM layer respectively, connecting learning results of each step in series, and outputting the learning results; a self-attention mechanism and a point multiplication functionare adopted in the self-attention layer to learn important local features at all positions in the sentence from the semantic word vector sequence and the part-of-speech word vector sequence respectively, corresponding semantic attention vectors and part-of-speech attention vectors are obtained, and the semantic attention vectors and the part-of-speech attention vectors are constrained through theKL distance; carrying out weighted summation on the output sequence of the bidirectional LSTM layer in the merging layer by utilizing the obtained semantic attention vector and part-of-speech attention vector to obtain semantic representation and part-of-speech representation of the sentence, and obtaining final sentence semantic representation; And finally, prediction and classified output are carried out through an MLP output layer.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to a sentence classification method based on LSTM combined with part-of-speech and multi-attention mechanism. Background technique [0002] Sentence classification has always been a research hotspot in the field of Natural Language Processing (NLP). In recent years, with the wide application of deep learning in NLP, many scholars have successively proposed various sentence classification methods based on the Long Short-Term Model (LSTM), and in many sentence classification corpora such as Stanford Twitter Sentiment (STS), Stanford Sentiment Treebank binary classification (SSTb2) and quintuple classification (SSTb5), TREC, IMDB, etc. have all achieved better results than traditional machine learning methods. Compared with the convolutional neural network CNN, LSTM can better describe the context information and long-term dependencies of text sequence data, and effectively ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/35G06F17/27G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06F40/253G06F40/211G06F40/289G06F40/30G06N3/045G06F18/2414
Inventor 苏锦钿周炀朱展东
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products