Mongolian text sentiment analysis method based on multi-size CNN and LSTM model

A sentiment analysis and multi-scale technology, applied in the field of artificial intelligence, can solve the problems that the text sentiment analysis is not real-time, the Mongolian corpus resources are few, and the local and global information of the text cannot be extracted at the same time.

Active Publication Date: 2021-09-10
INNER MONGOLIA UNIV OF TECH
View PDF14 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are relatively few Mongolian corpus resources that can be collected, so it cannot meet the requirements of deep neural network model training
Third, the current single neural network model does not have good real-time performance when solving text sentiment analysis, and cannot extract local and global information of the text at the same time, resulting in poor classification results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mongolian text sentiment analysis method based on multi-size CNN and LSTM model
  • Mongolian text sentiment analysis method based on multi-size CNN and LSTM model
  • Mongolian text sentiment analysis method based on multi-size CNN and LSTM model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The implementation of the present invention will be described in detail below in conjunction with the drawings and examples.

[0047] Such as figure 1 Shown, a kind of Mongolian text emotion analysis method based on multi-size CNN and LSTM model of the present invention, process is as follows:

[0048] Step 1: Preprocessing the Chinese and Mongolian emotional text corpora.

[0049] Before model training, the sentiment text corpus should be preprocessed. The present invention uses byte pair encoding technology (BPE) to carry out segmentation operation to corpus, because BPE technology is to use a character that does not appear in this character string to replace the most common pair of characters in the character string in the layer-by-layer iterative process , so by segmenting Mongolian vocabulary into stems and affixes, high-frequency words can be retained in the dictionary, while low-frequency words can be divided into smaller granular subunits, thereby alleviating ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Mongolian text sentiment analysis method based on a multi-size CNN and an model. The method comprises the steps of: preprocessing a Chinese and Mongolian sentiment text corpus; converting the words obtained through preprocessing into dynamic word vectors; connecting a multi-size CNN network and an mLSTM network in parallel to form a Mongolian text sentiment analysis model; splicing features extracted by the two to serve as emotion features finally extracted by the model; using a transfer learning strategy to take a large-scale Chinese emotion text corpus as a training set, transferring a neural network parameter weight obtained by training to a Mongolian text emotion analysis model as an initial parameter, and training by using the preprocessed Mongolian emotion text corpus to obtain a Mongolian text emotion analysis model based on a multi-size CNN and an LSTM model; and comparing and evaluating the analysis result of the model and the analysis result of a single network analysis method according to the accuracy rate, the recall rate and the F1 value, and therefore, the purpose of improving the Mongolian text sentiment analysis performance is achieved.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, and in particular relates to a Mongolian text sentiment analysis method based on multi-size CNN and LSTM models. Background technique [0002] With the rapid development of Internet technology, more and more people begin to share their feelings, views and opinions by expressing various opinions on platforms such as Weibo, forums, film and television websites, and shopping websites. And the content published by users may contain different emotional colors: positive or negative; support or opposition. The core of sentiment analysis is to divide the sentiment expressed in a piece of text into positive and negative categories. Its research value has been fully reflected in product reviews and recommendations, public opinion monitoring, and information prediction. [0003] With the rise of artificial intelligence, deep learning methods have received widespread attention. Because of th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/33G06F16/36G06F40/284G06N3/04G06N3/08
CPCG06F16/3344G06F16/36G06F40/284G06N3/08G06N3/044G06N3/045
Inventor 仁庆道尔吉尹玉娟麻泽蕊李媛程坤苏依拉李雷孝
Owner INNER MONGOLIA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products