Text feature extraction method, device, chat robot and storage medium based on fusion model

A technology that integrates models and feature extraction. It is used in chat robots and storage media. It can solve problems such as low accuracy and achieve the effect of improving accuracy and avoiding gradient disappearance or gradient dispersion.

Pending Publication Date: 2019-03-22
南京云思创智信息科技有限公司
View PDF3 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] In the identification of chat robot stock price query intentions, traditional SVM methods (such as image 3 Shown) uses pattern matching or basic features to train the model. The classification is simple but the accuracy is low. CNN makes better use of the chat context to extract features better than describe local features, but CNN has the ability to read text Further optimization of time series requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text feature extraction method, device, chat robot and storage medium based on fusion model
  • Text feature extraction method, device, chat robot and storage medium based on fusion model
  • Text feature extraction method, device, chat robot and storage medium based on fusion model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0063] The text mentioned in this embodiment includes words or sentences. Words are the collective designation of words and phrases, including words, phrases and entire vocabulary. Words form the minimum group word structure form of sentence articles. A sentence is the basic unit of language use. It is composed of words and phrases (phrases) and can express a complete meaning.

[0064] This embodiment provides a text feature extraction method based on the CNN-BLSTM-Attention fusion model, the method is applicable to the situation of text feature extraction, and the method can be executed by an extraction device following the text feature rules, and the text feature rule extraction device can be implemented by software and / or hardware, such as Figure 5-Figure 8 As shown, the text feature extraction method based on the CNN-BLSTM-Attention fusion model of this embodiment includes the following steps:

[0065] S1. The construction of the word vector, using the Word2vec model, us...

Embodiment 2

[0093] This embodiment provides a text feature extraction device based on a fusion model, including:

[0094] The word vector construction module is used to adopt the Word2vec model and utilize the Skip-gram model to construct the word vector;

[0095] The convolutional neural network module is used to extract the local features of the word vector using the convolutional neural network;

[0096] The BLSTM model module is used to extract global features related to local feature contexts using the BLSTM model;

[0097] The Attention mechanism module extracts deeper information features of global features through the Attention mechanism, and fuses the extracted features;

[0098] The text classification module is used to classify the text feature vectors obtained by extracting the network layer by layer using a soft-max classifier.

[0099] The text feature extraction device of this embodiment further includes a mapping module, configured to map sentences composed of words into...

Embodiment 3

[0102] refer to Figure 11 As shown, a frame diagram of a chat robot of this embodiment is provided, Figure 11 Only one embodiment is provided, but this embodiment should not bring any limitation to the function and scope of application of the present invention.

[0103] Such as Figure 11 As shown, chatbots are generally displayed on computer devices, and chatbots may include but are not limited to:

[0104] One or more processors 100 are mainly used to execute one or more programs stored in the storage device 20 .

[0105] The storage device 200 is used for storing one or more programs.

[0106] When one or more programs are executed by one or more processors 100, the one or more processors 100 implement the fusion model-based text feature extraction method described in the first embodiment.

[0107] The interactive interface 300 is used for providing human-computer interaction when the processor 100 executes programs.

[0108] The chat robot provided by this embodimen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text feature extraction method based on a fusion model, a device, a chat robot and a storage medium. The extraction method comprises the following steps: S1, adopting Word2vec model to extract words, utilizing Skip-gram model to construct a word vector, and then maps the sentences composed of words into a sentence matrix; S2, using convolution neural network to extract the local features of the word vector; S3, extracting the global features related to the context of the word vector by using the BLSTM model; S4, extracting the deeper information features of the localfeatures and the global features through Attention mechanism, and fusing the extracted features; S5, extracting the text feature vector extracted from the network layer by layer by using soft-Max classifier for text classification. The invention not only solves the problem that the single convolution neural network ignores the semantic information of the words in the context, but also effectivelyavoids the problem that the gradient of the traditional circulating neural network disappears or the gradient is dispersed.

Description

technical field [0001] The invention belongs to the technical field of text processing, and in particular relates to a text feature extraction method, device, chat robot and storage medium based on a CNN-BLSTM-Attention fusion model. Background technique [0002] At present, artificial feature engineering and shallow classification models are still used for text classification in text feature extraction. The process of training a text classifier is as follows figure 1 shown. The process of converting data into information and then refining knowledge in machine learning problems determines the upper limit of the result, while models and algorithms are used to approach this upper limit. Feature engineering is different from classifier models, which is very time-consuming and does not have strong Versatility often requires a combination of understanding of feature tasks. [0003] Therefore, deep learning methods are generally used for text feature extraction, mainly includin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F17/27
CPCG06F40/205G06F40/30G06F40/289
Inventor 张帆
Owner 南京云思创智信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products