Supercharge Your Innovation With Domain-Expert AI Agents!

Newly-developing hot topic detection system based on attention mechanism

A hot topic and detection system technology, applied in unstructured text data retrieval, special data processing applications, instruments, etc., can solve problems such as insufficient accuracy, lack of topic text context semantic analysis, poor topic tracking effect, etc., to improve The effect of detection ability

Inactive Publication Date: 2018-08-28
FUZHOU UNIV
View PDF3 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, although these method models have achieved corresponding results to a certain extent, they have also promoted the development of topic detection tasks; but there are also shortcomings. Methods based on static characteristics of topic content are certainly accurate in predicting emerging hot topics. rate, but it lacks the contextual semantic analysis of the topic text, so the tracking effect of the topic is poor
Those based on propagation features (dynamic features) also take into account the contextual semantic information of the text in the topic, and there is a certain delay in the prediction time of emerging hot topics, so its accuracy is not enough, but it has better performance in topic tracking

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Newly-developing hot topic detection system based on attention mechanism
  • Newly-developing hot topic detection system based on attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The technical solution of the present invention will be specifically described below in conjunction with the accompanying drawings.

[0022] A new hot topic detection system based on the attention mechanism of the present invention, such as figure 1 shown, including:

[0023] The data preprocessing module is used to preprocess the microblog text to provide highly available and high-quality data for the calculation in the later stage;

[0024] Hierarchical sequence model, used to train bidirectional cyclic neural network model, use bidirectional LSTM network to train input microblog text, obtain high-quality topic vector representation, and improve prediction accuracy;

[0025] The word sequence encoding layer is used to vectorize each word in the sentence to form a preliminary vector representation; use word2vec to perform a preliminary vectorization of sentence segmentation;

[0026] The word-level attention layer is used to consider the attention mechanism to form a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a newly-developing hot topic detection system based on an attention mechanism. The newly-developing hot topic detection system comprises a data preprocessing module, a hierarchical sequence model, a word sequence coding layer, a word level attention layer, a sentence level coding layer, a sentence level attention layer and a topic prediction module. By use of the newly-developing hot topic detection system based on the attention mechanism, on the basis of a bidirectional recurrent neural network, two layers of attention mechanisms are added to reinforce the vector representation of a topic, a hierarchical recurrent neural network model based on the attention mechanism is put forward, each dimension of data in social media can be used as characteristics to train high-quality topic vector characteristics, so that the newly-developing hot topic is detected, and the detection ability of the newly-developing hot topic is improved.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to an emerging hot topic detection system based on an attention mechanism. Background technique [0002] At present, there are some emerging hot topic detection methods that are biased towards topic content features (static features). The growth rate, etc., are used as real features, and then use a judgment function (such as a classification algorithm) to determine whether it is an emerging hot topic. [0003] At present, there are still some biases that use the propagation characteristics of topics to detect emerging hot topics. The basic idea is to use related specific data structures (such as: trees, graphs, particle swarms, neural networks, etc.) to calculate or train topics. Features, here the features tend to be disseminated, that is, there is a connection between the data, not static. Then use the classification algorithm to find out whether the topic is an emergi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
CPCG06F16/35G06F40/289G06F40/30
Inventor 廖祥文陈国龙殷明刚杨定达
Owner FUZHOU UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More