Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Emerging hot topic detection system on basis of multiclass feature fusion

A hot topic and feature fusion technology, applied in network data retrieval, other database retrieval, special data processing applications, etc., can solve the problems of lack of topic text context semantic analysis, insufficient accuracy, poor topic tracking effect, etc.

Active Publication Date: 2018-08-14
FUZHOU UNIV
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, although these method models have achieved corresponding results to a certain extent, they have also promoted the development of topic detection tasks; but there are also shortcomings. Methods based on static characteristics of topic content are certainly accurate in predicting emerging hot topics. rate, but it lacks the contextual semantic analysis of the topic text, so the tracking effect of the topic is poor
Those based on propagation features (dynamic features) also take into account the contextual semantic information of the text in the topic, and there is a certain delay in the prediction time of emerging hot topics, so its accuracy is not enough, but it has better performance in topic tracking

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Emerging hot topic detection system on basis of multiclass feature fusion
  • Emerging hot topic detection system on basis of multiclass feature fusion
  • Emerging hot topic detection system on basis of multiclass feature fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The technical solution of the present invention will be specifically described below in conjunction with the accompanying drawings.

[0052] The present invention is an emerging hot topic detection system based on multi-category feature fusion, such as figure 1 shown, including:

[0053] The data pre-processing module is used to pre-process the data to remove links, special characters, emoticons, punctuation marks, etc. in the text, so as to provide highly available and high-quality data for later stage operations;

[0054] Hierarchical sequence model, used to train bidirectional cyclic neural network model, use bidirectional LSTM network to train input microblog text, obtain high-quality topic vector representation, improve prediction accuracy, and deliver high-quality words for subsequent prediction tasks vector, sentence vector and topic vector;

[0055] The word sequence encoding layer is used to vectorize each word in the sentence to form a preliminary vector rep...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an emerging hot topic detection system on the basis of multiclass feature fusion. The emerging hot topic detection system comprises a data preprocessing module, a hierarchicalsequence model, a word encoder layer, a sentence level feature solving layer, a topic level feature solving layer and a topic predicting module. The data preprocessing module is used for preprocessing microblog texts; the hierarchical sequence model is used for training bidirectional recurrent neural network models and training inputted microblog texts by the aid of bidirectional LSTM (long shortterm memory) networks; the word encoder layer is used for vectorizing various words in sentences and forming preliminary vector representation; the sentence level feature solving layer is used for constructing static feature vectors for microblog sentences, linking the static feature vectors of the microblog sentences with neural network dynamic features of the sentence level feature solving layer and forming microblog sentence vector representation; the topic level feature solving layer is used for constructing static feature vectors for topics, linking the static feature vectors of the topics with neural network dynamic features of the topic level feature solving layer and forming topic vector representation; the topic predicting module is used for predicting the topics. The emerging hot topic detection system has the advantages that the emerging hot topic detection system is based on bidirectional long short term memory network architecture, the corresponding dynamic features and the corresponding static features are added, and accordingly the emerging hot topic detection capability can be improved.

Description

technical field [0001] The invention relates to the field of natural language processing, in particular to an emerging hot topic detection system based on multi-category feature fusion. Background technique [0002] At present, there are some emerging hot topic detection methods that are biased towards topic content features (static features). The growth rate, etc., are used as real features, and then use a judgment function (such as a classification algorithm) to determine whether it is an emerging hot topic. [0003] At present, there are still some biases that use the propagation characteristics of topics to detect emerging hot topics. The basic idea is to use related specific data structures (such as: trees, graphs, particle swarms, neural networks, etc.) to calculate or train topics. Features, here the features tend to be disseminated, that is, there is a connection between the data, not static. Then use the classification algorithm to find out whether the topic is an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N3/04G06F17/27G06Q50/00G06K9/62
CPCG06F16/951G06Q50/01G06F40/211G06F40/284G06N3/045G06F18/253
Inventor 廖祥文陈国龙殷明刚杨定达
Owner FUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products