Real-time news content-oriented stream topic evolution tracking method

A topic and content technology, applied in the field of streaming topic evolution tracking, can solve the problems of not being able to make full use of topic differences, not being able to measure the correlation of topic mining results before and after, lack of rationality and accuracy, etc.

Active Publication Date: 2018-09-07
SOUTHEAST UNIV
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the traditional LDA method cannot measure the relationship between the results of topic mining before and after, and the OLDA method cannot make full use of the differences between topics and between topics in time when describing the relationship between the results of topic mining before and after. Accuracy is lacking

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time news content-oriented stream topic evolution tracking method
  • Real-time news content-oriented stream topic evolution tracking method
  • Real-time news content-oriented stream topic evolution tracking method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0046] A streaming topic evolution tracking method (referred to as the dELDA method) for real-time news content is implemented, and its overall workflow is as follows: figure 1 shown. This method first divides the news content collected in real time from the Internet into batches according to time periods, and uses the LDA method to mine preliminary topic results for each batch of news content; then, performs named entity recognition within this batch of news content, and calculates the topic Ass...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a real-time news content-oriented stream topic evolution tracking method. The method comprises the steps of firstly, batching news contents collected in real time according totime periods, and mining a preliminary topic result for each batch of the news contents by adopting an LDA method; secondly, performing named entity identification in the batch of the news contents, and calculating correlation between topics and entities, thereby updating entity link relationships in an entity library; thirdly, through topic inner lexical item clustering, obtaining a topic-topic inner class cluster corresponding relationship, and storing topic results in a topic library; and finally, calculating popularity information of the topics and topic inner class clusters, and accordingto the popularity information, performing dynamic updating on LDA topic mining parameters for topic evolution tracking of the next batch of the news contents. According to the method, topic featuresand class cluster features of topic inner lexical items in the real-time news contents can be mined; difference among the topics and among different class clusters in the topics is fully utilized; andthe LDA topic mining parameters are dynamically updated.

Description

technical field [0001] The present invention relates to a streaming topic evolution tracking method for real-time news content. The method can conduct time-by-segment, streaming topic mining and evolution tracking of news content collected in real time from the Internet by means of dynamic update of topic mining parameters. , belonging to the field of Internet and natural language processing technology. Background technique [0002] In recent years, with the vigorous development of information technology, the Internet has become the most convenient channel for people to obtain information and understand news. However, while the Internet news information resources are extremely rich, it also brings the huge challenge of "information overload", that is, the continuous emergence of massive news content makes it difficult for users to obtain useful parts from them, which in turn affects the quality of news content. Effective utilization. Personalized recommendation technology ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F40/295G06F40/30
Inventor 杨鹏张成帅李幼平张长江
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products