Detection method of network sudden hot events based on topic model

A hot event and topic model technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as weakening, no original data optimization processing, etc., and achieve the effect of eliminating interference
CN102289487AInactive Publication Date: 2011-12-21ZHEJIANG UNIV

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
ZHEJIANG UNIV
Publication Date
2011-12-21
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a network burst hotspot event detection method based on a topic model, which comprises the following steps of: 1, firstly, carrying out participle treatment on a file data set to obtain a word list, a file word relation matrix, a word file distribution matrix and a word date distribution matrix; 2, screening the data set according to relevant words in an emerging process of network hotspot events and burst characteristics of a file; 3, obtaining characteristic words and characteristic texts of the burst hotspot events through topic modeling; and 4, figuring out attention date distribution of the hotspot events. Compared with the prior art, the invention has the advantages that the topic modeling is carried out by using the topic model, thus a topic event can be more accurately described; and a burst characteristic computing method of words is introduced and then the data set is screened, thus time-unrelated topics are removed through filtering, and an actual burst hotspot event is obtained.
Need to check novelty before this filing date? Find Prior Art

Description

Technical field

[0001] The invention relates to the field of topic models and event detection, and in particular to a method for detecting network hotspot events based on topic models. Background technique

[0002] With the rapid development and wide application of network technology, the Internet has gradually become an important channel for people to obtain information. There are hundreds of millions of network information emerging worldwide every day. How to detect sudden hot events in massive network information has become An emerging research topic.

[0003] Traditional topic models, such as PLSA (Probabilistic Latent Semantic Analysis), LDA (Latent Dirichlet Allocation), etc., can be used to perform topic mining on a document set. They approximate each topic in the document set through iterative calculations. However, these topic models are based on the BOW (Bag Of Words) model, which only considers the affiliation of words and documents, ignoring the time information of wor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More