Method and system for automatic analysis of hotspot subject propagation process in the internet

A technology for automatic analysis and dissemination process, applied in the direction of instrument, calculation, electrical and digital data processing, etc., can solve the problem of inability to analyze the dissemination process of subject information, and achieve the effect of great practical value

Active Publication Date: 2008-07-30
PEKING UNIV
View PDF0 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The above models all describe the characteristics of Internet information dissemination from a macro perspective, and cannot analyze the information dissemination process of a specific topic. Users often need to monitor and track the information dissemination process of hot topics or sensitive topics, and then make decisions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for automatic analysis of hotspot subject propagation process in the internet
  • Method and system for automatic analysis of hotspot subject propagation process in the internet
  • Method and system for automatic analysis of hotspot subject propagation process in the internet

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] Further illustrate the technical scheme of the present invention below in conjunction with embodiment and accompanying drawing:

[0051] In order to meet the needs of users to track and monitor the information dissemination process of a specific topic, the present invention uses the pattern matching method and the similarity comparison method to find the reprint source and the corresponding source document one by one for the documents belonging to the topic, and finally draws the information dissemination process diagram. Specifically, for document b on site B, the reprint source A of document b and the corresponding source document a can be obtained by using the method of the present invention, recorded as site A (document a) -> site B (document b), site A and B become the publishing sites (PublishSite) of documents a and b respectively, site A is the reprint source (SourceSite) of document b, and document a is the source document (SourceDoc) of document b. This metho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method which can automatically analyze the propagation process of an internet hot subject, as well as a system thereof, and belongs to the intelligent information processing technology. As the textual information on the internet gradually increases, an important subject in the text mining and information retrieval field is to automatically detect and analyze the hot or sensitive subject from large text database, the subject has great use value. The invention utilizes the natural language processing approach to automatically analyze the propagation process of the text document in the given hot or sensitive subject; after the text documents in the subject are arranged in a time order, the reference origin of the current text document is searched by utilizing the pattern matching method from the first text document, if the reference origin isn't found, the reference origin is further judged by utilizing the text document similarity comparative method, at the same time, the corresponding source text document is obtained. At last, the reference relation is intuitively presented to the user in a graphic mode. The method is widely applicable to internet intelligent information processing, public opinion analyzing and monitoring, etc.

Description

technical field [0001] The invention belongs to the technical field of intelligent information processing, and in particular relates to a method and system for automatically analyzing the dissemination process of hot topics on the Internet. Background technique [0002] In recent years, text information on the Internet has grown explosively, including news, forums, blogs and other forms. A characteristic of text information on the Internet is that not all text information is original, and a lot of text information is reproduced from other websites. For example, most of the news on Sina.com is reproduced from other websites or media. And it may be processed with simple editing. Many popular posts on the forum are reproduced from other websites or media. This phenomenon of mass reprinting of text information on the Internet is called Internet information dissemination. People can find hot topics and sensitive topics through topic detection and full-text search, and by analy...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 万小军王栋黄小江余军杨建武吴於茜
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products