Method and system for automatically computing subject evolution trend in the internet

An automatic computing and Internet technology, applied in computing, instrumentation, electrical and digital data processing, etc., can solve the problem that the theme detection system cannot analyze the evolution trend of computing theme, and achieve the effect of high computing and storage efficiency and strong practicability

Active Publication Date: 2008-07-30
NEW FOUNDER HLDG DEV LLC +2
View PDF1 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] In view of the defect that the existing topic detection system cannot analyze and calculate the topic evolution trend, the purpose of the present invention is to analyze the topic evolution trend over time by calculating the similarity relationship between topics in different time periods in real time, and draw the topic evolution Trend

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for automatically computing subject evolution trend in the internet
  • Method and system for automatically computing subject evolution trend in the internet
  • Method and system for automatically computing subject evolution trend in the internet

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] Further illustrate the method of the present invention below in conjunction with embodiment and accompanying drawing:

[0052] Such as figure 1 As shown, a method for automatically calculating the evolution trend of topics on the Internet includes the following steps:

[0053] (1) Collect Internet text information and preprocess it;

[0054] In this embodiment, the Founder Radar webpage collection tool is used to collect news text information on the Internet in real time. The text sources collected include more than a dozen major news websites such as Sina, Sohu, and NetEase. Because the webpage text contains a lot of HTML tags, as well as irrelevant information such as advertisements and navigation bars, the downloaded webpage needs to be preprocessed by HTML tag filtering, text extraction, and time extraction to obtain important text content and time stamps of the webpage. The time stamp refers to the publication time of the text. If the publication time of the text...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method and a system for automatically calculating the evolutional trend of a topic on the internet. The prior art only can simply analyze the topic (or event) from a document in a centralization way, and give out the document information contained by the topic. In fact, each topic changes with the variation of the time, and the topic evolutes constantly on the time dimension. On the basis of the prior topic detection system, the invention periodically calculates the relation between the topic in the current period and the topic in the last period and stores the relations. The system takes out the relations between the topic information which is corresponded to a plurality of periods and the topics, and can visually display the evolutional trend of the topic over time at a client in a graphic mode according to the time range input by the user. By adopting the method of the invention, a more three-dimensional topic analysis result can be provided for the user, and the understanding and the recognition of the user to the topic are deepened, thereby helping the user to make a decision. The method is widely applicable to the intelligent information processing.

Description

technical field [0001] The invention belongs to the technical field of intelligent information processing, and in particular relates to a method and system for automatically calculating the evolution trend of topics on the Internet. Background technique [0002] With the explosive growth of text information on the Internet, it is becoming more and more difficult for people to obtain interesting topic (event) information in time from massive text information. Topic Detection technology (Topic Detection, also known as topic detection technology, event detection technology) is committed to automatically detecting topics from massive texts in real time, providing topic information to users, and users can understand the importance of massive texts by browsing topics. content. [0003] According to the definition of the International Topic Detection and Tracking Group (see The 2002 topic detection and tracking (TDT2002) task definition and evaluation plan, version 1.1, prepared b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 万小军冯涛黄小江杨霙杨建武吴於茜路斌
Owner NEW FOUNDER HLDG DEV LLC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products