Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method, system and device for discovering and tracking hot topics based on network media data stream

A technology of hot topics and network media, applied in digital data processing, special data processing applications, natural language data processing, etc., can solve problems such as incomplete hot topics, inability to track topics, and low algorithm efficiency

Pending Publication Date: 2018-11-13
WISERS INFORMATION LTD
View PDF5 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the existing hot topic discovery and tracking technical solutions more or less have the following defects: 1) the data source is single, and the hot topics discovered are not comprehensive; The brevity and irregularity of social media data make the accuracy low; 3) For methods that simply define topics as commonly used / common keywords, phrases, hashtags, or articles at a specific time and place according to the type of source data, there are Insufficient analysis and description of the rich and semantic level of the topic, and tracking of the topic; 4) Segmentation of word co-occurrence graphs (word graphs for short) using graph search (such as breadth-first search) In order to realize the method of hot topic discovery, due to the large size of the word graph, the complexity of the graph search algorithm is high, and the algorithm efficiency is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, system and device for discovering and tracking hot topics based on network media data stream
  • Method, system and device for discovering and tracking hot topics based on network media data stream
  • Method, system and device for discovering and tracking hot topics based on network media data stream

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The specific implementation manners of the present invention are described below in the form of embodiments in conjunction with the accompanying drawings, so that those skilled in the art can understand the purpose, technical solutions and advantages of the present invention. Those skilled in the art can understand that the specific implementations described in the form of examples are only exemplary, and the concept of the present invention is not limited to these specific examples shown.

[0055] figure 1 An exemplary flowchart of a method 100 for discovering hot topics in network media data streams provided by the present invention is shown.

[0056] First, in step 101, according to a preset time interval, multiple text data of different types in the current time window t are obtained from the network media platform, and the obtained data are preprocessed. In the present invention, the network media data stream that contains text data can be obtained from various fo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method for discovering hot topics based on network media data stream. The method for discovering the hot topics based on the network media data stream comprises the followingsteps: obtaining plurality pieces of text data with different types in the current time window t from a network media platform according to a preset time interval, and preprocessing the obtained data;recognizing the category of each piece of preprocessed text data by means of a classification algorithm, and filtering each piece of text data according to the recognized categories; performing statistics on keyword co-occurrence relationship using each piece of preprocessed and filtered text data as the input; constructing or updating a corresponding keyword graph according to the statistical results of the keyword co-occurrence relationship; segmenting the keyword graph step by step according to a predetermined rule to obtain a series of sub-graphs as a candidate topic set; and performing clustering and merging based on the corresponding sub-graphs for each candidate topic in the candidate topic set to obtain the hot topics as the result. The invention also provides a system and devicefor discovering the hot topics based on the network media data stream.

Description

technical field [0001] The invention belongs to the technical field of Internet data mining, and in particular relates to a method, system and device for discovering and tracking hot topics based on network media data streams. Background technique [0002] The rapid development of computer, communication, and network technologies has continuously improved the performance of terminal equipment including PCs, tablet computers, smart phones, and Internet TVs. Correspondingly, Internet media, especially Internet social media, has gradually become one of the main ways for the public to obtain news and information due to its characteristics of diversity, speed, interactivity, easy replication, and multimedia. Internet social media, as a tool for people to communicate, is increasingly used to disseminate news reports, update personal status, publish witness records and exchange ideas. The amount of data on social media is growing rapidly at a rate of millions every day. How to dis...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27
CPCG06F40/289
Inventor 唐晓丽梁颖琪
Owner WISERS INFORMATION LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products