Unlock instant, AI-driven research and patent intelligence for your innovation.

Microblog text level subject finding method and system based on seed words

A technology of hierarchical topics and discovery methods, applied in the field of microblog text hierarchical topic discovery methods and systems, can solve the problems of difficulty in collecting, analyzing and sorting microblogs, time-consuming and laborious, and low efficiency.

Active Publication Date: 2014-08-06
TSINGHUA UNIV
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, as the number of microblog texts increases, it becomes extremely difficult, time-consuming, laborious, and inefficient to manually collect and analyze relevant microblogs.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog text level subject finding method and system based on seed words
  • Microblog text level subject finding method and system based on seed words
  • Microblog text level subject finding method and system based on seed words

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0031] The method and system for discovering topics of microblog text levels based on seed words according to the embodiments of the present invention will be described below with reference to the accompanying drawings.

[0032] figure 1 is a flow chart of a seed word-based microblog text hierarchical topic discovery method according to an embodiment of the present invention. Such as figure 1 As shown, the microblog text hierarchy topic discovery method based on the seed word according to an embodiment of the present invention comprise...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a microblog text level subject finding method based on seed words. The method comprises the following steps: acquiring data information from the internet, wherein the data information comprises microblog texts; analyzing the microblog texts to acquire a seed word cluster serving as priori knowledge; conducting level subject clustering on the microblog texts to generate a level subject model; integrating the priori knowledge to the level subject model to find level subjects of the microblog texts. By means of the microblog text level subject finding method based on seed words, the level subjects and subject distribution of the texts can be fast extracted from the microblog texts, and the level granulation relation between the released subjects is conveniently found. The invention further provides a microblog text level subject finding system based on the seed words.

Description

technical field [0001] The present invention relates to the fields of computer application technology and Internet technology, in particular to a method and system for discovering topics of microblog text levels based on seed words. Background technique [0002] With the continuous popularization of the Internet and the rapid development of web2.0, the public's comments on social events, hot people and e-commerce products conveyed by the Internet have received special attention from all parties. Based on the characteristics of information dissemination, the Internet has the interactivity of multi-modal information, which can quickly and effectively disseminate the opinions of netizens, thus forming a certain orientation of social public opinion. Orientation and other aspects have great advantages compared with traditional media. Users are now not only acting as a simple information browser, but more often than not, they are also a publisher of information. For example, for...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/35
Inventor 徐华王玮
Owner TSINGHUA UNIV