Unlock instant, AI-driven research and patent intelligence for your innovation.

A method for microblog topic detection and popularity evaluation based on semantic expansion

A technology of semantic expansion and topic detection, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problems of short length of microblog text, sparse data, and less than ideal effect of microblog text information processing.

Active Publication Date: 2017-09-29
GOONIE INT SOFTWARE BEIJING
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Due to the short text length and low information content of Weibo, there will be serious data sparse problems, resulting in unsatisfactory effects of Weibo text information processing such as Weibo topic detection.
Researchers have made some attempts to solve the problem of microblog data sparsity and improve the effect of topic detection, but such problems have not been completely solved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for microblog topic detection and popularity evaluation based on semantic expansion
  • A method for microblog topic detection and popularity evaluation based on semantic expansion
  • A method for microblog topic detection and popularity evaluation based on semantic expansion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044]The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0045] according to figure 1 Shown, the method that the present invention proposes is to realize by following steps successively:

[0046] Step 1: Filter out low-information microblogs through the following microblog noise data filtering method.

[0047] Step 1.1: Segment the microblog text, remove stop words, select effective words, feature weighting and text representation;

[0048] Step 1.2:

[0049] Calculate the information content index A:

[0050] (1) Obtain core words: Calculate the document frequency of each word in the microblog set, set the frequency threshold η, filter out words whose document frequency is less than the threshold η, and obtain the core word set...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A semantic expansion-based microblog topic detection and popularity evaluation method belongs to the field of text information processing, and specifically relates to microblog noise data filtering, a semantic expansion-based microblog topic detection and topic popularity assessment method and system. The present invention first provides a microblog noise data filtering method for filtering low-information microblogs, then supplements the effective semantic information in microblog comments into microblog semantics, improves the effect of microblog topic detection, and finally performs Evaluate the popularity of Weibo topics, and then obtain hot topics.

Description

technical field [0001] The invention belongs to the field of text information processing, and in particular relates to microblog noise data filtering, a method and system for microblog topic detection and topic popularity evaluation based on semantic expansion. Background technique [0002] Weibo is an information sharing carrier based on user relationships. Users can update and share information within 140 characters through the WEB and various APPs. Users realize the transmission of information through the way of "following and being followed". The forwarding function of the Weibo platform promotes and realizes the rapid dissemination of Weibo among users. [0003] With the rapid development of Weibo, it has been widely used and has become a new type of media with strong influence. Weibo has 4A characteristics (any time, any place, any method, anyone), and anyone can become a disseminator of information anytime, anywhere. Weibo has positive meanings to the government, in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 刘磊许志刚李静
Owner GOONIE INT SOFTWARE BEIJING