Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Microblog topic detection and hotspot evaluation method based on semantic expansion

A technology of semantic expansion and topic detection, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of short microblog text, less information content, sparse data, etc.

Active Publication Date: 2015-08-12
GOONIE INT SOFTWARE BEIJING
View PDF4 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Due to the short text length and low information content of Weibo, there will be serious data sparse problems, resulting in unsatisfactory effects of Weibo text information processing such as Weibo topic detection.
Researchers have made some attempts to solve the problem of microblog data sparsity and improve the effect of topic detection, but such problems have not been completely solved

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog topic detection and hotspot evaluation method based on semantic expansion
  • Microblog topic detection and hotspot evaluation method based on semantic expansion
  • Microblog topic detection and hotspot evaluation method based on semantic expansion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044]The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0045] according to figure 1 Shown, the method that the present invention proposes is to realize by following steps successively:

[0046] Step 1: Filter out low-information microblogs through the following microblog noise data filtering method.

[0047] Step 1.1: Segment the microblog text, remove stop words, select effective words, feature weighting and text representation;

[0048] Step 1.2:

[0049] Calculate the information content index A:

[0050] (1) Obtain core words: Calculate the document frequency of each word in the microblog set, set the frequency threshold η, filter out words whose document frequency is less than the threshold η, and obtain the core word set...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a microblog topic detection and hotspot evaluation method based on semantic expansion, belongs to the field of text information processing, and particularly relates to microblog noisy data filtering and a microblog topic detection and topic hotspot evaluation method and system based on semantic expansion. The method comprises the steps of firstly giving out a microblog noisy data filtering method for filtering microblogs with low information content, then supplementing effective semantic information in microblog comments into microblog semanteme so as to improve the detection effect of microblog topics, finally carrying out microblog topic hotspot evaluation, and thus obtaining hot topics.

Description

technical field [0001] The invention belongs to the field of text information processing, and in particular relates to microblog noise data filtering, a method and system for microblog topic detection and topic popularity evaluation based on semantic expansion. Background technique [0002] Weibo is an information sharing carrier based on user relationships. Users can update and share information within 140 characters through the WEB and various APPs. Users realize the transmission of information through the way of "following and being followed". The forwarding function of the Weibo platform promotes and realizes the rapid dissemination of Weibo among users. [0003] With the rapid development of Weibo, it has been widely used and has become a new type of media with strong influence. Weibo has 4A characteristics (any time, any place, any method, anyone), and anyone can become a disseminator of information anytime, anywhere. Weibo has positive meanings to the government, in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/30
Inventor 刘磊许志刚李静
Owner GOONIE INT SOFTWARE BEIJING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products