Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method for mine hot events on microblog

A microblogging hotspot, microblogging technology, applied in the field of data processing, to achieve the effect of improving efficiency

Inactive Publication Date: 2019-02-12
KUNMING UNIV OF SCI & TECH
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to provide a microblog hot event mining method for the limitations and deficiencies of the existing technology, which mainly solves the lack of a large amount of complete training corpus in the preprocessing process of microblog and the The irregularity of microblog data leads to large errors in the process of identifying entities, which makes the accuracy of microblog event extraction low, so as to improve the efficiency of microblog hot event mining

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for mine hot events on microblog
  • A method for mine hot events on microblog
  • A method for mine hot events on microblog

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] Example 1: Such as Figure 1-4 As shown, a microblog hot event mining method first crawls microblog data to build a microblog database; then preprocesses the crawled microblog data; then performs named entity recognition on the preprocessed microblog data; Then, according to the results of preprocessing and named entity recognition, the entities and event triggers of the microblog data are extracted to determine the events expressed by each microblog, and finally the similarity between microblogs is calculated, and the similarity results and publisher are analyzed. Information and release time, get hot events on Weibo.

[0032] The specific steps are:

[0033] ① Crawling Weibo data and establishing a Weibo database.

[0034] ②Preprocess the crawled microblog data.

[0035] ③ Perform named entity recognition on the pre-processed Weibo data.

[0036] ④According to the results of preprocessing and named entity recognition, extract entities and event trigger words of Weibo data.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a micro-blog hot event mining method, belonging to the technical field of data processing. Firstly, the micro-blog data is crawled and the micro-blog database is established.Then preprocessing the crawled micro-blog data; Then the pre-processed micro-blog data is identified by named entity recognition. Then, according to the results of preprocessing and named entity recognition, the entities and event triggers of micro-blog data are extracted to determine the events expressed by each micro-blog, and finally, similarity between micro-blogs is calculated, similarity results are analyzed, publisher information and publishing time are obtained, and micro-blog hot events are obtained. As compare with that prior art, This method mainly solves the problem of lacking a large number of complete training corpus in the pre-processing process of micro-blog and the non-standardization of the naming entity recognition link, which will lead to a large error in the process ofentity recognition, so that the accuracy of micro-blog event extraction is low, in order to improve the efficiency of micro-blog hotspot event mining.

Description

Technical field [0001] The invention relates to a method for mining microblog hot events, belonging to the technical field of data processing. Background technique [0002] In recent years, a large number of social media platforms such as Weibo have emerged. As a representative new type of communication media, Weibo has now become the most popular network tool for people to express ideas, share information, and exchange opinions. Formal news text, Weibo is conducive to more accurate and timely extraction of richer event information. Through the mining of hot events in Weibo, we can timely understand the large and small events that occurred at home and abroad, and understand people’s views on various events. Responses and opinions, screening out useful information, have a very good auxiliary role for real-time monitoring, risk assessment and analysis, and decision support. [0003] Generally, due to the fast information update speed of microblog data, traditional microblog preproce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/951G06F16/2458G06F17/27
CPCG06F40/295
Inventor 龙华吴睿熊新邵玉斌杜庆治
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products