Unlock instant, AI-driven research and patent intelligence for your innovation.

Method of collecting and processing microblog data in designated regions

A processing method and data collection technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problem of not being able to obtain a large amount of microblog data

Active Publication Date: 2018-05-04
HEFEI UNIV OF TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a method for collecting and processing microblog data in a designated area, so as to solve the problem that the prior art crawler method or third-party API call cannot obtain a large amount of microblog data in a designated area

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method of collecting and processing microblog data in designated regions
  • Method of collecting and processing microblog data in designated regions

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The method of collecting and processing microblog data in a specified area. The area is the area where microblog users publish microblogs, and the geographical boundaries are divided by administrative boundaries; regional microblogs are all microblogs sent by microblog users appearing in the specified area. Include the following steps:

[0019] (1), GEO geographic information seed point selection:

[0020] Set the number of target seed points as N, use rectangular cuts for the specified urban area to determine the city edge; make diagonal lines in the rectangular area, and make parallel lines with a map scale length of 10 kilometers to divide the rectangular area; on each divided parallel line, Use the map scale length of 5 kilometers as the radius to make a circular area to cover the rectangular area in turn, and the circular areas do not overlap; the area on the dividing line less than 5 kilometers is covered with a suitable circular area according to the actual situa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a designated area microblog data collecting and processing method. According to the method, firstly, GEO geographic information seed point selection is carried out; then, microblog data is obtained; and finally, the microblog data is processed. The designated area microblog data collecting and processing method has the advantages that a parallel multi-user calling mode is adopted for increasing the data collecting flow rate; and multi-information-point coverage is adopted for searching and collecting the microblog data, and the requirements of designated area microblog data collection and processing can be met.

Description

technical field [0001] The invention relates to the field of microblog data processing methods, in particular to a method for collecting and processing microblog data in a designated area. Background technique [0002] With the rise of Weibo, short texts containing a large number of micro-viewpoints and emotional tendencies are rapidly enriched, and Weibo text analysis has become a popular research direction. [0003] In the process of microblog data collection, a large number of microblog data collection strategies usually adopt crawler crawling method. This method has fast crawling speed and high efficiency, but the captured data is noisy. Although it reduces the time of data collection, it is It doubles the preprocessing time for obtaining accurate data; and the crawler is unstable and often faces the danger of being banned by Sina. A small amount of Weibo data is generally collected by calling the third-party API of Sina Weibo. The data collected by this method has less...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/9537
Inventor 任福继刘宁全昌勤华磊
Owner HEFEI UNIV OF TECH