Unlock instant, AI-driven research and patent intelligence for your innovation.

A distributed classification device and method for massive microblog data

A classification method and a classification device technology, applied in the field of distributed classification devices for massive microblog data, can solve problems such as difficult large-scale data analysis

Active Publication Date: 2015-10-28
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the existing research on microblog data, the amount of microblog data processed is often relatively small, and can be processed in a centralized environment; however, with the rapid growth of microblog data in the Internet, microblog The amount of blog data far exceeds the processing power of a single computer, and it is difficult to achieve large-scale data analysis using existing methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A distributed classification device and method for massive microblog data
  • A distributed classification device and method for massive microblog data
  • A distributed classification device and method for massive microblog data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] Embodiments of the present invention will be described in further detail below in conjunction with the accompanying drawings.

[0053] Today's microblog data contains a large amount of emotional information of microblog users, which indicates the views and opinions of microblog users on an event, product, person, etc., and these emotional information have high research and application value. This also makes the sentiment analysis of microblog data gain widespread attention and has broad application prospects, such as opinion analysis, commodity evaluation, public opinion detection, etc. Therefore, in a specific embodiment of the present invention, the microblog data is classified according to the emotional tendency of the microblog data.

[0054] The present invention analyzes massive microblog data in a distributed environment, wherein the distributed system structure is such as figure 1 shown. Including a master node n 0 and multiple slave nodes n 1 ,n 2 ,...,n ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a distributed classification device and a distributed classification method for massive micro-blog data, and belongs to the field of data mining technology. The distributed classification device is of a distributed structure. The method includes that each secondary controller transmits an intermediate result to a master controller according to an ELM (extreme learning machine) processing technology, and the intermediate result is generated by the secondary controller and is used for generating a final micro-blog data classifier; the master controller acquires the final micro-blog data classifier according to an ELM principle after receiving all the intermediate results transmitted by the secondary controllers; and the generated micro-blog classifier classifies the micro-blog data. The distributed classification device and the distributed classification method have the advantages that the shortcoming that an existing method implemented by an extreme learning machine technology only can be applied to a centralized environment and cannot be adapted to ELM classification for large-scale training sample sets is overcome, the massive micro-blog data can be processed and analyzed, the effectiveness of the massive micro-blog data accumulated during application can be sufficiently played, and an effective application service effect is realized.

Description

technical field [0001] The invention belongs to the technical field of data mining, and relates to an extreme learning machine classification device and method based on distributed processing technology, in particular to a distributed classification device and method for massive microblog data. Background technique [0002] At present, a large amount of information is generated on the Internet all the time, and the information can be expressed in various forms. Among them, the amount of information generated by the Weibo platform is also increasing rapidly. Micro-blogs (Micro-Blogs) are a form of blog that allows users to update in time and publish short texts (usually around 140 characters) publicly. The rapid development of Weibo allows anyone to become a Weibo user, and to publish and read information on any client that supports Weibo at any time, to interact and express their own emotional information. Weibo has become a powerful information carrier of the Internet, and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N5/00
Inventor 王国仁信俊昌聂铁铮赵相国丁琳琳
Owner BEIJING INSTITUTE OF TECHNOLOGYGY