Unlock instant, AI-driven research and patent intelligence for your innovation.

Microblog Classification Method and Device

A classification method and microblog technology, applied in the field of computer networks, can solve the problems of low recall rate, short microblog data, large data sparsity, etc., and achieve the effect of improving recall rate and accuracy rate

Active Publication Date: 2017-12-19
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the microblog data is short, a single microblog can be up to 140 characters, the data is sparse, and the recall rate of small-scale annotation is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog Classification Method and Device
  • Microblog Classification Method and Device
  • Microblog Classification Method and Device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0039] The present invention provides a method and device for classifying microblogs. The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0040] method embodiment

[0041] According to an embodiment of the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a microblog classifying method and a microblog classifying device. The method comprises the following steps: step1, pre-processing a training corpus collection, segmenting words from the preprocessed training corpus to obtain candidate features, carrying out weight calculation on the candidate features, and selecting features according to weight calculation results so as to obtain final classifying features; step 2, adopting a Bayes classifier to carry out model training according to the final classifying features so as to obtain a classifying model; step 3, classifying microblog files by the Bayes classifier according to the classifying model. By the technical scheme of the invention, the classifying recall rate and accuracy are improved.

Description

technical field [0001] The invention relates to the field of computer networks, in particular to a microblog classification method and device. Background technique [0002] The microblog user base is huge. CNNIC announced in January 2014 that the scale of microblog users in my country is 281 million, and the utilization rate of microblog among netizens is 45.5%. And the number of active users is huge. In December 2013, the number of monthly active users of Sina Weibo reached 129.1 million. Weibo generates massive amounts of data, but users feel that information is scarce and cannot find relevant information. Classification is an effective means of information organization, which can assist users to find the information they need. And classification is the basis of information recommendation and data analysis. [0003] The Weibo data is short, the amount of information is large, the information fragmentation is high, and the content is colloquial, so the traditional classi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/35G06F40/211
Inventor 杜翠兰李鹏霄孙旷怡刘晓辉赵淳璐翟羽佳段东圣杨博钮艳
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT