Method for sorting and processing internet public feelings information

A processing method and technology of public opinion information text, which is applied in the field of classification and processing of Internet public opinion information, can solve problems such as unsatisfactory text classification effect, huge amount of similarity calculation, and reduced problem complexity

Inactive Publication Date: 2009-04-22
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF0 Cites 58 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the simplification of text classification into space vector operations, the complexity of the problem is greatly reduced
[0006] The traditional text classification processing method based on the vector space model has the following shortcomings: First, the "item" in the model is simply taken as the feature words in the text, and there is a certain correlation between the feature words, so the distance between the vectors will be caused. The calculation is not accurate enough, resulting in unsatisfactory text classification results; second, it is limited to the classification mode of the usual similarity measure, simply corresponding the text to a feature vector in a high-dimensional space, and the amount of similarity calculation is huge

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for sorting and processing internet public feelings information
  • Method for sorting and processing internet public feelings information
  • Method for sorting and processing internet public feelings information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Specific embodiments of the present invention are described below, and it should be noted that in the following description, when detailed descriptions using known functions and designs may dilute the main content of the present invention, these descriptions will be ignored here .

[0038] figure 1 It is a flow chart of a specific embodiment of the method for classifying and processing Internet public opinion information of the present invention.

[0039] In this embodiment, the method for classifying and processing Internet public opinion information includes the following steps:

[0040] (1), the Internet public opinion information is divided into M categories, download and extract the public opinion information from Internet sites, manually classify it into one of the M category public opinion information, and store it in the corresponding file directory in the form of a text file, Select f public opinion information texts for each category as training texts. The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a classified processing method of internet public information. The method comprises the following steps: selecting a classified public information text as a training text, and parsing words; selecting and screening nouns and verbs, acquiring feature words by extraction, vectorizing the training text, then acquiring a PCA transformation feature matrix, a BP neural network model, and a decision tree rule; performing dimension reduction on vectors of the vector matrix of the public information text to be classified by the PCA transformation feature matrix, and transforming the vectors by the BP neural network model to obtain an output vector which has the same number of dimensions as the classified number, and then performing matching by the decision tree rule, and determining that the public information text to be classified belongs to the public information category marked by the rule if the matching is successful. As the PCA transformation converts a feature word space related to a high dimension into a low-dimensional orthogonal feature space, the disadvantage of inaccurate classification is solved; meanwhile, the decision tree rule is used for classification without data similarity comparison so that a plurality of data sources can be processed in a short time.

Description

technical field [0001] The invention belongs to the technical field of Internet information release monitoring, and specifically relates to a method for classifying and processing Internet public opinion information. Background technique [0002] With the rapid development of Internet technology, people can more conveniently browse the web online, read news, post posts and comments, and edit personal web pages. The generation, dissemination and consumption of information by users plays an important role in the development of the Internet. [0003] Due to the virtuality, concealment, divergence, penetration and randomness of Internet communication, Internet public opinion gradually poses a threat to social public security in the form of "content threat". Public opinion refers to the socio-political attitudes that the public generates and holds toward social managers around the occurrence, development, and changes of intermediary social events within a certain social space. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N3/06
Inventor 高辉傅彦陈旭
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products