Internet information classification method and system

A technology of Internet information and classification methods, applied in the field of Internet information classification methods and systems, can solve problems such as low efficiency and inaccurate statistical results

Active Publication Date: 2012-03-21
SHENZHEN SHI JI GUANG SU INFORMATION TECH
View PDF1 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] In order to solve the problem of inaccurate and inefficient statistical results caused by manually counting user comments in the prior art, ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Internet information classification method and system
  • Internet information classification method and system
  • Internet information classification method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0061] The first embodiment of the present invention proposes a method for classifying Internet information, the process of which is as follows figure 1 shown, including:

[0062] Step 101, obtaining comments input by users, and performing word segmentation on the comments to obtain keywords;

[0063] Step 102, matching the keywords with a preset keyword library to obtain the emotional value corresponding to each keyword; the preset keyword library stores keywords of at least two categories, and each category At least one keyword and the sentiment value corresponding to the keyword are respectively pre-stored in ;

[0064] Step 103. Obtain the emotional value of the evaluation according to the emotional value corresponding to each keyword.

[0065] The Internet information classification method proposed by the embodiment of the present invention can obtain the emotional value corresponding to the keyword by matching the keyword, and obtain the emotional value of the user eva...

Embodiment 2

[0067] The second embodiment of the present invention proposes a method for classifying Internet information, which is improved on the basis of the first embodiment, including:

[0068] Step 201. Acquire comments input by users, and perform word segmentation on the comments to obtain keywords.

[0069] Wherein, the keywords may include nouns, verbs, adjectives, and adverbs in the comments. This is because adverbs are used for modification, which can indicate the intensity of tone, or indicate negation or affirmation; while the words expressing emotion in the prior art can be nouns, verbs, and adjectives. Wherein, performing word segmentation on an article is a prior art, which will not be repeated here.

[0070] Since the comment entered by the user can be a word, a sentence, or a piece of text. Therefore, when evaluating a sentence or a piece of text, n keywords will be obtained during word segmentation. For example, in the report on the Wenchuan mother in the Wenchuan ear...

Embodiment 3

[0113] The third embodiment of the present invention proposes a kind of Internet information classification system, its structure is as follows Figure 5 shown, including:

[0114] The word segmentation module 1 is used to obtain the comments input by the user, and perform word segmentation on the comments to obtain keywords;

[0115] The preset keyword library module 2 is used to store keywords of at least two categories, each of which is pre-stored with at least one keyword and the corresponding emotional value of the keyword;

[0116] Matching module 3, is used for the keyword that described participle module obtains and the keyword preset in the preset keyword storehouse module are matched, to obtain the emotion value corresponding to each keyword; And calculate the emotion of comment with this value.

[0117] The Internet information classification system proposed by the embodiment of the present invention can obtain the emotional value corresponding to the keyword by m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an Internet information classification method and system, and belongs to the technical field of computers. The system provided by the embodiment of the invention comprises a word segmentation module, a preset keyword library module and a matching module. The method comprises the following steps of: obtaining comments input by users, and segmenting the comments to obtain keywords; matching the keywords with a preset keyword library to obtain an emotion value corresponding to each keyword; storing at least two classes of keywords in the preset keyword library, pre-storing at least one keyword and the emotion value corresponding to the keyword in each class respectively; and obtaining the emotion value of the comment according to the emotion value corresponding to each keyword. In the embodiment of the invention, by the preset keyword library and the emotion values corresponding to the keywords, the emotion values corresponding to the comments of the users obtained by a word segmentation and matching mode are obtained. Accordingly, compared with an artificial statistic method in the prior art, the statistical results are more accurate and efficient.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method and system for classifying Internet information. Background technique [0002] With the development of Internet technology and the popularity of the Internet, more and more Internet users are not only satisfied with simply obtaining information, but also hope to participate in it. Therefore, the function of commenting on articles published on the Internet has emerged as the times require. Commenting on an article means that users who browse articles sent on the Internet such as news and blogs can express their own opinions on the article by inputting a piece of text. Chinese is extensive and profound. According to Xu Xiaoying's paper "Research on Emotional Division in Chinese Emotional System" published in the first volume of "The First Chinese Affective Computing and Intelligent Interaction Academic Conference" in 2003, Chinese is divided into 8 types and 33 subtypes ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 张鹏马尧
Owner SHENZHEN SHI JI GUANG SU INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products