Chinese short text sentiment classification method based on fields

A technology of sentiment classification and short text, applied in the field of machine learning, can solve problems such as ambiguity or extreme, easy omission of sentiment dictionary, low algorithm complexity, etc., and achieve the effect of improving accuracy

Inactive Publication Date: 2015-11-18
GUANGDONG UNIV OF PETROCHEMICAL TECH
View PDF4 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Both technical solutions have advantages and disadvantages: the algorithm of the former is often simpler, the algorithm complexity is lower, and does not require a large number of label corpora; but there are sentimental dictionaries that are easy to miss, ambiguous or extreme, and the emotional differences generated by emotional words in different scenes are often difficult to understand. perception

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese short text sentiment classification method based on fields
  • Chinese short text sentiment classification method based on fields
  • Chinese short text sentiment classification method based on fields

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0042] figure 1 It is a schematic flow diagram of an embodiment of the field-based Chinese short text sentiment classification method provided by the present invention, including the following steps:

[0043] S101. Perform data preprocessing on the short text, including sentence segmentation, word segmentation, stop word filtering, and domain division.

[0044] Specifically, such as figure 2 As shown, step S101 includes steps:

[0045] S1011. Using punctua...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a Chinese short text sentiment classification method based on fields, which includes: data preprocessing of a short text including sentence segmentation, word segmentation, stop word filtration, and field division; construction of a field-oriented sentiment dictionary; extraction and matching of sentiment paths, extraction and polarity discrimination of candidates, and TF-IDF weight calculation of sentiment words by the field-oriented sentiment dictionary and using a corpus as a data set; sentimental characteristic extraction of the short text; and the corpus training or unknown sentiment types discrimination by a rand forest algorithm. Experiments show that the scheme provided by the present invention has high accuracy rate.

Description

technical field [0001] The invention relates to the technical field of machine learning, in particular to a field-based Chinese short text sentiment classification method. Background technique [0002] The rapid development of the Internet has made social networking and e-commerce shopping platforms more and more widely favored by users, such as Facebook, Twitter, Sina Weibo, Douban, Jingdong and Taobao and other domestic and foreign network platforms. Data on these online platforms has exploded, including product reviews, views on surrounding events, and records of interesting events in life or mood swings. Among them, short text is an important form commonly used for these data, and often contains emotional color or subjective consciousness. Mining the emotions expressed by users in this kind of short text data will help different users make better decisions or services, such as providing more pertinent recommendations to users when they choose, and helping e-commerce com...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/35G06F16/374
Inventor 舒磊牛建伟毛凯莉傅树霞赵晓轲
Owner GUANGDONG UNIV OF PETROCHEMICAL TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products