Establishment method of emotion dictionary based on linguistic data

A construction method and technology of emotion dictionary, applied in the field of corpus-based emotion dictionary construction, can solve the problem of poor utilization of emotion words, save time and labor cost, have strong versatility, and save time and energy.

Inactive Publication Date: 2015-01-28
NANJING UNIV OF SCI & TECH
View PDF2 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But they have a poor utilization rate of the emotional words obtained during the execution of the algorithm

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Establishment method of emotion dictionary based on linguistic data
  • Establishment method of emotion dictionary based on linguistic data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] The inventive method comprises the following steps:

[0016] Step 1: Use the corpus that has been subjected to word segmentation processing. These corpora are generally subjective comments. Sort the adjectives that appear in order of frequency from high to low, and extract the top 5%~10% of adjectives that have a certain emotional polarity. According to HowNet (Hownet) mark its emotional polarity as a seed word to form an emotional lexicon;

[0017] Step 2: Break sentences according to punctuation marks to generate short sentences one by one;

[0018] Step 3: Scan short sentences one by one to extract adjectives. Build two temporary lists for storing adjectives of the two sentiment polarities. For the adjective, judge whether it has a negative word modification, and if so, reverse the polarity and store it in the corresponding temporary list; then judge whether the short sentence starts with a transition word, if so, reverse the polarity and store it in the correspond...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an establishment method of an emotion dictionary based on linguistic data, and the establishment method comprises the following steps of: in advance acquiring partial known emotional tendency adjectives, including positive and negative adjectives, extracting and analyzing the unknown emotional tendency adjectives by using adversatives and privatives, constantly extending seed lexicons and finally judging them. The method doesn't need manual intervention and belongs to a non-supervision learning method so as to greatly improve the operating efficiency. The emotion dictionary established by the method can be applied for critical analysis so as to quickly obtain emotional tendency, thereby achieving the purpose of rapid analysis.

Description

technical field [0001] The invention belongs to artificial intelligence invention technology, and specifically relates to a method for constructing an emotional dictionary based on corpus. Background technique [0002] Some of the existing Chinese emotional dictionaries are constructed by artificially summarizing some commonly used adjectives, which is inefficient and non-territorial. However, Chinese does not have a dictionary similar to English wordnet, and it is impossible to construct a dictionary of emotional words through existing dictionaries. The corpus-based sentiment dictionary construction method applies people's language habits to the analysis of texts, and constructs two types of dictionaries, positive and negative. That is to say, labor cost is saved, and it also has the ability to judge the emotion of new words. [0003] Hazivassiloglou and McKeown were the first to analyze the corpus and construct the sentiment lexicon according to the language rules. They ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/21G06F16/2462
Inventor 夏睿王科周清清刘超
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products