Microblog-based neologism emotional tendency judgment method

A judgment method and a technology for emotional tendencies, which are applied in the fields of instruments, computing, and electrical and digital data processing, and can solve the problem that new emotional words cannot be automatically recognized.

Active Publication Date: 2015-12-09
KUNMING UNIV OF SCI & TECH
View PDF4 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The invention provides a method for judging the emotional tendency of new words based on microblog, which can solve the problem that the new emotional words in the microblog corpus cannot be automatically identified in the existing situation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog-based neologism emotional tendency judgment method
  • Microblog-based neologism emotional tendency judgment method
  • Microblog-based neologism emotional tendency judgment method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0045] Embodiment 1: as shown in Figure 1, a kind of new word emotion tendency judgment method based on micro-blog, carry out word segmentation to micro-blog corpus by Chinese word segmentation tool, and take the stop word in the word segmentation result as segmentation point to the word segmentation The corpus is divided into blocks, and the adjacent word strings in each block are combined in pairs, and the frequency of the combined word strings is counted, and the word strings with a frequency higher than the threshold are used as new word candidate strings; according to the word formation of Chinese linguistics The rules and the rules of the number of adjacent changes filter the new word candidate strings to obtain new words; then use HowNet’s sentiment dictionary to calculate the word similarity between co-occurrence words and HowNet emotional words; calculate the correlation between new words and co-occurrence words; Take the new word and its co-occurrence word as the node...

Embodiment 2

[0076] Embodiment 2: as shown in Figure 1, a kind of new word emotion tendency judgment method based on micro-blog, carry out word segmentation to micro-blog corpus by Chinese word segmentation tool, and take the stop word in the word segmentation result as segmentation point to the word segmentation The corpus is divided into blocks, and the adjacent word strings in each block are combined in pairs, and the frequency of the combined word strings is counted, and the word strings with a frequency higher than the threshold are used as new word candidate strings; according to the word formation of Chinese linguistics The rules and the rules of the number of adjacent changes filter the new word candidate strings to obtain new words; then use HowNet’s sentiment dictionary to calculate the word similarity between co-occurrence words and HowNet emotional words; calculate the correlation between new words and co-occurrence words; Take the new word and its co-occurrence word as the node...

Embodiment 3

[0077] Embodiment 3: as shown in Figure 1, a kind of new word emotion tendency judgment method based on micro-blog, carry out word segmentation to micro-blog corpus by Chinese word segmentation tool, and take the stop word in the word segmentation result as segmentation point to the word segmentation The corpus is divided into blocks, and the adjacent word strings in each block are combined in pairs, and the frequency of the combined word strings is counted, and the word strings with a frequency higher than the threshold are used as new word candidate strings; according to the word formation of Chinese linguistics The rules and the rules of the number of adjacent changes filter the new word candidate strings to obtain new words; then use HowNet’s sentiment dictionary to calculate the word similarity between co-occurrence words and HowNet emotional words; calculate the correlation between new words and co-occurrence words; Take the new word and its co-occurrence word as the node...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a microblog-based neologism emotional tendency judgment method, belonging to the field of natural language processing. The microblog-based neologism emotional tendency judgment method disclosed by the invention comprises the following steps: dividing words of microblog corpuses through a Chinese word division tool, blocking the corpuses, the words in which are divided, by taking stop words in a word division result as a division point, pairwise combining adjacent word strings in each block, calculating the combined word string frequency, and taking the word strings, the frequencies of which are higher than a threshold value, as neologism candidate strings; filtering the neologism candidate strings according to a word formation rule of Chinese linguistics and an adjacent change number rule so as to obtain neologisms; calculating the similarity between co-occurrence words and hownet emotional words by utilizing an emotional dictionary of a hownet; calculating the relevancy between the neologisms and the co-occurrence words; constructing an image model; and obtaining the emotional polarity distribution of the neologisms by utilizing a label propagation algorithm, and obtaining the emotional tendency of the neologisms by constructing a linear classifier. By means of judgement of the emotional tendency of the neologisms, a blogger can express views better; and furthermore, the emotional tendency of the blogger can be accurately known by users.

Description

technical field [0001] The invention relates to a microblog-based method for judging the emotional tendency of new words, which belongs to the field of natural language processing. Background technique [0002] A large number of emotional new words have emerged in Weibo. The appearance of these new words plays an important role in people's daily communication. It can express people's views and emotions more comprehensively, and it is also a reflection of social trends and news events. In the process of natural language processing, the recognition of new emotional words has always been a difficult problem, and it has very important applications in Chinese word segmentation, information retrieval, and question answering systems. [0003] The current vocabulary emotion polarity recognition method first selects a word with a strong emotional tendency as a reference word, and then determines the emotional polarity of the target word by calculating the correlation strength with th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30
Inventor 严馨周超余正涛洪旭东伏云发
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products