Cross-domain text sentiment classification method based on domain confrontation self-adaption

A sentiment classification and cross-domain technology, applied in the field of text analysis, can solve the problems of inability to accurately predict the emotional tendency of new comment data and low efficiency

Active Publication Date: 2019-03-19
廊坊嘉杨鸣科技有限公司
View PDF3 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, with the development of social media, the increasing number of new corpus gradually expands the scope of domains, and the amount of data in each domain is very large. Traditional text sentiment classification methods need to manually label a large amount of data for each newly added domain. Completing the training of the sentiment classifier, the process of manually labeling samples is inefficient
At the same time, with the passage of time and the development of society, the new feature words in the known field will gradually increase. Because there are certain differences in the feature distribution between the original sample and the new sample, the original sentiment classifier in this field will not be able to accurately predict the new comment data. emotional tendency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-domain text sentiment classification method based on domain confrontation self-adaption
  • Cross-domain text sentiment classification method based on domain confrontation self-adaption
  • Cross-domain text sentiment classification method based on domain confrontation self-adaption

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0021] The method model structure of the present invention is as figure 1 As shown, the flow chart of the method is as figure 2 As shown, it specifically includes the following steps:

[0022] Step 1, input the word vector matrix, sentiment category label and domain label of the source domain and target domain samples.

[0023] Since the computer cannot directly process text data, it is necessary to convert the text input data into a data type recognizable by the computer. Let the number of rows n of the matrix represent the total number of words in the paragraph, and the number of columns of the matrix k represent the dimension of the word vector. First, convert each word in the input text into a 1×k word vector, and then follow the order in which the words appear in the text , concatenate the word vectors into ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cross-domain text sentiment classification method based on domain countermeasure self-adaption. The method comprises the following steps: inputting a word vector matrix, a category label and a domain label of a source domain sample and a target domain sample; Utilizing a feature extraction module based on a convolutional neural network to extract low-level features of thesample; constructing a constraint based on distribution consistency of a source domain and a target domain in a main task module, mapping a low-layer sample to a regeneration kernel Hilbert space, and learning a high-layer feature with transferability; inputting the high-level features of the source domain into a class classifier, and ensuring that the classifier has class discrimination on samples on the basis of reducing domain difference; a domain invariance constraint based on adversarial learning is constructed in an auxiliary task module, and low-level features are input into a domain classifier with adversarial properties, so that the classifier cannot judge the domain to which a sample belongs as much as possible, high-level features with domain invariance are extracted, and the migration problem of a source domain classifier to a target domain is effectively solved.

Description

technical field [0001] The invention belongs to the technical field of text analysis, and in particular relates to a cross-domain text sentiment classification method based on domain confrontation self-adaptation. Background technique [0002] In recent years, with the vigorous development of artificial intelligence and machine learning technology, text sentiment classification technology has emerged as the times require. This technology can automatically classify the sentiment trend of text data, effectively solving the time-consuming and laborious problem of manual judgment. Traditional text sentiment classification methods usually use calibration data to train specific sentiment classifiers for a certain field to complete sentiment classification tasks. However, with the development of social media, the increasing number of new corpus gradually expands the scope of domains, and the amount of data in each domain is very large. Traditional text sentiment classification meth...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06K9/62
CPCG06F18/24G06F18/214
Inventor 贾熹滨曾檬史佳帅刘洋苏醒郭黎敏
Owner 廊坊嘉杨鸣科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products