Supercharge Your Innovation With Domain-Expert AI Agents!

A Cross-Domain Text Sentiment Classification Method Based on Domain Adversarial Adaptation

A sentiment classification, cross-domain technology, applied in the field of text analysis, can solve the problem of inability to accurately predict the sentiment tendency of new comment data, low efficiency and so on

Active Publication Date: 2022-03-15
廊坊嘉杨鸣科技有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, with the development of social media, the increasing number of new corpus gradually expands the scope of domains, and the amount of data in each domain is very large. Traditional text sentiment classification methods need to manually label a large amount of data for each newly added domain. Completing the training of the sentiment classifier, the process of manually labeling samples is inefficient
At the same time, with the passage of time and the development of society, the new feature words in the known field will gradually increase. Because there are certain differences in the feature distribution between the original sample and the new sample, the original sentiment classifier in this field will not be able to accurately predict the new comment data. emotional tendency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Cross-Domain Text Sentiment Classification Method Based on Domain Adversarial Adaptation
  • A Cross-Domain Text Sentiment Classification Method Based on Domain Adversarial Adaptation
  • A Cross-Domain Text Sentiment Classification Method Based on Domain Adversarial Adaptation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0021] The method model structure of the present invention is as figure 1 As shown, the flow chart of the method is as figure 2 As shown, it specifically includes the following steps:

[0022] Step 1, input the word vector matrix, sentiment category label and domain label of the source domain and target domain samples.

[0023] Since the computer cannot directly process text data, it is necessary to convert the text input data into a data type recognizable by the computer. Let the number of rows n of the matrix represent the total number of words in the paragraph, and the number of columns of the matrix k represent the dimension of the word vector. First, convert each word in the input text into a 1×k word vector, and then follow the order in which the words appear in the text , concatenate the word vectors into ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cross-domain text sentiment classification method based on domain confrontation self-adaptation, the method includes: inputting word vector matrix, category label and domain label of samples in source domain and target domain; using feature extraction based on convolutional neural network module, which extracts the low-level features of the sample; in the main task module, a constraint based on the distribution consistency of the source domain and the target domain is constructed, and the low-level samples are mapped to the regenerated kernel Hilbert space to learn high-level features with transferability; the source domain The high-level features are input into the category classifier, and on the basis of reducing the difference in the field, the classifier is guaranteed to have category discrimination for the samples; in the auxiliary task module, the domain invariance constraint based on adversarial learning is constructed, and the low-level features are input into the field with adversarial properties The classifier makes the classifier unable to distinguish the domain of the sample as much as possible, thereby extracting high-level features with domain invariance, which effectively solves the migration problem of the source domain classifier to the target domain.

Description

technical field [0001] The invention belongs to the technical field of text analysis, and in particular relates to a cross-domain text sentiment classification method based on domain confrontation self-adaptation. Background technique [0002] In recent years, with the vigorous development of artificial intelligence and machine learning technology, text sentiment classification technology has emerged as the times require. This technology can automatically classify the sentiment trend of text data, effectively solving the time-consuming and laborious problem of manual judgment. Traditional text sentiment classification methods usually use calibration data to train specific sentiment classifiers for a certain field to complete sentiment classification tasks. However, with the development of social media, the increasing number of new corpus gradually expands the scope of domains, and the amount of data in each domain is very large. Traditional text sentiment classification meth...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06K9/62
CPCG06F18/24G06F18/214
Inventor 贾熹滨曾檬史佳帅刘洋苏醒郭黎敏
Owner 廊坊嘉杨鸣科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More