Multilingual text data sorting treatment method
A technology of data processing and classification methods, applied in the field of data processing, can solve problems such as information loss, great differences in emotional expression, and unsatisfactory performance of multilingual sentiment analysis, etc., to achieve small resource dependence, reduce information loss, and avoid mistakes.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0052] In order to achieve the above object, the present invention proposes a self-learning classification method involving multilingual data processing, including:
[0053] See figure 1 Is the flow chart of sentiment classification algorithm.
[0054] Step 1, extract candidate emotional words by "very" and then perform stop word filtering. The stop word list is automatically obtained from the target language;
[0055] Step 2. Simultaneous clustering of sentiment words and sentiment texts by "good" and "bad" (for or against);
[0056] Step 3, build an emotion classifier through semi-supervised learning, first select confident samples from the clustering results in step 2 to train the initial classifier, and then combine the sentiment score of the text and the posterior probability of the classifier to select new samples to add to the training set .
[0057] Said step 1 includes:
[0058] In addition to extracting English emotional words through "very (very)", it also inclu...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com