A text classification algorithm that combines statistical features and Attention mechanism
A technology of statistical features and text classification, applied in the field of text datasets, can solve the problem of inability to learn text statistical features, and achieve the effect of reducing training time, good classification effect, and improving accuracy.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0038] For a piece of input text, it is first segmented, stop words and synonyms are replaced. Then use the word2vec tool to train and generate word vectors for each word, calculate the tf-idf value for the acquired words, and assign relevant weights according to the part-of-speech and tf-idf values of the words to obtain the statistical feature value of the word. Calculate the Attention weight based on the event, and calculate the statistical feature weight of the event at the same time. The two weights are fused, and the feature vector obtained based on this contains more semantic information. The specific algorithm logic steps are as follows:
[0039](1) For a document set, first perform word segmentation, part-of-speech tagging and stop word processing.
[0040] (2) Record the word frequency information of the word, and replace the synonyms in the document at the same time.
[0041] (3) Extract events in each document
[0042] (4) Calculate the statistical feature va...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com