A text representation method based on wt-glove word vector construction
A technology of text representation and word vector, applied in the fields of natural language processing, data mining and text classification, it can solve the problems of complex calculation and insufficient representation of text information.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0056] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0057] A text representation method based on WT-GloVe word vector construction of the present invention, the flow chart is as follows figure 1 As shown, the specific steps are as follows:
[0058] Step 1. Calculate and evaluate the importance of the network text itself by calculating the word distance of its own features, and judge its own contribution to the category according to the inter-class distribution of the feature, and combine the two as a feature weighted model of word distance and inter-class distribution, which is called WDID-TFIDF, step 1 is implemented according to the following steps:
[0059] Load the data set 20NewsGroups, import the required modules, give the GloVe model, set the training data storage path, and the encoding format; define functions, introduce the English general stop vocabulary, perform word segmentation o...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com