A text data stream classification method based on word vectors and an integrated SVM
A text data and integrated classifier technology, applied in text database clustering/classification, unstructured text data retrieval, etc., can solve problems such as complex construction scheme, low classification accuracy of weak classifiers, and high time complexity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0054] In this example, if figure 1 As shown, a text data stream classification method based on word vector and integrated SVM is carried out as follows:
[0055] 1, a kind of text data flow classification method based on word vector and integrated SVM, it is characterized in that carry out as follows:
[0056] Step 1. Obtain a text data set, and mark part of the text in the text data set to obtain a labeled text set and use it as a seed text set; the seed text is obtained by randomly selecting about 10% of the total text data set.
[0057] Step 2. Perform word vector expansion processing on the seed text set to obtain the corresponding feature dictionary and noise dictionary; the word vector algorithm is obtained by training the deep learning word vector algorithm proposed by Google from Wikipedia corpus.
[0058] Step 2.1, segment the seed text in the seed text set into words;
[0059] Step 2.2, sort the words after the segmentation according to the word frequency, and fil...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com