Text classification method based on correlation analysis and KNN
A technology of correlation analysis and text classification, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of further improvement of efficiency and accuracy, and achieve the effect of improving efficiency and accuracy.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] For the convenience of description, we assume the following application examples: collect news from the Internet and store them in categories for data analysis. To determine the category of the document, the text classification method based on association analysis and KNN proposed by the present invention can be applied.
[0036] The specific embodiment of the present invention is:
[0037] (1) Use web crawlers or related network information grabbing tools to grab a certain number of representative articles in various fields from the Internet as a training sample set for the text classification system.
[0038] (2) Preprocess these texts, remove stop words after word segmentation, obtain feature words, count word frequency and reverse document frequency, and calculate the weight of a feature word relative to each category according to the χ2 feature evaluation method And sum to get the feature evaluation value. Set the final weight of each feature word as: TF-IDF*feat...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com