KNN text classification method based on improved K-Medoids
A text classification and text technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as classification performance impact, inapplicability, and huge amount of similarity calculations
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0038] The present invention is realized by adopting the following technical means:
[0039] A KNN text classification method based on improved K-Medoids. Firstly, the training text set and the test text set are preprocessed, including word segmentation, stop word processing, DF feature selection, and both the training text and the test text are expressed as vectors; then the training text is processed by the improved K-Medoids method Crop to get a new training text set S new ;Finally, the representative degree function is defined and introduced into the category attribute function of the original KNN algorithm for KNN classification.
[0040] The above-mentioned improved KNN text classification method comprises the following steps:
[0041] Step 1, download the publicly released Chinese corpus from the Internet - the training text set and the test text set;
[0042] Step 2, using the word segmentation software ICTCLAS to perform word segmentation and stop word removal prep...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com