Chinese text classification method
A text classification and text word segmentation technology, applied in semantic analysis, special data processing applications, instruments, etc., can solve problems such as interference of classification results, failure to meet practical applications, and decline in accuracy rate, and achieve good classification results
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0026] The present invention will be further described below in conjunction with the accompanying drawings.
[0027] like figure 1 As shown, a Chinese text classification method includes the following steps:
[0028] (1) Text preprocessing, including corpus selection, text word segmentation, word frequency statistics and text representation;
[0029] (2) Feature representation and feature extraction
[0030] The feature representation method of the text is the model of the text, using the vector space model to simplify the text into a vector representation with the weight of the feature item as the component;
[0031] Feature extraction refers to removing words that cannot represent information to improve classification efficiency and reduce computational complexity. This method uses information gain. The information gain comes from information theory. It indicates that the feature appears or does not appear in the text to determine the type of text The size of the amount o...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com