Feature dimension reduction method for automatic classification of Chinese text
A technology of automatic classification and feature dimensionality reduction, applied to instruments, character and pattern recognition, computer components, etc., can solve problems such as high-dimensional problem obstacles and arduous tasks of dimensionality reduction
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0092] A feature dimensionality reduction method for automatic classification of Chinese text, including the following steps:
[0093] In the learning phase, the following steps are involved:
[0094] (1). Determine the feature selection method (statistics), feature vector weight calculation method and the value of related parameters;
[0095] (2). Preprocessing the learning text set;
[0096] (3). Perform one-element, two-element, and three-element string indexing (Indexing) on the learning text set respectively to obtain the original feature set of one-element string, original feature set of two-element string and original feature set of three-element string. According to the original feature set of binary strings, the feature frequency vectors of each learning text are generated, as shown in formula 1.
[0097] d=(tf(T 1d ), tf(T 2d ),...,tf(T nd )) (1)
[0098] d is any learning text; n is the total number of features contained in the binary string ori...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 