Term selection method for filtering harmful text information
A feature selection method and text information technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of high calculation results, difficult to filter, and incorrect retention, and achieve the effect of improving the effect.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0047] The invention provides a feature selection method for filtering bad text information, which is applied to the filtering process of bad text information, and is a feature selection method used for extracting feature items of bad categories when classifying bad categories. This method takes the traditional χ 2 Based on the statistical feature selection method, the CTW value of the classification feature weight value is used as the basis for feature selection. The factor for calculating the CTW value in this method includes the traditional χ 2 Including statistics, three additional factors are added, which are the improved inverse document frequency IDF value, inverse category frequency ICF value and inverse bad document frequency IHDF value; after the feature weight value (CTW) is calculated, the The feature weight values of the feature items are sorted from large to small, and then the optimal number of feature items is selected to form a new feature item set. At this ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


