Overlapped-between-clusters-oriented method for classifying two types of texts
Patent Information
- Authority / Receiving Office
- CN Β· China
- Current Assignee / Owner
- THE PLA INFORMATION ENG UNIV
- Publication Date
- 2010-11-03
- Estimated Expiration
- Not applicable Β· inactive patent
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] The invention relates to the technical field of text information analysis and processing, in particular to a two-class text classification method oriented to class overlap. Background technique
[0002] With the popularization and rapid development of the Internet, a large number of text data, which is the main form of network data, has emerged, and text classification has become an effective way to organize and manage massive data. Text classification is to establish a mapping between the sample set to be classified and the pre-specified category set. According to the number of pre-specified categories, it is divided into two-class classification and multi-class classification. Among them, the two-class classification is aimed at the classification of positive and negative classes, and usually requires a manually labeled training set, including positive and negative samples. On this basis, the classifier learns, adjusts parameters, and establishes a ...