A text feature selection method based on full-coverage granular computing
A feature selection method and full coverage technology, applied in the field of text mining, can solve problems such as poor accuracy and weak feature representation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0041] In order to clarify the purpose, technical solutions and advantages of the present invention, the present invention will be further described in detail below with practical examples.
[0042] Use web crawlers to obtain a certain amount of news in different fields from Sohu News, analyze and organize these articles, remove the same news and non-text symbols in the news, and use it as a sample set.
[0043] In order to select a representative set of feature words from the text, the title and body of the sample set are segmented, stop words removed and part-of-speech tagged.
[0044] The improved TFIDF method is used to calculate the probability of feature words, and words with different positions and parts of speech are given different weight coefficients. For example, a news article can be expressed as: d i ={t i |t i1 ,t i2 ,t i3 ,t i4 ,...,t im}, where t i Represents the set of news words, t i1 ,t i2 ,t i3 represent the words in the title, and the rest repre...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


