A text keyword weight calculation method integrating a word position factor and a word frequency factor
Patent Information
- Authority / Receiving Office
- CN ยท China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- SHANGHAI UNIV
- Publication Date
- 2019-05-17
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The present invention relates to a kind of text key word weight calculation method of comprehensive word position factor and word frequency factor, specifically relate to adopting harmonic progression comprehensive word position factor and word frequency factor to calculate the weight of word, improve title and first and last two paragraphs of words Weight, and make each word as the word frequency increases, the weight of the position where the word appears decreases. Background technique
[0002] The most widely used keyword extraction algorithm is vector space model. The vector space model represents the text as a weight vector, each item in the vector is composed of a word, and the weight of each word is determined by the TFIDF method. Among them, the TFIDF method uses the word weight formula to calculate the importance of a word to a single text in the corpus. The word weight of the TFIDF method is the product of the term frequency TF (Term Frequ...