TF-IDF keyword extraction-based improvement method
A TF-IDF, keyword technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of time-consuming and laborious keywords, lack of keyword tags, etc., and achieve the effect of accurate keyword extraction results
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0026] The improved method of the present invention based on TF-IDF keyword extraction specifically comprises the following steps, as figure 1 :
[0027] S1, respectively counting the number of occurrences of all words in each document in the document collection;
[0028] S2, using the improved TF-IDF formula to calculate the weight of words;
[0029] S3. The words are sorted according to the weights from large to small, and the sorted results are used as the basis for keyword retrieval in the database.
[0030] This method first considers that for short text corpus, the importance of each part in the text is different. Generally speaking, verbs, nouns, and adjectives are the main part of a sentence, and are also important for keyword extraction technology; numerals, pronouns, etc. are only modifiers, which further improve the integrity of the sentence, but there is almost no classification of sentences. effect. Therefore, this method assigns different important coefficien...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

