Target keyword extraction system
A keyword and keyword library technology, applied in the computer field, can solve problems such as the inability to guarantee accuracy, achieve the effects of reducing the amount of calculation, widely using value, and improving efficiency and accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment approach 1
[0047] The step S5 may specifically include: directly sorting the second candidate keywords in the second candidate keyword set according to the distance from the center point from near to far, and determining the first M second candidate keywords as target keywords word.
Embodiment approach 2
[0049] The step S5 may specifically include: acquiring the word frequency of each second candidate keyword in the document to be processed in the second candidate keyword set, and determining the second candidate keywords whose word frequencies are in the top M as target keywords.
[0050] It should be noted that in step S5, the target keyword is further determined through word frequency based on the second candidate keyword set. On the one hand, the second keywords are already keywords in the professional field and have a certain degree of accuracy; on the other hand, Compared with counting the word frequency of all word segments in the prior art, performing word frequency statistics only based on the second candidate keyword set can greatly reduce the calculation amount of target keyword extraction, and can improve accuracy.
Embodiment approach 3
[0052] Vocabulary in some professional fields may occupy an important position, but often the corresponding word frequency is not too high. Therefore, it can be adjusted by further setting the weight to improve the accuracy of keyword extraction results. On the basis of the second embodiment, the system also Including the keyword weight configuration list, the weight of each keyword in the keyword bank is configured, and the step S5 includes:
[0053] Step S51, acquiring the word frequency in the document to be processed of each second candidate keyword in the second candidate keyword set;
[0054] Specifically, the TF-IDF algorithm may be used to obtain the word frequency of each second candidate keyword in the document to be processed in the second candidate keyword set. The TF-IDF algorithm is an existing algorithm, and will not be repeated here.
[0055] Step S52, multiplying the word frequency of each second candidate keyword in the document to be processed by the weight...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com