Method and system for searching non-structural electronic document with obvious category classification
A category division and unstructured technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as loss of proper functions, large search of enterprise-level electronic documents, and poor differentiation of IDF, etc., to achieve easy The effect of implementation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0072] As mentioned in the background technology, because the TF-IDF algorithm does not consider the type of the electronic document and the relationship between the search term and the type, two problems arise. In severe cases, the IDF algorithm in the TF-IDF algorithm will Some of them are almost completely ineffective, and the correlation between electronic documents and keywords can only be determined by the frequency of keywords appearing in the document (TF algorithm).
[0073] Therefore, the present invention considers from the type correlation, and improves the TF-IDF algorithm, such as figure 1 As shown, the system of the present invention is made up of following several modules:
[0074] Document classification module: classify the documents of a specific collection according to the relationship between the contents of each document;
[0075] Type keyword identification module: identify all types of keywords;
[0076] Real-time search module: According to the searc...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com