Rapid hierarchical document querying method
A Query Method, Hierarchical Technology
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0070] Embodiments of the present invention and its implementation process are as follows:
[0071] The specific implementation is to further illustrate the technical scheme of the present invention with document data set Reuters-21578 (Reuters) and schematic diagram; Wherein, Reuters is a branch of Newswire news document collection, and can openly obtain from Internet, and it comprises 65 subjects, 8293 documents, 2347 documents are used for testing, and 5946 training documents are used to build the hash table.
[0072] Document format processing stage
[0073] Step 1: Establish the data model of each document. The data model of a document is mainly composed of three parts: words, word vectors and word weights; words are valid words left after the document is preprocessed, and word vectors are available on the Internet. The word vector of the publicly available Google News Word2Vec model, the word weight is the TF-IDF value corresponding to the word.
[0074] Step 2: Format...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap