Document similarity scoring and ranking method, device and computer program product
a similarity scoring and ranking technology, applied in the field of document similarity scoring and ranking method, device and computer program product, can solve the problems of unattainable goal for large document set, non-conventional step, and inability to cope with computational burden engendered, so as to avoid large, avoid waste of computational effort, and achieve small similarity scores.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Benefits of technology
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
Calculating Similarity Scores Among a Set of Documents.
[0042] The method of the present invention begins with calculating similarity scores among a set of documents. This includes the following steps: [0043] 1. Constructing a word corpus from the document set. [0044] 2. Constructing an inverted index, based on the corpus and on the document set. [0045] 3. For each word in the index: [0046] a. Calculating a word similarity score between the index word and each of the documents in which the word appears. [0047] b. Sorting the document IDs in decreasing order of word similarity. [0048] c. Truncating the sorted list of document IDs. This is accomplished by enforcing a word threshold τword—that is, by discarding documents with word similarity scores less than the word threshold. The resulting truncated, sorted list is termed the index-word document list. [0049] d. Performing the following operations on the documents in the index-word document list, in order of decreasing word similarit...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com