Method and device for obtaining lexical item paragraph association weights
A technique for any paragraph or paragraph, applied in the field of obtaining the associated weights of terms and paragraphs, which can solve the problem of not considering the difference of document representation.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment 1
[0060] See attached figure 1 , the method for obtaining term-paragraph association weights in the first embodiment, comprising the steps of:
[0061] A1. Based on a plurality of pre-set terms, the number of the document structure position where the term is located, the number of the paragraph in the document structure position where the term is located, and the weight of the term, obtain and The number of terms in any paragraph in the document structure position corresponding to the number of the document structure position and the total weight of all terms in the paragraph.
[0062] Wherein, the numbers of the paragraphs correspond to the order of the paragraphs in the document structure position where the paragraphs are located.
[0063] A2. Based on the number of terms in any paragraph in the document structure position corresponding to the number of the document structure position and the total weight of all terms in the paragraph, obtain the preset multiple terms Paragr...
specific Embodiment 2
[0121] In order to better explain the present invention, refer to the appended figure 2 , in this embodiment, the term document paragraph position table is input into the computer in advance, and the table will be described first.
[0122] In this embodiment, the input is the term document paragraph position table words_list of a specific document, which is a database table containing all terms extracted from a specific document and its document paragraph position information, and the term of each specific number in the table is in the document There may be multiple records in different paragraphs of the same structure, or different sentences in the same paragraph. See Table 1 for specific field definitions.
[0123] Table 1 Definition of term document paragraph position table
[0124] Field Name field meaning Field Type field description word_id term number INTEGER A unique number for a particular term word_weight term basic weight DECIMA...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com