Method for automatically generating abbreviations of English paper titles in computer
A technology for automatically generating and acronyms, applied in the field of text analysis, which can solve the problems of many candidates and it is difficult for users to pick out abbreviations.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0106] Suppose the title to be abbreviated is "A Second Generation RDF Query Language".
[0107] First, the title is syntactically analyzed to obtain a dependency parsing tree. syntax tree such as image 3 shown.
[0108] After the dependency tree is obtained, the words are scored according to their position on the tree. It is assumed here that the weights related to the syntax tree are set to 0.99368, 0.95529, 0.44995, and 0.15046. Words in the title are scored as follows:
[0109] Table I
[0110] word
A
second
Generation
Query
Language
scoring
1.91059
1.91059
1.91059
1.91059
1.91059
1.98736
[0111] After the syntax tree analysis, the title needs to be semantically analyzed. The word vector has been trained before, and here it can be read directly from the saved file. Because the word vector is a 300-dimensional vector, the word vectors of the words in the title are not listed here. After calculation,...
Embodiment 2
[0125] All codes of the present invention are written by Java, and the model adopted is an Intel Xeon X7550 processor with a main frequency of 2.00GHZ and a memory of 40G. The standFord parser and word2vec used in the present invention are common open source syntax analysis and word vector training tools at present.
[0126] More specifically, as figure 1 As shown, the present invention operates as follows:
[0127] 1. Description content analysis: use standford parser and word2vec to analyze the title, and get the score of each word in the title, that is, the importance of the word;
[0128] 2. Use beamsearch to generate candidate acronyms: Use beamsearch to search the candidate abbreviation space, and calculate the score of the current acronym every time the status is updated.
[0129] 3. Adjust the score of each candidate abbreviation: use the language model and the length of the abbreviation to adjust the score of the abbreviation, and output the score in descending order....
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com