Processing method of unregistered words in Chinese dependency tree bank
A technology of unregistered words and processing methods, which is applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., and can solve problems such as coarse information granularity and sparse tree bank data
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] The specific embodiments of the present invention will be described in further detail below in conjunction with the drawings and embodiments. The following examples are used to illustrate the present invention, but not to limit the scope of the present invention.
[0026] S10. Use the synonym word forest to find all synonyms of unregistered words.
[0027] Search for unregistered words in the dependency tree. According to the 5-layer encoding method of the "Synonyms Cilin" expansion board, obtain all words with the same 5-layer encoding as unregistered words and the eighth tag bit as "=". Synonym for login term.
[0028] S20. Calculate the font similarity between the unregistered word and the synonyms according to the features of the Chinese characters.
[0029] Use (sw 1 ,sw 2 ,...,Sw n ) Means that each word can be represented by a Chinese character vector consisting of 0 or the frequency of the word contained. Use uw for unregistered words in the tree library 1 ,uw 2 ,...,...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com