Method for automatically extracting terms from Chinese electronic document
An electronic document and automatic extraction technology, applied in the direction of electronic digital data processing, special data processing applications, instruments, etc., can solve problems affecting accuracy, improve accuracy, promote automation and performance, and solve problems with low performance Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0032] The present invention will be further described below with reference to the accompanying drawings and embodiments.
[0033] like figure 1 As shown, the present embodiment provides a method for automatically extracting words from a Chinese electronic document, which is characterized by comprising the following steps: Step S01: processing the electronic document into a group of word strings consisting of atomic words of a specific part of speech; step S02: Count the frequencies of these atomic word strings and their substrings, and use atomic word strings that appear more than N times as candidate words, where N is a parameter that can be set, and preferably the N can be 2; step S03: Delete the words that appear only as substrings in the candidate word set, and obtain a set of words appearing in the document, so as to achieve the purpose of automatically extracting the words in the Chinese electronic document.
[0034] Specifically, see figure 2 , the automatic word pr...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com