New word discovery method and system, electronic equipment and medium
A new word discovery and new word technology, applied in the field of data capabilities, can solve the problems of dependence on existing, low accuracy rate of new word discovery, low logic of new word discovery methods, etc., to improve accuracy, purpose and advantages Concise and easy Understand the effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0065] This embodiment provides a new word discovery method. Please refer to figure 1 , figure 1 is a flow chart of a new word discovery method according to an embodiment of the present application, such as figure 1 As shown, the new word discovery method includes the following steps:
[0066] Candidate word cohesion calculation step S1: after calculating the candidate word frequency and split word frequency, calculate the candidate word cohesion degree according to the candidate word frequency and the split word frequency;
[0067] Candidate word degree of freedom calculation step S2: calculate the information entropy of the left adjacent word and the information entropy of the right adjacent word of the candidate word, and select the information entropy with a small information entropy value from the information entropy of the left adjacent word and the information entropy of the right adjacent word Information entropy is used as the degree of freedom of candidate words; ...
Embodiment 2
[0084] Please refer to figure 2 , figure 2 It is a structural schematic diagram of the new word discovery system of the present invention. like figure 2 Shown, the new word discovery of invention is applicable to above-mentioned new word discovery method, new word discovery system, comprises:
[0085] Candidate word cohesion calculation unit 51: after calculating the candidate word frequency and split word frequency, calculate the candidate word cohesion degree according to the candidate word frequency and the split word frequency;
[0086] Candidate word degree of freedom calculation unit 52: calculate the information entropy of the left adjacent word and the information entropy of the right adjacent word of the candidate word, and select the information entropy value from the information entropy of the left adjacent word and the information entropy of the right adjacent word Information entropy is used as the degree of freedom of candidate words;
[0087] New word jud...
Embodiment 3
[0100] combine image 3 As shown, this embodiment discloses a specific implementation manner of an electronic device. The electronic device may include a processor 81 and a memory 82 storing computer program instructions.
[0101] Specifically, the above-mentioned processor 81 may include a central processing unit (CPU), or a specific integrated circuit (Application Specific Integrated Circuit, ASIC for short), or may be configured as one or more integrated circuits implementing the embodiments of the present application.
[0102] Among others, memory 82 may include mass storage for data or instructions. By way of example and not limitation, the memory 82 may include a Hard Disk Drive (HDD), a floppy disk drive, a Solid State Drive (SSD), a flash memory, an optical disk, a magneto-optical disk, a magnetic tape, or a Universal Serial Bus (Universal SerialBus, abbreviated as USB) drive or a combination of two or more of these. Memory 82 may include removable or non-removable ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com