Chinese word segmentation method based on hash table dictionary structure
A Chinese word segmentation and hash table technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as language information organization, achieve the effects of improving efficiency, improving matching efficiency, and increasing comparison speed
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0021] The present invention will be further described below in conjunction with the accompanying drawings.
[0022] Such as figure 1 As shown, first we need to establish a dictionary structure, and store the hash table in the present invention in the memory in the form of a linked list. At the same time, we also need to establish an index table to facilitate queries in subsequent programs.
[0023] In the preprocessing stage, what we need to do is to segment each sentence in the text to be processed with a period as the terminator, so as to reduce the complexity of the subsequent two-way scanning.
[0024] The next thing the system needs to do is to perform forward and reverse maximum matching for each text block to be processed. The basic process of the forward maximum matching method is: assuming that the length of the longest word in the word segmentation dictionary is n, each time a string s of length n is intercepted from the beginning of the string to be segmented, and...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com