Recursive and multilevel Chinese word segmentation method
A Chinese word segmentation, multi-level technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of large segmentation granularity, small segmentation granularity, long cycle, etc., to improve accuracy and eliminate ambiguity , to ensure the effect of segmentation granularity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0015] Embodiments of the present invention will be specifically described below in conjunction with the accompanying drawings.
[0016] A recursive multi-level Chinese word segmentation method comprises the following steps:
[0017] Step 1, use the current dictionary tree to use the maximum matching algorithm to perform Chinese word segmentation on the input Chinese text, and generate the current word segmentation and the current word segmentation level;
[0018] Step 2, selectively masking the word segmentation generated in step 1 in the current dictionary tree;
[0019] Step 3, using the trie selectively masked in step 2 as the current trie;
[0020] Step 4, determine whether each Chinese word segmentation generated in the above step 1 has a non-single-character prefix word in the current dictionary tree, if there is a non-single-character prefix word in a word segment, then proceed to the above steps 1 to 3, if each If there is no non-single word prefix in the participle...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 