Chinese word segmentation method based on hash table dictionary structure
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- DALIAN UNIV
- Publication Date
- 2014-03-19
Smart Images
Figure 1 Figure 2
Abstract
Description
technical field
[0001] The invention relates to the technical field of Chinese information processing, in particular to a Chinese word segmentation method based on a hash table dictionary structure. Background technique
[0002] Chinese word segmentation is the most basic and important issue in Chinese information processing. It is a key step in the automatic annotation of Chinese text, search engines, machine translation, speech recognition, etc. The quality of word segmentation directly affects the accuracy of the results. Chinese and English word segmentation are different. There is no formal delimiter between Chinese words and words, and the continuous Chinese character sequence can only be recombined according to certain Chinese norms. However, the complexity and variability of Chinese sentence composition make Chinese word segmentation has always been a difficult point in Chinese information processing. The discovery of unregistered words and the resolution of ambigui...