Four-layer structure Chinese text regularized system and realization thereof
A layer structure and text technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as difficult writing of rules, maintenance, general promotion, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0018] The Chinese text regularization system proposed by the present invention includes three parts: non-standard word recognition, non-standard word ambiguity elimination and standard pinyin generation, and a Chinese text regularization system with a four-layer structure is constructed. The finite automata recognizes non-standard words from the real text, and gives the specific category marks of non-standard words. The conditional random field model is used for ambiguous non-standard words, and its sub-classification is given with the corresponding rules, and the third stage is used based on the error The driven rule learning method constructs optimal rules to further optimize the results of the previous stage. Finally, both basic non-standard words and ambiguous non-standard words are input into the last part to generate standard pronunciation. At the same time, this whole set of Chinese text regularization system provides web services based on C / S mode, and can support up ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 