Method for constructing input method word bank according to Chinese language model

A language model and input method technology, applied in the field of input method lexicon, can solve problems such as difficulty in managing the content of the lexicon, poor input experience of the recorder, difficulty in remembering, etc., so as to save computer resources, improve the input experience, and eliminate the effect of influence

Inactive Publication Date: 2017-08-01
杨文韬 +1
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The purpose of the present invention is to solve the above-mentioned problems such as "small and incomplete, large and incomplete, difficult to memorize, difficult to manage the content of the thesaurus, poor input experience and low efficiency of the recorder", etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The basic idea of ​​the present invention is to use the Chinese language model to construct the input method lexicon and implement effective management on it. According to this basic thought, the corresponding module in the content of the present invention is further described below and elaborated as follows in conjunction with embodiment:

[0037] 1. Refining the Chinese language model

[0038] The Chinese language model is mainly refined from three dimensions.

[0039] The first is to refine according to the semantic integrity and speech pause law in language communication. Let me talk about semantic integrity first, which refers to the combination of words with complete meaning in a sentence. For example, if "eat, work, sing, pay money, and do homework" are respectively regarded as a semantic unit, then "after eating, after work, after singing, after paying money, and doing homework "After" should be regarded as a complete semantic unit corresponding to it. Accord...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is applicable to the field of computer input methods and provides a method for constructing an input method word bank according to a Chinese language model. The method is composed of a Chinese language model module and a word creation module, wherein the Chinese language model module provides word formation information and word bank management information for the word creation module, the word creation module generates input method terms in batch by aid of database management software according to the word formation information provided by the Chinese language model, and addition / deletion, retrieval, ordering and other operation can be conveniently performed on the constructed word bank by use of the word bank management information provided by the Chinese language model module. The Chinese language model is customized according to the requirements of phonetic pause and semantic integrity in language communication and fully reflects characteristics of the Chinese language. Through the method, the key problems that traditional input method word banks are short of scientific word collection standards and low in word collection efficiency, word bank content is not comprehensive and not systematic and cannot be effectively managed, a user has no way of grasping the word bank content in an input process, the input speed is low, and language communication experience is lacked are effectively solved.

Description

technical field [0001] The invention relates to the field of computer input methods, in particular to an input method lexicon automatically generated according to a Chinese language model. Background technique [0002] In the field of Chinese input method, Chinese character encoding technology and thesaurus technology are two core technologies. After more than 30 years of development since the 1980s, Chinese character encoding technology has become mature and stable. At present, the space and potential for innovation and development of input methods have been concentrated in input method lexicon technology, but the current development status of input method lexicon technology Look, whether it is an input method developed for the standard keyboard of a desktop computer, an input method developed for a mobile terminal such as a mobile phone touch screen, or an input method developed for the field of speech recognition, there are five problems in its lexicon: [0003] One is t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/023
Inventor 杨文韬杨景玉
Owner 杨文韬
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products