User character recognition method in sentence-level Chinese character input method and machine learning system
A technology of Chinese character input and recognition methods, which is applied in the input/output process of data processing, instruments, electrical digital data processing, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach 1
[0042] Specific embodiment one: the user's word recognition method in the Chinese input method described in the present embodiment is:
[0043] For the root c, the probability of the root c appearing in the word combination at the position rp is taken as the word-forming ability IWP(c,rp) of the root c:
[0044] IWP ( c , rp ) = C ( Word ( c , rp ) ) C ( c ) - - - ( 1 )
[0045] Wherein, C(Word(c,rp)) is the number of words that root c occurs with position rp in the corpus used for t...
specific Embodiment approach 2
[0071] Specific embodiment two: the online one-time learning method in the sentence-level input method described in this embodiment, the online learning method is:
[0072] Step 1. Align the output path cRoad[M] and the final candidate path wRoad[N] based on the length, and obtain the aligned output path cRoadA[L] and the final candidate path wRoadA[L]; M, N and L respectively represent the number of words contained in these two paths;
[0073] Step 2, set i=1;
[0074] Step 3. According to the information in the language model, calculate p(cRoadA[i]|cRoadA[i-1]) and p(wRoadA[i]|wRoadA[i-1]), and then use these two values to adopt Maximum a posteriori MAP (Maximum a Posterior) probability method to calculate the user adjustment value C with the largest posterior probability A ;Compare (wRoad[i-1],wRoad[i]) and the corresponding C A Added to the user language model library as a binary element;
[0075] Step 4: Set i=i+1, if i≤L, return to step 3; otherwise, one-time learn...
specific Embodiment approach 3
[0089] Specific Embodiment Three: The machine learning system in the sentence-level Chinese character input described in this embodiment is realized by using the user word recognition method described in Embodiment 1 and the online one-time learning method described in Embodiment 2. The system consists of a user word recognition module and an online one-time learning module, in which:
[0090] The user word identification module is used to identify whether the final output result obtained through user intervention in the sentence-level Chinese character input method is a user word, and encode the word judged as a user word, and then store the user word machine code into the sentence level In the user lexicon of the Chinese character input method;
[0091] The online one-time learning module is used for online one-time learning according to the optimal path output by the sentence-level Chinese character input method and the final path obtained through user intervention when the...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 