Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and system for updating input method lexicon

An input method and thesaurus technology, applied in the field of input methods, can solve the problems of affecting the input speed, increasing the frequency of use, and overheating entries, etc., to achieve the effect of increasing input speed, improving performance, and improving quality

Active Publication Date: 2016-04-06
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in practical applications, some entries have the following characteristics: these entries exist in the system thesaurus, but compared with other entries under the same pronunciation, the average word frequency may be relatively low, so when the user enters its code When a character string is used, the ranking of the entry in the candidate items may be relatively low; however, these entries may have a phenomenon of increased frequency of use in stages (usually this entry is called a hot word), at this time, If the candidates are still given according to the current lexicon, it will affect the input speed
If you wait for the server to generate a new thesaurus, the long update cycle may cause the popularity of the entry to have passed, and even if the word frequency of the entry has changed in the new thesaurus, it has lost its meaning up

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for updating input method lexicon
  • A method and system for updating input method lexicon

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach 1

[0066] In the first embodiment, the client can obtain entry update information from the server, and the entry update information can indicate which entry or entries are to be updated, and if it involves changes in word frequency or multivariate relationship strength, it can also indicate the change The specific value of the word frequency or the strength of the multiple relationship after that. If a multiple relationship is added, it can also indicate which multiple relationships between the newly added entries, and so on.

[0067] In this case, on the server side, hot words related to current events can be continuously collected, and after the hot words are collected, information to be updated can be made into an update file for the client to download. In this way, the client can obtain the update information of the above entry from the downloaded update file, and then update according to the specific information content. Of course, in practical applications, for the convenie...

Embodiment approach 2

[0075]In the first embodiment above, when modifying the attributes of the entry, it is done by changing the word frequency of the entry, the multiple relationship between the entries (including changing the strength of the multiple relationship, adding or deleting the multiple relationship), etc. In the second embodiment, it is also possible to change the attributes of the entry by adding a hot word tag to the entry or the multiple relationship between the entries. For the client, as long as it is agreed to display the candidate Prioritize the display of entries with hot word tags, or group word entries with multiple relations of hot word tags. At this time, when generating the hot word update information, it is not necessary to indicate the updated word frequency or strength value in the hot word update information, but to directly indicate that it is a hot word in the hot word update information through a hot word tag.

[0076] That is to say, in this way, after learning the...

Embodiment approach 3

[0082] In the foregoing first and second embodiments, the hot words are collected by the server, and a corresponding update file is produced, and the client downloads and updates the attributes of the entry. In the third embodiment, for some special hot words, hot word related information can also be set on the client side. An attribute to indicate the conditions under which the entry should be treated as a hot word, so that the client can directly obtain the update information of the hot word locally. This special hot word usually refers to some hot words that appear regularly according to time. For example, some hot words related to festivals, etc. For this type of entry, you can directly add an attribute to it in the input method thesaurus to indicate when to handle it as a hot word. For example, during festivals such as "May 1st", "Dragon Boat Festival", and "New Year" every year, some entries related to tourism, travel, shopping, etc. may become hot words. Therefore, ho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a system for updating an input method lexicon. The method comprises the following steps of: obtaining entry updated information; updating attributes of a conventional entry of the input method lexicon according to the entry updated information; and providing a candidate item according to the updated entry. By utilizing the method and the system, a relatively short lexicon updating period to hot words can be realized, and improvement of the performance of an input method system is benefited.

Description

technical field [0001] The invention relates to the technical field of input methods, in particular to a method and system for updating an input method lexicon. Background technique [0002] As an interface of man-machine dialogue, the input method system provides coding methods for inputting various characters into computers or other devices (such as mobile phones). In other words, for the text that needs coding to complete the input, if you want to input some text content into the computer, you need to use the input method system to complete it. Therefore, the input method system plays a pivotal role in the process of human-computer interaction. [0003] The input method system usually has its own lexicon. For Chinese, Japanese and other texts, the lexicon of the input method stores common entries and corresponding coded strings (such as pinyin, etc.). After a coded string is input, the input method system can query the thesaurus and display the entry corresponding to th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F3/023
Inventor 查文
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD