Method and system for revising user word bank

A user and thesaurus technology, applied in the field of input methods, can solve problems such as affecting user experience, occupying user word space, user input interference, etc., to expand the breadth and depth of applications, widely and accurately identify, and remove data noise.

Active Publication Date: 2013-04-17
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF9 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] 1. Record a large number of wrong inputs that do not need to be recorded as user words, occupying the user word space and reducing the efficiency of user word search and matching
[0008] 2. If the other entries that the user wants to input happen to have the same input codes (Pinyin, Wubi, etc.) as these spam entries, these entries will be ranked relatively high, which will inevitably bring interference to the user's input and affect user experience
First of all, the user's correction of the input may not be continuous, but intermittent; and, for application scenarios such as IM (instant messaging) and search engines, the original input cannot be edited, that is, the user cannot delete the original input characters , there is no delete operation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for revising user word bank
  • Method and system for revising user word bank
  • Method and system for revising user word bank

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0047] refer to figure 1 , which shows an embodiment of a method for correcting a user thesaurus according to the present invention, which may specifically include:

[0048] Step 101. Check whether the current input content is the same or similar to the input code of all or part of the user's completed input content, but the text is different; and / or check whether the current input content is the same as the user's completed input content. Part of it, the characters are the same but the input codes are different;

[0049] Step 102, if the conditions are met, correct the data in the user thesaurus based on the current input content and the error correction content; the error correction content is the part of the completed input co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and system for revising a user word bank. The method comprises checking whether current input contents are completely or partially same or similar with input contents on input codes and different on characters; and/or checking whether the current input contents are completely or partially same or similar with the input contents on the characters and different on the input codes; revising data in the user word bank based on the current input contents and error correction contents if conditions are met; and enabling the error correction contents to be a part of the input contents corresponding to the current input contents. The method and system can intelligently record user input information, avoids learn misinput words as much as possible and reduces data noise in the user word bank. The method and system does not need more limitation on user editing actions, greatly expand application range and depth of word bank revising, and can better remove the data noise which cannot be found in the prior art.

Description

technical field [0001] The invention relates to the technical field of input methods, in particular to a method and system for correcting user thesaurus. Background technique [0002] With the popularization and development of computer technology and Internet technology, input methods have become an important means for users to interact with computers. Users with different professional fields, different interests and usage habits have higher and higher requirements for the intelligence of input methods. [0003] Existing input methods generally improve the efficiency when users input characters by improving the update degree of entries in the system lexicon and the accuracy of word frequency information. [0004] The thesaurus installed on the user's machine along with the input method software installation package is often the basic thesaurus that meets the general input needs of ordinary users, and we call it the system thesaurus. However, for those personalized and non-u...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F3/023
Inventor 张扬王坚
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products