Phase level forecast inputting method based on personal corpus

A corpus and input method technology, applied in the field of phrase-level predictive input based on personal corpus, can solve the problems of difficulty in expansion, small vocabulary of idioms or idioms, poor flexibility, etc., and achieve the effect of improving input efficiency.

Active Publication Date: 2010-09-15
SAMSUNG ELECTRONICS CHINA R&D CENT +1
View PDF9 Cites 45 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] 2. Some input methods support the association of idioms or idioms, that is, after the user enters the first few characters of an idiom or idiom, the input method can provide the complete idiom or idiom as a candidate for the user, but this type of input The idiom or idiom library provided by the law can only be a common language habit of all users, and there are problems such as small idiom or idiom library vocabulary, poor flexibility, and difficult to expand
[0007] 3. Some input methods support the function of caching and matching the complete sentences entered by the user in the past, that is, to

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phase level forecast inputting method based on personal corpus
  • Phase level forecast inputting method based on personal corpus
  • Phase level forecast inputting method based on personal corpus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

[0026] What the present invention involves is to collect the input (such as short message, email or other text information) edited by the user in the past as a personal corpus, and perform word segmentation, phrase extraction, probability calculation and other preprocessing on it to form a specific probability file. When the user subsequently uses the input method to edit, after the user enters the initial Chinese characters or words, the words, phrases or sentences that the user may need to input later can be predicted.

[0027] figure 1 is a block diagram showing a method for phrase-level predictive input based on a personal corpus according to the present invention. The predictive input method at least includes the following parts: personal corpus processing module 108 , phrase processing module 109 , probability file formation and adjustment module 110 , ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a phase level forecast inputting method based on a personal corpus, comprising the following steps: collecting previous input of a user as the personal corpus; performing word segmentation on the previous input of the user by taking a sentence as a unit, and segmenting into characters and words having independent meanings; calculating the occurrence frequency of words or phases formed by words before and after, and calculating the conditional probability of the words or the phases occurring next to the previous words to form a probability file reflecting the unique language habit of the user; and forecasting the subsequent words, phrase or sentences expected to be input by the user after the words or the phrases at the beginning are input according to the probability file when the user input subsequently so as to facilitate selection and rapid input for the user. Therefore, the subsequent possible candidate words, phrases or sentences can be obtained when the user only inputs the beginnings of characters or words according to the probability file, thus improving input efficiency.

Description

technical field [0001] The present invention relates to a method for predicting input, and more specifically, relates to a method for performing phrase-level predictive input based on a personal corpus. Background technique [0002] Since there is no division between words in Chinese written sentences (different from English input, words are separated by spaces) and there is no clear definition of Chinese word division, so the earliest Chinese input method is to input with a single Chinese character. [0003] Most of the existing input methods can input words, but need to key in the corresponding pinyin or strokes, and then the input method prompts out corresponding alternative words or words for the user to choose. The problem that brings thereby is, when carrying out Chinese character word input, needs to key in too much information, and does not possess the associative function between words or phrases. [0004] Even if there are some improved input methods that have the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06F3/048G06F17/30G06F3/023
Inventor 万磊何亮叶松
Owner SAMSUNG ELECTRONICS CHINA R&D CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products