Whole sentence generating method and device

A technology of candidate words and context, which is applied in the fields of instruments, computing, and electrical digital data processing, etc., can solve the problem of low accuracy, achieve high accuracy, and improve the effect of input experience

Active Publication Date: 2008-04-09
SHENZHEN SHI JI GUANG SU INFORMATION TECH
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But using this method, only the word with the highest word frequency can be selected. If the first candidate word is incorrect, the user has to re-select each phrase, and the accuracy rate is not high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Whole sentence generating method and device
  • Whole sentence generating method and device
  • Whole sentence generating method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The basic idea of ​​the present invention is to train the original text so that it contains co-occurring word frequency. Usually, the input method will have its own vocabulary, the way of dividing phrases is the word segmentation method, and the number of occurrences of each word is trained according to the original text, that is, word frequency. In the original text training process, the present invention not only counts the word frequency of a single word, but also the co-occurrence frequency of various phrases, that is, the co-occurring word frequency, and saves the statistical results in the vocabulary for future use. When the user enters text, select the candidate word currently input in pinyin with the highest probability of forming a complete sentence with the context, and generate a complete sentence output with the context.

[0039] The device of the present invention is shown in Figure 3, and the device includes: a word segmentation module, a statistical modul...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for generating a complete sentence. The method includes that segment the context on the both sides of candidate words; search each candidate word and the co-occurrence word frequency of the context in the word list; according to the co-occurrence word frequency, calculate the probability of compositing a complete sentence by each candidate word and the context, and select and output the candidate word with the highest probability to construct a complete sentence with the context. The invention also discloses a corresponding device, which comprises a query module, a first buffer area, a second buffer area, and a complete sentence output module; wherein, the first and the second buffer areas are used respectively to store the upper and the lower texts input by the current pinyin; the query module is used to search the word frequency of each candidate word and the co-occurrence word frequency of each candidate word and the context phrases; the complete sentence output module is used to calculate according to the condition probability of the co-occurrence of each candidate word and the context, and select the candidate word with the high condition probability to form and output the complete sentence with the context. The invention has a more high accuracy to output complete sentences.

Description

technical field [0001] The invention relates to Chinese character input technology, in particular to a method and device for generating a complete sentence. Background technique [0002] In the process of typing, it is often necessary to modify the input text, such as deleting individual words or sentences or inserting individual words and sentences, so that a new whole sentence needs to be generated according to the newly inserted words or words and sentences. Inserting a word or a sentence in the middle of a sentence in a traditional input method is no different from inputting in other occasions. The most commonly used method is the maximum probability method. The pinyin input method is taken as an example to describe in detail below. [0003] In the pinyin input method, one Chinese pinyin string can correspond to multiple candidate words. For example, the candidate words corresponding to the pinyin string of "dajia" may include: everyone, fighting, Dajia, cracking down...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/27
Inventor 张会鹏
Owner SHENZHEN SHI JI GUANG SU INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products