Text input method and device

A text input and sample technology, applied in the field of input methods, can solve the problems of large memory usage and low accuracy, and achieve the effect of saving memory space and accurate prediction

Pending Publication Date: 2019-12-13
PINGDINGSHAN UNIVERSITY
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the present invention is to provide a text input method to solve the problems of low accuracy and large memory occupation of the existing input methods; meanwhile, it also provides a text input device to solve the problems of low accuracy and large memory occupation of the existing input devices. big memory problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text input method and device
  • Text input method and device
  • Text input method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Example of text input method:

[0029] The text input method that the present embodiment proposes is the given phonetic sequence p={p that is made up of n phonetic characters 1 ,p 2 ,...,p n} Input the PS2CS model, and generate a Chinese character sequence c={c under the conversion of the PS2CS model 1 ,c 2 ,...,c m},For example figure 1 As shown, for the pinyin sequence "shen_jing_wang_luo_mo_xing" (the connector "_" is not required for actual input, it is added here for ease of reading), the PS2CS model should output a "neural network model".

[0030] The main idea of ​​the present invention lies in the proposed PS2CS task self-attention model (referred to as the PS2CS model), the key idea of ​​which is to simultaneously generate all words of a Chinese sentence by considering all pinyin character sequences. The most commonly used method in line with this idea is the Seq2Seq method for neural sequence modeling. In this paper, we use the self-attention mechanism ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a text input method and device, and belongs to the technical field of input methods. The text input method comprises the steps of obtaining a pinyin sequence; inputting the pinyin sequence into a trained PS2CS model, and predicting a Chinese character sequence corresponding to the pinyin sequence through the trained PS2CS model; the training process of the PS2CS model comprises the following steps: performing vector conversion on each letter of a pinyin sequence sample in training set data according to a lookup table to generate a vector corresponding to each letter; wherein the lookup table comprises 28 characters and vector representations of the corresponding characters; wherein the 28 characters comprise 26 phonetic letters and 2 placeholder symbols; outputtinga prediction result through a prediction layer according to the pinyin sequence sample vector matrix; and comparing the prediction result with a corresponding standard Chinese character sequence sample in the training set data, and solving a loss function. Vectorization is carried out through the original pinyin character sequence, and then the corresponding Chinese character sequence is predicted, so that the memory space is greatly saved, and prediction is more accurate.

Description

technical field [0001] The invention relates to a text input method and device, belonging to the technical field of input methods. Background technique [0002] The n-gram language model n-gram is generally used in the Chinese Pinyin input method, and the n-gram model is used to determine which specific word the nth is under the condition that n-1 words are known in the given phrase (or sentence). word probabilities. In order to take into account the computational efficiency, binary or ternary (the value of n is 2 or 3) is used to model language sequences in practical applications. Therefore, in order to model longer-distance word histories, some recent works propose to use continuous-space language models instead of n-gram models for Pinyin input methods. [0003] In fact, most modern Pinyin input methods follow a serial process: 1) Pinyin word segmentation, which splits the input Pinyin sequence into legal Pinyin syllables; 2) generates candidate words for each Pinyin sy...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27G06K9/62G06N3/04G06N3/08G06F3/023
CPCG06N3/08G06F3/0233G06N3/048G06F18/214Y02D10/00
Inventor 熊蜀峰王丙坤娄鹏宇宁菲菲刘玉坤
Owner PINGDINGSHAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products