Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Statement-level Chinese and English mixed input method

A mixed input, Chinese and English technology, applied in the input/output process of data processing, instruments, electronic digital data processing, etc., can solve the problem of unable to guarantee sentence input, etc., to reduce the burden of thinking, improve input efficiency, and improve input habits Effect

Inactive Publication Date: 2012-06-27
HARBIN INST OF TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to provide a sentence-level Chinese-English mixed input method to solve the problem that the existing Chinese-English mixed input method uses manual marking of which of the input letter strings needs to be converted into Chinese and which are English words, and continuous sentence input cannot be guaranteed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Statement-level Chinese and English mixed input method
  • Statement-level Chinese and English mixed input method
  • Statement-level Chinese and English mixed input method

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach 1

[0009] Specific implementation mode one: the following combination Figure 1 to Figure 3 This embodiment will be described. It includes the following steps: 1. Input character strings with a keyboard; 2. Segment the character strings in Chinese and English by the input method management system; the Chinese-English segmentation is based on the automatic recognition of the Chinese pinyin database and the English thesaurus; 3. The input method management system performs corresponding Chinese and English character conversions on the segmented strings, and provides mixed sentence candidates and Chinese and English word candidates; the Chinese and English character conversions made are based on the Chinese thesaurus and Chinese Automatic recognition of English statistical database; 4. Select output content from the candidate list including mixed sentence candidates and Chinese and English word candidates through editing commands such as insert, delete, and select; 5. Output Chinese ...

specific Embodiment approach 2

[0019] Specific embodiment two: the difference between this embodiment and embodiment one is: it also includes step 6, according to the Chinese-English mixed sentence that step 5 outputs, construct Chinese unregistered vocabulary and English unregistered vocabulary and to English thesaurus, Chinese Thesaurus and Chinese and English statistical databases are updated.

[0020]The existing Chinese unregistered word recognition methods mainly include two methods based on statistics and based on rules. Statistics-based methods generally use statistical strategies to extract candidate strings, and then use language knowledge to exclude garbage strings that are not new words, which are suitable for processing large-scale corpus and are not limited by domains. However, its recall rate is low, it is sensitive to the problem of data sparsity, and there is a danger of "short word priority". The rule-based method usually builds a rule base, a professional thesaurus or a pattern base base...

specific Embodiment approach 3

[0022] Specific Embodiment Three: An example is specifically given below to illustrate the method of the present invention. Such as figure 1 As shown, the sentence-level Chinese-English mixed input first needs to be segmented into Chinese and English to recognize Chinese pinyin syllables and English words. After Chinese and English segmentation, according to the recognition result, the input method management system of the present invention performs corresponding Chinese and English character conversion on the segmented character strings, and provides mixed sentence candidates and Chinese and English word candidates. The user performs editing operations according to the Chinese and English segmentation results and candidates, and finally confirms the output. The present invention has fully considered Quanpin and Jianpin when Chinese syllables are segmented, and for the sake of convenience, Quanpin is mainly used in the process of illustration.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a statement-level Chinese and English mixed input method, which relates to an information input method, in particular to a Chinese and English mixed statement input method, and solves the problem that continuous sentence input cannot be ensured because which need to be converted into Chinese and which are English words in an input letter string are marked manually in the conventional Chinese and English mixed input method. The method comprises the following steps of: inputting a character string by using a keyboard; performing Chinese and English segmentation on the character string by using an input method management system, wherein the performed Chinese and English segmentation is automatic identification based on a Chinese pinyin library and an English word library; performing corresponding Chinese and English character conversion on the segmented character string and providing a mixed statement candidate and a Chinese and English word candidate by using theinput method management system, wherein the performed Chinese and English character conversion is the automatic identification based on a Chinese word library and a Chinese and English statistical library; selecting output contents from a candidate list finally by using the editing commands such as inserting, deleting, selecting and the like; and outputting a Chinese and English mixed statement. The method is used for character input.

Description

technical field [0001] The invention relates to an information input method, in particular to a Chinese-English mixed sentence input method. Background technique [0002] With the development of society and the gradual deepening of international exchanges, the phenomenon of mixed use of two languages ​​or even multiple languages ​​is becoming more and more common, which is always full of people's daily life. Among them, such as "This is OK!", "Ah Q Spirit" and other Chinese mixed with a small amount of English is the most common phenomenon. In order to meet the needs of the mixed use of Chinese and English, in the computer system, the input method is required to support the mixed input method of Chinese and English. At present, there are mainly two methods for Chinese and English mixed input: [0003] (1) Input Chinese and English independently by switching the Chinese and English input modes. This input method achieves the purpose of input by artificially splicing mixed ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F3/023
Inventor 王晓龙刘秉权汤步洲林磊刘远超王轩陈清财
Owner HARBIN INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products