Chinese character datamation input and output method

An input method and data-based technology, applied in the input/output process of data processing, electrical digital data processing, natural language data processing, etc., can solve problems such as increasing system overhead, making mistakes, and failing to consider the highest bit of character codes. , to achieve the effect of improving accuracy and safety and reliability

Pending Publication Date: 2022-03-15
史颖
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Therefore, there are many problems in the computer internal code expression mode of Chinese characters bundled together with multiple bytes: the non-ASCII multi-byte Chinese character computer internal code only serves as the identification of Chinese characters, and has no computing functions such as digital sorting and retrieval.
The multi-byte high position "1" bundling mode will bring huge security risks: in some computer operating systems, the highest bit of the character code is not considered, and the character code system adopted by some operating systems is expanded ASCII characters, that is, 8-bit ASCII characters, in these operating systems, if the highest bit of a character is 1, it may be an extended ASCII character, or it may be a Chinese character with the highest bit set to 1. In this case, if the computer cannot make the correct distinction, there may be a headache of "garbled characters"
The existing Chinese character mode will bring operations such as non-stop transcoding transformation for Chinese computer processing, because the commonly used Chinese character input methods all need to use Chinese character input codes, such as pinyin codes, Wubi font codes, etc., and the input codes are accepted. Finally, it needs to be converted into an internal code by the "input code conversion module" of the Chinese character operating system before it can be stored and processed. This process will inevitably cause the alienation of the input code and the stored code, thus increasing the system overhead and inevitably will cause error

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese character datamation input and output method
  • Chinese character datamation input and output method
  • Chinese character datamation input and output method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0049] In this example, if figure 1 As shown, a kind of input method of Chinese character digitization is provided, and described method comprises:

[0050] Step S101, receiving and storing the input keyboard characters, the keyboard includes a physical keyboard or a virtual keyboard, that is, the keys encoded by this input method can use a commonly used physical keyboard, or a virtual keyboard, that is, a keyboard provided in a software program , such as keyboards in mobile phones and tablets, the stored input code characters can be used for computing and communication by computers.

[0051] Step S102, divide the English characters according to the coding rules, and determine the codes representing each Chinese character or word.

[0052] Step S103, query the encoding rule database obtained according to the encoding rules, determine the Chinese characters or words corresponding to the encoding, and output the Chinese characters corresponding to the encoding.

[0053] The en...

Embodiment 2

[0102] In this example, if figure 2 As shown, a kind of output method of Chinese speech data is provided, and described method comprises:

[0103] S201. Receive voice information.

[0104] S202. Identify syllables, tones, and sentence readings of the voice information.

[0105] S203, according to the syllables, tones and sentence readings of the voice information, determine the corresponding Chinese characters and sentence codes by calculating and querying the coding rule database, and then output the codes and / or Chinese with partial tones and sentence-reading marks corresponding to the codes text.

[0106] The encoding rule database may adopt the encoding rule described in Embodiment 1.

[0107] The calculation refers to precisely distinguishing the pronunciation difference between the suffix word and the word according to the syllable, tone and sentence reading information of the voice information, and accurately judging the sentence break.

[0108] For example, for th...

Embodiment 3

[0137] In this example, if image 3 As shown, a kind of output method of Chinese character encoding speech broadcast is provided, and described method comprises:

[0138] Step S301, receiving Chinese character codes with partial tones and sentence-reading marks.

[0139] Step S302, query the coding rule database to determine the pronunciation characteristics of the corresponding Chinese characters and sentences, and output the corresponding accurate and unambiguous voice information.

[0140] The encoding rule database may adopt the encoding rule described in Embodiment 1.

[0141] For example, for the sentence "this kind of talent is what we need", the difference in hyphenation and sentence segmentation during input causes this sentence to have two meanings. Differently, when inputting Chinese character codes according to two different meanings, the computer can analyze the codes according to the coding rules, determine the vocal characteristics of corresponding Chinese cha...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Chinese character datamation input and output method. According to the method, input keyboard characters are received and stored, the characters are divided according to coding rules, codes representing all Chinese characters or words are determined, a coding rule database is inquired to determine the Chinese characters or words corresponding to the codes, the Chinese characters corresponding to the codes are output, and target characters or statements can be input on electronic equipment. According to the coding rule, row, column, longitudinal and sequence four-digit western capital letter combinations are used for representing each Chinese character, a one-to-one corresponding recognizable mapping relation is established between the Chinese characters and ASCII codes, thorough ASCII coding of the Chinese characters is achieved, the Chinese characters can be sorted and retrieved, Chinese messy codes and system crash cannot be caused when a system processes information, and the system can be used for processing the information. The accuracy, safety and reliability of Chinese processing by a computer are greatly improved; according to the invention, each letter combination accurately represents a unique Chinese character and pronunciation thereof, the input code is a Chinese character built-in code, and keyboard touch typing input and card filling machine reading input of Chinese characters are realized.

Description

technical field [0001] The present application relates to the technical field of Chinese information processing, in particular to a method for inputting and outputting Chinese characters. Background technique [0002] The expression mode of language and characters in the computer and its network is very important. The data processed by the computer is actually binary data, that is, the computer can actually only recognize two states of 0 and 1. Therefore, in the process of computer development, people A very important problem to be solved is text processing, that is, how to convert text symbols into binary data, and how to assign unique binary codes to information or data. [0003] The world's first electronic computer was invented by Americans, and it was based on Western culture. English itself has only 26 letters, plus all the symbols Americans use every day, it will not exceed 100 indivual. Based on this, Americans have formulated a set of rules: American Standard Code...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F3/023G06F40/126
CPCG06F3/0233G06F40/126
Inventor 史颖
Owner 史颖
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products