Unlock instant, AI-driven research and patent intelligence for your innovation.

Technique for processing Chinese information words identification code in electronic computers

An electronic computer and identification code technology, applied in the fields of electronic digital data processing, calculation, special data processing applications, etc., can solve the problems that the machine cannot recognize, cannot be further analyzed, and the production of subject heading index cannot be automatically realized, etc., to achieve high efficiency. Effect

Inactive Publication Date: 2005-11-23
HUNAN UNIV
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] (1) The preservation of words (including storage disk and storage buffer) cannot be solved, so it is impossible to further analyze part of speech, meaning, and enter Chinese semantic understanding, and other language intelligence (such as translators), etc.
[0008] (2) The output of words (Chinese Pinyin) is difficult to solve, so the production of indexes such as keywords and subject words cannot be realized automatically
The problem is that the words expressed in Chinese characters cannot be recognized by the machine and cannot be processed automatically

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0070] Embodiment 1: such as the following sentences:

[0071] All cultural development is inseparable from invention and creation. Running the word segmentation technology developed by the previous invention (Luo Haiqing ) will produce:

[0072] All cultural development is inseparable from invention and creation.

[0073] The present invention is combined with the word extraction retrieval technology of previous invention again, can find out the linguistic boundary word in the sentence, and analyze sentence composition (digital identification), all information is by pinyin.exe program, come down with word identification code:

[0074] 1yiqiefwenhualfazhanm2lidbzkai3famingpyujchuangzaor.

[0075] 1-Subject 2-Verb 3-Object

[0076] This provides word meaning analysis, semantic understanding and the development of other intelligent software.

Embodiment 2

[0077] Embodiment 2: automatic indexing of documents.

[0078] In the case of a vocabulary provided by the user, the previous invention can be used to automatically extract words and compile an index with sentence page numbers, such as the index compilation of , which can be sorted at one time by the machine, and the pinyin words and corresponding Chinese characters can be combined Together, look up by pinyin words rather than syllables:

[0079] anleir-amines 343

[0080] baihel-white crane 004 526

[0081] baineilzhangz - cataract 036 087

[0082] baipitshu - white paper 338 386 544 875

[0083] baiselcezhis-white toilet paper 576

[0084] baiselwuranm - white pollution 348 349 350 560

[0085] banzganhanf - semi-arid 053 056 063 309 713 744

[0086] banzhanshengz-semi-arid 713

[0087] baochiqshuituh - water and soil conservation 077 712 713 716

[0088] baohuxhaiyangq-protect the ocean 031 247 24...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an electronic computer Chinese information words identification code processing technique which is at the base of using general purpose Chinese phonetic to express Chinese characters. It uses the following method to entitle the suffix note identify code of the word: (1) establishing line and column structure of words' suffix note identify code meter as follows: b p m f d t k l j q h x z c s r; (2) transforming the line and column information into words' tone information wherein the characters of one, two, three, four lines separately represent one, two, three, four tones of words' first syllable, while the characters of one, two, three, four columns separately represent one, two, three, four tones of words' second syllable; (3) ascertaining the word which is located in the crossing of the line and the column by the said information changing rule as the word's tone identification code.

Description

technical field [0001] The invention relates to electronic computer Chinese information processing technology. Background technique [0002] Chinese information processing is divided into three levels according to the degree of language: (1) internal code word processing layer; (2) intermediate word processing layer; (3) pinyin word processing layer. [0003] Chinese information word processing technology is relatively mature. The internal code of the first and second level Chinese characters of the national standard contains a total of 6763 Chinese characters. Among them, there is a first-level Chinese character that contains the Chinese pinyin sequence, but it is not perfect, because the internal code is one word and one sound, and multi-phonetic characters cannot be accommodated, so it cannot be processed. . [0004] The middle character also corresponds to the Chinese characters of the first and second grades, but it contains 7585 Chinese characters including all sylla...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28
Inventor 罗海清罗万
Owner HUNAN UNIV