Technique for processing Chinese information words identification code in electronic computers
An electronic computer and identification code technology, applied in the fields of electronic digital data processing, calculation, special data processing applications, etc., can solve the problems that the machine cannot recognize, cannot be further analyzed, and the production of subject heading index cannot be automatically realized, etc., to achieve high efficiency. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Examples
Embodiment 1
[0070] Embodiment 1: such as the following sentences:
[0071] All cultural development is inseparable from invention and creation. Running the word segmentation technology developed by the previous invention (Luo Haiqing ) will produce:
[0072] All cultural development is inseparable from invention and creation.
[0073] The present invention is combined with the word extraction retrieval technology of previous invention again, can find out the linguistic boundary word in the sentence, and analyze sentence composition (digital identification), all information is by pinyin.exe program, come down with word identification code:
[0074] 1yiqiefwenhualfazhanm2lidbzkai3famingpyujchuangzaor.
[0075] 1-Subject 2-Verb 3-Object
[0076] This provides word meaning analysis, semantic understanding and the development of other intelligent software.
Embodiment 2
[0077] Embodiment 2: automatic indexing of documents.
[0078] In the case of a vocabulary provided by the user, the previous invention can be used to automatically extract words and compile an index with sentence page numbers, such as the index compilation of , which can be sorted at one time by the machine, and the pinyin words and corresponding Chinese characters can be combined Together, look up by pinyin words rather than syllables:
[0079] anleir-amines 343
[0080] baihel-white crane 004 526
[0081] baineilzhangz - cataract 036 087
[0082] baipitshu - white paper 338 386 544 875
[0083] baiselcezhis-white toilet paper 576
[0084] baiselwuranm - white pollution 348 349 350 560
[0085] banzganhanf - semi-arid 053 056 063 309 713 744
[0086] banzhanshengz-semi-arid 713
[0087] baochiqshuituh - water and soil conservation 077 712 713 716
[0088] baohuxhaiyangq-protect the ocean 031 247 24...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More