Named entity identification method and device for traditional Chinese medicine ancient book literature

A technology of named entity recognition and ancient books of traditional Chinese medicine, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of high cost and achieve the effect of improving the level of automatic processing

Inactive Publication Date: 2019-10-11
UNIV OF SCI & TECH BEIJING
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The current methods all require a large amount of manual labeling data or design features. However, labeling and feature design in the field of traditional Chinese medicine require domain knowledge, so the cost is relatively high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Named entity identification method and device for traditional Chinese medicine ancient book literature
  • Named entity identification method and device for traditional Chinese medicine ancient book literature
  • Named entity identification method and device for traditional Chinese medicine ancient book literature

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0030] It should be clear that the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0031] For the convenience of description, the above devices are described by dividing their functions into various units / modules and describing them separately. Of course, when implementing the present invention, the functions of each unit / module can be implemented in one or more pieces of software and / or hardware.

[0032] Such as figure 1 As shown, a named entity recognition method for ancient Chinese medicine literature, including:

[0033] S1. Arranging entity words of at least one entity type to obtain a firs...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a named entity recognition method and device for traditional Chinese medicine ancient book literature, and the method comprises the steps: arranging entity words of at least one entity type, and obtaining a first traditional Chinese medicine domain word list containing to-be-recognized entity types; using an AutoPhrase automatic phrase mining technology tomine phrases from the traditional Chinese medicine ancient language materials to obtain a second traditional Chinese medicine field word list; marking entities appearing in the traditional Chinese medicine ancient text corpus according to a preset back marking strategy; obtaining annotation data of the traditional Chinese medicine ancient text corpus; generating a training data set, a verification data set and a test data set, outputting the training data set to the training file, and outputting the verification data set and the test data set to the test file; reading data from the training file and the test file, training an automatic naming entity recognition model according to the read-in data, and predicting the traditional Chinese medicine ancient text corpus to obtain a recognitionresult; and obtaining the identified entity according to the result.

Description

technical field [0001] The invention relates to the field of Chinese processing, in particular to a named entity recognition method and device for ancient Chinese medicine documents. Background technique [0002] With the development of technology, it is necessary to perform named entity recognition on ancient Chinese medicine documents. The current methods all require a large amount of manual labeling data or design features. However, labeling and feature design in the field of traditional Chinese medicine require domain knowledge, so the cost is relatively high. Contents of the invention [0003] In view of this, the embodiments of the present invention provide a named entity recognition method and device for ancient Chinese medicine documents, which can improve the automation level of named entity recognition for ancient Chinese medicine documents. [0004] A named entity recognition method for ancient Chinese medicine literature, including: [0005] S1. Arranging ent...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/295
Inventor 谢永红夏超张德政阿孜古丽栗辉杨石兵
Owner UNIV OF SCI & TECH BEIJING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products