Recognition method and recognition system of Chinese medicine named entity based on ancient Chinese medicine literature
A technology of named entity recognition and traditional Chinese medicine ancient books, which is applied in the direction of instruments, network data indexing, and other database retrieval, etc., can solve the problems of increasing the difficulty of named entity recognition of traditional Chinese medicine, failure to obtain ideal results, difficult and expensive manual labeling, etc., to achieve Save the cost of manual labeling, improve the effect, and the effect of easy operation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
no. 1 example
[0054] This embodiment provides a method for recognizing named entities of traditional Chinese medicine based on ancient Chinese medicine literature, figure 1 Shown is a schematic flow chart of the TCM named entity recognition method.
[0055] The named entities described in this embodiment are aimed at medical case documents in ancient Chinese medicine documents, but the present invention is not limited to medical records, and can also be applied to other ancient Chinese medicine documents.
[0056] Such as figure 1 Shown, described Chinese medicine named entity recognition method based on ancient Chinese medicine literature, comprises the following steps:
[0057] Step S1, obtaining the medical record corpus of ancient Chinese medicine books.
[0058] Further, the acquisition of medical case corpus of ancient Chinese medical records specifically includes the following steps:
[0059] Step S11, using Optical Characters Recognition (OCR) to scan and recognize the existing p...
no. 2 example
[0130] This embodiment provides a TCM named entity recognition system based on ancient TCM literature, said system comprising: corpus acquisition module, data cleaning module, language model pre-training module, training set labeling module, sequence labeling model training module, entity recognition module; among them,
[0131] The corpus acquisition module is used to acquire the medical case corpus of ancient Chinese medical books;
[0132] The data cleaning module is used to perform data cleaning on the acquired Chinese medical record corpus to be processed;
[0133] The language model pre-training module is used to perform language model pre-training for the ancient Chinese medicine medical record corpus based on the ancient Chinese medical record corpus;
[0134] The training set labeling module is used to perform sequence labeling on the corpus based on the cleaned-up ancient medical case corpus of traditional Chinese medicine to form a training set for subsequent model...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com