NER-oriented Chinese clinical text data enhancement method and device
A text data and Chinese technology, applied in the field of Chinese clinical text data enhancement, can solve problems such as aggravated generation methods, violation of medical logic, ignoring semantic characteristics, etc., to achieve the effect of exploring the potential of the model and improving the difficulty
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0059] The specific embodiments of the present invention will be further described in detail below with reference to the accompanying drawings.
[0060] like figure 1 As shown, a kind of NER-oriented Chinese clinical text data enhancement method provided by the present invention, the main process and detailed description are as follows:
[0061] 1. Data preprocessing:
[0062] The data preprocessing process mainly includes word segmentation for unlabeled data and label linearization for labeled data.
[0063] For unlabeled data, it is mainly used for language model learning in the pre-training stage. Based on the existing medical dictionary, the unlabeled data is segmented by a combination of dictionary and rules.
[0064] For labeled data, it is mainly used for generative model training and optimization in the finetune stage. The main processing flow is as follows:
[0065] Entity segmentation:
[0066]Based on the existing medical dictionary and combined with the knowle...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com