Electronic medical record named entity recognition system and method
A technology of named entity recognition and electronic medical records, applied in neural learning methods, electrical digital data processing, instruments, etc., can solve problems such as excessive dependence on frameworks, failure to recognize nested named entities, high cost of labeling data, etc., and achieve the goal of reducing coupling Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0054] Such as Figure 4 As shown, a named entity recognition system for electronic medical records is proposed, including:
[0055] The data cleaning unit performs data cleaning on the original data of the electronic medical records to obtain standardized original data;
[0056] The rule pre-labeling unit performs rule pre-labeling on the standard original data through the labeling rules to obtain the rule pre-labeling data;
[0057] The algorithm pre-labeling unit performs algorithmic pre-labeling on the rule pre-labeled data through the labeling algorithm to obtain the pre-labeled data set;
[0058] Manual inspection and labeling unit, where labelers correct and label pre-labeled datasets to generate standard datasets;
[0059] Construct the input data unit, classify and construct the input for the standard data set, and obtain the input data;
[0060] The model construction unit builds the named entity recognition model of electronic medical records, that is, the first ...
Embodiment 2
[0074] A method for named entity recognition for electronic medical records, comprising the following steps:
[0075] 1. Data cleaning
[0076] Data cleaning of the original data is mainly to standardize and unify punctuation marks and English.
[0077] 2. Rule pre-marking
[0078] For the description of the time point and time period in the electronic medical record, the regularization is extracted, the regularization library is written, and the time expressions of different laws are classified, and the extracted entities are pre-labeled.
[0079] 3. Algorithm pre-labeling
[0080] 4. Construct corresponding entity dictionaries using standardized drug databases, disease databases, surgery databases, symptom databases, and other standardized names. This part of the dictionary is used as a proprietary entity name that needs to be updated iteratively. The names in the dictionary need to exclude characters with a length of less than 2. word. Use the word segmentation package ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com