A method and system for removing privacy of medical texts based on stacking ensemble learning
An integrated learning and privacy-removing technology, applied in unstructured text data retrieval, text database clustering/classification, instruments, etc., can solve the problem of removing private information from medical texts
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0093] A medical text privacy removal system based on Stacking integrated learning, the technical solution adopted is as follows, and the system includes:
[0094] A text segmentation module used to segment the input text to obtain a processing unit token;
[0095] A feature extraction module for obtaining the relevant features of each processing unit token;
[0096] A rule-based PHI labeling module for building on training data and obtaining transformation-based rules automatically;
[0097] Used to build and obtain PHI labeling modules based on conditional random fields on the training data;
[0098] It is used to establish and obtain the PHI labeling module based on the neural network on the training data;
[0099] It is used to mark each processing unit token by using the PHI marking module, the conditional random field-based PHI marking module and the neural network-based PHI marking module, and identify the PHI entity recognition module of the PHI entity in each proces...
Embodiment 2
[0156] A kind of medical text de-privacy method based on Stacking integrated learning, the adopted technical scheme is as follows, and described method comprises:
[0157] A text segmentation step for segmenting the input text to obtain a processing unit token;
[0158] A feature extraction step for obtaining relevant features of each processing unit token;
[0159] An automatic acquisition step based on conversion rules for establishing and obtaining an automatic acquisition model based on conversion rules on the training data;
[0160] a conditional random field based learner step for building and obtaining a conditional random field based learner model on the training data;
[0161] For establishing and obtaining the neural network-based learner model on the training data based on the neural network learner step;
[0162] It is used to mark each processing unit token by using the conversion-based rule automatic acquisition model, the conditional random field-based learner...
PUM

Abstract
Description
Claims
Application Information

- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com