Method and device for data standardization processing of medical big data

A technology of standardized processing and big data, applied in the field of medical entity recognition, which can solve the problems of single matching word segmentation method and difficult standardized processing of medical big data.

Active Publication Date: 2017-07-04
易保互联医疗信息科技(北京)有限公司
View PDF7 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a data standardization processing method and device for medical big data to solve the problem that there is no research on automatic term standardization of medical big data in the prior art, and the matching word segmentation method in the prior art is relatively single, It is difficult to accurately standardize the processing of massive medical big data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for data standardization processing of medical big data
  • Method and device for data standardization processing of medical big data
  • Method and device for data standardization processing of medical big data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0112] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0113] Such as figure 1 As shown, the embodiment of the present invention provides a data standardization processing method of medical big data, including:

[0114] Step 101, acquire the sentences to be processed in the original data.

[0115] Step 102. Segment the sentence to be processed into individual characters, and determine each character in the sentence to be processed.

[0116] Step 103, according to the pre-trained CRF training model, determine the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and device for data standardization processing of medical big data and relates to the technical field of medical entity identification. The method comprises the steps that a first group of candidate entities of a to-be-processed statement is determined according to an entity mark sequence of the to-be-processed statement; word extraction is conducted to the to-be-processed statement according to a preset medical noumenon term word extraction strategy, so a second group of candidate entities can be determined; entities in the to-be-processed statement are determined from the first group of candidate entities and the second group of candidate entities; screening is conducted according to preset syntactic analysis and screening rules, so a candidate standardization term in the to-be-be-processed statement can be determined; if the candidate standardization term in the to-be-processed statement can match a preset medical noumenon term library, the candidate standardization term in the to-be-processed statement can be determined as the standardization term; and if the matching fails, a matching failure problem report will be generated, or fuzzy matching will be conducted to the candidate standardization term which is not matched and belongs to a disease term type, so the standardization term can be determined.

Description

technical field [0001] The invention relates to the technical field of medical entity recognition, in particular to a data standardization processing method and device for medical big data. Background technique [0002] In recent years, with the development of medical and health informatization, the medical and health field has entered the era of big data. The medical business process is also a process of accumulating medical big data, which has a huge impact on the medical and health industry. For example, through the analysis and mining of medical big data, the comparative effect research of clinical operations, the construction of clinical decision support system, the research based on health economics and curative effect, and the analysis and research of disease patterns can be realized, so as to promote the development of medicine and improve the quality of clinical medicine. The current medical big data includes clinical data (such as electronic medical records, healt...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/00
CPCG06F19/32
Inventor 金以东黄玉丽李雪莉
Owner 易保互联医疗信息科技(北京)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products