Medical information data item name standardization method and system, equipment and medium

A project name, medical information technology, applied in the field of data standardization of medical data sources, can solve problems such as large amount of calculation, difficult data source data, time-consuming and labor-intensive, etc., to achieve the effect of strong adaptability and simplified calculation amount

Pending Publication Date: 2021-12-28
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This brings difficulties to the fusion of data from multiple data sources. Unified standardization of data from multiple data sources requires a lot of labor, time-consuming and labor-intensive. The existing standardization methods, on the one hand, vectorize the names and compare them, and the amount of calculation Large, and different vectorization will bring devia

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medical information data item name standardization method and system, equipment and medium
  • Medical information data item name standardization method and system, equipment and medium
  • Medical information data item name standardization method and system, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0043] The following detailed description of the present invention will be described in connection with the specific embodiments below, which is not limited thereto.

[0044] In order to make the objects, technical solutions, and advantages of the present invention more clearly, the technical solutions in the embodiments of the present invention will be described in contemplation in the embodiment of the present invention. It is an embodiment of the invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art are in the range of the present invention without making creative labor premise.

[0045] It should be noted that the features of the present application and the features in the embodiments in the present application can be combined with each other in the case of an unable conflict.

[0046]The present invention may be described in the general context of computer-executable instr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to data standardization of medical data sources, in particular to a medical information data item name standardization method and system, equipment and a medium, which can automatically standardize data of a plurality of data sources from a literal description level, is reasonable in design, simple in processing and high in adaptability, greatly liberates manpower and improves efficiency. The method comprises the following steps: unifying and de-duplicating acquired initial data item names of a plurality of medical information data sources in a character level to obtain data items with different names; constructing an n-gram feature set of each data item according to the number of characters of the name of each data item; according to the n-gram feature set of each data item, obtaining a character level-based name similarity between every two data items, and constructing a similar matrix; and clustering the data items greater than a similarity threshold in the similar matrix, and assigning the same standardization name for all the data items in each cluster for standardization.

Description

technical field [0001] The invention relates to data standardization of medical data sources, in particular to a method, system, device and medium for standardizing names of medical information data items. Background technique [0002] With the advancement of informatization construction in various industries, massive amounts of data are stored electronically. For example, in the medical industry, more and more medical institutions use a Hospital Information System (Hospital Information System, HIS system) to manage collected data. This type of information system improves the ability of data collection and management, but also brings about the problem of data standardization from different data sources. [0003] The HIS system of each medical institution has a set of data standard methods. However, the data standardization methods of different medical institutions are usually different, and it is very difficult to implement data standardization in multiple medical institut...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/247G06F40/216G06K9/62
CPCG06F40/247G06F40/216G06F18/23G06F18/22
Inventor 唐蕊
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products