Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Medical data duplicate checking and associating method and system

A medical data and data technology, applied in the field of data processing, can solve problems such as inability to effectively check medical data, lack of medical data association, etc.

Inactive Publication Date: 2017-07-07
JIANGSU TODAYSOFT TECH
View PDF6 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In view of the above problems, the present invention solves the problems in the prior art that it is impossible to effectively check a large amount of medical data with unstructured data and the lack of association between medical data through a method and system for checking and associating medical data. question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medical data duplicate checking and associating method and system
  • Medical data duplicate checking and associating method and system
  • Medical data duplicate checking and associating method and system

Examples

Experimental program
Comparison scheme
Effect test

experiment example

[0111] A group (Group A) of people with medical professional background screened 10,000 valid tumor medical records manually, and randomly selected 200 of these valid medical records. The people in group A used the extracted 200 medical records as a template, and modified some data except the patient's basic information through manual editing to obtain 200 new medical records A. Using the same 200 medical records of a group (group B) of people without a medical professional background as a template, a new 200 medical records B were obtained by manually editing and modifying some of the data except the basic information of the patient.

[0112] After mixing the original 10,000 valid medical records, 200 "duplicate" medical records A and 200 "duplicate" medical records B, the other four groups (group C, group D, group E and group F) with medical professional background and The other four groups (Group G, Group H, Group I, and Group J) who do not have medical professional backgro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a medical data duplicate checking and associating method and system. The method comprises the following steps of: (1) extracting core data items in to-be-processed medical data; (2) classifying the core data items; (3) respectively carrying out preliminary screening on the data items in an exclusion array and a fuzzy array; (4) carrying out deep screening on the data items in the core data items; (5) setting a threshold value M2 of a suspected duplicated data similarity and / or a threshold value M3 of suspected associated data; and (6) after artificially checking and judging the suspected duplicated and / or associated data, inputting the data which is judged as non-duplicated data into a medical database, and endowing the data which is judged to be associated with one or more corresponding association labels. Compared with the prior art, the method and system provided by the invention have the characteristics of being low in missed judging rate, low in wrong judging rate and high in duplicate checking efficiency, and do not have high requirement for the profession degree of artificial checking, so that the operation costs of the duplicate checking and associating are remarkably reduced.

Description

technical field [0001] The present invention generally relates to data processing technology, and more particularly, relates to a medical data plagiarism checking and correlation processing method and system. Background technique [0002] In the practice of the medical data collection process, there is a possibility that the same data is collected multiple times and entered into the database, and there is also the possibility that the data is collected and entered into the database as different data after slight changes by professionals or non-professionals. In order to ensure the authenticity and validity of the data in the medical database, it is necessary to set up a plan. After the data is submitted and before it is formally reviewed and put into the database, it will be checked for duplicates, and duplicate data will be blocked from the gate of the database. Due to the large amount of unstructured data in medical data, such as symptom descriptions in medical records, tr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F19/00
CPCG06F16/24553G06F16/2462G06F16/2468G06F16/287G06F16/3349G06F16/358G06F19/32
Inventor 刘劲松王友柱饶江李广东李楠王东陈桂太
Owner JIANGSU TODAYSOFT TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products