Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for extracting data based on medical system crawler

A medical system and data extraction technology, applied in the field of medical image text recognition, can solve the problems of time-consuming and cumbersome extraction and difficult extraction of medical data, and achieve the effect of saving human and material resources, efficient extraction and sorting, and saving human and material resources.

Pending Publication Date: 2020-04-28
KUNMING UNIV OF SCI & TECH +1
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The invention provides a method for extracting data based on medical system reptiles to solve the problems of difficult and time-consuming extraction of medical data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for extracting data based on medical system crawler
  • Method for extracting data based on medical system crawler
  • Method for extracting data based on medical system crawler

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0023] Embodiment 1: as Figure 1-3 As shown, a method for extracting data based on a medical system crawler, the specific steps of the method for extracting data based on a medical system crawler are as follows:

[0024] Step 1: Initialize the URL: Use the http library of the hospital web page in the medical system to send a request to the target medical data site for medical data crawling. If the server can respond, you can get a response Response from the hospital web page, which contains the hypertext of the hospital web page The data of the markup language html, the data of the lightweight data exchange format json of the hospital web page;

[0025] Step2: Analyze the URL queue: use regular expressions to parse html data, and then use json module to parse json data;

[0026] Step3: Patient data crawling: HTTP protocol transmission is performed on the URL of each medical data required, and the crawling target medical data is matched through the patient's doctor's ID and d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method for extracting data based on a medical system crawler, and belongs to the technical field of medical image character recognition. The method comprises the following steps: firstly, initializing a URL in a medical system; analyzing a URL queue, analyzing html data by using a regular expression, and analyzing json data by using a json module; carrying out hTTP protocol transmission on the URL of each piece of required medical data, and crawling target medical data in a matched mode through the doctor-seeing ID and the doctor advice ID of a patient; storing the data crawled by the crawler into a medical database; judging the crawled patient data, and performing character recognition on a PDF document by using a Baidu character recognition API; and performingword segmentation, text denoising and key information extraction on a PDF document corpus processed by the Baidu character recognition API, and then storing the PDF document corpus into the medical database. The problems that medical data is difficult to extract and is time-consuming and tedious to extract are solved.

Description

technical field [0001] The invention relates to a method for extracting data based on a crawler in a medical system, and belongs to the technical field of medical image character recognition. Background technique [0002] With the development of my country's medical and health services, domestic hospitals have successively established systems such as (hospital information system), PACS (medical image transmission and archiving system), LIS (inspection information system), and along with the application of these information systems, a long-term The neglected problem gradually surfaced, which is the problem of data extraction. Nowadays, the problem of data extraction has become the bottleneck and short board that restricts the effectiveness of various information systems, and the importance of data extraction has become the focus of attention; [0003] Data mining is a non-square process of proposing implicit, potentially valuable and ultimately understandable patterns from da...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/951G06F16/9532G16H50/70
CPCG06F16/951G06F16/9532G16H50/70
Inventor 马磊蒋卫丽陈振华王雄彬陈昊昱龙晨
Owner KUNMING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products