Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cleaning method of health data for atmospheric pollution health risk assessment

A health data and risk assessment technology, applied in data processing applications, instruments, calculations, etc., can solve problems such as ambiguity and inaccuracy of household registration and current address information, and achieve good practical value, scientific conception, and high operability

Inactive Publication Date: 2016-02-03
中国疾病预防控制中心环境与健康相关产品安全所
View PDF1 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to solve the problem of fuzzy and inaccurate information about the registered residence and current address of patients in the medical data of patients from various medical institutions in the same area used in air pollution health risk assessment, an automatic health data cleaning based on decision tree learning is proposed method, which is used to distinguish whether the patients are local permanent residents or temporary local medical patients

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cleaning method of health data for atmospheric pollution health risk assessment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] See figure 1 , the present invention is a health data cleaning method for air pollution health risk assessment, which is a health data cleaning method for air pollution health risk assessment based on decision tree learning, and it has five major steps:

[0026] Step 1: Take a small number of samples from the health data, and manually judge whether the patient is a local resident population or a temporary population for medical treatment based on the patient's registered residence and current address information in the sample.

[0027] Step 2: Rule design, build a decision tree; design the following 6 rules.

[0028] 1. Search all the health data in this area to see if there are other records that contain the current fuzzy and incomplete household registration and current address information, and complete other records with current residential address information, and determine whether the local resident population that can be matched with this record Or temporarily co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a cleaning method of health data for atmospheric pollution health risk assessment. The cleaning method comprises five steps: 1, extracting a few samples in health data, determining whether patients are local permanent residents or people who temporarily come here and seek treatment through manpower according to patients' registered residences and present address information registered in the samples; 2, designing six rules, and constructing a decision tree; 3, processing a data set of the samples based on the six rules, and obtaining the training data set of the decision tree through the combination of the manual mark results in the step 1; 4 constructing the decision tree according to the training data set; and 5, performing the determination of the six rules in the steps to the cleaning data in turn, inputting six results into the decision tree trained in the step 4, and obtaining a final determination result. The cleaning method of health data for atmospheric pollution health risk assessment has scientific conception, simple calculation and high universality, has good practical values in the health data cleaning for atmospheric pollution health risk assessment, and has practical application prospects in the popularization of the real work of public health and environmental health.

Description

technical field [0001] The invention relates to a health data cleaning method for air pollution health risk assessment. The method cleans the health data from different medical institutions in the same city, and extracts the health data with the highest probability from the health data with different structures and vague descriptions. A specific health dataset of the city's permanent population is used to analyze and evaluate the relationship between air pollution and health risks in the city. It belongs to the technical field of public health and environmental health. Background technique [0002] Due to the large geographic differences in meteorological characteristics and major air pollutants, the health impact of air pollution in a specific city or region is more concentrated on the permanent population in that region. Therefore, the analysis and assessment of air pollution health risks in a specific city or region needs to be based on the health data of the resident po...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06Q50/22
Inventor 孙庆华李湉湉
Owner 中国疾病预防控制中心环境与健康相关产品安全所
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products