Health insurance outpatient clinic big data extraction system and method based on hadoop platform

A technology of extraction system and big data, applied in digital data processing, structured data retrieval, special data processing applications, etc., can solve the problems of filtering, screening of medical data that cannot be valuable, and data analysis, etc., to ensure reliable Security and security, good scalability, improved efficiency

Inactive Publication Date: 2014-10-22
DAREWAY SOFTWARE
View PDF3 Cites 80 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the Sqoop tool is limited to the mutual transfer of data in Hadoop and relational

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Health insurance outpatient clinic big data extraction system and method based on hadoop platform
  • Health insurance outpatient clinic big data extraction system and method based on hadoop platform
  • Health insurance outpatient clinic big data extraction system and method based on hadoop platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] Below in conjunction with accompanying drawing and embodiment the present invention will be further described:

[0044] Among them, OLAP (On-Line Analysis Processing) is online analytical processing;

[0045] HiveQL is a SQL-like language that is compatible with most SQL syntax;

[0046] MapReduce is a software architecture proposed by Google for parallel computing of large-scale data sets (greater than 1TB).

[0047] A Hadoop-based medical insurance outpatient big data extraction system, such as figure 1 As shown, it consists of four parts: data acquisition module, data storage module, data analysis and processing module and data display module.

[0048] The data extraction module is mainly responsible for extracting medical insurance-related data from business data sources into HDFS. It uses the flume log collection tool provided by Cloudera, wherein the Flume agent is used to upload data from the data source, and the Flume collector is used to collect multiple The...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a health insurance outpatient clinic big data extraction system and method based on a hadoop platform. The system comprises a data acquisition module, a data storage module, a data cleaning module, a data analyzing and processing module, an Hbase distributed database and a data display module. The data acquisition module is connected with the data storage module, the data storage module is connected with the data analyzing and processing module through the data cleaning module, and a data query and analysis module is respectively connected with the Hbase distributed database and the data display module. The system and method have the advantages that a Hadoop cluster can be formed by thousands of cheap servers, a distributed file system cluster is constructed on large-scale cheap machines, data extraction and analysis cost is reduced to a large extent, and parallel processing can be carried out on outpatient clinic big data. Meanwhile, reliability and security of the data are well guaranteed by means of a transcript storage strategy of an HDFS.

Description

technical field [0001] The invention relates to a system and method for extracting big data of medical insurance outpatient clinics based on hadoop platform. Background technique [0002] With the development of medical informatization and the extensive development of the medical insurance system across the country, the data on medical insurance has increased massively, and these data often require a long storage period. For example, the basic information of insured personnel may need to be stored for 70 By 80 years, or even longer, and with the increase of the population, the demand for data storage space will increase, and the traditional relational database may not be able to meet the storage demand. Moreover, it is necessary to analyze and process these massive data to obtain useful information contained therein. However, most of the traditional large-scale data processing uses distributed high-performance computing, grid computing and other technologies, which require ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/27G06F16/254
Inventor 孔兰菊宋婷婷闫中敏李庆忠
Owner DAREWAY SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products