Multi-source heterogeneous data fusion system and method

A multi-source heterogeneous data and fusion method technology, applied in the field of big data analysis in the aviation industry, can solve problems such as inconsistent data quality, uneven data quality, and large data volume

Inactive Publication Date: 2018-05-11
中国南方航空股份有限公司
View PDF6 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006](3) Large amount of data: Basically, the amount of data on each platform is very large;
[0007](4) Data quality is uneven: the data quality of different platforms is inconsistent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source heterogeneous data fusion system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0037] see figure 1 As shown, the present invention discloses a multi-source heterogeneous data fusion system, which is used for multi-source heterogeneous data fusion in the aviation industry, which includes a data source layer, a calculation layer, a data layer and an analysis layer.

[0038] The data source layer is used to acquire a collection of heterogeneous data sources, and the acquired data sources include structured data, unstructured data and real-time streaming data.

[0039] The computing layer is used for collecting, cleaning, storing and computing the data sources, and includes a memory computing framework, a stream computing framework, a data warehouse, a data mining engine, a distributed computing framework and a file system. The memory computing framework is used to implement memory-based data computing, such as the calculation of website visitor loss model; the stream computing framework is used for real-time reception and calculation of aviation PNR data; t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-source heterogeneous data fusion system which comprises a data source layer, a computing layer, a data layer and an analysis layer. The computing layer comprises a memory computing frame, a flow computing frame, a data warehouse, a data mining engine, a distributed type computing frame and a file system; the data layer comprises an SQL system, a NoSQL system and a caching system, and the analysis layer comprises a semantic layer and an OLAP engine. The invention further discloses a multi-source heterogeneous data fusion method which comprises the steps of S1, anairline company official website is transformed, and a user zipper table representing the user unique identity is obtained; S2, multi-source heterogeneous data are obtained, merged together and stored on a large data platform in a single user data manner; S3, application support is performed: the merged multi-source heterogeneous data are used for forming a user image; the formed user image is stored onto the large data platform in the NOSQL representation manner. According to the multi-source heterogeneous data fusion system and method, merging of the multi-source heterogeneous data is realized, and support is provided for scientific decision of the airline company.

Description

technical field [0001] The invention relates to big data analysis in the aviation industry, in particular to a multi-source heterogeneous data fusion system and method. Background technique [0002] In the aviation industry, due to the relatively early informatization construction, each airline now has its own passenger information database, electronic ticket database and departure record database. With the rise of the e-commerce industry, more and more passengers pass through the first Three-party OTA agencies, airlines' official websites or APPs purchase tickets. Due to different information construction times and different structures, airline companies generate a large amount of multi-source heterogeneous data. [0003] Multi-source heterogeneous data has the following characteristics: [0004] (1) Hybrid data: including structured and unstructured data; [0005] (2) Data discreteness: data is distributed in different systems or platforms; [0006] (3) Large amount of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06Q30/02G06Q50/30
CPCG06Q30/0283G06Q50/30G06F16/2465G06F16/258
Inventor 彭向晖黄文强卢春邱文辉黄瑞辉
Owner 中国南方航空股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products