Method for integrating and exchanging data on basis of unique identification

A technology of data integration and identification, applied in the information field, can solve the problems of no unified specification standard, no solution, and unique identifiers that cannot reveal any characteristics of documents

Active Publication Date: 2015-02-11
KARAMAY HONGYOU SOFTWARE
View PDF4 Cites 91 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] At present, multi-source heterogeneous data sharing mainly faces the following difficulties: ①Achievability refers to the difficulty for users to obtain data; due to the complexity and variety of multi-source heterogeneous data structures, the workload of data transmission is relatively large, and users can only To obtain multi-source heterogeneous data resources
In the past, a large amount of application data was developed for a single machine or a local area network, which resulted in a large number of legacy data resources that cannot be directly accessed on the Internet. How to access these resources on the Internet needs to be considered, and how to bridge multi-source heterogeneous data How to use the Internet protocol to transmit multi-source heterogeneous data; how users can find the system on the Internet and access legacy multi-source heterogeneous data through this system; the format of multi-source heterogeneous data obtained by users How, whether it can be applied directly or after conversion, there is no effective solution yet
②Interoperability refers to the difficulty for users to understand data; due to differences in product development and business strategies, different application data have clear boundaries, making it difficult for users to understand and use multi-source heterogeneous data
The key to multi-source heterogeneous data interoperability is to solve the heterogeneous problem of multi-source heterogeneous data, and data has syntax and semantics, how to discuss the problem of data heterogeneity in layers, and solve the problem of syntactic differences, semantic differences and fusion in the Internet environment difference problem, there is currently no effective solution
③ Ease of use refers to how easy it is for users to process multi-source heterogeneous data; many multi-source heterogeneous data products provide a secondary development platform for users to construct their own applications to meet various needs; applications in the Internet environment The construction method has also expanded from the single-machine single-task mode to the multi-task distributed computing mode. The potential user market cannot be monopolized by a few manufacturers, and it is difficult to provide services for specific applications. This requires an open data processing framework to provide data elements and services. elements, and then complete the task through the integration and application of the elements. There is no effective solution yet
[0008] The management of traditional data centers has the following defects: ① low utilization rate and poor flexibility; ② poor scalability; ③ chimney management; ④ high cost and increased energy consumption
[0011] Compared with foreign countries, the application of domestic unique identifiers is still in its infancy, and there are mainly the following defects: ①The role of domestic custom unique identifiers is only the unique number of digital objects, and the formulation and use of unique identifiers lack specifications. The unique identifiers used by literature manufacturers are different, and there is no unified standard; ②The unique identifiers only function within the scope of their respective resources, and once they are separated from their respective databases, their unique identifiers cannot reveal any characteristics of the literature;③ The application level of the unique identifier is relatively low, and its role is limited to the identification of internal digital objects. The analysis system and management mechanism related to the application of the unique identifier have not been established, and the resource sharing of various digital document manufacturers cannot be realized; ④ Unique There is no hierarchical relationship in the identification, and a unified identification method is adopted for all data, which cannot reflect the level and relationship between data
2) The materialization processing method is mainly to establish a central database and copy the data of each data source to the data center. Its advantage is that it is easy to obtain better integrated query performance, but it cannot flexibly adapt to changes in requirements
②Unified representation of data objects. Due to the differentiation of data structures, there are many ways to represent data objects, which makes the data integration process complex and diverse.
[0016] Currently there is no data integration and exchange method to effectively solve the above problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for integrating and exchanging data on basis of unique identification
  • Method for integrating and exchanging data on basis of unique identification
  • Method for integrating and exchanging data on basis of unique identification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0082] In order to better understand the technical problems solved by the present invention and the technical solutions provided, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The specific embodiments described here are only used to explain the implementation of the present invention, but not to limit the present invention.

[0083] The design of the present invention is mainly to solve data integration, conversion, fusion and sharing services between heterogeneous databases, such as figure 1 As shown, the purpose is to shield attribute information such as the underlying database type, data management mode, data access method, database physical structure, and the name of the database access entity.

[0084] In a preferred embodiment, figure 2 It exemplarily shows a flow chart of a unique identification-based data integration and exchange method; including:

[0085] ①Establish the data exchange ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of information, and particularly discloses a method for integrating and exchanging data on the basis of unique identification. The method includes building data element management models for business required to be integrated, exchanged and shared, and uniquely identifying each data item in each data element management model; mapping the identification in the data element management models with fields of multi-source heterogeneous databases; generating corresponding target SQL (structured query language) statements according to the fields corresponding to the identification; accessing the multi-source heterogeneous databases, executing the target SQL statements and returning result sets; fusing and processing the result sets by the aid of fusion algorithms. The method has the advantages that conflict examples in heterogeneous data sources can be effectively recognized and fused by the aid of the method, and accordingly data integration / data fusion effects can be improved on high level; the data exchange and integration accuracy can be effectively improved, and the data integration and exchange efficiency can be greatly enhanced.

Description

technical field [0001] The invention relates to the field of information technology, in particular to a method for data integration and exchange based on unique identification. Background technique [0002] Data sharing: It is the common goal of every information system construction. It can enable more people to use existing data resources more fully, reduce duplication of labor and corresponding costs such as data collection and data collection, and focus on developing new ones. application and system integration. [0003] In summary, traditional data sharing technologies have the following characteristics: ①Only support the mapping of basic geometric elements between data models, and generally only support simple point and line concepts; ②Traditional attribute (non-graphic) data is processed separately; ③Undefined metadata , even if metadata is defined, it is an application based on direct mapping of shared data, such as data resource catalogs and registration management ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/24532G06F16/25
Inventor 谭远华张建涛朱平夏东梅
Owner KARAMAY HONGYOU SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products