Multi-source data granulation fusion and index classification layering processing method

A technology of multi-source data and processing methods, applied in the field of big data processing, can solve the problems of too long index names, redundant indicators, and difficulty in unified storage, and achieve the effect of reducing data redundancy, ensuring uniqueness, and reducing complexity

Active Publication Date: 2021-03-12
HEBEI INST OF SCI & TECH INFORMATION HEBEI INST OF SCI & TECH INNOVATION STRATEGY
View PDF12 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1) The diversity of multi-source data makes it difficult to deal with it uniformly
Since multi-source data comes from different sources, these data are of various types, and their index names and styles are different. Most of the existing technologies adopt different storage and expression schemes for different tabular data, and it is difficult to achieve unified storage and Express
[0005] 2) The redundancy of multi-source indicators makes it difficult to store them uniformly
Due to the diversity of two-dimensional spreadsheets, the same indicator name appears in different data tables at the same time. The existing technology uses separate storage methods, resulting in redundant indicators and cannot ensure the uniqueness of indicator names.
[0006] 3) The diversity of application scenarios of multi-source indicators makes it difficult to retrieve them uniformly
Because the existing processing technology does not store the hierarchical relationship and application scenarios between indicators and indicator classification descriptions, it is impossible to perform unified retrieval for different application scenarios of multi-source indicators
[0007] 4) The direct combination of XY (or YX) is difficult to accurately express the meaning of the data
Due to the defects in the storage method of the existing processing technology, the indicators are presented in the form of combined indicators. There are many defects such as too long indicator names, irregularities, inaccuracies, and poor readability. It is difficult to accurately express the meaning of the data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source data granulation fusion and index classification layering processing method
  • Multi-source data granulation fusion and index classification layering processing method
  • Multi-source data granulation fusion and index classification layering processing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0076] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0077] The present invention proposes a multi-source data granulation fusion and index classification and layering processing method, by classifying a large number of diverse and redundant indexes in the table, and establishing a storage library, and then unifying various forms of tables into Standard form, identifying title area and value area, respectively extracting index units, index classification description units, application scenarios of index units and index classification description units, source library tables and other hidden attributes, numerical items and related attribute information, thus forming granules Then, by constructing an index index structure, according to the user's individual selection, the numerical items, the indicators and levels corresponding to the values, the index classification descriptions and level...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-source data granulation fusion and index classification layering processing method, which comprises the following steps of: classifying a large number of diversified redundant indexes in a table, establishing a storage library, unifying various forms of tables into a standard table, identifying a title area and a numerical value area, and respectively extracting related attribute information of numerical values to form granulated and standardized data; and then, constructing an index access structure, according to personalized selection of a user, presenting numerical items, indexes and hierarchies corresponding to numerical values, index classification description and hierarchies, application scenes of index and index classification description and other implicit attributes in a spreadsheet form, so that accurate expression of two-dimensional table data is realized. According to the method, the multi-source data can be normalized and stored, so that theindex name uniqueness, the hierarchical relationship multisource, the index classification description multisource and the application scene multisource are realized, and a basis is provided for flexible, diversified and rapid presentation of the multi-source data.

Description

technical field [0001] The invention belongs to the technical field of big data processing, in particular to a multi-source data granular fusion and index classification and layered processing method. Background technique [0002] In the process of big data governance, the sources of data are diverse, and tabular data files are one of the common types. Typical two-dimensional spreadsheet data includes business (survey) and yearbooks. For business (survey) data, due to its diversity, most existing technologies use the form of sub-database storage, separate query, and separate expression. Unified solution; For yearbook data, although the existing technology has basically realized the unified storage and joint query of yearbook data, most of them adopt the direct combination of X axis and Y axis, that is, XY (or YX) combination index method, and there are index names Too long, irregular, inaccurate, poor readability and many other defects. [0003] The reasons are mainly refl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22G06F16/28G06F16/215
CPCG06F16/215G06F16/2228G06F16/2282G06F16/285
Inventor 李银生聂永川张朝宗王红吴峰任雁刘淼张金龙陈娟张碟蒋倩男张聪高原高银珍毋鹏杰
Owner HEBEI INST OF SCI & TECH INFORMATION HEBEI INST OF SCI & TECH INNOVATION STRATEGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products