A multi-source data granular fusion and index classification and hierarchical processing method

A technology of multi-source data and processing methods, applied in the field of big data processing, can solve the problems of redundant multi-source indicators, too long and inaccurate indicator names, etc., and achieve the effect of clear indicator expression, various combination methods, and flexible presentation methods.

Active Publication Date: 2021-08-27
HEBEI INST OF SCI & TECH INFORMATION HEBEI INST OF SCI & TECH INNOVATION STRATEGY
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] 1) The diversity of multi-source data makes it difficult to deal with it uniformly
Since multi-source data comes from different sources, these data are of various types, and their index names and styles are different. Most of the existing technologies adopt different storage and expression schemes for different tabular data, and it is difficult to achieve unified storage and Express
[0005] 2) The redundancy of multi-source indicators makes it difficult to store them uniformly
Due to the diversity of two-dimensional spreadsheets, the same indicator name appears in different data tables at the same time. The existing technology uses separate storage methods, resulting in redundant indicators and cannot ensure the uniqueness of indicator names.
[0006] 3) The diversity of application scenarios of multi-source indicators makes it difficult to retrieve them uniformly
Because the existing processing technology does not store the hierarchical relationship and application scenarios between indicators and indicator classification descriptions, it is impossible to perform unified retrieval for different application scenarios of multi-source indicators
[0007] 4) The direct combination of XY (or YX) is difficult to accurately express the meaning of the data
Due to the defects in the storage method of the existing processing technology, the indicators are presented in the form of combined indicators. There are many defects such as too long indicator names, irregularities, inaccuracies, and poor readability. It is difficult to accurately express the meaning of the data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A multi-source data granular fusion and index classification and hierarchical processing method
  • A multi-source data granular fusion and index classification and hierarchical processing method
  • A multi-source data granular fusion and index classification and hierarchical processing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0076] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0077] The present invention proposes a multi-source data granulation fusion and index classification and layering processing method, by classifying a large number of diverse and redundant indexes in the table, and establishing a storage library, and then unifying various forms of tables into Standard form, identifying title area and value area, respectively extracting index units, index classification description units, application scenarios of index units and index classification description units, source library tables and other hidden attributes, numerical items and related attribute information, thus forming granules Then, by constructing an index index structure, according to the user's individual selection, the numerical items, the indicators and levels corresponding to the values, the index classification descriptions and level...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a multi-source data granulation fusion and index classification and layering processing method, by classifying a large number of diverse and redundant indexes in the table, and establishing a storage library, and then unifying various forms of tables into a standard table , Identify the title area and the value area, and extract the relevant attribute information of the value respectively, so as to form granular and standardized data; then, by constructing the index index structure, according to the user's personalized choice, the value item and the value are presented in the form of a spreadsheet Corresponding indicators and levels, indicator classification descriptions and levels, indicators and indicator classification description application scenarios and other hidden attributes, to achieve accurate expression of two-dimensional tabular data. The present invention can normalize and store multi-source data, so that the uniqueness of index names, multi-sources of hierarchical relationships, multi-sources of index classification descriptions, and multi-sources of application scenarios provide flexibility, diversity, and Rapid rendering provides the foundation.

Description

technical field [0001] The invention belongs to the technical field of big data processing, in particular to a multi-source data granular fusion and index classification and layered processing method. Background technique [0002] In the process of big data governance, the sources of data are diverse, and tabular data files are one of the common types. Typical two-dimensional spreadsheet data includes business (survey) and yearbooks. For business (survey) data, due to its diversity, most existing technologies use the form of sub-database storage, separate query, and separate expression. Unified solution; For yearbook data, although the existing technology has basically realized the unified storage and joint query of yearbook data, most of them adopt the direct combination of X axis and Y axis, that is, XY (or YX) combination index method, and there are index names Too long, irregular, inaccurate, poor readability and many other defects. [0003] The reasons are mainly refl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/22G06F16/28G06F16/215
CPCG06F16/215G06F16/2228G06F16/2282G06F16/285
Inventor 李银生聂永川张朝宗王红吴峰任雁刘淼张金龙陈娟张碟蒋倩男张聪高原高银珍毋鹏杰
Owner HEBEI INST OF SCI & TECH INFORMATION HEBEI INST OF SCI & TECH INNOVATION STRATEGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products