Data management service system based on large data

A data management and service system technology, applied in the application field of computer and network technology, can solve the problems of high difficulty of precise search and low accuracy rate, and achieve the effect of improving accuracy rate and recall rate, low price and strong operability

Active Publication Date: 2015-02-11
BEIHANG UNIV
View PDF3 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Miscellaneous data is worthless for massive data, a

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data management service system based on large data
  • Data management service system based on large data
  • Data management service system based on large data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] Such as figure 1 As shown, the system as a whole is divided into three layers: data management layer, application layer and presentation layer.

[0062] The data management layer is the physical unit management of data storage. Based on the hdfs of the Lingyun platform, Hbase is used as the storage system for localized data. In the data storage process, the physical storage of data is managed uniformly by Hbase (this is also the characteristic of Hbase itself). Based on transparent physical storage, a reasonable rowkey must be designed to improve storage efficiency. And the data management layer is the basic layer, which has a direct support relationship to the upper two layers. In other words, the efficiency of matching and searching for data requests is closely related to the underlying data design. Accordingly, according to Hbase's own characteristics and data standards, a unified rowkey naming rule and attribute naming rule standard have been formulated for data...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a data management service system based on large data. The data management service system comprises a heterogeneous data normalized-description module, a data semantization module, a data storage performance module, a data logic-management module, a data scenarization and service matching module and a data display module. The data management service system based on scenes solves the problems as follows: first, the data volumes are large at present, the data variety is large, the data is from various data sources, and the data categories and formats are rich; as a result, the problem of difficult storage is formed; second, the description of data heterogeneity: the data with multi-source large data forms data islands; different data structures exit in each data source, and at the same time, different designation systems also exist in each data source; the homogeneous data can also not interoperate; third, the data matching problem: the data matching problem is formed as the structures of data categories are different, the precision ratio and the recall rate are low, and the query cost is high.

Description

technical field [0001] The invention relates to big data management services and belongs to the field of computer and network technology applications. Background technique [0002] According to estimates made by IDC, data has been growing at a rate of 50% per year, which means it doubles every two years (big data Moore's Law), which means that the amount of data generated by humans in the last two years is equivalent to Based on the total amount of data generated before, it is estimated that by 2020, the world will have a total of 3.5 billion GB of data, which will increase nearly 30 times compared to 2010. This is not a simple problem of increasing data, but a completely new problem. [0003] "Big data" is a data set with a particularly large volume and data category, and such a data set cannot be captured, managed and processed with traditional database tools. It is characterized by large data volumes, which refer to large-scale data sets, generally around 10TB in size. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F3/067G06F16/284
Inventor 姜骁熊桂喜杜博文詹俊峰肖道锐
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products