Big data storage system

A big data storage and data technology, applied in the field of big data, can solve the problems of single function, high speed, low value density, etc., and achieve the effect of improving analysis efficiency and convenient invocation

Pending Publication Date: 2020-06-23
TSINGHUA UNIV
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] With the explosive growth and accumulation of information, the era of big data is approaching. The basic characteristics of big data are: large data volume, various types, low value density, fast speed and high timeliness; for big data, correlation is more important than causality. More importantly, this is the conclusion of the era of big data. In the face of massive amounts of information, how to analyze the correlation between certain types of inf

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Big data storage system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to make the objects and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0028] Such as figure 1 As shown, the embodiment of the present invention provides a kind of big data storage system, comprises:

[0029] The target data acquisition module collects the target data through multiple ports, and sends the collected data to the data preprocessing module;

[0030] Data preprocessing module, based on ISODATA algorithm to realize the removal of redundant data;

[0031] The data feature extraction module uses an attribute reduction algorithm based on attribute importance to extract feature data of target data;

[0032] A data classification module, based on the Naive Bayesian classification algorithm, realizes the cl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a big data storage system, which comprises a target data acquisition module for acquiring target data through multiple ports and sending the acquired data to a data preprocessing module; the data preprocessing module is used for clearing redundant data on the basis of an ISODATA algorithm; the data feature extraction module is used for extracting feature data of the targetdata by adopting an attribute reduction algorithm based on attribute importance; the data classification module is used for realizing classification of target data according to the feature data basedon a naive Bayes classification algorithm; and the data positioning module is used for finding a proper position in the database for the target data in the database according to the classification result of the data classification module, finding similar data points for the target data, and establishing a relationship between the target data and the similar data points. According to the invention,while automatic arrangement and storage of big data can be realized, numerous and diverse mass data can be converted into available data with information and commercial value.

Description

technical field [0001] The invention relates to the field of big data, in particular to a big data storage system. Background technique [0002] With the explosive growth and accumulation of information, the era of big data is approaching. The basic characteristics of big data are: large data volume, various types, low value density, fast speed and high timeliness; for big data, correlation is more important than causality. More importantly, this is the conclusion of the era of big data. In the face of massive amounts of information, how to analyze the correlation between certain types of information and analyze the hidden value behind the information can reflect the data at a higher and deeper level. The value of information, but in the face of such a massive amount of data, it is very difficult to quickly and accurately analyze the relationship between the data. [0003] At present, the traditional data storage system generally has the disadvantages of single function and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/901G06F16/906
CPCG06F16/901G06F16/906
Inventor 林波
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products