Method and system for classification and pre-processing of big data under Internet environment

A classification preprocessing, Internet technology, applied in electronic digital data processing, special data processing applications, network data retrieval, etc., can solve problems such as the inability to meet the requirements of big data classification

Inactive Publication Date: 2016-10-26
INST OF SCI & TECHN INFORMATION OF CHINA
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, traditional storage and classification algorithms cannot meet the classification requirements of big data in the Internet application environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for classification and pre-processing of big data under Internet environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to further illustrate the purpose and advantages of the present invention, the present invention will be described below in conjunction with the accompanying drawings and specific embodiments.

[0024] The big data classification preprocessing method under the Internet environment in the present embodiment comprises the following steps:

[0025] Step 1, the data acquisition of the big data classification preprocessing method in the Internet environment.

[0026] Collect different types of network data in the Internet and perform dimension reduction processing.

[0027] Step 2. Preprocessing of big data classification preprocessing methods in the Internet environment to form data that can be directly processed by the system

[0028] The preprocessing includes noise removal.

[0029] The preprocessing system based on the above-mentioned big data classification preprocessing method in the Internet environment, its structural framework is as follows figure 1 As ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method and system for classification and pre-processing of big data and especially relates to the method for the classification and the pre-processing of the big data under an Internet environment. The method and the system belong to the field of data exaction. The method provided by the invention comprises the steps that multiple types of network data in the Internet is used to compose a complete pre-processing basic dataset, and the data is simplified through operations such as dimension reduction; and then, the different types of data in the dataset is analyzed and pre-processed respectively, and a dataset used for classification is obtained, so that a data preparation is made for further classification.

Description

technical field [0001] The invention relates to a big data classification preprocessing method and system, in particular to a big data classification preprocessing method under the Internet environment, which belongs to the field of data mining. Background technique [0002] With the continuous progress of modern society, especially the rapid development of the Internet, the number of various network resources presents the characteristics of huge quantity, variety and rapid change. The Internet has entered the era of big data. In addition to the huge amount of big data in the current Internet application environment, unstructured data accounts for an increasing proportion, and the number of resources increases linearly. Only 10% of the data in such a variety of network resources can really be used. Therefore, quickly locating valid data and realizing automatic classification of resources is one of the key methods to solve this problem. However, traditional storage and cla...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/62
CPCG06F16/951G06F18/24
Inventor 张晓丹梁冰王莉白海燕
Owner INST OF SCI & TECHN INFORMATION OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products