Unlock instant, AI-driven research and patent intelligence for your innovation.

Data sampling method and device

A data sampling and data technology, applied in the field of data processing, can solve the problems of random seed evaluation, difference in stability, and no consideration of the influence of outlier extreme values ​​on sampling results, so as to improve precision and reduce the influence of sampling precision Effect

Inactive Publication Date: 2018-12-18
AGRICULTURAL BANK OF CHINA
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The existing stratification method does not consider the impact of outlier extreme values ​​on the sampling results, especially for financial data, there will be abnormal value production due to system reasons, human negligence and other reasons, and the extraction of such abnormal values ​​will affect the sampling results. The precision of the result affects
In the process of systematic sampling after stratification, there is no evaluation of the rationality of the random seed, the stability of different seed extraction will be different, and the random seed also affects the accuracy of the sampling result

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data sampling method and device
  • Data sampling method and device
  • Data sampling method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0047] see figure 1 , this embodiment discloses a data sampling method, including:

[0048] S101: Determine the stratification type of the data to be sampled, delete the outlier extreme value in each stratification, and the remaining data in each stratification form the target data;

[0049] The data to be sampled is data that needs to be sampled, and may be any type of data.

[0050] In order to improve the sampling accuracy as much as possible and reduce t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application provides a data sampling method, which comprises the steps of determining a hierarchical type of data to be sampled, deleting an outlier extreme value in each hierarchical type, and composing target data from remaining data of each hierarchy; determining a sampling interval according to the number of target data and the number of target samples; dividing each hierarchical data in the target data into a plurality of cells according to the sampling interval; determining a plurality of random seeds, respectively sampling in each hierarchy of the target data according to asystematic sampling method according to each random seed and the sampling interval, and obtaining a plurality of sets of sample data; selecting a set of sample data as the final target sample data inthe plurality of sets of sample data according to a preset rule. The outlier extreme value is processed after stratification to effectively shield the influence of the outlier extreme value on the sampling precision, and the stratified sampling, the systematic sampling and the whole cluster sampling are combined and applied at the same time, so that the sampling precision is improved.

Description

technical field [0001] The present invention relates to the technical field of data processing, and more specifically, to a data sampling method and device. Background technique [0002] In the case of limited system resources and a certain time, the computer can only process a limited amount of data, so the advantages of sampling are reflected. Through the sampling of data, it not only meets the requirements of time performance, but also represents the overall data through sample data. To achieve the effect of seeing the whole leopard at a glance. However, sampling also has certain limitations and is characterized by instability. Therefore, how to extract enough good sample data to make the sample data fully representative and random, and what method to use for sampling needs in-depth research. [0003] Stratified sampling is applicable when the population is known to be composed of several parts with obvious differences. In order to make the sample more objectively refle...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06Q40/00
CPCG06Q40/00
Inventor 高晓鹏李乾张怡康
Owner AGRICULTURAL BANK OF CHINA