Sample extraction method and device

A sample and sample group technology, applied in the field of data processing, can solve problems such as easy and time-consuming

Active Publication Date: 2020-08-11
TENCENT TECH (SHENZHEN) CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The embodiment of the present invention provides a sample extraction method and device to at least solve the technical problem that the sample extraction method in the prior art tends to take a long time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sample extraction method and device
  • Sample extraction method and device
  • Sample extraction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0025] According to the embodiment of the present invention, a method embodiment that can be executed by the device embodiment of the present application is provided. It should be noted that the steps shown in the flow chart of the accompanying drawings can be implemented in a computer system such as a set of computer-executable instructions and, although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0026] According to an embodiment of the present invention, a sample extraction method is provided.

[0027] Optionally, in this embodiment, the above sample extraction method can be applied to such as figure 1 In the hardware environment formed by the terminal 102 and the server 104 shown. Such as figure 1 As shown, the terminal 102 is connected to the server 104 through a network or a data line, and the above-mentioned network includes but not limited to: a wi...

Embodiment approach

[0038] The embodiment of the present invention also provides a method according to the position D(a xy ) to determine whether to save the location D (a xy ) to the implementation of the target sample set, specifically as follows:

[0039] First, the target array is traversed to compare position D(a xy ) and the location D(a x′1 ), where, before traversing the target array, the position D(a x′1 ) to sort so that the position D(a) saved in the target array x′1 ) is ordered.

[0040] Secondly, in the comparison position D(a xy ) is less than position D(a x′1 ) in the case of the minimum value, determine the storage location D(a xy ) to the target sample set. Since the sample sequence is ordered, after comparing the position D(a xy ) is less than position D(a x′1 In the case of the minimum value in ), it means that the random sample drawn in the current time is before the sample that has been drawn and saved, then the random sample drawn in the current time must be a sam...

Embodiment 2

[0054] According to an embodiment of the present invention, there is also provided a sample extraction device for implementing the above sample extraction method, the sample extraction device is mainly used to implement the sample extraction method provided in the above content of the embodiment of the present invention, the following describes the embodiment of the present invention The provided sample extraction device is introduced in detail:

[0055] image 3 is a schematic diagram of a sample extraction device according to an embodiment of the present invention, such as image 3 As shown, the sample extraction device mainly includes a first sorting unit 10, a first storage unit 20, a repeat execution unit 30, an extraction unit 40, a first acquisition unit 50, a first judgment unit 60, a processing unit 70 and a second storage unit 80, of which:

[0056] The first sorting unit 10 is used to sort the total amount of samples to obtain a sample sequence including n sample ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sample extraction method and apparatus. The sample extraction method comprises the steps of performing sorting on total quantity of samples to obtain a sample sequence including n sample groups; storing the sample number len (Ai) of a sample group Ai and the position D (ai1) of the sample ai1 in the sample sequence; and calculating the positions of samples that are really required to be stored in a target sample set according to the positions of random samples in the sample sequence through inverse computation. By adoption of the sample extraction method and apparatus, the problem of relatively high time consumption in the sample extraction way existing in the prior art is solved, so that a technical effect of reducing the time consumption on the basis of lowering the repetitive rate of the extracted samples is achieved.

Description

technical field [0001] The present invention relates to the field of data processing, in particular to a sample extraction method and device. Background technique [0002] Regarding the evaluation of relevant indicators of massive data (such as the quality of web page results), it is generally necessary to extract m samples from the total number of N samples. When performing sample extraction, there are the following two options in the prior art: [0003] Solution 1: Use a distributed cluster (hadoop) to divide the overall sample, divide the overall sample into n blocks, and randomly select m / n samples for each block. According to the size of the cluster, the data size of each block is generally about N / 1000, and it is extracted in parallel to improve the extraction speed. [0004] Option 2: Extract directly without dividing into blocks. For each extraction, if the currently extracted sample is a new sample, the number of extraction results will be increased by 1 and then t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/448
Inventor 张壮
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products