Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for generating feature database

A feature library and generation module technology, which is applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems that do not involve feature libraries, etc., and achieve the effects of reducing performance overhead, improving analysis efficiency, and satisfying accuracy

Active Publication Date: 2018-02-27
SHANGHAI DAMENG DATABASE
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Due to the uneven distribution of data in the database, current database management system manufacturers mostly use the above-mentioned random sampling method to obtain feature records to generate feature libraries, but the current public information does not involve how to effectively use random steps to generate feature libraries Methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for generating feature database
  • Method and device for generating feature database
  • Method and device for generating feature database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0030] see Figure 1a , is a schematic flowchart of a method for generating a feature library according to Embodiment 1 of the present invention. The method of the embodiment of the present invention may be executed by a generating apparatus configured with a feature library implemented in hardware and / or software, and the implementing apparatus may generally be integrated in a server capable of providing a feature record collection service.

[0031] The method includes steps 110 to 170 .

[0032] Step 110: Determine the target set size and the number of feature records in the target set according to the preset initial set size and sampling percentage.

[0033] Specifically, it can include the following two steps:

[0034] Calculate the product of the preset initial set size and the preset sampling percentage;

[0035] When the product of the initial set size and the sampling percentage is less than 1, expand the initial set size until the calculated product of the expanded ...

Embodiment 2

[0057] see Figure 2a , is a schematic flowchart of a method for generating a feature library according to Embodiment 2 of the present invention. The method includes steps 210 to 290 .

[0058] Step 210: Determine the target set size and the number of feature records in the target set according to the preset initial set size and sampling percentage.

[0059] This step is also applicable to the specific operations in step 110 of the foregoing embodiment, and will not be repeated here.

[0060] Step 220: Use a preset random seed to randomly generate random numbers with the same number as the number of feature records in the target set element, and save each generated random number as an initial random step size array. Values ​​range from 0 to the target set size.

[0061] Step 230: Calculate the sum of the random numbers included in the initial random step size array.

[0062] Step 240: Determine whether the sum of the random numbers included in the initial random step size ...

Embodiment 3

[0096] see image 3 , is a schematic structural diagram of an apparatus for generating a feature library according to Embodiment 3 of the present invention. The apparatus includes: a parameter configuration module 310 , an initial random step size array generation module 320 , a target random step size array generation module 330 and a feature library generation module 340 .

[0097] The parameter configuration module 310 is used to determine the target set size and the number of feature records in the target set according to the preset initial set size and sampling percentage; the initial random step size array generation module 320 is used to use the preset The random seed randomly generates the same number of random numbers as the number of feature records in the target set element, and saves each generated random number as an initial random step size array, and the value range of each random number is between 0 and 0. between the target set element scales; the target rand...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a generation method and device of a feature library. The method comprises the steps of determining the target element scale and number of feature cords in the target element; randomly generating random numbers as much as the feature records in the target element according to preset random seed; storing as the initial random step-size number set; modifying the initial random step-size number set at least twice to obtain a target random step-size number set, wherein the modifying efficiency is high; dividing the whole target record according to the target element scale; acquiring the corresponding feature cord in each element according to the target random step-size number set to be used as a corresponding sample library; combining the sets of the sample library corresponding to each element as the feature library of the whole target record. According to the method, the target random step-size number set can be generated at a time to apply to all elements, so that the performance cost of acquiring the feature records by a database management system can be decreased, the CBO analyzing efficiency can be increased, and the sampling randomness and the sampling rate accuracy can be met.

Description

technical field [0001] The embodiments of the present invention relate to a data sampling technology of a database management system, and in particular, to a method and device for generating a feature library. Background technique [0002] The database management system generates a corresponding execution plan according to the SQL (Structured Query Language, Structured Query Language) statement input by the user. Most database management systems have introduced a cost based optimizer (CBO), that is, the database management system obtains all the information related to the execution plan, and calculates and analyzes the information to obtain all feasible execution plans. An execution plan with the least cost is used as the final execution plan to improve the execution efficiency of the database management system. The sampling, calculation and analysis of database records is the cornerstone of the existence of CBO. [0003] Computational analysis of all records can certainly...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
Inventor 朱仲颖张钦张黎敏
Owner SHANGHAI DAMENG DATABASE