Unlock instant, AI-driven research and patent intelligence for your innovation.

Data access method and device, equipment and storage medium

A data access and data technology, applied in the field of data processing, can solve the problems of low effective utilization of memory, incomplete loading of training data, waste of memory resources, etc., so as to avoid the waste of memory resources.

Pending Publication Date: 2021-07-02
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, as the scale of data used for model training becomes larger and larger, this approach gradually becomes unfeasible, because the training data of tens or even hundreds of GB may not be fully loaded into the memory
In addition, only a small part of the entire training data set is actually used in each step of model training, which makes the effective utilization of memory very low and wastes a lot of memory resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data access method and device, equipment and storage medium
  • Data access method and device, equipment and storage medium
  • Data access method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] Embodiments of the present application are described below in conjunction with the accompanying drawings.

[0037] In some related technologies, the entire training data set is read into the memory, and then the training data in the training data set is randomly shuffled before each round of training, and a small part of the training data is sequentially read in the shuffled training data set. data entry model, see figure 1 shown.

[0038] Among them, e indicates that the current number of training rounds is the e-th round, and e starts counting from 0. When e=0, it means entering the first round of training, and E indicates the total number of training rounds. When e=0 (see S101 ) after the training starts, the entire training data set is read into the memory (see S102 ). If e

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data access method and device, equipment and a storage medium, and the method comprises the steps: obtaining a training data set for training a target deep learning model, determining a target index file according to the training data set, and carrying out random access of each piece of training data in the training data set based on an index. on this basis, converting operations such as random sorting on training data into operations on index information, namely, in the process of training the target deep learning model in each round, randomly upsetting the sequence of the index information in the target index file to acquire a target index file in which the index information is arranged according to a second sequence and acquire target index file according to the second sequence; and reading target training data corresponding to the target index information from the training data set, and inputting the target training data into the target deep learning model for the round of training. Therefore, only the index information with small volume and a small amount of training data required by each step in training need to be stored in the memory, the use of the memory is remarkably reduced, and the high efficiency of reading the training data can be ensured.

Description

technical field [0001] The present application relates to the field of data processing, in particular to a data access method, device, device and storage medium. Background technique [0002] In recent years, deep learning technology has made great progress, and has achieved performance close to or even surpassing that of humans in many fields such as natural language processing, computer vision, and speech recognition. A deep learning model with high training accuracy and good effect is inseparable from massive training data. [0003] Currently, deep learning models are mainly trained based on the stochastic gradient descent algorithm. In most model training scenarios, it is necessary to first read the entire training data set into the memory, and then randomly shuffle the training data in the training data set before each round of training, and read sequentially in the shuffled training data set Feed a small portion of the training data into the model. [0004] However,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N20/00
CPCG06N20/00
Inventor 唐晶廖阔
Owner TENCENT TECH (SHENZHEN) CO LTD