Unlock instant, AI-driven research and patent intelligence for your innovation.

Data processing method and device, electronic equipment and storage medium

A technology for data processing and state data, applied in the fields of electronic equipment and storage media, data processing methods, and devices, and can solve the problems that the model environment is harmful, time-consuming, and expensive.

Pending Publication Date: 2022-03-25
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, training a reinforcement learning model is time-consuming, because the reinforcement learning model needs to interact with the environment a lot during the training process to determine the matching actions, and these costs are high
At the same time, the simple exploration strategy slows down the learning speed of the model and even causes the model to make actions that are harmful to the environment

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device, electronic equipment and storage medium
  • Data processing method and device, electronic equipment and storage medium
  • Data processing method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0028] The data processing method, device, electronic device, and storage medium of the embodiments of the present disclosure are described below with reference to the accompanying drawings.

[0029] figure 1 It is a schematic flowchart of a data processing method provided by an embodiment of the present disclosure.

[0030] The execution subject of the data processin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a data processing method and device, electronic equipment and a storage medium, and relates to the technical field of artificial intelligence, in particular to the technical field of deep learning. According to the scheme, the method comprises the steps of obtaining state data of an environment where a target object is located, inputting the state data into a strategy network of an enhancement model, sampling from an action set to obtain multiple actions corresponding to the state data, inputting the multiple actions and the state data into a guide network of the enhancement model, and outputting a target matching degree between each action and the state data, and according to the target matching degree of each action, determining a target action of the target object from the plurality of actions obtained by sampling. The subsequent processing efficiency is improved by sampling the multiple actions from the action set output by the strategy network, the state data and the multiple actions are combined to be input into the guide network, the target matching degree corresponding to each action is calculated, the target matching degree indicates the higher relevance between each action and the state data, and the action matching efficiency is improved. Therefore, the accuracy of target action determination is improved.

Description

technical field [0001] The present disclosure relates to the technical field of artificial intelligence, in particular to the technical field of deep learning, and in particular to a data processing method, device, electronic device, and storage medium. Background technique [0002] In recent years, reinforcement learning has been applied in many fields, such as games, robots, recommendation systems, etc. However, training a reinforcement learning model is time-consuming, because the reinforcement learning model needs to interact with the environment a lot during the training process to determine the matching actions, and these costs are high. At the same time, the simple exploration strategy slows down the learning speed of the model and even causes the model to make actions that are harmful to the environment. Therefore, how to improve the accuracy of action determination is a technical problem to be solved urgently. Contents of the invention [0003] The disclosure pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08G06V10/74G06V10/80G06V10/82
CPCG06N3/08G06N3/045G06F18/22G06F18/253
Inventor 李旭黄泰然孙明明李平
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD