Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

90 results about "Learning agent" patented technology

A deep learning agent is any autonomous or semi-autonomous AI-driven system that uses deep learning to perform and improve at its tasks. Systems (agents) that use deep learning include chatbots, self-driving cars, expert systems, facial recognition programs and robots.

Multi-machine collaborative air combat planning method and system based on deep reinforcement learning

ActiveCN112861442ASolve hard-to-converge problemsMake up for the shortcomings of poor exploratoryDesign optimisation/simulationNeural architecturesEngineeringNetwork model
According to the multi-aircraft cooperative air combat planning method and system based on deep reinforcement learning provided by the invention, a combat aircraft is regarded as an intelligent agent, a reinforcement learning agent model is constructed, and a network model is trained through a centralized training-distributed execution architecture, so that the defect that the exploratory performance of a network model is not strong due to low action distinction degree among different entities during multi-aircraft cooperation is overcome; and by embedding expert experience in the reward value, the problem that a large amount of expert experience support is needed in the prior art is solved. Through an experience sharing mechanism, all agents share one set of network parameters and experience playback library, and the problem that the strategy of a single intelligent agent is not only dependent on the feedback of the own strategy and the environment, but also influenced by the behaviors and cooperation relationships of other agents is solved. By increasing the sampling probability of the samples with large absolute values of the advantage values, the samples with extremely large or extremely small reward values can influence the training of the neural network, and the convergence speed of the algorithm is accelerated. The exploration capability of the intelligent agent is improved by adding the strategy entropy.
Owner:NAT UNIV OF DEFENSE TECH

Adjustable hardware aware pruning and mapping framework based on ReRAM neural network accelerator

The invention provides an adjustable hardware perception pruning and mapping framework based on a ReRAM neural network accelerator. The pruning and mapping framework comprises a DDPG agent and the ReRAM neural network accelerator. The DDPG agent is composed of a behavior decision module Actor and an evaluation module Critic, and the behavior decision module Actor is used for making a pruning decision for the neural network; the ReRAM neural network accelerator is used for mapping a model formed under a pruning decision generated by the behavior decision module Actor, and feeding back a performance parameter mapped by the model under the pruning decision to the evaluation module Critic as a signal; the performance parameters comprise energy consumption, delay and model accuracy of the simulator; the judgment module Critic updates a reward function value according to the fed back performance parameters and guides a pruning decision of the behavior decision module Actor in the next stage;according to the method, a pruning scheme which is most matched with hardware and user requirements and is most efficient is made by utilizing a reinforcement learning DDPG agent, so that the delay performance and the energy consumption performance on the hardware are improved while the accuracy is ensured.
Owner:ZHEJIANG LAB +1

Reinforcement learning-based Simulink software testing method

PendingCN114706762AWork around the constraints of shortagesAddressing the Shortcoming of Lack of GuidanceSoftware testing/debuggingNeural architecturesEngineeringMATLAB
The invention discloses a reinforcement learning-based Simulink software testing method. The reinforcement learning-based Simulink software testing method comprises two parts: a use case generation part and a use case testing part, the case generation part comprises the following steps: (1) selecting an initial model in a test case library, (2) inputting the state characteristics of the initial model into a reinforcement learning agent, (3) selecting the next action to be executed by the model in an action library by the agent according to the input, and (4) outputting an action index to the model, and executing the action by the model. A case test part: (5) carrying out compiling test on the model after action execution by MATLAB, (6) repairing compiling errors if compiling is not passed, (7) carrying out differential test on the model after compiling is passed, (8) judging whether a test result is equivalent in function or not, if the test result is equivalent, considering that no bug is found, if a difference exists, considering that the bug is found, and (9) based on the test result, judging that the bug is not found. And updating the reinforcement learning agent, so that the reinforcement learning agent tends to generate a model of the easily triggered bug.
Owner:DALIAN MARITIME UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products