Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

49 results about "Inverse reinforcement learning" patented technology

Automatic driving system and method based on relative-entropy deep and inverse reinforcement learning

The invention relates to an automatic driving system based on relative-entropy deep and inverse reinforcement learning. The system comprises a client, a driving basic data collection sub-system and astorage module, wherein the client displays a driving strategy; the driving basic data collection sub-system collects road information; the storage module is connected with the client and the drivingbasic data collection sub-system and stores the road information collected by the driving basic data collection sub-system. The driving basic data collection sub-system collects the road information and transmits the road information to the client and the storage module; the storage module receives the road information, stores a piece of continuous road information into a historical route, conducts analysis and calculation according to the historical route so as to simulate the driving strategy, and transmits the driving strategy to the client so that a user can select the driving strategy; the client receives the road information and implements automatic driving according to the selection of the user. In the automatic driving system, the relative-entropy deep and inverse reinforcement learning algorithm is adopted, so that automatic driving under the model-free condition is achieved.
Owner:POLIXIR TECH LTD

Vehicle following system and method for simulating driving style based on deep inverse reinforcement learning

The invention belongs to the technical field of intelligent driving, and discloses a vehicle following system and method for simulating a driving style based on deep inverse reinforcement learning. The vehicle following system comprises a millimeter-wave radar which collects the information of the distance between a vehicle and a front vehicle, the lateral distance between the vehicle and the front vehicle, the relative speed and the azimuth angle, a vehicle speed collection device which collects the speed of the vehicle, and a vehicle-mounted industrial personal computer. A vehicle followingdata processor in the vehicle-mounted industrial personal computer processes information acquired by the millimeter wave radar and the vehicle speed collection device, extracts a vehicle following data fragment required by vehicle following model training, and performs vehicle following model training on the vehicle following data fragment to obtain a vehicle following strategy model; the vehiclefollowing system is simple in structure, a reward function is learned from the historical vehicle following data of a driver through the deep inverse reinforcement learning method, the vehicle following strategy of the driver is solved through the reward function and the reinforcement learning method, the obtained vehicle following model can simulate the driving styles of different drivers and understand the preference of the driver in the vehicle following process, and a personified vehicle following behavior is generated.
Owner:CHANGAN UNIV

Mobile robot navigation method and device, computer equipment and storage medium

The invention relates to a mobile robot navigation method and device, computer equipment and a storage medium. The method comprises the following steps of: extracting features of a target point image and a scene image through a feature extraction module to obtain state features of a current state; resolving a preset expert track through an inverse reinforcement learning module to obtain a reward function; outputting a predicted execution action of a robot through a strategy network in an A3C reinforcement learning network, obtaining a predicted value function through a value network, and after the execution action obtains a next state, calculating a TD error according to the current state, the next state and the execution action to obtain a first loss function; obtaining an expert reward value according to the state features and a weight parameter, and obtaining a second loss function according to the network reward value and the expert reward value; and training the A3C reinforcement learning network and a reward network to obtain a trained mobile robot navigation model for navigation. According to the invention, the accuracy and efficiency of indoor navigation of the robot can be improved, and the generalization ability is high.
Owner:NAT UNIV OF DEFENSE TECH

Automatic driving method and device, electronic equipment and storage medium

The invention discloses an automatic driving method and device, electronic equipment and a computer readable storage medium. The method comprises the steps that multi-mode sensing information and driving behavior data of a driving environment are acquired; extracting multi-scale features of the multi-modal perception information by using a convolutional neural network, and fusing the multi-scale features by using Transform to obtain fused feature data; the fusion feature data and the driving behavior data are combined into expert demonstration data, and an automatic driving process is modeled into a Markov decision process; obtaining a reward function of an automatic driving process by using expert demonstration data and maximum entropy inverse reinforcement learning, and optimizing a driving strategy model by using deep reinforcement learning; and outputting the optimized driving strategy model to the client, so that the client realizes automatic driving by using the optimized driving strategy model according to the environmental perception information. The reliability of automatic driving perception data is ensured, and the reasonability of decision planning in the automatic driving process is improved.
Owner:INSPUR BEIJING ELECTRONICS INFORMATION IND
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products